haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-16 20:17:00 +02:00

Author	SHA1	Message	Date
Baptiste Assmann	e6baecfe23	BUILD: fix build issue without USE_OPENSSL The SSL check referenced use_ssl which only exists when USE_OPENSSL is set.	2012-10-05 11:48:04 +02:00
Willy Tarreau	6c16adc661	MEDIUM: checks: enable the PROXY protocol with health checks When health checks are configured on a server which has the send-proxy directive and no "port" nor "addr" settings, the health check connections will automatically use the PROXY protocol. If "port" or "addr" are set, the "check-send-proxy" directive may be used to force the protocol.	2012-10-05 00:33:14 +02:00
Willy Tarreau	763a95bfde	MEDIUM: checks: add the "check-ssl" server option This option forces health checks to be sent over SSL even if the address or port are not the standard ones.	2012-10-05 00:33:14 +02:00
Willy Tarreau	f150317671	MAJOR: checks: completely use the connection transport layer With this change, we now use the connection's transport layer to receive and send data during health checks. It even becomes possible to send data in multiple times, which was not possible before. The transport layer used is the same as the one used for the traffic, unless a specific address and/or port is specified for the checks using "port" or "addr", in which case the transport layer defaults to raw_sock. An option will be provided to force SSL checks on different IP/ports later. Connection errors and timeouts are still reported. Some situations where strerror() was able to report a precise error after a failed connect() in the past might not be reported with as much precision anymore, but the error message was already meaningless. During the tests, no situation was found where a message became less precise.	2012-10-05 00:33:14 +02:00
Willy Tarreau	f4288ee4ba	MEDIUM: check: add the ctrl and transport layers in the server check structure Since it's possible for the checks to use a different protocol or transport layer than the prod traffic, we need to have them referenced in the server. The SSL checks are not enabled yet, but the transport layers are completely used.	2012-10-05 00:33:14 +02:00
Willy Tarreau	1ae1b7b53c	MEDIUM: checks: use real buffers to store requests and responses Till now the request was made in the trash and sent to the network at once, and the response was read into a preallocated char[]. Now we allocate a full buffer for both the request and the response, and make use of it. Some of the operations will probably be replaced later with buffer macros but the point was to ensure we could migrate to use the data layers soon. One nice improvement caused by this change is that requests are now formed at the beginning of the check and may safely be sent in multiple chunks if needed.	2012-10-05 00:33:14 +02:00
Willy Tarreau	5b3a202f78	REORG: server: move the check-specific parts into a check subsection The health checks in the servers are becoming a real mess, move them into their own subsection. We'll soon need to have a struct buffer to replace the char * as well as check-specific protocol and transport layers.	2012-10-05 00:33:14 +02:00
Willy Tarreau	fb56aab443	MAJOR: checks: make use of the connection layer to send checks This is a first step, we now use the connection layer without the data layers (send/recv are still used by hand). The connection is established using tcp_connect_server() and raw_sock is assumed and forced for now. fdtab is not manipulated anymore and polling is managed via the connection layer. It becomes quite clear that the server needs a second ->ctrl and ->xprt dedicated to the checks.	2012-10-05 00:33:14 +02:00
Willy Tarreau	5f1504f524	MEDIUM: connection: add a new local send-proxy transport callback This callback sends a PROXY protocol line on the outgoing connection, with the local and remote endpoint information. This is used for local connections (eg: health checks) where the other end needs to have a valid address and no connection is relayed.	2012-10-05 00:32:35 +02:00
Willy Tarreau	e1e4a61e7a	REORG: connection: move the PROXY protocol management to connection.c It was previously in frontend.c but there is no reason for this anymore considering that all the information involved is in the connection itself only. Theorically this should be in the socket layer but we don't have this yet.	2012-10-05 00:32:33 +02:00
Willy Tarreau	0ffde2cc3f	MEDIUM: connection: automatically disable polling on error We absolutely want to disable FD polling after an error is detected, otherwise the data layer has to do it and it's far from being obvious at these layers. The way we did it was a bit tricky in conn_update__polling and conn__polling_changes. However it has almost no impact on performance and code size both for the fast and slow path. We'll now be able to remove some flag updates in the stream interface.	2012-10-04 22:26:11 +02:00
Willy Tarreau	665e6ee7aa	MEDIUM: connection: it's not the data layer's role to validate the connection Till now we used to perform the L4_CONN check in the data layer (eg: stream interface) but that does not make sense, because some transport layers will imply that the connection is opened (eg: SSL), and also because the complexity to check for this is higher in the data layer than in the transport layer. This is so much true that some read0 cases did not validate the connection. So as of now, the transport layer is responsible for clearing L4_CONN when it detects an activity, and the data layer may safely rely on this flag. This only impacts a minor change in raw_sock and stream_interface for now.	2012-10-04 22:26:11 +02:00
Willy Tarreau	78eaebed13	MEDIUM: connection: don't call the data->init callback upon error We don't call ->init() anymore upon error since we already call ->wake().	2012-10-04 22:26:10 +02:00
Willy Tarreau	9683e9a05f	MEDIUM: session: register a data->wake callback to process errors The connection layer will soon call ->wake() only when errors happen, and not ->init(). So make the session layer use this callback to detect errors and abort connections.	2012-10-04 22:26:10 +02:00
Willy Tarreau	2396c1c4a2	MEDIUM: connection: make it possible for data->wake to return an error Just like ->init(), ->wake() may now be used to return an error and abort the connection. Currently this is not used but will be with embryonic sessions.	2012-10-04 22:26:10 +02:00
Willy Tarreau	9e272bf95d	MEDIUM: connection: only call the data->wake callback on activity We now check the connection flags for changes in order not to call the data->wake callback when there is no activity. Activity means a change on any of the CO_FL__SH, CO_FL_ERROR, CO_FL_CONNECTED, CO_FL_WAIT_CONN flags, as well as a call to data->recv or data->send.	2012-10-04 22:26:10 +02:00
Willy Tarreau	071e137ec2	MEDIUM: connection: use a generic data-layer init() callback The generic data-layer init callback is now used after the transport layer is complete and before calling the data layer recv/send callbacks. This allows the session to switch from the embryonic session data layer to the complete stream interface data layer, by making conn_session_complete() the data layer's init callback. It sill looks awkwards that the init() callback must be used opon error, but except by adding yet another one, it does not seem to be mergeable into another function (eg: it should probably not be merged with ->wake to avoid unneeded calls during the handshake, though semantically that would make sense).	2012-10-04 22:26:10 +02:00
Willy Tarreau	5e75e2755e	MEDIUM: session: use a specific data_cb for embryonic sessions We don't want to have the recv or send callbacks in embryonic sessions, and we want the stream interface to be referenced as the connection owner only once the session is instanciated. So let's first have the embryonic session be the owner, then replaced later by the stream interface once the transport layer is ready.	2012-10-04 22:26:10 +02:00
Willy Tarreau	4aa3683b2d	MINOR: connection: provide a generic data layer wakeup callback Instead of calling conn_notify_si() from the connection handler, we now call data->wake(), which will allow us to use a different callback with health checks. Note that we still rely on a flag in order to decide whether or not to call this function. The reason is that with embryonic sessions, the callback is already initialized to si_conn_cb without the flag, and we can't call the SI notify function in the leave path before the stream interface is initialized. This issue should be addressed by involving a different data_cb for embryonic sessions and for stream interfaces, that would be changed during session_complete() for the final data_cb.	2012-10-04 22:26:10 +02:00
Willy Tarreau	74beec32a5	REORG: connection: rename app_cb "data" Now conn->data will designate the data layer which is the client for the transport layer. In practice it's the stream interface and will soon also be the health checks.	2012-10-04 22:26:10 +02:00
Willy Tarreau	f7bc57ca6e	REORG: connection: rename the data layer the "transport layer" While working on the changes required to make the health checks use the new connections, it started to become obvious that some naming was not logical at all in the connections. Specifically, it is not logical to call the "data layer" the layer which is in charge for all the handshake and which does not yet provide a data layer once established until a session has allocated all the required buffers. In fact, it's more a transport layer, which makes much more sense. The transport layer offers a medium on which data can transit, and it offers the functions to move these data when the upper layer requests this. And it is the upper layer which iterates over the transport layer's functions to move data which should be called the data layer. The use case where it's obvious is with embryonic sessions : an incoming SSL connection is accepted. Only the connection is allocated, not the buffers nor stream interface, etc... The connection handles the SSL handshake by itself. Once this handshake is complete, we can't use the data functions because the buffers and stream interface are not there yet. Hence we have to first call a specific function to complete the session initialization, after which we'll be able to use the data functions. This clearly proves that SSL here is only a transport layer and that the stream interface constitutes the data layer. A similar change will be performed to rename app_cb => data, but the two could not be in the same commit for obvious reasons.	2012-10-04 22:26:09 +02:00
Willy Tarreau	6f5d141149	MEDIUM: raw_sock: improve connection error reporting When a connection setup is pending and we receive an error without a POLL_IN flag, we're certain there will be nothing to read from it and we can safely report an error without attempting a recv() call. This will be significantly better for health checks which will avoid a useless recv() on all failed checks.	2012-10-04 22:26:09 +02:00
Willy Tarreau	c0e98868fe	MINOR: raw_sock: always report asynchronous connection errors Depending on the pollers used, a connection error may be notified with POLLOUT\|POLLERR\|POLLHUP. POLLHUP by itself is enough for the connection handler to call the read actor, which would only consider this flag as a good indication of a hangup, without considering the POLLERR flag. In order to address this, we directly jump to the read0 label if POLLERR was not set. This will be important with health checks as we don't want to believe a connection was properly established when it's not the case !	2012-10-04 22:26:09 +02:00
Willy Tarreau	c39b0d17f2	MINOR: signal: really ignore signals configured with no handler Until now, signals configured with no handler were still enabled and ignored upon signal reception. Until now it was not an issue but with SSL causing many EPIPE all the time, it becomes obvious that signal processing comes with a cost. So set the handler to SIG_IGN when the function is NULL.	2012-10-04 22:26:09 +02:00
Willy Tarreau	f8cfa447c6	BUG/MINOR: epoll: correctly disable FD polling in fd_rem() When calling fd_rem(), the polling was not correctly disabled because the ->prev state was set to zero instead of the previous value. fd_rem() is very rarely used, only just before closing a socket. The effect is that upon an error reported at the connection level, if the task assigned to the connection was too slow to be woken up because of too many other tasks in the run queue, the FD was still not disabled and caused the connection handler to be called again with the same event until the task was finally executed to close the fd. This issue only affects the epoll poller, not the sepoll variant nor any of the other ones. It was already present in 1.4 and even 1.3 with the same almost unnoticeable effects. The bug can in fact only be discovered during development where it emphasizes other bugs. It should be backported anyway.	2012-10-04 22:26:09 +02:00
Willy Tarreau	050536d582	MEDIUM: proxy: add the global frontend to the list of normal proxies Since recent changes on the global frontend, it was not possible anymore to soft-reload a process which had a stats socket because the socket would not be disabled upon reload. The only solution to this endless madness is to have the global frontend part of normal proxies. Since we don't want to get an ID that shifts all other proxies and causes trouble in deployed environments, we assign it ID #0 which other proxies can't grab, and we don't report it in the stats pages.	2012-10-04 08:58:23 +02:00
Willy Tarreau	b3fb60bdcd	BUG/MEDIUM: listener: don't pause protocols that do not support it Pausing a UNIX_STREAM socket results in a major pain because the socket does not correctly resume, it wakes poll() but return EAGAIN on accept(), resulting in a busy loop. So let's only pause protocols that support it. This issues has existed since UNIX sockets were introduced on bind lines.	2012-10-04 08:58:21 +02:00
Willy Tarreau	8113a5d78f	BUG/MINOR: config: use a copy of the file name in proxy configurations Each proxy contains a reference to the original config file and line number where it was declared. The pointer used is just a reference to the one passed to the function instead of being duplicated. The effect is that it is not valid anymore at the end of the parsing and that all proxies will be enumerated as coming from the same file on some late configuration errors. This may happen for exmaple when reporting SSL certificate issues. By copying using strdup(), we avoid this issue. 1.4 has the same issue, though no report of the proxy file name is done out of the config section. Anyway a backport is recommended to ease post-mortem analysis.	2012-10-04 08:13:32 +02:00
Willy Tarreau	d1a33e35fb	BUG/MEDIUM: proxy: must not try to stop disabled proxies upon reload Herv� Commowick reported an issue : haproxy dies in a segfault during a soft restart if it tries to pause a disabled proxy. This is because disabled proxies have no management task so we must not wake the task up. This could easily remain unnoticed since the old process was expected to go away, so having it go away faster was not really troubling. However, with sync peers, it is obvious that there is no peer sync during this reload. This issue has been introduced in 1.5-dev7 with the removal of the maintain_proxies() function. No backport is needed.	2012-10-04 00:20:55 +02:00
Willy Tarreau	8923019a1d	BUG/MINOR: ssl: report the L4 connection as established when possible If we get an SSL error during the handshake, we at least try to see if a syscall reported an error or not. In case of an error, it generally means that the connection failed. If there is no error, then the connection established successfully. The difference is important for health checks which report the precise cause to the logs and to the stats.	2012-10-02 19:54:38 +02:00
Emeric Brun	051cdab68b	BUG/MINOR: build: Fix compilation issue on openssl 0.9.6 due to missing CRL feature.	2012-10-02 19:54:38 +02:00
Emeric Brun	561e574e2f	BUG/MINOR: ssl: Fix CRL check was not enabled when crlfile was specified.	2012-10-02 16:05:51 +02:00
Emeric Brun	2d0c482682	MINOR: ssl: add statement 'no-tls-tickets' on bind to disable stateless session resumption Disables the stateless session resumption (RFC 5077 TLS Ticket extension) and force to use stateful session resumption. Stateless session resumption is more expensive in CPU usage.	2012-10-02 16:05:33 +02:00
Emeric Brun	c6678e21bb	MEDIUM: config: authorize frontend and listen without bind. This allows to easily add/remove "bind" entries to a frontend without being forced to remove it when the last entry is temporarily removed. While "disabled" may sometimes work in a frontend, it becomes trickier on "listen" sections which can also hold servers and be referenced by other frontends. Note that a "listen" section with no "bind" is equivalent to a "backend" section. Configs without any listeners are still reported as invalid and refuse to load.	2012-10-02 08:34:39 +02:00
Emeric Brun	c0ff4924c0	MINOR: ssl : add statements 'notlsv11' and 'notlsv12' and rename 'notlsv1' to 'notlsv10'. This is because "notlsv1" used to disable TLSv1.0 only and had no effect on v1.1/v1.2. so better have an option for each version. This applies both to "bind" and "server" statements.	2012-10-02 08:34:38 +02:00
Emeric Brun	9faf071acb	MINOR: ssl: add build param USE_PRIVATE_CACHE to build cache without shared memory It removes dependencies with futex or mutex but ssl performances decrease using nbproc > 1 because switching process force session renegotiation. This can be useful on small systems which never intend to run in multi-process mode.	2012-10-02 08:34:38 +02:00
Emeric Brun	4b3091e54e	MINOR: ssl: disable shared memory and locks on session cache if nbproc == 1 We don't needa to lock the memory when there is a single process. This can make a difference on small systems where locking is much more expensive than just a test.	2012-10-02 08:34:38 +02:00
Emeric Brun	f282a810b7	MINOR: ssl: add fetches and ACLs to return verify errors Add fetch 'ssl_verify_caerr': returns the first ssl verify error at depth > 0 (CA chain). Add fetch 'ssl_verify_caerr_depth': returns the first ssl verify error depth (max returns is 15 if depth > 15). Add fetch 'ssl_verify_crterr': returns the fist ssl verify error at depth == 0.	2012-10-02 08:34:37 +02:00
Emeric Brun	baf8ffb673	MINOR: ssl: add fetch and ACL 'ssl_verify_result' This fetch returns the final ssl verify error.	2012-10-02 08:34:37 +02:00
Emeric Brun	81c00f0a7a	MINOR: ssl: add ignore verify errors options Allow to ignore some verify errors and to let them pass the handshake. Add option 'crt-ignore-err <list>' Ignore verify errors at depth == 0 (client certificate) <list> is string 'all' or a comma separated list of verify error IDs (see http://www.openssl.org/docs/apps/verify.html) Add option 'ca-ignore-err <list>' Same as 'crt-ignore-err' for all depths > 0 (CA chain certs) Ex ignore all errors on CA and expired or not-yet-valid errors on client certificate: bind 0.0.0.0:443 ssl crt crt.pem verify required cafile ca.pem ca-ignore-err all crt-ignore-err 10,9	2012-10-02 08:32:50 +02:00
Emeric Brun	e64aef124a	MINOR: ssl: add fetch and ACL 'client_crt' to test a client cert is present Useful in case of 'verify optional' to know if the client sent a certificate.	2012-10-02 08:32:50 +02:00
Emeric Brun	d94b3fe98f	MEDIUM: ssl: add client certificate authentication support Add keyword 'verify' on bind: 'verify none': authentication disabled (default) 'verify optional': accept connection without certificate and process a verify if the client sent a certificate 'verify required': reject connection without certificate and process a verify if the client send a certificate Add keyword 'cafile' on bind: 'cafile <path>' path to a client CA file used to verify. 'crlfile <path>' path to a client CRL file used to verify.	2012-10-02 08:04:49 +02:00
Emeric Brun	2b58d040b6	MINOR: ssl: add elliptic curve Diffie-Hellman support for ssl key generation Add 'ecdhe' on 'bind' statement: to set named curve used to generate ECDHE keys (ex: ecdhe secp521r1)	2012-10-02 08:03:21 +02:00
Emeric Brun	a4bcd9a5a8	MINOR: ssl: try to load Diffie-Hellman parameters from cert file Feature is disabled if openssl compiled with OPENSSL_NO_DH.	2012-10-02 08:01:42 +02:00
Willy Tarreau	e603e69d18	MEDIUM: connection: make use of the owner instead of container_of This way the connection can become independant on the stream interface.	2012-09-28 00:01:23 +02:00
Willy Tarreau	82569f9158	MEDIUM: monitor: simplify handling of monitor-net and mode health We were having several different behaviours with monitor-net and "mode health" : - monitor-net on TCP connections was evaluated just after accept(), did not count a connection on the frontend and were not subject to tcp-request connection rules, and caused an immediate close(). - monitor-net in HTTP mode was evaluated once the session was accepted (eg: on top of SSL), returned "HTTP/1.0 200 OK\r\n\r\n" over the connection's data layer and instanciated a session which was responsible for closing this connection. A connection AND a session were counted for the frontend ; - "mode health" with "option httpchk" would do exactly the same as monitor-net in HTTP mode ; - "mode health" without "option httpchk" would do the same as above except that "OK" was returned instead of "HTTP/1.0 200 OK\r\n\r\n". None of them took care of cleaning the input buffer, sometimes resulting in a TCP reset to be emitted after the last packet if a request was received over the connection. Given the inconsistencies and the complexity in keeping all these features handled at the right position, we now slightly changed the way they are handled : - all of them are handled just after the "tcp-request connection" rules, so that all of them may be blocked using such rules, offering more flexibility and consistency ; - no connection handshake is performed anymore for non-TCP modes - all of them send the response as raw data over the socket, there is no more difference between TCP and HTTP mode for example (these rules were never meant to be served over SSL connections and were never documented as able to do that). - any possible pending data on the incoming socket is drained before the response is sent, in order to avoid the risk of a reset. - none of them exactly did what was documented ! This results in more consistent, more flexible and more accurate handling of monitor rules, with smaller and more robust code.	2012-09-28 00:01:22 +02:00
Willy Tarreau	b8ffd378f0	BUG/MAJOR: http: chunk parser was broken with buffer changes Since at least commit `a458b679`, msg->sov could become negative in http_parse_chunk_size() if a chunk size wrapped around the buffer. The effect is that at some point channel_forward() was called with a negative size, causing all data to be transferred without being analyzed anymore. Since haproxy does not support keep-alive with the server yet, this issue is not really noticeable, as the server closes the connection in response. Still, when tunnel mode is used or when pretent-keepalive is used, it is possible to see the problem. This issue was reported and diagnosed by William Lallemand at Exceliance.	2012-09-27 15:08:56 +02:00
Willy Tarreau	3c7a79dbb1	MINOR: cli: allow to set frontend maxconn to zero It is sometimes useful to completely disable accepting new connections on a frontend during maintenance operations. By setting a frontend's maxconn to zero, connections are not accepted anymore until the limit is increased again.	2012-09-26 21:07:15 +02:00
Willy Tarreau	a7944ad9ef	BUG: stats: fix regression introduced by commit `4348fad1` Recent commit `4348fad1` (listeners: use dual-linked lists to chain listeners with frontends) broke frontend lookup in stats sockets by using the wrong iterator in the listeners.	2012-09-26 21:03:11 +02:00
Willy Tarreau	3631d41778	CLEANUP: config: fix typo inteface => interface This was in an error message.	2012-09-25 16:31:00 +02:00
Willy Tarreau	173e7fbd94	BUG/MINOR: config: check the proper pointer to report unknown protocol Check the protocol pointer and not the socket to report an unknown family in servers or peers. This can never happen anyway, it's just to be completely clean.	2012-09-24 22:49:06 +02:00
Willy Tarreau	e92693af26	BUG: http: do not print garbage on invalid requests in debug mode Cyril Bont� reported a mangled debug output when an invalid request was sent with a faulty request line. The reason was the use of the msg->sl.rq.l offset which was not yet initialized in this case. So we change the way to report such an error so that first we initialize it to zero before parsing a message, then we use that to know whether we can trust it or not. If it's still zero, then we display the whole buffer, truncated by debug_hdr() to the first CR or LF character, which results in the first line only. The same operation was performed for the response, which was wrong too.	2012-09-24 21:16:42 +02:00
Cyril Bonté	3aaba440a2	BUILD: fix compilation error with DEBUG_FULL Recent changes in structures broke the compilation when using DEBUG_FULL. Let's update apply the changes also to the variables used in DPRINTF calls.	2012-09-24 20:36:39 +02:00
Willy Tarreau	d578120a3e	MEDIUM: stats: make use of the standard "bind" parsers to parse global socket The global stats socket statement now makes use of the standard bind parsers. This results in all UNIX socket options being set by proto_uxst and in all TCP and SSL options being inherited and usable. For example it is now possible to enable a stats socket over SSL/TCP by appending the "ssl" keyword and a certificate after "crt". The code is simplified since we don't have a special case to parse this config keyword anymore.	2012-09-24 10:53:17 +02:00
Willy Tarreau	81796be87c	MINOR: ssl: set the listeners' data layer to ssl during parsing It's better to set all listeners to ssl_sock when seeing the "ssl" keyword that to loop on all of them afterwards just for this. This also removes some #ifdefs.	2012-09-24 10:53:17 +02:00
Willy Tarreau	c53d42256d	MEDIUM: stats: remove the stats_sock struct from the global struct Now the stats socket is allocated when the 'stats socket' line is parsed, and assigned using the standard str2listener(). This has two effects : - more than one stats socket can now be declared - stats socket now support protocols other than UNIX The next step is to remove the duplicate bind config parsing.	2012-09-24 10:53:16 +02:00
Willy Tarreau	4fbb2285e2	MINOR: config: make str2listener() use memprintf() to report errors. This will make it possible to use the function for other listening sockets.	2012-09-24 10:53:16 +02:00
Willy Tarreau	eb6cead1de	MINOR: standard: make memprintf() support a NULL destination Doing so removes many checks that were systematically made because the callees don't know if the caller passed a valid pointer.	2012-09-24 10:53:16 +02:00
Willy Tarreau	ce39bfb7c4	BUG: backend: balance hdr was broken since 1.5-dev11 Alex Markham reported and diagnosed a bug appearing on 1.5-dev11, causing a crash on x86_64 when header hashing is used. The cause is a missing (int) cast causing a negative offset to appear positive and the resulting pointer to go out of bounds. The crash is not possible anymore since 1.5-dev12 because a second bug caused the negative sign to disappear so the pointer is always within range but always wrong, so balance hdr() never works anymore. This fix restores the correct behaviour and ensures the sign is correct.	2012-09-22 18:36:29 +02:00
Willy Tarreau	290e63aa87	REORG: listener: move unix perms from the listener to the bind_conf Unix permissions are per-bind configuration line and not per listener, so let's concretize this in the way the config is stored. This avoids some unneeded loops to set permissions on all listeners. The access level is not part of the unix perms so it has been moved away. Once we can use str2listener() to set all listener addresses, we'll have a bind keyword parser for this one.	2012-09-20 18:07:14 +02:00
Willy Tarreau	4348fad1c1	MAJOR: listeners: use dual-linked lists to chain listeners with frontends Navigating through listeners was very inconvenient and error-prone. Not to mention that listeners were linked in reverse order and reverted afterwards. In order to definitely get rid of these issues, we now do the following : - frontends have a dual-linked list of bind_conf - frontends have a dual-linked list of listeners - bind_conf have a dual-linked list of listeners - listeners have a pointer to their bind_conf This way we can now navigate from anywhere to anywhere and always find the proper bind_conf for a given listener, as well as find the list of listeners for a current bind_conf.	2012-09-20 16:48:07 +02:00
Willy Tarreau	81a8117b41	MINOR: config: set the bind_conf entry on listeners created from a "listen" line. Otherwise we would risk a segfault when checking the config's validity (eg: when looking for conflicts on ID assignments). Note that the same issue exists with peers_fe and the global stats_fe. All listeners should be reviewed and simplified to use a compatible declaration mode.	2012-09-18 20:56:12 +02:00
Willy Tarreau	a020fbd593	MINOR: stats: fill the file and line numbers in the stats frontend The stats frontend struct has config file and line which were not set. They're not used right now but better fill them correctly anyway.	2012-09-18 20:05:00 +02:00
Willy Tarreau	28a47d6408	MINOR: config: pass the file and line to config keyword parsers This will be needed when we need to create bind config settings.	2012-09-18 20:02:48 +02:00
Willy Tarreau	51fb7651c4	MINOR: listener: add a scope field in the bind keyword lists This scope is used to report what the keywords are used for (eg: TCP, UNIX, ...). It is now reported by bind_dump_kws().	2012-09-18 18:27:14 +02:00
Willy Tarreau	8638f4850f	MEDIUM: config: enumerate full list of registered "bind" keywords upon error When an unknown "bind" keyword is detected, dump the list of all registered keywords. Unsupported default alternatives are also reported as "not supported".	2012-09-18 18:27:14 +02:00
Willy Tarreau	d0a895d25f	MEDIUM: config: move all unix-specific bind keywords to proto_uxst.c The "mode", "uid", "gid", "user" and "group" bind options were moved to proto_uxst as they are unix-specific. Note that previous versions had a bug here, only the last listener was updated with the specified settings. However, it almost never happens that bind lines contain multiple UNIX socket paths so this is not that much of a problem anyway.	2012-09-18 18:26:08 +02:00
Willy Tarreau	3dcc341720	MEDIUM: config: move the common "bind" settings to listener.c These ones are better placed in listener.c than in cfgparse.c, by relying on the bind keyword registration subsystem.	2012-09-18 17:17:28 +02:00
Willy Tarreau	dda322dec0	MINOR: config: improve error reporting for "bind" lines We now report the bind argument, which was missing in all error reports. It is now much more convenient to spot configuration mistakes.	2012-09-18 16:34:09 +02:00
Willy Tarreau	79eeafacb4	MEDIUM: move bind SSL parsing to ssl_sock Registering new SSL bind keywords was not particularly handy as it required many #ifdef in cfgparse.c. Now the code has moved to ssl_sock.c which calls a register function for all the keywords. Error reporting was also improved by this move, because the called functions build an error message using memprintf(), which can span multiple lines if needed, and each of these errors will be displayed indented in the context of the bind line being processed. This is important when dealing with certificate directories which can report multiple errors.	2012-09-18 16:20:01 +02:00
Willy Tarreau	4479124cda	MEDIUM: config: move the "bind" TCP parameters to proto_tcp Now proto_tcp.c is responsible for the 4 settings it handles : - defer-accept - interface - mss - transparent These ones do not need to be handled in cfgparse anymore. If support for a setting is disabled by a missing build option, then cfgparse correctly reports : [ALERT] 255/232700 (2701) : parsing [echo.cfg:114] : 'bind' : 'transparent' option is not implemented in this version (check build options).	2012-09-15 22:33:16 +02:00
Willy Tarreau	269826659d	MEDIUM: listener: add a minimal framework to register "bind" keyword options With the arrival of SSL, the "bind" keyword has received even more options, all of which are processed in cfgparse in a cumbersome way. So it's time to let modules register their own bind options. This is done very similarly to the ACLs with a small difference in that we make the difference between an unknown option and a known, unimplemented option.	2012-09-15 22:33:08 +02:00
Willy Tarreau	88500de69e	CLEANUP: listener: remove unused conf->file and conf->line These ones are already in bind_conf.	2012-09-15 22:29:33 +02:00
Willy Tarreau	2a65ff014e	MEDIUM: config: replace ssl_conf by bind_conf Some settings need to be merged per-bind config line and are not necessarily SSL-specific. It becomes quite inconvenient to have this ssl_conf SSL-specific, so let's replace it with something more generic.	2012-09-15 22:29:33 +02:00
Willy Tarreau	d1d5454180	REORG: split "protocols" files into protocol and listener It was becoming confusing to have protocols and listeners in the same files, split them.	2012-09-15 22:29:32 +02:00
Willy Tarreau	21c705b0f8	MINOR: config: add a function to indent error messages Bind parsers may return multiple errors, so let's make use of a new function to re-indent multi-line error messages so that they're all reported in their context.	2012-09-15 22:29:27 +02:00
Willy Tarreau	3e394c903f	BUG/MAJOR: ssl: missing tests in ACL fetch functions Baptiste Assmann observed a crash of 1.5-dev12 occuring when the ssl_sni fetch was used with no SNI on the input connection and without a prior has_sni check. A code review revealed several issues : 1) it was possible to call the has_sni and ssl_sni fetch functions with a NULL data_ctx if the handshake fails or if the connection is aborted during the handshake. 2) when no SNI is present, strlen() was called with a NULL parameter in smp_fetch_ssl_sni().	2012-09-15 08:57:46 +02:00
Willy Tarreau	2e1dca8f52	MEDIUM: http: add "redirect scheme" to ease HTTP to HTTPS redirection For instance : redirect scheme https if !{ is_ssl }	2012-09-12 08:43:15 +02:00
Willy Tarreau	69845dfcf3	DOC: add a special acknowledgement for the stud project Really, the quality of their code deserves it, it would have been much harder to figure how to get all the things right at once without looking there from time to time !	2012-09-10 09:44:59 +02:00
Willy Tarreau	7875d0967f	MEDIUM: ssl: add sample fetches for is_ssl, ssl_has_sni, ssl_sni_* This allows SNI presence and value to be checked on incoming SSL connections. It is usable both for ACLs and stick tables.	2012-09-10 09:27:02 +02:00
Willy Tarreau	1ee0e302a1	BUILD: report openssl build settings in haproxy -vv Since it's common enough to discover that some config options are not supported due to some openssl version or build options, we report the relevant ones in "haproxy -vv".	2012-09-10 09:27:02 +02:00
Emeric Brun	fc0421fde9	MEDIUM: ssl: add support for SNI and wildcard certificates A side effect of this change is that the "ssl" keyword on "bind" lines is now just a boolean and that "crt" is needed to designate certificate files or directories. Note that much refcounting was needed to have the free() work correctly due to the number of cert aliases which can make a context be shared by multiple names.	2012-09-10 09:27:02 +02:00
Willy Tarreau	f5ae8f7637	MEDIUM: config: centralize handling of SSL config per bind line SSL config holds many parameters which are per bind line and not per listener. Let's use a per-bind line config instead of having it replicated for each listener. At the moment we only do this for the SSL part but this should probably evolved to handle more of the configuration and maybe even the state per bind line.	2012-09-08 08:31:50 +02:00
Willy Tarreau	aa52bef622	BUILD: shut a gcc warning introduced by commit `269ab31` Usual warning on unchecked write() on which no operation is possible.	2012-09-08 08:24:51 +02:00
Willy Tarreau	50acaaae5e	MINOR: config: make the tasks "nice" value configurable on "bind" lines. This is very convenient to reduce SSL processing priority compared to other traffic. This applies to CPU usage only, but has a direct impact on latency under congestion.	2012-09-06 14:28:58 +02:00
Willy Tarreau	58363cf193	MEDIUM: connection: improve error handling around the data layer Better avoid calling the data functions upon error or handshake than having to put conditions everywhere, which are too easy to forget (one check for CO_FL_ERROR was missing, but this was harmless).	2012-09-06 14:12:03 +02:00
Willy Tarreau	184636e3e7	BUG: tcp: close socket fd upon connect error When the data layer fails to initialize (eg: out of memory for SSL), we must close the socket fd we just allocated.	2012-09-06 14:04:41 +02:00
Willy Tarreau	403edff4b8	MEDIUM: config: implement maxsslconn in the global section SSL connections take a huge amount of memory, and unfortunately openssl does not check malloc() returns and easily segfaults when too many connections are used. The only solution against this is to provide a global maxsslconn setting to reject SSL connections above the limit in order to avoid reaching unsafe limits.	2012-09-06 12:10:43 +02:00
Willy Tarreau	cbaaec475c	MINOR: session: do not send an HTTP/500 error on SSL sockets If a session fails its initialization, we don't want to send HTTP/500 over the socket if it's not a raw data layer.	2012-09-06 11:32:07 +02:00
Willy Tarreau	32368ceba4	MEDIUM: config: support per-listener backlog and maxconn With SSL, connections are much more expensive, so it is important to be able to limit concurrent connections per listener in order to limit the memory usage.	2012-09-06 11:10:55 +02:00
Willy Tarreau	269ab318ef	BUG/MEDIUM: workaround an eglibc bug which truncates the pidfiles when nbproc > 1 Thomas Heil reported that when using nbproc > 1, his pidfiles were regularly truncated. The issue could be tracked down to the presence of a call to lseek(pidfile, 0, SEEK_SET) just before the close() call in the children, resulting in the file being truncated by the children while the parent was feeding it. This unexpected lseek() is transparently performed by fclose(). Since there is no way to have the file automatically closed during the fork, the only solution is to bypass the libc and use open/write/close instead of fprintf() and fclose(). The issue was observed on eglibc 2.15.	2012-09-05 15:04:20 +02:00
Willy Tarreau	ee2e3a4027	BUILD: ssl: use MAP_ANON instead of MAP_ANONYMOUS FreeBSD uses the former, Linux uses the latter but generally also defines the former as an alias of the latter. Just checked on other OSes and AIX defines both. So better use MAP_ANON which seems to be more commonly defined.	2012-09-04 15:45:21 +02:00
David BERARD	e566ecbea8	MEDIUM: ssl: add support for prefer-server-ciphers option I wrote a small path to add the SSL_OP_CIPHER_SERVER_PREFERENCE OpenSSL option to frontend, if the 'prefer-server-ciphers' keyword is set. Example : bind 10.11.12.13 ssl /etc/haproxy/ssl/cert.pem ciphers RC4:HIGH:!aNULL:!MD5 prefer-server-ciphers This option mitigate the effect of the BEAST Attack (as I understand), and it equivalent to : - Apache HTTPd SSLHonorCipherOrder option. - Nginx ssl_prefer_server_ciphers option. [WT: added a test for the support of the option]	2012-09-04 15:35:32 +02:00
Willy Tarreau	ff9f7698fc	BUILD: fix build error without SSL (ssl_cert) One last-minute optimization broke the build without SSL support. Move ssl_cert out of the #ifdef/#endif and it's OK.	2012-09-04 15:13:20 +02:00
Willy Tarreau	18b2059a75	BUILD: ssl: fix shctx build on RHEL with futex On RHEL/CentOS, linux/futex.h uses an u32 type which is never declared anywhere. Let's set it with a #define in order to fix the issue without causing conflicts with possible typedefs on other platforms.	2012-09-04 12:26:26 +02:00
Willy Tarreau	783f25800c	BUILD: http: rename error_message http_error_message to fix conflicts on RHEL Duncan Hall reported a build issue on CentOS where error_message conflicts with another system declaration when SSL is enabled. Rename the function.	2012-09-04 12:19:04 +02:00
Willy Tarreau	0573747da0	BUG: ssl: mark the connection as waiting for an SSL connection during the handshake The WAIT_L6_CONN was designed especially to ensure that the connection was not marked ready before the SSL layer was OK, but we forgot to set the flag, resulting in a rejected handshake when ssl was combined with accept-proxy because accept-proxy would validate the connection alone and the SSL handshake would then believe in a client-initiated reneg and kill it.	2012-09-04 08:03:39 +02:00
Willy Tarreau	c230b8bfb6	MEDIUM: config: add "nosslv3" and "notlsv1" on bind and server lines This is aimed at disabling SSLv3 and TLSv1 respectively. SSLv2 is always disabled. This can be used in some situations where one version looks more suitable than the other.	2012-09-03 23:55:16 +02:00
Willy Tarreau	d7aacbffcb	MEDIUM: config: add a "ciphers" keyword to set SSL cipher suites This is supported for both servers and listeners. The cipher suite simply follows the "ciphers" keyword.	2012-09-03 23:43:25 +02:00
Emeric Brun	fc32acafcd	MINOR: ssl add global setting tune.sslcachesize to set SSL session cache size. This new global setting allows the user to change the SSL cache size in number of sessions. It defaults to 20000.	2012-09-03 22:36:33 +02:00
Emeric Brun	aa35f1fad7	MEDIUM: ssl: replace OpenSSL's session cache with the shared cache OpenSSL's session cache is now totally disabled and we use our own implementation instead.	2012-09-03 22:36:33 +02:00
Emeric Brun	3e541d1c03	MEDIUM: ssl: add shared memory session cache implementation. This SSL session cache was developped at Exceliance and is the same that was proposed for stunnel and stud. It makes use of a shared memory area between the processes so that sessions can be handled by any process. It is only useful when haproxy runs with nbproc > 1, but it does not hurt performance at all with nbproc = 1. The aim is to totally replace OpenSSL's internal cache. The cache is optimized for Linux >= 2.6 and specifically for x86 platforms. On Linux/x86, it makes use of futexes for inter-process locking, with some x86 assembly for the locked instructions. On other architectures, GCC builtins are used instead, which are available starting from gcc 4.1. On other operating systems, the locks fall back to pthread mutexes so libpthread is automatically linked. It is not recommended since pthreads are much slower than futexes. The lib is only linked if SSL is enabled.	2012-09-03 22:36:33 +02:00
Willy Tarreau	fbac6638c1	MINOR: ssl: disable TCP quick-ack by default on SSL listeners Since the SSL handshake involves an immediate reply from the server to the client, there's no point responding with a quick-ack before sending the data, so disable quick-ack by default, just as it is done for HTTP. This shows a 2-2.5% transaction rate increase on a dual-core atom.	2012-09-03 22:36:27 +02:00
Emeric Brun	e1f38dbb44	MEDIUM: ssl: protect against client-initiated renegociation CVE-2009-3555 suggests that client-initiated renegociation should be prevented in the middle of data. The workaround here consists in having the SSL layer notify our callback about a handshake occurring, which in turn causes the connection to be marked in the error state if it was already considered established (which means if a previous handshake was completed). The result is that the connection with the client is immediately aborted and any pending data are dropped.	2012-09-03 22:03:17 +02:00
Emeric Brun	01f8e2f61b	MEDIUM: config: add support for the 'ssl' option on 'server' lines This option currently takes no option and simply turns SSL on for all connections going to the server. It is likely that more options will be needed in the future.	2012-09-03 22:02:21 +02:00
Emeric Brun	6e159299f1	MEDIUM: config: add the 'ssl' keyword on 'bind' lines "bind" now supports "ssl" followed by a PEM cert+key file name.	2012-09-03 20:49:14 +02:00
Emeric Brun	4659195e31	MEDIUM: ssl: add new files ssl_sock.[ch] to provide the SSL data layer This data layer supports socket-to-buffer and buffer-to-socket operations. No sock-to-pipe nor pipe-to-sock functions are provided, since splicing does not provide any benefit with data transformation. At best it could save a memcpy() and avoid keeping a buffer allocated but that does not seem very useful. An init function and a close function are provided because the SSL context needs to be allocated/freed. A data-layer shutw() function is also provided because upon successful shutdown, we want to store the SSL context in the cache in order to reuse it for future connections and avoid a new key generation. The handshake function is directly called from the connection handler. At this point it is not certain whether this will remain this way or if a new ->handshake callback will be added to the data layer so that the connection handler doesn't care about SSL. The sock-to-buf and buf-to-sock functions are all capable of enabling the SSL handshake at any time. This also implies polling in the opposite direction to what was expected. The upper layers must take that into account (it is OK right now with the stream interface).	2012-09-03 20:49:14 +02:00
Willy Tarreau	dd2f85eb3b	CLEANUP: includes: fix includes for a number of users of fd.h It appears that fd.h includes a number of unneeded files and was included from standard.h, and as such served as an intermediary to provide almost everything to everyone. By removing its useless includes, a long dependency chain broke but could easily be fixed.	2012-09-03 20:49:14 +02:00
Willy Tarreau	45dab73788	CLEANUP: fdtab: flatten the struct and merge the spec struct with the rest The "spec" sub-struct was using 8 bytes for only 5 needed. There is no reason to keep it as a struct, it doesn't bring any value. By flattening it, we can merge the single byte with the next single byte, resulting in an immediate saving of 4 bytes (20%). Interestingly, tests have shown a steady performance gain of 0.6% after this change, which can possibly be attributed to a more cache-line friendly struct.	2012-09-03 20:49:14 +02:00
Willy Tarreau	40ff59d820	CLEANUP: fd: remove fdtab->flags These flags were added for TCP_CORK. They were only set at various places but never checked by any user since TCP_CORK was replaced with MSG_MORE. Simply get rid of this now.	2012-09-03 20:49:14 +02:00
Willy Tarreau	34ffd77648	MAJOR: stream_interface: continue to update data polling flags during handshakes Since data and socket polling flags were split, it became possible to update data flags even during handshakes. In fact this is very important otherwise it is not possible to poll for writes if some data are to be forwarded during a handshake (eg: data received during an SSL connect).	2012-09-03 20:49:13 +02:00
Willy Tarreau	d9de7ca3d0	MEDIUM: connection: avoid calling handshakes when polling is required If a data handler suddenly switches to a handshake mode and detects the need for polling in either direction, we don't want to loop again through the handshake handlers because we know we won't be able to do anything. Similarly, we don't want to call again the data handlers after a loop through the handshake handlers if polling is required. No performance change was observed, it might only be observed during high rate SSL renegociation.	2012-09-03 20:47:35 +02:00
Willy Tarreau	56a77e5933	MEDIUM: connection: complete the polling cleanups I/O handlers now all use __conn_{sock,data}_{stop,poll,want}_* instead of returning dummy flags. The code has become slightly simpler because some tricks such as the MIN_RET_FOR_READ_LOOP are not needed anymore, and the data handlers which switch to a handshake handler do not need to disable themselves anymore.	2012-09-03 20:47:35 +02:00
Willy Tarreau	f8deb0cfa8	MEDIUM: connection: only call tcp_connect_probe when nothing was attempted yet It was observed that after a failed send() on EAGAIN, a second connect() would still be attempted in tcp_connect_probe() because there was no way to know that a send() had failed. By checking the WANT_WR status flag, we know if a previous write attempt failed on EAGAIN, so we don't try to connect again if we know this has already failed. With this simple change, the second connect() has disappeared.	2012-09-03 20:47:35 +02:00
Willy Tarreau	e9dfa79a75	MAJOR: connection: rearrange the polling flags. Polling flags were set for data and sock layer, but while this does make sense for the ENA flag, it does not for the POL flag which translates the detection of an EAGAIN condition. So now we remove the {DATA,SOCK}_POL* flags and instead introduce two new layer-independant flags (WANT_RD and WANT_WR). These flags are only set when an EAGAIN is encountered so that polling can be enabled. In order for these flags to have any meaning they are not persistent and have to be cleared by the connection handler before calling the I/O and data callbacks. For this reason, changes detection has been slightly improved. Instead of comparing the WANT_* flags with CURR_*_POL, we only check if the ENA status changes, or if the polling appears, since we don't want to detect the useless poll to ena transition. Tests show that this has eliminated one useless call to __fd_clr(). Finally the conn_set_polling() function which was becoming complex and required complex operations from the caller was split in two and replaced its two only callers (conn_update_data_polling and conn_update_sock_polling). The two functions are now much smaller due to the less complex conditions. Note that it would be possible to re-merge them and only pass a mask but this does not appear much interesting.	2012-09-03 20:47:35 +02:00
Willy Tarreau	74172ff9c3	CLEANUP: frontend: remove the old proxy protocol decoder This one used to rely on a stream analyser which was inappropriate. It's not used anymore.	2012-09-03 20:47:35 +02:00
Willy Tarreau	22cda21ad5	MAJOR: connection: make the PROXY decoder a handshake handler The PROXY protocol is now decoded in the connection before other handshakes. This means that it may be extracted from a TCP stream before SSL is decoded from this stream.	2012-09-03 20:47:35 +02:00
Willy Tarreau	2542b53b19	MAJOR: session: introduce embryonic sessions When an incoming connection request is accepted, a connection structure is needed to store its state. However we don't want to fully initialize a session until the data layer is about to be ready. As long as the connection is physically stored into the session, it's not easy to split both allocations. As such, we only initialize the minimum requirements of a session, which results in what we call an embryonic session. Then once the data layer is ready, we can complete the function's initialization. Doing so avoids buffers allocation and ensures that a session only sees ready connections. The frontend's client timeout is used as the handshake timeout. It is likely that another timeout will be used in the future.	2012-09-03 20:47:35 +02:00
Willy Tarreau	15678efc45	MEDIUM: connection: add an ->init function to data layer SSL need to initialize the data layer before proceeding with data. At the moment, this data layer is automatically initialized from itself, which will not be possible once we extract connection from sessions since we'll only create the data layer once the handshake is finished. So let's have the application layer initialize the data layer before using it.	2012-09-03 20:47:34 +02:00
Willy Tarreau	64ee491309	MINOR: tcp: replace tcp_src_to_stktable_key with addr_to_stktable_key Make it more obvious that this function does not depend on any knowledge of the session. This is important to plan for TCP rules that can run on connection without any initialized session yet.	2012-09-03 20:47:34 +02:00
Willy Tarreau	14f8e86da5	MEDIUM: proto_tcp: remove any dependence on stream_interface The last uses of the stream interfaces were in tcp_connect_server() and could easily and more appropriately be moved to its callers, si_connect() and connect_server(), making a lot more sense. Now the function should theorically be usable for health checks. It also appears more obvious that the file is split into two distinct parts : - the protocol layer used at the connection level - the tcp analysers executing tcp-* rules and their samples/acls.	2012-09-03 20:47:34 +02:00
Willy Tarreau	93b0f4f6c6	MEDIUM: stream_interface: remove CAP_SPLTCP/CAP_SPLICE flags These ones are implicitly handled by the connection's data layer, no need to rely on them anymore and reaching them maintains undesired dependences on stream-interface.	2012-09-03 20:47:34 +02:00
Willy Tarreau	986a9d2d12	MAJOR: connection: move the addr field from the stream_interface We need to have the source and destination addresses in the connection. They were lying in the stream interface so let's move them. The flags SI_FL_FROM_SET and SI_FL_TO_SET have been moved as well. It's worth noting that tcp_connect_server() almost does not use the stream interface anymore except for a few flags. It has been identified that once we detach the connection from the SI, it will probably be needed to keep a copy of the server-side addresses in the SI just for logging purposes. This has not been implemented right now though.	2012-09-03 20:47:34 +02:00
Willy Tarreau	3cefd521fa	REORG: connection: move the target pointer from si to connection The target is per connection and is directly used by the connection, so we need it there. It's not needed anymore in the SI however.	2012-09-03 20:47:34 +02:00
Willy Tarreau	8263d2b259	CLEANUP: channel: use "channel" instead of "buffer" in function names This is a massive rename of most functions which should make use of the word "channel" instead of the word "buffer" in their names. In concerns the following ones (new names) : unsigned long long channel_forward(struct channel buf, unsigned long long bytes); static inline void channel_init(struct channel buf) static inline int channel_input_closed(struct channel buf) static inline int channel_output_closed(struct channel buf) static inline void channel_check_timeouts(struct channel b) static inline void channel_erase(struct channel buf) static inline void channel_shutr_now(struct channel buf) static inline void channel_shutw_now(struct channel buf) static inline void channel_abort(struct channel buf) static inline void channel_stop_hijacker(struct channel buf) static inline void channel_auto_connect(struct channel buf) static inline void channel_dont_connect(struct channel buf) static inline void channel_auto_close(struct channel buf) static inline void channel_dont_close(struct channel buf) static inline void channel_auto_read(struct channel buf) static inline void channel_dont_read(struct channel buf) unsigned long long channel_forward(struct channel *buf, unsigned long long bytes) Some functions provided by channel.[ch] have kept their "buffer" name because they are really designed to act on the buffer according to some information gathered from the channel. They have been moved together to the same place in the file for better readability but they were not changed at all. The "buffer" memory pool was also renamed "channel".	2012-09-03 20:47:33 +02:00
Willy Tarreau	03cdb7c678	CLEANUP: channel: usr CF_/CHN_ prefixes instead of BF_/BUF_ Get rid of these confusing BF_* flags. Now channel naming should clearly be used everywhere appropriate. No code was changed, only a renaming was performed. The comments about channel operations was updated.	2012-09-03 20:47:33 +02:00
Willy Tarreau	af81935b82	REORG: channel: move buffer_{replace,insert_line}* to buffer.{c,h} These functions do not depend on the channel flags anymore thus they're much better suited to be used on plain buffers. Move them from channel to buffer.	2012-09-03 20:47:33 +02:00
Willy Tarreau	f941cf2ef2	MAJOR: channel: remove the BF_FULL flag This is similar to the recent removal of BF_OUT_EMPTY. This flag was very problematic because it relies on permanently changing information such as the to_forward value, so it had to be updated upon every change to the buffers. Previous patch already got rid of its users. One part of the change is sensible : the flag was also part of BF_MASK_STATIC, which is used by process_session() to rescan all analysers in case the flag's status changes. At first glance, none of the analysers seems to change its mind base on this flag when it is subject to change, so it seems fine not to add variation checks here. Otherwise it's possible that checking the buffer's input and output is more reliable than checking the flag's replacement.	2012-09-03 20:47:33 +02:00
Willy Tarreau	3bf1b2b816	MAJOR: channel: stop relying on BF_FULL to take action This flag is quite complex to get right and updating it everywhere is a major pain, especially since the buffer/channel split. This is the first step of getting rid of it. Instead now it's dynamically computed whenever needed.	2012-09-03 20:47:33 +02:00
Willy Tarreau	ad1cc3df9c	MINOR: channel: rename bi_full to channel_full as it checks the whole channel Since the function takes care of the forward count and involves more than buffer knowledge, rename it.	2012-09-03 20:47:32 +02:00
Willy Tarreau	a75bcef867	REORG: buffer: move buffer_flush, b_adv and b_rew to buffer.h These one now operate over real buffers, not channels anymore.	2012-09-03 20:47:32 +02:00
Willy Tarreau	8e21bb9e52	MAJOR: channel: remove the BF_OUT_EMPTY flag This flag was very problematic because it was composite in that both changes to the pipe or to the buffer had to cause this flag to be updated, which is not always simple (eg: there may not even be a channel attached to a buffer at all). There were not that many users of this flags, mostly setters. So the flag got replaced with a macro which reports whether the channel is empty or not, by checking both the pipe and the buffer. One part of the change is sensible : the flag was also part of BF_MASK_STATIC, which is used by process_session() to rescan all analysers in case the flag's status changes. At first glance, none of the analysers seems to change its mind base on this flag when it is subject to change, so it seems fine not to add variation checks here. Otherwise it's possible that checking the buffer's output size is more useful than checking the flag's replacement.	2012-09-03 20:47:32 +02:00
Willy Tarreau	c7e4238df0	REORG: buffers: split buffers into chunk,buffer,channel Many parts of the channel definition still make use of the "buffer" word.	2012-09-03 20:47:32 +02:00
Willy Tarreau	c578891112	CLEANUP: connection: split sock_ops into data_ops, app_cp and si_ops Some parts of the sock_ops structure were only used by the stream interface and have been moved into si_ops. Some of them were callbacks to the stream interface from the connection and have been moved into app_cp as they're the application seen from the connection (later, health-checks will need to use them). The rest has moved to data_ops. Normally at this point the connection could live without knowing about stream interfaces at all.	2012-09-03 20:47:31 +02:00
Willy Tarreau	62266dba88	MEDIUM: stream-interface: don't remove WAIT_DATA when a handshake is in progress This doesn't make sense and will only require that it's enabled again.	2012-09-03 20:47:31 +02:00
Willy Tarreau	2c052083e6	MAJOR: stream-interface: fix splice not to call chk_snd by itself In recent splice fixes we made splice call chk_snd, but this was due to inappropriate checks in conn_notify_si() which prevented the chk_snd() call from being performed. Now that this has been fixed, remove this duplicate code.	2012-09-03 20:47:31 +02:00
Willy Tarreau	f16723e4ca	MAJOR: stream-interface: don't commit polling changes in every callback It's more efficient to centralize polling changes, which is already done in the connection handler. So now all I/O callbacks just change flags and rely on the connection handler for the commit. The special case of the send loop is handled by the chk_snd() function which does an update at the end.	2012-09-03 20:47:31 +02:00
Willy Tarreau	a1a74744a4	MEDIUM: proxy-proto: don't use buffer flags in conn_si_send_proxy() These ones should only be handled by the stream interface at the end of the handshake now. Similarly a number of information are now taken at the connection level rather than at the data level (eg: shutdown). Fast polling updates have been used instead of slow ones since the function is only called by the connection handler.	2012-09-03 20:47:31 +02:00
Willy Tarreau	44b5dc6f85	MAJOR: stream-interface: make conn_notify_si() more robust This function was relying on the result of file descriptor polling which is inappropriate as it may be subject to race conditions during handshakes. Make it more robust by relying solely on buffer activity.	2012-09-03 20:47:31 +02:00
Willy Tarreau	96199b1016	MAJOR: stream-interface: restore splicing mechanism The splicing is now provided by the data-layer rcv_pipe/snd_pipe functions which in turn are called by the stream interface's recv and send callbacks. The presence of the rcv_pipe/snd_pipe functions is used to attest support for splicing at the data layer. It looks like the stream-interface's SI_FL_CAP_SPLICE flag does not make sense anymore as it's used as a proxy for the pointers above. It also appears that we call chk_snd() from the recv callback and then try to call it again in update_conn(). It is very likely that this last function will progressively slip into the recv/send callbacks in order to avoid duplicate check code. The code works right now with and without splicing. Only raw_sock provides support for it and it is automatically selected when the various splice options are set. However it looks like splice-auto doesn't enable it, which possibly means that the streamer detection code does not work anymore, or that it's only called at a time where it's too late to enable splicing (in process_session).	2012-09-03 20:47:31 +02:00
Willy Tarreau	5368d80ede	MAJOR: connection: split the send call into connection and stream interface Similar to what was done on the receive path, the data layer now provides only an snd_buf() callback that is iterated over by the stream interface's si_conn_send_loop() function. The data layer now has no knowledge about channels nor stream interfaces. The splice() code still need to be ported as it currently is disabled.	2012-09-03 20:47:31 +02:00
Willy Tarreau	ce323dea14	REORG: stream-interface: move sock_raw_read() to si_conn_recv_cb() The recv function is now generic and is usable to iterate any connection-to-buf reading function from a stream interface. So let's move it to stream-interface.	2012-09-03 20:47:30 +02:00
Willy Tarreau	1fe6bc335a	MINOR: stream-interface: add an rcv_buf callback to sock_ops This one is to be used by the read I/O handlers.	2012-09-03 20:47:30 +02:00
Willy Tarreau	af978c4170	MAJOR: raw_sock: temporarily disable splicing It's too hard to convert splicing to connection+buf for now, so let's disable it in order to make progress.	2012-09-03 20:47:30 +02:00
Willy Tarreau	2ba4465086	MAJOR: raw_sock: extract raw_sock_to_buf() from raw_sock_read() This is the start of the stream connection iterator which calls the data-layer reader. This still looks a bit tricky but is OK. Splicing is not handled at all at the moment.	2012-09-03 20:47:30 +02:00
Willy Tarreau	75bf2c925f	REORG: sock_raw: rename the files raw_sock* The "raw_sock" prefix will be more convenient for naming functions as it will be prefixed with the data layer and suffixed with the data direction. So let's rename the files now to avoid any further confusion. The #include directive was also removed from a number of files which do not need it anymore.	2012-09-02 21:54:56 +02:00
Willy Tarreau	572bf9095d	REORG/MAJOR: extract "struct buffer" from "struct channel" At the moment, the struct is still embedded into the struct channel, but all the functions have been updated to use struct buffer only when possible, otherwise struct channel. Some functions would likely need to be splitted between a buffer-layer primitive and a channel-layer function. Later the buffer should become a pointer in the struct buffer, but doing so requires a few changes to the buffer allocation calls.	2012-09-02 21:54:56 +02:00
Willy Tarreau	7421efb85f	REORG/MAJOR: use "struct channel" instead of "struct buffer" This is a massive rename. We'll then split channel and buffer. This change needs a lot of cleanups. At many locations, the parameter or variable is still called "buf" which will become ambiguous. Also, the "struct channel" is still defined in buffers.h.	2012-09-02 21:54:55 +02:00
Willy Tarreau	9bf9c14c12	MEDIUM: stream-interface: provide a generic stream_sock_read0() function This function is used by the data layer when a zero has been read over a connection. At the moment it only handles sockets and nothing else. Once the complete split is done between buffers and stream interfaces, it should become possible to work regardless on the connection type.	2012-09-02 21:54:55 +02:00
Willy Tarreau	eecf6ca68a	MEDIUM: stream-interface: provide a generic si_conn_send_cb callback The connection send() callback is supposed to be generic for a stream-interface, and consists in calling the lower layer snd_buf function. Move this function to the stream interface and remove the sock-raw and sock-ssl clones.	2012-09-02 21:54:55 +02:00
Willy Tarreau	de5722c302	MEDIUM: stream-interface: provide a generic stream_int_chk_snd_conn() function This one can be used by both sock_raw and sock_ssl instead of each having their own.	2012-09-02 21:54:55 +02:00
Willy Tarreau	fae4499e36	MEDIUM: stream-interface: add a snd_buf() callback to sock_ops This callback is used to send data from the buffer to the socket. It is the old write_loop() call of the data layer which is used both by the ->write() callback and the ->chk_snd() function. The reason for having it as a pointer is that it's the only remaining part which causes the write and chk_snd() functions to be different between raw and ssl.	2012-09-02 21:54:18 +02:00
Willy Tarreau	46a8d925c2	MEDIUM: stream-interface: offer a generic chk_rcv function for connections sock_raw and sock_ssl use a pretty generic chk_rcv function, so let's move this function to the stream_interface and remove specific functions. Later we might have a single chk_rcv function.	2012-09-02 21:54:18 +02:00
Willy Tarreau	100c467120	MEDIUM: stream_interface: offer a generic function for connection updates We need to have a generic function to be called by upper layers when buffer flags have been updated (the si->update function). At the moment, both sock_raw and sock_ssl had their own which basically was a copy-paste. Since these functions are only used to update stream interface flags, it is logical to have them handled by the stream interface code. This allowed us to remove the stream_interface-specific update function from sock_raw and sock_ssl which now use the generic code. The stream_sock_update_conn callback has also been more appropriately renamed conn_notify_si() since it's meant to be called by lower layers to notify the SI and possibly upper layers about incoming changes.	2012-09-02 21:54:18 +02:00
Willy Tarreau	26f44d1e91	MINOR: fd: get rid of FD_WAIT_* These flags were used to ease a transition which has been completed, so they're not needed anymore. Get rid of them.	2012-09-02 21:53:12 +02:00
Willy Tarreau	3267d36c84	MEDIUM: checks: don't use FD_WAIT_* anymore make use of fd_poll_* instead in preparation for a later adoption by the connection subsystem.	2012-09-02 21:53:12 +02:00
Willy Tarreau	afad0e0f80	MAJOR: make use of conn_{data\|sock}_{poll\|stop\|want}* in connection handlers This is a second attempt at getting rid of FD_WAIT_. Now the situation is much better since native I/O handlers can directly manipulate the FD using fd_{poll\|want\|stop}_ and the connection handlers manipulate connection-level flags using the conn_{data\|sock}_* equivalent. Proceeding this way ensures that the connection flags always reflect the reality even after data<->handshake switches.	2012-09-02 21:53:12 +02:00
Willy Tarreau	f9dabecd03	MEDIUM: connection: make use of the new polling functions Now the connection handler, the handshake callbacks and the I/O callbacks make use of the connection-layer polling functions to enable or disable polling on a file descriptor. Some changes still need to be done to avoid using the FD_WAIT_* constants.	2012-09-02 21:53:11 +02:00
Willy Tarreau	b5e2cbdcc8	MEDIUM: connection: add definitions for dual polling mechanisms The conflicts we're facing with polling is that handshake handlers have precedence over data handlers and may change the polling requirements regardless of what is expected by the data layer. This causes issues such as missed events. The real need is to have three polling levels : - the "current" one, which is effective at any moment - the data one, which reflects what the data layer asks for - the sock one, which reflects what the socket layer asks for Depending on whether a handshake is in progress or not, either one of the last two will replace the current one, and the change will be propagated to the lower layers. At the moment, the shutdown status is not considered, and only handshakes are used to decide which layer to chose. This will probably change.	2012-09-02 21:53:11 +02:00
Willy Tarreau	babd05a6c6	MEDIUM: fd: add fd_poll_{recv,send} for use when explicit polling is required The old EV_FD_SET() macro was confusing, as it would enable receipt but there was no way to indicate that EAGAIN was received, hence the recently added FD_WAIT_* flags. They're not enough as we're still facing a conflict between EV_FD_* and FD_WAIT_*. So let's offer I/O functions what they need to explicitly request polling.	2012-09-02 21:53:11 +02:00
Willy Tarreau	49b046dddf	MAJOR: fd: replace all EV_FD_* macros with new fd__ inline calls These functions have a more explicity meaning and will offer provisions for explicit polling. EV_FD_ISSET() has been left for now as it is still in use in checks.	2012-09-02 21:53:11 +02:00
Willy Tarreau	4a36b56909	MAJOR: stream_int: use a common stream_int_shut() functions regardless of the data layer Up to now, we had to use a shutr/shutw interface per data layer, which basically means 3 distinct functions when we include SSL : - generic stream_interface - sock_raw - sock_ssl With this change, the code located in the stream_interface manages all the stream_interface and buffer updates, and calls the data layer hooks when needed. At the moment, the socket layer hook had been implicitly considered as being a regular socket, so the si_shut() functions call the normal shutdown() and EV_FD_CLR() functions on the fd if a socket layer is defined. This may change in the future. The stream_int_shut*() functions don't call EV_FD_CLR() so that they can later be embedded in lower layers. Thus, the si->data->shutr() is not called anymore and si->data->shutw() is called to close the data layer only (eg: only for SSL). Proceeding like this is very important because it's the only way to be able not to rely on these functions when called from the connection handlers, and call the data layers' instead.	2012-09-02 21:53:10 +02:00
Willy Tarreau	3d8903fae0	MEDIUM: sock_raw: introduce a read0 callback that is different from shutr This one is supposed to be called by the lower layer upon receiving a shutr notification, which is different from the call performed by the upper layer. Specifically, this function will ultimately not call EV_FD_* but will just manipulate event flags instead. The function also does not call shutw anymore and instead performs the necessary work. Splitting it into si-specific part and data-specific parts will not be easy.	2012-09-02 21:53:10 +02:00
Willy Tarreau	8b117082bc	REORG: connection: replace si_data_close() with conn_data_close() This close function only applies to connection-specific parts and the stream-interface entry may soon disappear. Move this to the connection instead.	2012-09-02 21:53:10 +02:00
Willy Tarreau	3438f5dce1	MINOR: sock_raw: move calls to si_data_close upper Some users of si_data_close() need to have the fd still open, so we must move the call before fd_delete().	2012-09-02 21:53:10 +02:00
Willy Tarreau	3788e4c874	MEDIUM: fd: remove the EV_FD_COND_* primitives These primitives were initially introduced so that callers were able to conditionally set/disable polling on a file descriptor and check in return what the state was. It's been long since we last had an "if" on this, and all pollers' functions were the same for cond_* and their systematic counter parts, except that this required a check and a specific return value that are not always necessary. So let's simplify the FD API by removing this now unused distinction and by making all specific functions return void.	2012-09-02 21:53:10 +02:00
Willy Tarreau	c76ae33bfc	MAJOR: connection: call data layer handshakes from the handler Handshakes is not called anymore from the data handlers, they're only called from the connection handler when their flag is set. Also, this move has uncovered an issue with the stream interface notifier : it doesn't consider the FD_WAIT_* flags possibly set by the handshake handlers. This will result in a stuck handshake when no data is in the output buffer. In order to cover this, for now we'll perform the EV_FD_SET in the SSL handshake function, but this needs to be addressed separately from the stream interface operations.	2012-09-02 21:53:09 +02:00
Willy Tarreau	0b0c097a3a	MINOR: rearrange tcp_connect_probe() and fix wrong return codes Sometimes we returned the need for polling while it was not needed. Remove some of the spaghetti in the function.	2012-09-02 21:53:09 +02:00
Willy Tarreau	8f8c92fe93	MAJOR: connection: add a new CO_FL_CONNECTED flag This new flag is used to indicate that the connection was already connected. It can be used by I/O handlers to know that a connection has just completed. It is used by stream_sock_update_conn(), allowing the sock_opt handlers not to manipulate the SI timeout nor the BF_WRITE_NULL flag anymore.	2012-09-02 21:53:09 +02:00
Willy Tarreau	3c55ec2020	MEDIUM: stream_interface: centralize the SI_FL_ERR management It's better to have only stream_sock_update_conn() handle the conversion of the CO_FL_ERROR flag to SI_FL_ERR than having it in each and every I/O callback.	2012-09-02 21:53:09 +02:00
Willy Tarreau	239d7189fc	MEDIUM: stream_interface: pass connection instead of fd in sock_ops The sock_ops I/O callbacks made use of an FD till now. This has become inappropriate and the struct connection is much more useful. It also fixes the race condition introduced by previous change.	2012-09-02 21:53:08 +02:00
Willy Tarreau	fd31e53139	MAJOR: remove the stream interface and task management code from sock_* The socket data layer code must only focus on moving data between a socket and a buffer. We need a special stream interface handler to update the stream interface and the file descriptor status. At the moment the code works but suffers from a race condition caused by its API : the read/write callbacks still make use of the fd instead of using the connection. And when a double shutdown is performed, a call to ->write() after ->read() processed an error results in dereferencing a NULL fdtab[]->owner. This is only a temporary issue which doesn't need to be fixed now since this will automatically go away when the functions change to use the connection instead.	2012-09-02 21:53:08 +02:00
Willy Tarreau	076be25ab8	CLEANUP: remove the now unused fdtab direct I/O callbacks They were all left to NULL since last commit so we can safely remove them all now and remove the temporary dual polling logic in pollers.	2012-09-02 21:51:29 +02:00
Willy Tarreau	2da156fe5e	MAJOR: tcp: remove the specific I/O callbacks for TCP connection probes Use a single tcp_connect_probe() instead of tcp_connect_write() and tcp_connect_read(). We call this one only when no data layer function have been processed, so this is a fallback to test for completion of a connection attempt. With this done, we don't have the need for any direct I/O callback anymore. The function still relies on ->write() to wake the stream interface up, so it's not finished.	2012-09-02 21:51:29 +02:00
Willy Tarreau	2c6be84b3a	MEDIUM: connection: extract the send_proxy callback from proto_tcp This handshake handler must be independant, so move it away from proto_tcp. It has a dedicated connection flag. It is tested before I/O handlers and automatically removes the CO_FL_WAIT_L4_CONN flag upon success. It also sets the BF_WRITE_NULL flag on the stream interface and stops the SI timeout. However it does not perform the task_wakeup(), and relies on the data handler to do so for now. The SI wakeup will have to be moved elsewhere anyway.	2012-09-02 21:51:28 +02:00
Willy Tarreau	61ace1b2ca	MEDIUM: connection: remove the FD_POLL_* flags only once It's inappropriate to remove FD_POLL_IN and FD_POLL_OUT in the IO callback handlers, first because they shouldn't care about this, and second because it will make it harder to chain multiple callers. So let's flush these flags only once for all in the connection handler. Right now, the HUP and ERR flags are still flushed in each IO handler to avoid multiple calls. This will probably have to be fixed later.	2012-09-02 21:51:28 +02:00
Willy Tarreau	8018471f44	MINOR: fd: make fdtab->owner a connection and not a stream_interface anymore It is more convenient with a connection here and will abstract stream_interface more easily.	2012-09-02 21:51:28 +02:00
Willy Tarreau	d2274c6536	MAJOR: connection: replace direct I/O callbacks with the connection callback Almost all direct I/O callbacks have been changed to use the connection callback instead. Only the TCP connection validation remains.	2012-09-02 21:51:28 +02:00
Willy Tarreau	59f98393bb	MINOR: connection: add a handler for fd-based connections This connection handler will be used as an I/O handler for events detected on a file descriptor. It is not used yet.	2012-09-02 21:51:28 +02:00
Willy Tarreau	aece46a44d	MEDIUM: protocols: use the generic I/O callback for accept callbacks This one is used only on read events, and it was easy to convert to use the new I/O callback.	2012-09-02 21:51:27 +02:00
Willy Tarreau	20bea42a95	MEDIUM: checks: make use of fdtab->iocb instead of cb[] Use the single I/O callback to handle the checks. This should soon be replaced by the common connection handler.	2012-09-02 21:51:27 +02:00
Willy Tarreau	9845e75d23	MEDIUM: polling: prepare to call the iocb() function when defined. We will need this to centralize I/O callbacks. Nobody sets it right now so the code should have no impact.	2012-09-02 21:51:27 +02:00
Willy Tarreau	4e6049e553	MINOR: fd: add a new I/O handler to fdtab This one will eventually replace both cb[] handlers. At the moment it is not used yet.	2012-09-02 21:51:27 +02:00
Willy Tarreau	505e34a36d	MAJOR: get rid of fdtab[].state and use connection->flags instead fdtab[].state was only used to know whether a connection was in progress or an error was encountered. Instead we now use connection->flags to store a flag for both. This way, connection management will be able to update the connection status on I/O.	2012-09-02 21:51:26 +02:00
Willy Tarreau	da92e2fb61	REORG/MINOR: checks: put a struct connection into the server This will be used to handle the connection state once it goes away from fdtab. There is no functional change at the moment.	2012-09-02 21:51:26 +02:00
Willy Tarreau	ed8f614078	REORG/MEDIUM: fd: get rid of FD_STLISTEN This state was only used so that ev_sepoll did not match FD_STERROR, which changed in previous patch. We can now safely remove this state.	2012-09-02 21:51:25 +02:00
Willy Tarreau	5d526b7215	REORG/MEDIUM: fd: remove checks for FD_STERROR in ev_sepoll This test is present only in this poller as an optimization, but this optimization adds some complexity to remove fdtab[].state. Let's get rid of it for now.	2012-09-02 21:51:25 +02:00
Willy Tarreau	db3b32610f	REORG/MEDIUM: fd: remove FD_STCLOSE from struct fdtab In an attempt to get rid of fdtab[].state, and to move the relevant parts to the connection struct, we remove the FD_STCLOSE state which can easily be deduced from the <owner> pointer as there is a 1:1 match.	2012-09-02 21:51:25 +02:00
Jamie Gloudon	801a0a353a	DOC: fix name for "option independant-streams" The correct spelling is "independent", not "independant". This patch fixes the doc and the configuration parser to accept the correct form. The config parser still allows the old naming for backwards compatibility.	2012-09-02 21:51:07 +02:00
Willy Tarreau	654694e189	MEDIUM: stats/cli: add support for "set table key" to enter values This is used to enter values for stick tables. The most likely usage is to set gpc0 for a specific IP address in order to block traffic for abusers without having to reload. Since all data types are supported, other usages are possible (eg: replace a users's assigned server).	2012-09-02 21:51:07 +02:00
Willy Tarreau	dec9814e74	MINOR: stats/cli: add plans to support more stick-table actions Right now we only support show/clear on a table. In order to introduce the "set" keyword we need to get rid of the "show" boolean arg. There is no functional change up to this commit.	2012-09-02 21:51:06 +02:00
William Lallemand	1dc00efedc	BUG/MINOR: to_log erased with unique-id-format curproxy->to_log was reset to LW_INIT when using unique-id-format, so logs looked like option logasap	2012-08-09 19:18:22 +02:00
Willy Tarreau	a9fddca778	MINOR: http: add the urlp_val ACL match It's derived from other urlp_* matches, but there was no way to check for an integer value and it seems like it's significantly used.	2012-07-31 07:55:32 +02:00
Willy Tarreau	491c498d97	BUG/MINOR: polling: some events were not set in various pollers fdtab[].ev was only set in ev_sepoll. Unfortunately, some I/O handling functions now rely on this, so depending on the polling mechanism, some useless operations might have been performed, such as performing a useless recv() when a HUP was reported. This is a very old issue, the flags were only added to the fdtab and not propagated into any poller. Then they were used in ev_sepoll which needed them for the cache. It is unsure whether a backport to 1.4 is appropriate or not.	2012-07-31 07:55:31 +02:00
Willy Tarreau	dae2a8a5a5	BUG/MINOR: tarpit: fix condition to return the HTTP 500 message Commit `fa7e1025` (1.3.16-rc1) introduced a minor bug by comparing req->flags with BF_READ_ERROR instead of checking for the bit. The result is that the error message is always returned even in case of client error. This has no real impact but this must be fixed. It may be backported to 1.4 and 1.3.	2012-07-31 07:55:31 +02:00
David du Colombier	65c1796c4a	MINOR: IPv6 support for transparent proxy Set socket option IPV6_TRANSPARENT on binding to enable transparent proxy on IPv6. This option is available from Linux 2.6.37.	2012-07-31 07:53:42 +02:00
Willy Tarreau	5b88da269c	OPTIM: i386: make use of kernel-mode-linux when available If haproxy is built with support for USE_VSYSCALL_DLSYM, it's very easy to check for KML availability. So let's enable it. Tests show a small overall performance improvement around 1%. Other tests show that the syscall overhead is divided by 4 on a Geode LX using this method.	2012-07-31 07:53:42 +02:00
Willy Tarreau	a7ad50cdb1	MEDIUM: pattern: add the "base" sample fetch method This one returns the concatenation of the first Host header entry with the path. It can make content-switching rules easier, help with fighting DDoS on certain URLs and improve shared caches efficiency.	2012-07-26 19:08:38 +02:00
Willy Tarreau	6812bcfc94	MINOR: replace acl_fetch_{path,url}* with smp_fetch_* Doing so allows us to support sticking on URL, URL's IP, URL's port and path. Both fetch functions should be improved to support an optional depth allowing to stick to a server depending on just a few directory components. This would help with portals, some prefetch-capable caches and with outgoing connections using multiple internet links.	2012-07-26 19:06:40 +02:00
Willy Tarreau	e3a461118c	BUG/MINOR: ACL implicit arguments must be created with unresolved flag Commit 496aa0 fixed a design issue by adding an "unresolved" flag to the ACL arguments. Unfortunately this unresolved flag was not set when building the fake argument some ACL need when using an implicit argument pointing to the local proxy. Special thanks to Michael Kearey who reported the issue with a reproducer and the commit introducing the bug.	2012-06-15 08:02:34 +02:00
Willy Tarreau	96596aeead	MEDIUM: fd/si: move peeraddr from struct fdinfo to struct connection The destination address is purely a connection thing and not an fd thing. It's also likely that later the address will be stored into the connection and linked to by the SI. struct fdinfo only keeps the pointer to the port range and the local port for now. All of this also needs to move to the connection but before this the release of the port range must move from fd_delete() to a new function dedicated to the connection.	2012-06-08 22:59:52 +02:00
Willy Tarreau	a05903174f	BUG/MAJOR: cookie prefix doesn't support cookie-less servers Commit `827aee91` merged in 1.5-dev5 introduced a regression causing the srv pointer to be tested twice instead of srv then srv->cookie. The result is that if a server has no cookie in prefix mode, haproxy will crash when trying to modify it. Such a config is very unlikely to happen, except maybe with a backup server, which would cause haproxy to die with the last server in the farm. No backport is needed, only 1.5-dev was affected.	2012-06-06 16:07:00 +02:00
Willy Tarreau	4f8a83cb6e	MEDIUM: stats: add the ability to kill sessions from the admin interface It was not possible to kill remaining sessions from the admin interface, which is annoying especially when switching to maintenance mode. Now it's possible.	2012-06-04 00:26:23 +02:00
Willy Tarreau	d72822442d	MEDIUM: stats: add support for soft stop/soft start in the admin interface One important missing feature on the web interface is the ability to perform a soft stop/soft start. This is now possible.	2012-06-04 00:22:44 +02:00
Justin Karneges	eb2c24ae2a	MINOR: checks: add on-marked-up option This implements the feature discussed in the earlier thread of killing connections on backup servers when a non-backup server comes back up. For example, you can use this to route to a mysql master & slave and ensure clients don't stay on the slave after the master goes from down->up. I've done some minimal testing and it seems to work. [WT: added session flag & doc, moved the killing after logging the server UP, and ensured that the new server is really usable]	2012-06-03 23:48:42 +02:00
Willy Tarreau	39b0665bc7	BUG/MINOR: commit `196729ef` used wrong condition resulting in freeing constants Recent commit `196729ef` had inverted condition to free format strings. No backport is needed, it was never released.	2012-06-01 10:58:06 +02:00
Willy Tarreau	496aa0111e	BUG/MEDIUM: ensure that unresolved arguments are freed exactly once When passing arguments to ACLs and samples, some types are stored as strings then resolved later after config parsing is done. Upon exit, the arguments need to be freed only if the string was not resolved yet. At the moment we can encounter double free during deinit() because some arguments (eg: userlists) are freed once as their own type and once as a string. The solution consists in adding an "unresolved" flag to the args to say whether the value is still held in the <str> part or is final. This could be debugged thanks to a useful bug report from Sander Klein.	2012-06-01 10:40:52 +02:00
Willy Tarreau	4992dd2d30	MINOR: http: add support for "httponly" and "secure" cookie attributes httponly This option tells haproxy to add an "HttpOnly" cookie attribute when a cookie is inserted. This attribute is used so that a user agent doesn't share the cookie with non-HTTP components. Please check RFC6265 for more information on this attribute. secure This option tells haproxy to add a "Secure" cookie attribute when a cookie is inserted. This attribute is used so that a user agent never emits this cookie over non-secure channels, which means that a cookie learned with this flag will be presented only over SSL/TLS connections. Please check RFC6265 for more information on this attribute.	2012-05-31 21:02:17 +02:00
Willy Tarreau	b5ba17e3a9	BUG/MINOR: config: do not report twice the incompatibility between cookie and non-http This one was already taken care of in proxy_cfg_ensure_no_http(), so if a cookie is presented in a TCP backend, we got two warnings. This can be backported to 1.4 since it's been this way for 2 years (although not dramatic).	2012-05-31 20:47:00 +02:00
Willy Tarreau	674021329c	REORG/MINOR: use dedicated proxy flags for the cookie handling Cookies were mixed with many other options while they're not used as options. Move them to a dedicated bitmask (ck_opts). This has released 7 flags in the proxy options and leaves some room for new proxy flags.	2012-05-31 20:40:20 +02:00
Willy Tarreau	99a7ca2fa6	BUG/MINOR: log: don't report logformat errors in backends Logs have always been ignored by backends, do not report useless warnings there.	2012-05-31 19:39:23 +02:00
Willy Tarreau	196729eff8	BUG/MINOR: fix option httplog validation with TCP frontends Option httplog needs to be checked only once the proxy has been validated, so that its final mode (tcp/http) can be used. Also we need to check for httplog before checking the log format, so that we can report a warning about this specific option and not about the format it implies.	2012-05-31 19:30:26 +02:00
Willy Tarreau	743a2d3e14	BUG/MEDIUM: buffers: fix bi_putchr() to correctly advance the pointer bi_putchr() failed to move the buffer pointer forward. The only user was the peer handler which was broken, it failed to sync. Thanks to Herv� Commowick for reporting the issue.	2012-05-31 16:40:11 +02:00
Willy Tarreau	fa6bac6ec3	BUG/MEDIUM: register peer sync handler in the proper order Herv� Commowick reported a failure to resync upon restart caused by a segfault on the old process. This is due to the data_ctx of the connection being initialized after the stream interface.	2012-05-31 14:16:59 +02:00
Willy Tarreau	cde18fc1ba	BUG/MINOR: perform_http_redirect also needs to rewind the buffer Commit `d1de8af362` was incomplete, because perform_http_redirect() also needs to rewind the buffer since it's called after data are scheduled for forwarding. No backport needed.	2012-05-30 08:00:56 +02:00
Cyril Bont�	a32d275ab0	BUG/MEDIUM: option forwardfor if-none doesn't work with some configurations When "option forwardfor" is enabled in a frontend that uses backends, "if-none" ignores the header name provided in the frontend. This prevents haproxy to add the X-Forwarded-For header if the option is not used in the backend. This may introduce security issues for servers/applications that rely on the header provided by haproxy. A minimal configuration which can reproduce the bug: defaults mode http listen OK bind :9000 option forwardfor if-none server s1 127.0.0.1:80 listen BUG-frontend bind :9001 option forwardfor if-none default_backend BUG-backend backend BUG-backend server s1 127.0.0.1:80	2012-05-30 06:43:24 +02:00
Willy Tarreau	7de211c88b	MINOR: add a new function call tracer for debugging purposes This feature relies on GCC's ability to call helpers at function entry/exit points. We define these helpers to quickly dump the minimum info into a trace file that can be converted to a human readable format using a script in the contrib/trace directory. This has only been implemented in the GNU makefile for now on as it is unsure whether it's supported on all OSes. The feature is enabled by building with "TRACE=1". The performance impact is huge, so this feature should only be used when debugging. To limit the loss of performance, fprintf() has been disabled and the output is hand-crafted and emitted using fwrite(), resulting in doubling the performance. Using the TSC instead of gettimeofday() also doubles the performance. Around 1200 conns/s may be achieved on a Pentium-M 1.7 GHz which leads to around 50 MB/s of traces. The entry and exits of all functions will be dumped into a file designated by the HAPROXY_TRACE environment variable, or by default "trace.out". If the trace file name is empty or "/dev/null", then traces are disabled. If opening the trace file fails, then stderr is used. If HAPROXY_TRACE_FAST is used, then the time is taken from the global <now> variable. Last, if HAPROXY_TRACE_TSC is used, then the machine's TSC is used instead of the real time (almost twice as fast). The output format is : <sec.usec> <level> <caller_ptr> <dir> <callee_ptr> or : <tsc> <level> <caller_ptr> <dir> <callee_ptr> where <dir> is '>' when entering a function and '<' when leaving. The awk script in contrib/trace provides a nicer indented output : 6f74989e6f8 ->->-> run_poll_loop > signal_process_queue [src/haproxy.c:1097:0x804bd69] > [include/proto/signal.h:32:0x8049cd0] 6f74989eb00 run_poll_loop < signal_process_queue [src/haproxy.c:1097:0x804bd69] < [include/proto/signal.h:32:0x8049cd0] 6f74989ef44 ->->-> run_poll_loop > wake_expired_tasks [src/haproxy.c:1100:0x804bd72] > [src/task.c:123:0x8055060] 6f74989f3a6 ->->->-> wake_expired_tasks > eb32_lookup_ge [src/task.c:128:0x8055091] > [ebtree/eb32tree.c:138:0x80a8c70] 6f74989f7e9 wake_expired_tasks < eb32_lookup_ge [src/task.c:128:0x8055091] < [ebtree/eb32tree.c:138:0x80a8c70] 6f74989fc0d ->->->-> wake_expired_tasks > eb32_first [src/task.c:134:0x80550d5] > [ebtree/eb32tree.h:55:0x8054ad0] 6f7498a003d ->->->->-> eb32_first > eb_first [ebtree/eb32tree.h:56:0x8054af1] > [ebtree/ebtree.h:520:0x8054a10] 6f7498a0436 ->->->->->-> eb_first > eb_walk_down [ebtree/ebtree.h:521:0x8054a33] > [ebtree/ebtree.h:442:0x80549a0] 6f7498a0843 ->->->->->->-> eb_walk_down > eb_gettag [ebtree/ebtree.h:445:0x80549d6] > [ebtree/ebtree.h:418:0x80548e0] 6f7498a0c2b eb_walk_down < eb_gettag [ebtree/ebtree.h:445:0x80549d6] < [ebtree/ebtree.h:418:0x80548e0] 6f7498a1042 ->->->->->->-> eb_walk_down > eb_untag [ebtree/ebtree.h:447:0x80549e2] > [ebtree/ebtree.h:412:0x80548a0] 6f7498a1498 eb_walk_down < eb_untag [ebtree/ebtree.h:447:0x80549e2] < [ebtree/ebtree.h:412:0x80548a0] 6f7498a18c6 ->->->->->->-> eb_walk_down > eb_root_to_node [ebtree/ebtree.h:448:0x80549e7] > [ebtree/ebtree.h:432:0x8054960] 6f7498a1cd4 eb_walk_down < eb_root_to_node [ebtree/ebtree.h:448:0x80549e7] < [ebtree/ebtree.h:432:0x8054960] 6f7498a20c4 eb_first < eb_walk_down [ebtree/ebtree.h:521:0x8054a33] < [ebtree/ebtree.h:442:0x80549a0] 6f7498a24b4 eb32_first < eb_first [ebtree/eb32tree.h:56:0x8054af1] < [ebtree/ebtree.h:520:0x8054a10] 6f7498a289c wake_expired_tasks < eb32_first [src/task.c:134:0x80550d5] < [ebtree/eb32tree.h:55:0x8054ad0] 6f7498a2c8c run_poll_loop < wake_expired_tasks [src/haproxy.c:1100:0x804bd72] < [src/task.c:123:0x8055060] 6f7498a3095 ->->-> run_poll_loop > process_runnable_tasks [src/haproxy.c:1103:0x804bd7a] > [src/task.c:190:0x8055150] A nice improvement would possibly consist in trying to get the function's arguments in the stack and to dump a few more infor for some well-known functions (eg: the session's status for process_session).	2012-05-26 00:12:37 +02:00
Willy Tarreau	1e44a49c89	BUG/MINOR: checks: expire on timeout.check if smaller than timeout.connect It happens that haproxy doesn't displace the task in the wait queue when validating a connection, so if the check timeout is set to a smaller value than timeout.connect, it will not strike before timeout.connect. The bug is present at least in 1.4.15..1.4.21, so the fix must be backported.	2012-05-25 07:42:37 +02:00
Oskar Stolc	8dc4184c57	MINOR: balance uri: added 'whole' parameter to include query string in hash calculation This patch brings a new "whole" parameter to "balance uri" which makes the hash work over the whole uri, not just the part before the query string. Len and depth parameter are still honnored. The reason for this new feature is explained below. I have 3 backend servers, each accepting different form of HTTP queries: http://backend1.server.tld/service1.php?q=... http://backend1.server.tld/service2.php?q=... http://backend2.server.tld/index.php?query=...&subquery=... http://backend3.server.tld/image/49b8c0d9ff Each backend server returns a different response based on either: - the URI path (the left part of the URI before the question mark) - the query string (the right part of the URI after the question mark) - or the combination of both I wanted to set up a common caching cluster (using 6 Squid servers, each configured as reverse proxy for those 3 backends) and have HAProxy balance the queries among the Squid servers based on URL. I also wanted to achieve hight cache hit ration on each Squid server and send the same queries to the same Squid servers. Initially I was considering using the 'balance uri' algorithm, but that would not work as in case of backend2 all queries would go to only one Squid server. The 'balance url_param' would not work either as it would send the backend3 queries to only one Squid server. So I thought the simplest solution would be to use 'balance uri', but to calculate the hash based on the whole URI (URI path + query string), instead of just the URI path.	2012-05-22 07:56:54 +02:00
Emeric Brun	d88fd824b7	MEDIUM: protocol: add a pointer to struct sock_ops to the listener struct The listener struct is now aware of the socket layer to use upon accept(). At the moment, only sock_raw is supported so this patch should not change anything.	2012-05-21 22:22:39 +02:00
Emeric Brun	21adb02d19	MINOR: stream_interface: add a pointer to the listener for TARG_TYPE_CLIENT When the target is a client, it will be convenient to have a pointer to the original listener so that we can retrieve some configuration information at the stream interface level.	2012-05-21 22:22:39 +02:00
Willy Tarreau	1348d4ce0b	MINOR: peers: use the socket layer operations from the peer instead of sock_raw At the moment, all the peers are initialized to use sock_raw as the socket layer, so use this info in peers_session_create() instead of the hard-coded sock_raw.	2012-05-21 22:21:37 +02:00
Willy Tarreau	4da69a91a0	MEDIUM: stream_interface: call si_data_close() before releasing the si This will ensure that the data layer releases anything previously allocated.	2012-05-21 18:07:11 +02:00
Willy Tarreau	24208275d5	MINOR: stream_interface: add a data channel close function This function will be called later when splitting the shutdown in two steps. It will be needed by SSL and for remote socket operations to release unused contexts.	2012-05-21 17:59:53 +02:00
Willy Tarreau	949811319b	REORG/MEDIUM: stream_interface: move applet->state and private to connection The state and the private pointer are not specific to the applets, since SSL will require exactly both of them. Move them to the connection layer now and rename them. We also now ensure that both are NULL on first call.	2012-05-21 17:09:48 +02:00
Willy Tarreau	fb7508aefb	REORG/MINOR: stream_interface: move si->fd to struct connection The socket fd is used only when in socket mode and with a connection.	2012-05-21 16:47:54 +02:00
Willy Tarreau	73b013b070	MINOR: stream_interface: introduce a new "struct connection" type We start to move everything needed to manage a connection to a special entity "struct connection". We have the data layer operations and the control operations there. We'll also have more info in the future such as file descriptors and applet contexts, so that in the end it becomes detachable from the stream interface, which will allow connections to be reused between sessions. For now on, we start with minimal changes.	2012-05-21 16:31:45 +02:00
Willy Tarreau	fe7f1ea68e	REORG/MINOR: session: detect the TCP monitor checks at the protocol accept It does not make sense anymore to wait for a session creation to process a TCP monitor check which only closes the connection and returns. Better to process this immediately after the accept() return. It also saves us from counting a connection for monitor checks, which is much more logical.	2012-05-20 19:22:25 +02:00
Willy Tarreau	a190d591fc	REORG: move the send-proxy code to tcp_connect_write() It is much better and more efficient to consider that the send-proxy feature is part of the protocol layer than part of the data layer. Now the connection is considered established once the send-proxy line has been sent. This way the data layer doesn't have to care anymore about this specific part. The tcp_connect_write() function now automatically calls the data layer write() function once the connection is established, which saves calls to epoll_ctl/epoll_wait/process_session. It's starting to look more and more obvious that tcp_connect_read() and tcp_connect_write() are not TCP-specific but only socket-specific and as such should probably move, along with some functions from protocol.c, to a socket-specific file (eg: stream_sock). It would be nice to be able to support autonomous listeners to parse the proxy protocol before accepting a connection, so that we get rid of it at the session layer and to support using these informations in the tcp-request connection rules.	2012-05-20 18:35:19 +02:00
Willy Tarreau	8ae52cb144	BUG/MINOR: stop connect timeout when connect succeeds If the connect succeeds exactly at the same millisecond as the connect timeout is supposed to strike, the timeout is still considered while data may have already be sent. This results in a new connection attempt with no data and with the response being lost. Note that in practice the only real-world situation where this is observed is when connect timeouts are extremely low, too low for safe operations. This bug was encountered with a 1ms connect timeout. It is also present on 1.4 and needs to be fixed there too.	2012-05-20 10:38:46 +02:00
Willy Tarreau	9580d16e40	BUG/MAJOR: checks: don't call set_server_status_* when no LB algo is set David Touzeau reported that haproxy dies when a server is checked and is used in a farm with only "option transparent" and no LB algo. This is because the LB params are NULL, the functions should be checked before being called. The same bug is present in 1.4 so this patch must be backported.	2012-05-19 19:09:46 +02:00
Willy Tarreau	ea95316bf1	MEDIUM: http: msg->sov and msg->sol will never wrap These ones are offsets now, so they cannot wrap. Let's remove the useless wrapping detection and simplify the forwarding code.	2012-05-18 23:50:43 +02:00
Willy Tarreau	2692736aa3	MEDIUM: http: get rid of msg->som which is not used anymore msg->som was zero before the body and was used to carry the beginning of a chunk size for chunked-encoded messages, at a moment when msg->sol is always zero. Remove msg->som and replace it with msg->sol where needed.	2012-05-18 23:50:43 +02:00
Willy Tarreau	06a000f56e	CLEANUP: http: make it more obvious that msg->som is always null outside of chunks Since the recent buffer reorg, msg->som is redundant with buf->p but still appears at a number of places. This tiny patch allows to confirm that som follows two states : - 0 from the moment the message starts to be parsed - relative offset to ->p for start of chunk when parsing chunks During this second state, ->sol is never used, so we should probably merge the two.	2012-05-18 23:04:32 +02:00
Willy Tarreau	09d1e254c9	MAJOR: http: stop using msg->sol outside the parsers This is a left-over from the buffer changes. Msg->sol is always null at the end of the parsing, so we must not use it anymore to read headers or find the beginning of a message. As a side effect, the dump of the request in debug mode is working again because it was relying on msg->sol not being null. Maybe it will even be mergeable with another of the message pointers.	2012-05-18 22:43:55 +02:00
Willy Tarreau	d1de8af362	BUG/MAJOR: fix regression on content-based hashing and http-send-name-header The recent split between the buffers and HTTP messages in 1.5-dev9 caused a major trouble : in the past, we used to keep a pointer to HTTP data in the buffer struct itself, which was the cause of most of the pain we had to deal with buffers. Now the two are split but we lost the information about the beginning of the HTTP message once it's being forwarded. While it seems normal, it happens that several parts of the code currently rely on this ability to inspect a buffer containing old contents : - balance uri - balance url_param - balance url_param check_post - balance hdr() - balance rdp-cookie() - http-send-name-header All these happen after the data are scheduled for being forwarded, which also causes a server to be selected. So for a long time we've been relying on supposedly sent data that we still had a pointer to. Now that we don't have such a pointer anymore, we only have one possibility : when we need to inspect such data, we have to rewind the buffer so that ->p points to where it previously was. We're lucky, no data can leave the buffer before it's being connecting outside, and since no inspection can begin until it's empty, we know that the skipped data are exactly ->o. So we rewind the buffer by ->o to get headers and advance it back by the same amount. Proceeding this way is particularly important when dealing with chunked- encoded requests, because the ->som and ->sov fields may be reused by the chunk parser before the connection attempt is made, so we cannot rely on them. Also, we need to be able to come back after retries and redispatches, which might change the size of the request if http-send-name-header is set. All of this is accounted for by the output queue so in the end it does not look like a bad solution. No backport is needed.	2012-05-18 22:23:01 +02:00
Willy Tarreau	be0688c64d	MEDIUM: stream_interface: remove the si->init Calling the init() function in sess_establish was a bad idea, it is too late to allow it to fail on lack of resource and does not help at all. Remove it for now before it's used.	2012-05-18 15:15:26 +02:00
David du Colombier	7af4605ef7	BUG/MAJOR: trash must always be the size of a buffer Before it was possible to resize the buffers using global.tune.bufsize, the trash has always been the size of a buffer by design. Unfortunately, the recent buffer sizing at runtime forgot to adjust the trash, resulting in it being too short for content rewriting if buffers were enlarged from the default value. The bug was encountered in 1.4 so the fix must be backported there.	2012-05-16 14:21:55 +02:00
Willy Tarreau	7bb68abb9f	OPTIM/MEDIUM: stream_interface: add a new SI_FL_NOHALF flag This flag indicates that we're not interested in keeping half-open connections on a stream interface. It has the benefit of allowing the socket layer to cause an immediate write close when detecting an incoming read close. This releases resources much faster and saves one syscall (either a shutdown or setsockopt). This flag is only set by HTTP on the interface going to the server since we don't want to continue pushing data there when it has closed. Another benefit is that it responds with a FIN to a server's FIN instead of responding with an RST as it used to, which is much cleaner. Performance gains of 7.5% have been measured on HTTP connection rate on empty objects.	2012-05-13 14:52:22 +02:00
Willy Tarreau	dbcd47ea35	OPTIM/MAJOR: ev_sepoll: process spec events after polled events A suboptimal behaviour was appearing quite often with sepoll. When a speculative write failed after a connect(), the socket was added to the poll list using epoll_ctl(ADD). Then when epoll_wait() returned a write event, the send() was performed and write event disabled, causing it to get back to the spec list in order to be disabled later. But if some new accept() did succeed in the same run, then fd_created was not null, causing a new run of the spec list to happen. This run would then detect the old event in STOP state and would remove it from the poll list using epoll_ctl(DEL). After this, process_session() enables reading on the FD, attempting an speculative recv() which fails then adds it again using epoll_ctl(ADD) to do it again. So the total sequence of syscalls looked like this : connect(fd) = EAGAIN send(fd) = EAGAIN epoll_ctl(ADD(fd:OUT)) epoll_wait() = fd:OUT send(fd) > 0 epoll_ctl(DEL(fd)) recv(fd) = EAGAIN epoll_ctl(ADD(fd:IN)) recv(fd) > 0 In order to fix this stupid situation, we must compute the epoll_ctl() parameters at the last moment, just before doing epoll_wait(). This is what was done except that the spec events were processed just before doing that without leaving time for the tasks to adjust the FDs if needed. This is also the reason we have the re_poll_once label to try to catch new events in case of a successful accept(). The new solution consists in doing the opposite : - compute epoll_ctl() - call epoll_wait() - call spec events This significantly reduces the number of iterations on the spec events and avoids a huge number of epoll_ctl() ping/pongs. The new sequence above simply becomes : connect(fd) = EAGAIN send(fd) = EAGAIN epoll_ctl(ADD(fd:OUT)) epoll_wait() = fd:OUT send(fd) > 0 epoll_ctl(MOD(fd:IN)) recv(fd) > 0 Also, there is no need to re-run the spec events after an accept() as it will automatically be detected in the spec list after a return from polled events. The gains are important, with up to 4.5% global performance increase in connection rate on HTTP with small objects. The code is less tricky and does not need anymore to skip epoll_wait() every other call, nor to track the number of FDs newly created.	2012-05-13 09:55:07 +02:00
Willy Tarreau	93548be149	OPTIM: proto_http: don't enable quick-ack on empty buffers Commit `5e205524` was a bit overzealous by inconditionally enabling quick ack when a request is not yet in the buffer, because it also does so when nothing has been received yet, causing a useless ACK to be emitted. Improve the situation by doing this only if the input buffer is empty (indicating that nothing was sent by the client). In case of keep-alive, an empty buffer means we already have a response in flight which will serve as an ACK.	2012-05-13 08:44:16 +02:00
Willy Tarreau	b147a8382a	CLEANUP: fd: remove unused cb->b pointers in the struct fdtab These pointers were used to hold pointers to buffers in the past, but since we introduced the stream interface, they're no longer used but they were still sometimes set. Removing them shrink the struct fdtab from 32 to 24 bytes on 32-bit machines, and from 52 to 36 bytes on 64-bit machines, which is a significant saving. A quick tests shows a steady 0.5% performance gain, probably due to the better cache efficiency.	2012-05-13 00:35:44 +02:00
Willy Tarreau	ce887fd3b2	MEDIUM: session: add support for tunnel timeouts Tunnel timeouts are used when TCP connections are forwarded, or when forwarding upgraded HTTP connections (WebSocket) as well as CONNECT requests to proxies. This timeout allows long-lived sessions to be supported without having to set large timeouts to normal requests.	2012-05-12 12:50:00 +02:00
Willy Tarreau	2f5b6fc090	MINOR: session: call the socket layer init function when a session establishes In sess_establish, once we've prepared everythin, we can call the socket layer init function. We pass an argument for targets which have one (eg: servers). At the moment, the existing socket layers don't have init functions, but SSL will need one.	2012-05-12 08:09:27 +02:00
Willy Tarreau	eeda90e68c	MAJOR: fd: remove the need for the socket layer to recheck the connection Up to now, if an outgoing connection had no data to send, the socket layer had to perform a connect() again to check for establishment. This is not acceptable for SSL, and will cause problems with socketpair(). Some socket layers will also need an initializer before sending data (eg: SSL). The solution consists in moving the connect() test to the protocol layer (eg: TCP) and to make it hold the fd->write callback until the connection is validated. At this point, it will switch the write callback to the socket layer's write function. In fact we need to hold both read and write callbacks to ensure the socket layer is never called before being initialized. This intermediate callback is used only if there is a socket init function or if there are no data to send. The socket layer does not have any code to check for connection establishment anymore, which makes sense.	2012-05-11 20:18:26 +02:00
Willy Tarreau	d02394b5a1	MEDIUM: stream_interface: derive the socket operations from the target Instead of hard-coding sock_raw in connect_server(), we set this socket operation at config parsing time. Right now, only servers and peers have it. Proxies are still hard-coded as sock_raw. This will be needed for future work on SSL which requires a different socket layer.	2012-05-11 18:52:14 +02:00
Willy Tarreau	64798bd720	MINOR: stream_interface: add an init callback to sock_ops This will be needed for some socket layers such as SSL. It's not used at the moment.	2012-05-11 18:39:26 +02:00
Willy Tarreau	f873d754f8	CLEANUP: stream_interface: stop exporting socket layer functions Similarly to the previous patch, we don't need the socket-layer functions outside of stream_interface. They could even move to a file dedicated to applets, though that does not seem particularly useful at the moment.	2012-05-11 17:47:17 +02:00
Willy Tarreau	b277d6e568	CLEANUP: sock_raw: remove last references to stream_sock We also stop exporting all functions since they're not needed anymore outside of sock_raw.c.	2012-05-11 17:03:42 +02:00
Willy Tarreau	59b9479667	BUG/MEDIUM: stream_interface: restore get_src/get_dst Commit e164e7a removed get_src/get_dst setting in the stream interfaces but forgot to set it in proto_tcp. Get the feature back because we need it for logging, transparent mode, ACLs etc... We now rely on the stream interface direction to know what syscall to use. One benefit of doing it this way is that we don't use getsockopt() anymore on outgoing stream interfaces nor on UNIX sockets.	2012-05-11 16:48:10 +02:00
Willy Tarreau	1539a01645	MINOR: stream_interface: add a client target : TARG_TYPE_CLIENT This one will be used to identify the direction the SI is being used. All incoming connections have a target of type TARG_TYPE_CLIENT.	2012-05-11 14:47:34 +02:00
Willy Tarreau	c63190d429	REORG: use the name sock_raw instead of stream_sock We'll soon have an SSL socket layer, and in order to ease the difference between the two, we use the name "sock_raw" to designate the one which directly talks to the sockets without any conversion.	2012-05-11 14:23:52 +02:00
Willy Tarreau	46b39d0dc6	BUG/MEDIUM: config: don't crash at config load time on invalid userlist names Cyril Bont� reported that passing an invalid userlist name to http_auth_group() caused haproxy to crash at load. This was due to an attempt to use the unresolved userlist pointer later to resolve auth groups since we report many errors before leaving now. This issue does not exist in earlier versions since they immediately abort on the first error, so no backport is needed.	2012-05-10 23:42:22 +02:00
Willy Tarreau	b95095979c	CLEANUP: auth: make the code build again with DEBUG_AUTH Reported by Cyril Bont�, minor issue caused by recent ACL rework.	2012-05-10 23:25:35 +02:00
Willy Tarreau	4a3fd4c8df	BUG/MAJOR: acl: http_auth_group() must not accept any user from the userlist http_auth and http_auth_group used to share the same fetch function, while they're doing very different things. The first one only checks whether the supplied credentials are valid wrt a userlist while the second not only checks this but also checks group ownership from a list of patterns. Recent acl/pattern merge caused a simplification here by which the fetch function would always return a boolean, so the group match was always fine if the user:password was valid, regardless of the patterns provided with the ACL. The proper solution consists in splitting the function in two, depending on what is desired. It's also worth noting that check_user() would probably be split, one to check user:password, and the other one to check for group ownership for an already valid user:password combination. At this point it is not certain if the group mask is still useful or not considering that the passwd check is always made. This bug was reported and diagnosed by Cyril Bont�. It first appeared in 1.5-dev9 so it does not need any backporting.	2012-05-10 23:18:26 +02:00
Cyril Bont�	20a804ac6d	BUG/MINOR: stats admin: "Unexpected result" was displayed unconditionally I introduced a regression in commit `19979e176e` while reworking the admin actions results. "Unexpected result" was displayed even if the action was applied due to a misplaced initialization. This small patch should fix it. Note: no need to backport.	2012-05-10 21:51:17 +02:00
Willy Tarreau	a7fe8e527c	MINOR: http: replace http_message_realign() with buffer_slow_realign() There is no more reason for the realign function being HTTP specific, it only operates on a buffer now. Let's move it to buffers.c instead. It's likely that buffer_bounce_realign is broken (not used), this will have to be inspected. The function is worth rewriting as it can be cheaper than buffer_slow_realign() to realign large wrapping buffers.	2012-05-08 21:28:17 +02:00
Willy Tarreau	0a3dd74c9c	MEDIUM: cfgparse: use the new error reporting framework for remaining cfg_keywords All keywords registered using a cfg_kw_list now make use of the new error reporting framework. This allows easier and more precise error reporting without having to deal with complex buffer allocation issues.	2012-05-08 21:28:17 +02:00
Willy Tarreau	a93c74be5c	MEDIUM: cfgparse: make backend_parse_balance() use memprintf to report errors Using the new error reporting framework makes it easier to report complex errors.	2012-05-08 21:28:17 +02:00
Willy Tarreau	f4068b6503	MINOR: cfgparse: use a common errmsg pointer for all parsers In order to generalize the simplified error reporting mechanism, let's centralize the error pointer.	2012-05-08 21:28:17 +02:00
Willy Tarreau	bd83314ee9	BUG/MEDIUM: log: ensure that unique_id is properly initialized Last memory poisonning patch immediately made this issue appear. The unique_id field is released but not properly initialized. The feature was introduced very recently, no backport is needed.	2012-05-08 21:28:16 +02:00
Willy Tarreau	6e0644339f	MEDIUM: memory: add the ability to poison memory at run time From time to time, some bugs are discovered that are caused by non-initialized memory areas. It happens that most platforms return a zero-filled area upon first malloc() thus hiding potential bugs. This patch also replaces malloc() in pools with calloc() to ensure that all platforms exhibit the same behaviour upon startup. In order to catch these bugs more easily, add a -dM command line flag to enable memory poisonning. Optionally, passing -dM<byte> forces the poisonning byte to <byte>.	2012-05-08 21:28:16 +02:00
Willy Tarreau	63e7fe310e	BUG/MEDIUM: send_proxy: fix initialisation of send_proxy_ofs Commit `b22e55bc` introduced send_proxy_ofs but forgot to initialize it, which remained unnoticed since it's always at the same place in the stream interface. On a machine with dirty RAM returned by malloc(), some responses were holding a PROXY header, which normally is not possible. The problem goes away after properly initializing the field upon each new session_accept(). This fix does not need to be backported except if any code makes use of a backport of this feature.	2012-05-08 21:28:16 +02:00
Willy Tarreau	515393649c	MINOR: acl: add the cook_val() match to match a cookie against an integer	2012-05-08 21:28:16 +02:00
Willy Tarreau	d04b1bce69	MEDIUM: http: improve error capture reports A number of important information were missing from the error captures, so let's improve them. Now we also log source port, session flags, transaction flags, message flags, pending output bytes, expected buffer wrapping position, total bytes transferred, message chunk length, and message body length. As such, the output format has slightly evolved and the source address moved to the third line : [08/May/2012:11:14:36.341] frontend echo (#1): invalid request backend echo (#1), server <NONE> (#-1), event #1 src 127.0.0.1:40616, session #4, session flags 0x00000000 HTTP msg state 26, msg flags 0x00000000, tx flags 0x00000000 HTTP chunk len 0 bytes, HTTP body len 0 bytes buffer flags 0x00909002, out 0 bytes, total 28 bytes pending 28 bytes, wrapping at 8030, error at position 7: 00000 GET / /?t=20000 HTTP/1.1\r\n 00026 \r\n [08/May/2012:11:13:13.426] backend echo (#1) : invalid response frontend echo (#1), server local (#1), event #0 src 127.0.0.1:40615, session #1, session flags 0x0000044e HTTP msg state 32, msg flags 0x0000000e, tx flags 0x08200000 HTTP chunk len 0 bytes, HTTP body len 20 bytes buffer flags 0x00008002, out 81 bytes, total 92 bytes pending 11 bytes, wrapping at 7949, error at position 9: 00000 Foo: bar\r\r\n	2012-05-08 21:28:16 +02:00
Willy Tarreau	69d8c5d99e	BUG/MINOR: http: ensure that msg->err_pos is always relative to buf->p Since the beginning of buffer&msg changes, the error position (err_pos) had not completely been converted and some offsets still appear wrong. Now we ensure that everywhere msg->err_pos is relative to buf->p and we always report buf->i bytes starting at buf->p in all error captures, which ensures that err_pos is there. This is not exactly a bug and is specific to latest changes so no backport is needed.	2012-05-08 21:28:15 +02:00
Willy Tarreau	d6c2e8c916	BUG/MINOR: http: error snapshots are wrong if buffer wraps Commit 81f2fb added support for wrapping buffer captures, but unfortunately the code used to perform two memcpy() over the same destination, causing a loss of the start of the buffer rendering some error snapshots unusable. This bug is present in 1.4 too and must be backported.	2012-05-08 21:28:15 +02:00
Willy Tarreau	22bca61404	MEDIUM: proto_tcp: remove src6 and dst6 pattern fetch methods These methods have been superseded by src and dst which support multiple families. There is no point keeping them since they appeared in a development version anyway. For configurations using "src6", please use "src" instead. For "dst6", use "dst" instead.	2012-05-08 21:28:15 +02:00
Willy Tarreau	bbebbbff83	REORG/MEDIUM: move the default accept function from sockstream to protocols.c The previous sockstream_accept() function uses nothing from sockstream, and is totally irrelevant to stream interfaces. Move this to the protocols.c file which handles listeners and protocols, and call it listener_accept(). It now makes much more sense that the code dealing with listen() also handles accept() and passes it to upper layers.	2012-05-08 21:28:15 +02:00
Willy Tarreau	26d8c59f0b	REORG/MEDIUM: replace stream interface protocol functions by a proto pointer The stream interface now makes use of the socket protocol pointer instead of the direct functions.	2012-05-08 21:28:15 +02:00
Willy Tarreau	5c979a9c71	REORG/MEDIUM: stream_interface: initialize socket ops from descriptors	2012-05-08 21:28:14 +02:00
Willy Tarreau	1b79bdee26	REORG/MEDIUM: move protocol->{read,write} to sock_ops The protocol must not set the read and write callbacks, they're specific to the socket layer. Move them to sock_ops instead.	2012-05-08 21:28:14 +02:00
Willy Tarreau	060781fb4a	REORG: stream_interface: create a struct sock_ops to hold socket operations These operators are used regardless of the socket protocol family. Move them to a "sock_ops" struct. ->read and ->write have been moved there too as they have no reason to remain at the protocol level.	2012-05-08 21:28:14 +02:00
Willy Tarreau	ceb4ac9c34	MEDIUM: acl: support IPv6 address matching Make use of the new IPv6 pattern type so that acl_match_ip() knows how to compare pattern and sample. IPv6 may be entered in their usual form, with or without a netmask appended. Only bit counts are accepted for IPv6 netmasks. In order to avoid any risk of trouble with randomly resolved IP addresses, host names are never allowed in IPv6 patterns. HAProxy is also able to match IPv4 addresses with IPv6 addresses in the following situations : - tested address is IPv4, pattern address is IPv4, the match applies in IPv4 using the supplied mask if any. - tested address is IPv6, pattern address is IPv6, the match applies in IPv6 using the supplied mask if any. - tested address is IPv6, pattern address is IPv4, the match applies in IPv4 using the pattern's mask if the IPv6 address matches with 2002:IPV4::, ::IPV4 or ::ffff:IPV4, otherwise it fails. - tested address is IPv4, pattern address is IPv6, the IPv4 address is first converted to IPv6 by prefixing ::ffff: in front of it, then the match is applied in IPv6 using the supplied IPv6 mask.	2012-05-08 21:28:14 +02:00
Willy Tarreau	6d20e28556	MINOR: standard: add an IPv6 parsing function (str62net) str62net returns an address and a netmask in number of bits.	2012-05-08 20:57:21 +02:00
Willy Tarreau	c92ddbc37d	MINOR: acl: add types to ACL patterns We cannot currently match IPv6 addresses in ACL simply because we don't support types on the patterns. Let's introduce this notion. For now, we rely on the SMP_TYPES though it doesn't seem like it will last forever given that some types are not present there (eg: regex, meth). Still it should be enough to support mixed matchings for most types. We use the special impossible value SMP_TYPES for types that don't exist in the SMP_T_* space.	2012-05-08 20:57:21 +02:00
Willy Tarreau	cd3b094618	REORG: rename "pattern" files They're now called "sample" everywhere to match their description.	2012-05-08 20:57:21 +02:00
Willy Tarreau	1278578487	REORG: use the name "sample" instead of "pattern" to designate extracted data This is mainly a massive renaming in the code to get it in line with the calling convention. Next patch will rename a few files to complete this operation.	2012-05-08 20:57:20 +02:00
Willy Tarreau	7dcb6480db	MEDIUM: acl: extend the pattern parsers to report meaningful errors By passing the error pointer to all ACL parsers, we can make them report useful errors and not simply fail.	2012-05-08 20:57:20 +02:00
Willy Tarreau	08ad0b38c4	MINOR: acl: report errors encountered when loading patterns from files This happens in acl_read_patterns_from_file(). Errors are still incomplete, parsing functions must be improved to report parsing errors.	2012-05-08 20:57:20 +02:00
Willy Tarreau	4e6336fdfd	MINOR: arg: improve error reporting on invalid arguments It's important to report the faulty argument position and to distinguish between empty arguments and wrong ones. Integers were not properly tested either, now their parsing has been improved to report use of incorrect characters.	2012-05-08 20:57:20 +02:00
Willy Tarreau	b7451bb660	MEDIUM: acl: report parsing errors to the caller All parsing errors were known but impossible to return. Now by making use of memprintf(), we're able to build meaningful error messages that the caller can display.	2012-05-08 20:57:20 +02:00
Willy Tarreau	28376d62cb	MEDIUM: http: merge ACL and pattern cookie fetches into a single one It's easy to merge pattern and ACL fetches of cookies. It allows us to remove two distinct fetch functions. The new function internally uses an occurrence number to serve both purposes, but it didn't appear worth exposing it outside so there is no keyword argument to set it. However one of the benefits is that the "cookie" fetch for stick tables now automatically adapts to requests and responses, so there is no more need for set-cookie().	2012-05-08 20:57:19 +02:00
Willy Tarreau	185b5c4a7b	MEDIUM: http: merge acl and pattern header fetch functions HTTP header fetch is now done using smp_fetch_hdr() for both ACLs and patterns. This one also supports an occurrence number, making it possible to specify explicit occurrences for ACLs and patterns.	2012-05-08 20:57:19 +02:00
Willy Tarreau	0d5fe144a1	MINOR: proto_tcp: validate arguments of payload and payload_lv ACLs Now it's possible to control arguments, so let's do it.	2012-05-08 20:57:19 +02:00
Willy Tarreau	ae52f06da3	MINOR: acl: add a val_args field to keywords This will make it possible to delegate argument validating to functions shared with smp_fetch_*.	2012-05-08 20:57:19 +02:00
Willy Tarreau	7a777edbdf	MINOR: acl: set SMP_OPT_ITERATE on fetch functions This way, fetch functions will be able to tell if they're called for a single request or as part of a loop. This is important for instance when we use hdr(foo), because in an ACL this means that all hdr(foo) occurrences must be checked while in a pattern it means only one of them (eg: last one).	2012-05-08 20:57:18 +02:00
Willy Tarreau	d6281ae046	MEDIUM: pattern: use smp_fetch_rdp_cookie instead of the pattern specific version pattern_fetch_rdp_cookie() is useless now since it only used to add controls on top of smp_fetch_rdp_cookie() which have now been integrated into the pattern subsystem. Let's remove it.	2012-05-08 20:57:18 +02:00
Willy Tarreau	40aebd9239	MINOR: pattern: centralize handling of unstable data in pattern_process() Pattern fetch functions currently check for unstable data and return 0 when SMP_F_MAY_CHANGE is set. Instead of doing this everywhere and having to support specific fetch functions, better do that in pattern_process() which is the one interested in having stable data.	2012-05-08 20:57:18 +02:00
Willy Tarreau	7fc1c6eefb	MINOR: stick_table: centralize the handling of empty keys Right now, it's up to each pattern fetch method to return NULL when an empty string is returned, which is neither correct nor desirable as it is only stick tables which need to ignore empty patterns. Let's perform this check in stktable_fetch_key() instead.	2012-05-08 20:57:18 +02:00
Willy Tarreau	82ea800b0f	CLEANUP: pattern: ensure that payload and payload_lv always stay in the buffer A test was already performed which worked by pure luck due to integer types, otherwise it would have been possible to start checking for an offset out of the buffer's bounds if the buffer size was large enough to allow an integer wrap. Let's perform explicit checks and use unsigned ints for offsets instead of risking being hit later.	2012-05-08 20:57:18 +02:00
Willy Tarreau	0ce3aa0c66	MEDIUM: acl: implement payload and payload_lv These ones were easy to adapt to ACL usage and may really be useful, so let's make them available right now. It's likely that some extension such as regex, string-to-IP and raw IP matching will be implemented in the near future.	2012-05-08 20:57:17 +02:00
Willy Tarreau	4a12981c68	MEDIUM: acl/pattern: factor out the src/dst address fetches Since pattern_process() is able to automatically cast returned types into expected types, we can safely use the sample functions to fetch addresses whatever their family. The lowest castable type must be declared with the keyword so that config checks pass. Right now this means that src/dst use the same fetch function for ACLs and patterns. src6/dst6 have been kept so that configs which explicitly rely on v6 are properly checked.	2012-05-08 20:57:17 +02:00
Willy Tarreau	12e5011a76	MEDIUM: pattern: ensure that sample types always cast into other types. We want to ensure that a dynamically returned type will always have a cast before calling the cast function. This is done in pattern_process() and in stktable_fetch_key().	2012-05-08 20:57:17 +02:00
Willy Tarreau	25c1ebc0c9	MEDIUM: acl/pattern: start merging common sample fetch functions src_port, dst_port and url_param have converged between ACLs and patterns. This means that src_port is now available in patterns and that urlp_* has been added to ACLs. Some code has moved to accommodate for static function definitions, but there were little changes.	2012-05-08 20:57:17 +02:00
Willy Tarreau	32a6f2e572	MEDIUM: acl/pattern: use the same direction scheme Patterns were using a bitmask to indicate if request or response was desired in fetch functions and keywords. ACLs were using a bitmask in fetch keywords and a single bit in fetch functions. ACLs were also using an ACL_PARTIAL bit in fetch functions indicating that a non-final fetch was performed, which was an abuse of the existing direction flag. The change now consists in using : - a capabilities field for fetch keywords => SMP_CAP_REQ/RES to indicate if a keyword supports requests, responses, both, etc... - an option field for fetch functions to indicate what the caller expects (request/response, final/non-final) The ACL_PARTIAL bit was reversed to get SMP_OPT_FINAL as it's more explicit to know we're working on a final buffer than on a non-final one. ACL_DIR_* were removed, as well as PATTERN_FETCH_*. L4 fetches were improved to support being called on responses too since they're still available. The <dir> field of all fetch functions was changed to <opt> which is now unsigned. The patch is large but mostly made of cosmetic changes to accomodate this, as almost no logic change happened.	2012-05-08 20:57:17 +02:00
Willy Tarreau	9fb4bc7f43	MINOR: tcp: replace acl_fetch_rdp_cookie with smp_fetch_rdp_cookie The former was only a wrapper to the second, let's remove it now that the calling convention is exactly the same. This is the first function to be unified between ACLs and samples.	2012-05-08 20:57:16 +02:00
Willy Tarreau	24e32d8c6b	MEDIUM: acl: replace acl_expr with args in acl fetch_* functions Having the args everywhere will make it easier to share fetch functions between patterns and ACLs. The only place where we could have needed the expr was in the http_prefetch function which can do well without.	2012-05-08 20:57:16 +02:00
Willy Tarreau	32389b7d04	MEDIUM: acl/pattern: switch rdp_cookie functions stack up-down Previously, both pattern, backend and persist_rdp_cookie would build fake ACL expressions to fetch an RDP cookie by calling acl_fetch_rdp_cookie(). Now we switch roles. The RDP cookie fetch function is provided as a sample fetch function that all others rely on, including ACL. The code is exactly the same, only the args handling moved from expr->args to args. The code was moved to proto_tcp.c, but probably that a dedicated file would be more suited to content handling.	2012-05-08 20:57:16 +02:00
Willy Tarreau	b8c8f1f611	MEDIUM: pattern: retrieve the sample type in the sample, not in the keyword description We need the pattern fetchers and converters to correctly set the output type so that they can be used by ACL fetchers. By using the sample type instead of the keyword type, we also open the possibility to create some multi-type pattern fetch methods later (eg: "src" being v4/v6). Right now the type in the keyword is used to validate the configuration.	2012-05-08 20:57:16 +02:00
Willy Tarreau	342acb4775	MEDIUM: pattern: integrate pattern_data into sample and use sample everywhere Now there is no more reference to union pattern_data. All pattern fetch and conversion functions now make use of the common sample type. Note: none of them adjust the type right now so it's important to do it next otherwise we would risk sharing such functions with ACLs and seeing them fail.	2012-05-08 20:57:15 +02:00
Willy Tarreau	b4a88f0672	MINOR: pattern: replace struct pattern with struct sample This change is pretty minor. Struct pattern is only used for pattern_process() now so changing it to use the common type is quite obvious. It's worth noting that the last argument of pattern_process() is never used so the function is self-sufficient. Note that pattern_process() does not initialize the pattern at all before calling fetch->process(), and that minimal initialization will be required when we later change the argument for the sample.	2012-05-08 20:57:15 +02:00
Willy Tarreau	21e5b0e3cb	MEDIUM: get rid of SMP_F_READ_ONLY and SMP_F_MUST_FREE These ones were either unused or improperly used. Some integers were marked read-only, which does not make much sense. Buffers are not read-only, they're "constant" in that they must be kept intact after any possible change.	2012-05-08 20:57:15 +02:00
Willy Tarreau	197e10aaae	MEDIUM: acl: get rid of the SET_RES flags We now simply rely on a boolean result from a fetch to declare a match. Booleans are not compared against patterns, they fix the result.	2012-05-08 20:57:15 +02:00
Willy Tarreau	f853c46bc3	MEDIUM: pattern/acl: get rid of temp_pattern in ACLs This one is not needed anymore as we can return the data and its type in the sample provided by the caller. ACLs now always return the proper type. BOOL is already returned when the result is expected to be processed as a boolean. temp_pattern has been unexported now.	2012-05-08 20:57:14 +02:00
Willy Tarreau	3740635b88	MAJOR: acl: make use of the new sample struct and get rid of acl_test This change is invasive in lines of code but not much in terms of functionalities as it's mainly a replacement of struct acl_test with struct sample.	2012-05-08 20:57:14 +02:00
Willy Tarreau	422aa0792d	MEDIUM: pattern: add new sample types to replace pattern types The new sample types are necessary for the acl-pattern convergence. These types are boolean and signed int. Some types were renamed for less ambiguity (ip->ipv4, integer->uint).	2012-05-08 20:57:14 +02:00
Willy Tarreau	8f7406e9b4	MEDIUM: acl: remove the ACL_TEST_F_NULL_MATCH flag This flag was used to force a boolean match even if there was no pattern to match. It was used only by http_auth() and designed only for this one. It's easier and cleaner to make the fetch function perform the test and report the boolean result as a few other functions already do. It simplifies the acl_exec_cond() logic and will help merging ACLs and patterns.	2012-05-08 20:57:13 +02:00
Willy Tarreau	b27c0d35dd	MEDIUM: pattern: report the precise argument parsing error when known. The argument parser knows what exact error it has faced, and the pattern parser is able to report errors, so let's make use of it. From now on, it becomes possible to detect such things : $ ./haproxy -db -f echo5.cfg [ALERT] 110/160344 (4791) : parsing [echo5.cfg:38] : 'stick': invalid arg 2 in fetch method 'payload' : Missing arguments (got 1/2), type 'unsigned integer' expected. [ALERT] 110/160344 (4791) : parsing [echo5.cfg:39] : 'stick': invalid args in fetch method 'payload' : payload length must be > 0. [ALERT] 110/160344 (4791) : parsing [echo5.cfg:40] : 'stick': invalid arg 3 in fetch method 'payload_lv' : Failed to parse 'x' as type 'signed integer'. [ALERT] 110/160344 (4791) : parsing [echo5.cfg:41] : 'stick': invalid arg 4 in fetch method 'payload_lv' : End of arguments expected at ',13'. [ALERT] 110/160344 (4791) : Error(s) found in configuration file : echo5.cfg [ALERT] 110/160344 (4791) : Fatal errors found in configuration.	2012-05-08 20:57:13 +02:00
Willy Tarreau	21d68a6895	MEDIUM: pattern: add an argument validation callback to pattern descriptors This is used to validate that arguments are coherent. For instance, payload_lv expects that the last arg (if any) is not more negative than the sum of the first two. The error is reported if any.	2012-05-08 20:57:13 +02:00
Willy Tarreau	9fcb984b17	MEDIUM: pattern: use the standard arg parser We don't need the pattern-specific args parsers anymore, make use of the common parser instead. We still need to improve this by adding a validation function to report abnormal argument values or combinations. We don't report precise parsing errors yet but this was not previously done either.	2012-05-08 20:57:13 +02:00
Willy Tarreau	f995410355	MEDIUM: pattern: get rid of arg_i in all functions making use of arguments arg_i was almost unused, and since we migrated to use struct arg everywhere, the rare cases where arg_i was needed could be replaced by switching to arg->type = ARGT_STOP.	2012-05-08 20:57:12 +02:00
Willy Tarreau	ecfb8e8ff9	MEDIUM: pattern: replace type pattern_arg with type arg arg is more complete than pattern_arg since it also covers ACL args, so let's use this one instead.	2012-05-08 20:57:12 +02:00
Willy Tarreau	0146c2e873	MEDIUM: acl: remove unused tests for missing args when args are mandatory A number of ACL fetch methods use mandatory arguments (eg: proxy names) so it's pointless to test for the presence of this argument now.	2012-05-08 20:57:12 +02:00
Willy Tarreau	fc2c1fd449	MAJOR: acl: ensure that implicit table and proxies are valid A large number of ACLs make use of frontend, backend or table names in their arguments, and fall back to the current proxy when no argument is passed. If the expected capability is not available, the ACL silently fails at runtime. Now we make all those names mandatory in the parser and we rely on acl_find_targets() to replace the missing names with the holding proxy, then to perform the appropriate tests, and to reject errors at parsing time. It is possible that some faulty configurations will get rejected from now on, while they used to silently fail till now. This is the reason why this change is marked as MAJOR.	2012-05-08 20:57:12 +02:00
Willy Tarreau	d28c353fc5	MAJOR: acl: make acl_find_targets also resolve proxy names at config time Proxy names are now resolved when the config is parsed and not at runtime. This means that errors will be caught for real instead of having an ACL silently never match. Another benefit is that the fetch will be much faster since the lookup will not have to be performed anymore, eg for all ACLs based on explicitly named stick-tables. However some buggy configurations which used to silently fail in the past will now refuse to load, hence the MAJOR tag.	2012-05-08 20:57:11 +02:00
Willy Tarreau	63364eed75	MEDIUM: acl: acl_find_target() now resolves arguments based on their types This function does not rely on the keyword anymore but just on its type. It's much cleaner and much safer. It should be extended to do the same for all PRX type arguments.	2012-05-08 20:57:11 +02:00
Willy Tarreau	61612d49a7	MAJOR: acl: store the ACL argument types in the ACL keyword declaration The types and minimal number of ACL keyword arguments are now stored in their declaration. This will allow many more fantasies if some ACL use several arguments or types. Doing so required to rework all ACL keyword declarations to add two parameters. So this was a good opportunity for a general cleanup and to sort all entries in alphabetical order. We still have two pending issues : - parse_acl_expr() checks for errors but has no way to report them to the user ; - the types of some arguments are still not resolved and kept as strings (eg: ARGT_FE/BE/TAB) for compatibility reasons, which must be resolved in acl_find_targets()	2012-05-08 20:57:11 +02:00
Willy Tarreau	34db108423	MAJOR: acl: make use of the new argument parsing framework The ACL parser now uses the argument parser to build a typed argument list. Right now arguments are all strings and only one argument is supported since this is what ACLs currently support.	2012-05-08 20:57:11 +02:00
Willy Tarreau	2ac5718dbd	MEDIUM: add a new typed argument list parsing framework make_arg_list() builds an array of typed arguments with their values, that the caller describes how to parse. This will be used to support multiple arguments for ACLs and patterns, which is currently problematic and prevents ACLs and patterns from being merged. Up to 7 arguments types may be enumerated in a single 32-bit word, including their number of mandatory parts. At the moment, these files are not used yet, they're only built. Note that the 4-bit encoding for the type has left only one unused type!	2012-05-08 20:57:10 +02:00
Willy Tarreau	d53e2428d1	MEDIUM: http/acl: make acl_fetch_hdr_{ip,val} rely on acl_fetch_hdr() These two functions will now exploit the return of acl_fetch_hdr() instead of doing the same work again and again.	2012-05-08 20:57:10 +02:00
Willy Tarreau	e333ec9f24	MEDIUM: http/acl: merge all request and response ACL fetches of headers and cookies Latest changes have made it possible to remove all differences between request and response processing, making it worth merging request and response ACL fetch functions to reduce code size. Most likely with minor adaptation it will be possible to use the same hdr_* functions to match in the response path, and cook_* for the response cookie too.	2012-05-08 20:57:10 +02:00
Willy Tarreau	7744e0cf43	BUG/MINOR: http_auth: ACLs are volatile, not permanent ACLs are volatile since they require a fetch of request buffer data which is then copied to a temporary shared place. The issue is minor though since auth is generally checked very early.	2012-05-08 20:57:10 +02:00
Willy Tarreau	c0239e0425	MEDIUM: http: make all ACL fetch function use acl_prefetch_http() All ACLs which need to process HTTP contents first call this function which performs all the preliminary tests and also triggers the request parsing if needed. A macro was written to simplify the code. As a side effect, it's not required anymore to check for the HTTP ACL before checking for HTTP contents.	2012-05-08 20:57:10 +02:00
Willy Tarreau	14174bc4c0	MEDIUM: http: add a prefetch function for ACL pattern fetch This function will be called by all ACL fetch functions. Right now all ACL fetch functions have to perform the exact same tests to check whether data are available. Also, only one of them is able to actually parse an HTTP request. Using the prefetch function, it will be possible to try to parse a request on the fly and to avoid the fetch if some data are missing. This will significantly reduce the amount of tests in all ACL fetch functions.	2012-05-08 20:57:09 +02:00
Willy Tarreau	9dab5fc4d4	MEDIUM: buffers: rename a number of buffer management functions The following renaming took place : 1) buffer input functions buffer_put_block => bi_putblk buffer_put_char => bi_putchr buffer_put_string => bi_putstr buffer_put_chunk => bi_putchk buffer_feed => bi_putstr buffer_feed_chunk => bi_putchk buffer_cut_tail => bi_erase buffer_ignore => bi_fast_delete 2) buffer output functions buffer_get_char => bo_getchr buffer_get_line => bo_getline buffer_get_block => bo_getblk buffer_skip => bo_skip buffer_write => bo_inject 3) buffer input avail/full functions were introduced : bi_avail bi_full	2012-05-08 20:56:56 +02:00
Willy Tarreau	328582c3f9	MEDIUM: buffers: implement b_adv() to advance a buffer's pointer This is more convenient and efficient than buf->p = b_ptr(buf, n); It simply advances the buffer's pointer by <n> and trasfers that amount of bytes from <in> to <out>. The BF_OUT_EMPTY flag is updated accordingly. A few occurrences of such computations in buffers.c and stream_sock.c were updated to use b_adv(), which resulted in a small code shrink.	2012-05-08 12:28:14 +02:00
Willy Tarreau	cc5cfcbcce	MEDIUM: buffers: add new pointer wrappers and get rid of almost all buffer_wrap_add calls buffer_wrap_add was convenient for the migration but is not handy at all. Let's have new wrappers that report input begin/end and output begin/end instead. It looks like we'll also need a b_adv(ofs) to advance a buffer's pointer.	2012-05-08 12:28:14 +02:00
Willy Tarreau	ec1bc82a1d	MEDIUM: buffers: fix unsafe use of buffer_ignore at some places buffer_ignore may only be used when the output of a buffer is empty, but it's not granted it is always the case when sending HTTP error responses. Better use buffer_cut_tail() instead, and use buffer_ignore only on non-wrapping data.	2012-05-08 12:28:14 +02:00
Willy Tarreau	8b1323e4cb	MINOR: http: remove useless wrapping checks in http_msg_analyzer The message cannot wrap here.	2012-05-08 12:28:14 +02:00
Willy Tarreau	4baf44ba67	MEDIUM: http: remove buffer arg in chunk parsing functions The buffer pointer is now taken from the http_msg in the following functions : http_parse_chunk_size http_forward_trailers http_skip_chunk_crlf Most internal pointers were converted to const as the result of the operation.	2012-05-08 12:28:14 +02:00
Willy Tarreau	21710ffa35	MEDIUM: http: remove buffer arg in http_buffer_heavy_realign The buffer pointer is now taken from the http_msg. The function has also been renamed "http_message_realign".	2012-05-08 12:28:13 +02:00
Willy Tarreau	418bfcc794	MEDIUM: http: remove buffer arg in http_upgrade_v09_to_v10 The buffer and http_msg pointers are now taken from the transaction.	2012-05-08 12:28:13 +02:00
Willy Tarreau	a560c21f54	MEDIUM: http: remove buffer arg in http_msg_analyzer The buffer pointer is now taken from the http_msg.	2012-05-08 12:28:13 +02:00
Willy Tarreau	8a0cef2dad	MEDIUM: http: remove buffer arg in http_capture_bad_message The buffer pointer is now taken from the http_msg.	2012-05-08 12:28:13 +02:00
Willy Tarreau	6acf7c9179	MEDIUM: http: remove buffer arg in a few header manipulation functions The buffer pointer is now taken from the http_msg in the following functions : - http_remove_header2 - http_header_add_tail - http_header_add_tail2 - http_parse_connection_header - http_change_connection_header	2012-05-08 12:28:12 +02:00
Willy Tarreau	45c0d98769	MEDIUM: http: http_send_name_header: remove references to msg and buffer They can be deduced from txn.	2012-05-08 12:28:12 +02:00
Willy Tarreau	3a215bedba	MAJOR: http: make http_msg->sol relative to buffer's origin msg->sol is now a relative pointer just like all other ones. There is no more absolute references to the buffer outside the struct buffer itself. Next two cleanups should include removing buffer references to functions which already have an msg, and removal of wrapping detection in request and response parsing which cannot wrap by definition.	2012-05-08 12:28:12 +02:00
Willy Tarreau	62f791ea6f	MEDIUM: http: add a pointer to the buffer in http_msg ACLs and patterns only rely on a struct http_msg and don't know the pointer to the actual data. struct http_msg will soon only hold relative references so that's not possible. We need http_msg to hold a reference to the struct buffer before having relative pointers everywhere. It is likely that doing so will also result in opportunities to simplify a number of functions arguments. The following functions are already candidate : http_buffer_heavy_realign http_capture_bad_message http_change_connection_header http_forward_trailers http_header_add_tail http_header_add_tail2 http_msg_analyzer http_parse_chunk_size http_parse_connection_header http_remove_header2 http_send_name_header http_skip_chunk_crlf http_upgrade_v09_to_v10	2012-05-08 12:28:12 +02:00
Willy Tarreau	12e48b36dd	MAJOR: http: turn http_msg->eol to a buffer-relative offset It was an absolute pointer to the buffer's data, now it's a pointer relative to the buffer's origin.	2012-05-08 12:28:12 +02:00
Willy Tarreau	fa4a03ca08	CLEANUP: http: remove unused http_msg->col The <col> element of the struct http_msg has not been used for a long time now, remove it.	2012-05-08 12:28:11 +02:00
Willy Tarreau	ea1175a687	MAJOR: http: change msg->{som,col,sov,eoh} to be relative to buffer origin These offsets were relative to the buffer itself. Now they're relative to the buffer's origin (buf->p) which normally corresponds to the start of current message. This saves a big dependency between the HTTP message struct and the buffers. It appeared during this change that ->col is not used anymore (it will have to be removed). Next step is to turn ->eol and ->sol from absolute to relative.	2012-05-08 12:28:11 +02:00
Willy Tarreau	a458b67965	MAJOR: http: move buffer->lr to http_msg->next The buffer's pointer <lr> was only used by HTTP parsers which also use a struct http_msg to keep track of the parser's state. We've reached a point where it makes no sense to keep ->lr in the buffer, as the split between buffer and msg is only arbitrary for historical reasons. This change ensures that touching buffers will not impact HTTP messages anymore, making the buffers more content-agnostic. However, it becomes very important not to forget to update msg->next when some data get forwarded or moved (and in general each time buf->p is updated). The new pointer in http_msg becomes relative to buffer->p so that parsing multiple messages becomes easier. It is possible that at one point ->som and ->next will be merged. Note: http_parse_reqline() and http_parse_stsline() have been temporarily modified to know the message starting point in the buffer (->p).	2012-05-08 12:28:11 +02:00
Willy Tarreau	363a5bb152	MAJOR: buffers: replace buf->r with buf->p + buf->i This change gets rid of buf->r which is always equal to buf->p + buf->i. It removed some wrapping detection at a number of places, but required addition of new relative offset computations at other locations. A large number of places can be simplified now with extreme care, since most of the time, either the pointer has to be computed once or we need a difference between the old ->w and old ->r to compute free space. The cleanup will probably happen with the rewrite of the buffer_input_* and buffer_output_* functions anyway. buf->lr still has to move to the struct http_msg and be relative to buf->p for the rework to be complete.	2012-05-08 12:28:11 +02:00
Willy Tarreau	89fa706d39	MAJOR: buffers: replace buf->w with buf->p - buf->o This change introduces the buffer's base pointer, which is the limit between incoming and outgoing data. It's the point where the parsing should start from. A number of computations have already been greatly simplified, but more simplifications are expected to come from the removal of buf->r. The changes appear good and have revealed occasional improper use of some pointers. It is possible that this patch has introduced bugs or revealed some, although preliminary testings tend to indicate that everything still works as it should.	2012-05-08 12:28:10 +02:00
Willy Tarreau	02d6cfc1d7	MAJOR: buffer: replace buf->l with buf->{o+i} We don't have buf->l anymore. We have buf->i for pending data and the total length is retrieved by adding buf->o. Some computation already become simpler. Despite extreme care, bugs are not excluded. It's worth noting that msg->err_pos as set by HTTP request/response analysers becomes relative to pending data and not to the beginning of the buffer. This has not been completed yet so differences might occur when outgoing data are left in the buffer.	2012-05-08 12:28:10 +02:00
Willy Tarreau	2e046c6017	MAJOR: buffer rework: replace ->send_max with ->o This is the first minor step of the buffer rework. It's only renaming, it should have no impact.	2012-04-30 11:57:00 +02:00
Willy Tarreau	a36fc4d7ed	MEDIUM: move message-related flags from transaction to message Too many flags are stored in the transaction structure. Some flags are clearly message-specific and exist in two versions (request and response). Move them to a new "flags" field in the http_message struct instead.	2012-04-30 11:57:00 +02:00
Willy Tarreau	21337825c0	CLEANUP: remove a few warning about unchecked return values in debug code There were a few unchecked write() calls in the debug code that cause gcc 4.x to emit warnings on recent libc. We don't want to check them as we can't make anything from the result, let's simply surround them with an empty if statement. Note that one of the warnings was for chdir("/") which normally cannot fail since it follows a successful chroot (which means the perms are necessarily there). Anyway let's move the call uppe to protect it too.	2012-04-30 11:56:30 +02:00
Willy Tarreau	9a7bea52b1	MINOR: standard: add a memprintf() function to build formatted error messages memprintf() is just like snprintf() except that it always returns a properly sized allocated string that the caller is responsible for freeing. NULL is returned on serious errors. It also supports stackable calls over the same pointer since it offers support for automatically freeing a previous one : memprintf(&err, "invalid argument: '%s'", arg); ... memprintf(&err, "keyword parser said: <%s>", err); ... memprintf(&err, "line parser said: %s\n", err); ... free(*err);	2012-04-30 11:55:35 +02:00
Willy Tarreau	b56928a74c	CLEANUP: http: message parser must ignore HTTP_MSG_ERROR The issue only happens when DEBUG_FULL is enabled, which causes http_msg_analyzer() to complain if it's called twice with an invalid message, for instance because of two consecutive ACLs using req_proto_http. The code is commented out when DEBUG_FULL is disabled, so this is not a bug, just an annoyance for the developer.	2012-04-30 11:51:59 +02:00
Willy Tarreau	46787ed700	BUILD: http: stop gcc-4.1.2 from complaining about possibly uninitialized values The three warnings below are totally wrong since the variables depend on another one which is only turned on when the variables are initialized. Still this gcc-4.1.2 isn't able to see this and prefers to complain wrongly. So let's initialize the variables to shut it up since we're not in the fast path. src/proto_http.c: In function 'acl_fetch_any_cookie_cnt': src/proto_http.c:8393: warning: 'val_end' may be used uninitialized in this function src/proto_http.c: In function 'http_process_req_stat_post': src/proto_http.c:2577: warning: 'st_next_param' may be used uninitialized in this function src/proto_http.c:2577: warning: 'st_cur_param' may be used uninitialized in this function	2012-04-30 00:19:31 +02:00
Willy Tarreau	3fb818c014	BUILD: http: make extract_cookie_value() return an int not size_t It's very annoying that we have to deal with the crappy size_t and with ints at some places because these ones don't mix well. Patch 6f61b2 changed the chunk len to int but its size remains size_t and some functions are having trouble being used by several callers depending on the type of their arguments. Let's turn extract_cookie_value() to int for now on, and plan a massive cleanup later to remove all size_t.	2012-04-30 00:19:28 +02:00
Daniel Schultze	90690c7aca	MINOR: cli: display the 4 IP addresses and ports on "show sess XXX" I have modified dumpstats.c to show additional information for the show session <id> command on the statistics socket. This will dump the public, frontend, backend, and server ip/tcp addresses and port. We found it useful to have this information available in real time and could not find another way of getting this information.	2012-04-09 20:51:06 +02:00
Willy Tarreau	d017f113c0	BUG/MINOR: acl: req_ssl_sni would randomly fail if a session ID is present The wrong byte was checked for the session_id length in the payload. This used to work when the session ID was absent because zero was found there, but when a session ID is present, there is 1/256 chance that the inspected data contains 0x20 (the actual session ID length), so it fails. Thanks to Emmanuel B�zagu for reporting this bug. This bug does not need backporting, it is 1.5 specific.	2012-04-09 09:24:11 +02:00
Willy Tarreau	9b061e3320	MEDIUM: stream_sock: add a get_src and get_dst callback and remove SN_FRT_ADDR_SET These callbacks are used to retrieve the source and destination address of a socket. The address flags are not hold on the stream interface and not on the session anymore. The addresses are collected when needed. This still needs to be improved to store the IP and port separately so that it is not needed to perform a getsockname() when only the IP address is desired for outgoing traffic.	2012-04-07 18:03:52 +02:00
William Lallemand	5e19a2866f	MINOR: log: log-format: usable without httplog and tcplog Options httplog and tcplog aren't mandatory anymore for the log-format. The LW_ flags are now set during the log-format string parsing.	2012-04-07 16:25:26 +02:00
William Lallemand	a73203e3dc	MEDIUM: log: Unique ID The Unique ID, is an ID generated with several informations. You can use a log-format string to customize it, with the "unique-id-format" keyword, and insert it in the request header, with the "unique-id-header" keyword.	2012-04-07 16:25:26 +02:00
William Lallemand	5f2324019d	MEDIUM: log: New format-log flags: %Fi %Fp %Si %Sp %Ts %rt %H %pid %Fi: Frontend IP %Fp: Frontend Port %Si: Server IP %Sp: Server Port %Ts: Timestamp %rt: HTTP request counter %H: hostname %pid: PID +X: Hexadecimal represenation The +X mode in logformat displays hexadecimal for the following flags %Ci %Cp %Fi %Fp %Bi %Bp %Si %Sp %Ts %ct %pid rename logformat_write_string() to lf_text() Optimize size computation	2012-04-07 16:05:39 +02:00
William Lallemand	1d7055675e	MEDIUM: log: split of log_format generation * logformat functions now take a format linked list as argument * build_logline() build a logline using a format linked list * rename LOG_* by LOG_FMT_* in enum * improve error management in build_logline()	2012-04-07 16:05:02 +02:00
Aman Gupta	d94991d236	CLEANUP: Fix some minor whitespace issues	2012-04-07 09:56:14 +02:00
Aman Gupta	0bc0c2426c	MINOR: Add TO/FROM_SET flags to struct stream_interface [WT: it will make sense to remove SN_FRT_ADDR_SET and to use these flags everywhere instead ]	2012-04-07 09:17:26 +02:00
Willy Tarreau	64559c565f	CLEANUP: lb_first: add reference to a paper describing the original idea The original idea behind this implementation has been published in the paper below : http://reports-archive.adm.cs.cmu.edu/anon/2012/CMU-CS-12-109.pdf	2012-04-07 09:08:45 +02:00
Willy Tarreau	04aa6a9ce8	MEDIUM: http: add cookie and scookie ACLs The ACL matches rely on the extract_cookie_value() function as used for for patterns. This permits ACLs to match cookie values based on the cookie name instead of having to perform substring matching on the cookie header.	2012-04-07 08:47:26 +02:00
Willy Tarreau	4573af939c	MEDIUM: http: make extract_cookie_value() iterate over cookie values This will make the function usable for ACLs.	2012-04-06 18:20:06 +02:00
Willy Tarreau	c89ccb6221	MEDIUM: log: add a new cookie flag 'U' to report situations where cookie is not used This happens when a "use-server" rule sets the server instead.	2012-04-05 21:18:22 +02:00
Willy Tarreau	4a5cadea40	MEDIUM: session: implement the "use-server" directive Sometimes it is desirable to forward a particular request to a specific server without having to declare a dedicated backend for this server. This can be achieved using the "use-server" rules. These rules are evaluated after the "redirect" rules and before evaluating cookies, and they have precedence on them. There may be as many "use-server" rules as desired. All of these rules are evaluated in their declaration order, and the first one which matches will assign the server.	2012-04-05 21:14:10 +02:00
Aman Gupta	ceafb4aa92	CLEANUP: Fix some minor typos	2012-04-05 10:39:45 +02:00
Aman Gupta	9a13e84cc2	MINOR: Add release callback to si_applet	2012-04-05 10:39:20 +02:00
Cyril Bonté	19979e176e	MINOR: stats admin: reduce memcmp()/strcmp() calls on status codes memcmp()/strcmp() calls were needed in different parts of code to determine the status code. Each new status code introduces new calls, which can become inefficient and source of bugs. This patch reorganizes the code to rely on a numeric status code internally and to be hopefully more generic.	2012-04-05 09:58:27 +02:00
Cyril Bonté	aa0a45d2ed	MINOR: stats admin: use the backend id instead of its name in the form Proxy ids are unique whereas names can be used several times in the configuration. In order to prevent the ambiguity, the HTML form now provides the backend id instead of its name (the name can still be provided in the POST data).	2012-04-05 09:58:26 +02:00
Cyril Bonté	0bb519e6e5	CLEANUP: fix typo in findserver() log message There was a typo in the findserver() log message : "found" was written "fould".	2012-04-05 09:58:25 +02:00
Cyril Bonté	cf8d9ae3cd	MINOR: stats admin: allow unordered parameters in POST requests Previously, the stats admin page required POST parameters to be provided exactly in the same order as the HTML form. This patch allows to handle those parameters in any orders. Also, note that haproxy won't alter server states anymore if backend or server names are ambiguous (duplicated names in the configuration) to prevent unexpected results (the same should probably be applied to the stats socket).	2012-04-05 09:58:25 +02:00
Willy Tarreau	5dd7fa1f6b	BUG/MEDIUM: balance source did not properly hash IPv6 addresses The hash of IPv6 addresses was not properly aligned and resulted in the last quarter of the address not being hashed. In practice, this is rarely detected since MAC addresses are used in the second half. But this becomes very visible with IPv6-mapped IPv4 addresses such as ::FFFF:1.2.3.4 where the IPv4 part is never hashed. This bug has been there forever, since introduction of "balance source" in v1.2.11. The fix must then be backported to all stable versions. Thanks to Alex Markham for reporting this issue to the list !	2012-03-31 19:53:37 +02:00
William Lallemand	51b5dcae85	BUG/MAJOR: log: possible segfault with logformat Possible zero-pointer deference in sess_log(). Checks of return values in sess_log() fix the issue. Fix bad computation in logformat_write_string(). This issue is 1.5-specific and was introduced just before 1.5-dev8. No backport is needed.	2012-03-27 19:42:43 +02:00
Willy Tarreau	9eeb57bd7f	[RELEASE] Released version 1.5-dev8 Released version 1.5-dev8 with the following main changes : - MINOR: patch for minor typo (ressources/resources) - MEDIUM: http: add support for sending the server's name in the outgoing request - DOC: mention that default checks are TCP connections - BUG/MINOR: fix options forwardfor if-none when an alternative header name is specified - CLEANUP: Make check_statuses, analyze_statuses and process_chk static - CLEANUP: Fix HCHK spelling errors - BUG/MINOR: fix typo in processing of http-send-name-header - MEDIUM: log: Use linked lists for loggers - BUILD: fix declaration inside a scope block - REORG: log: split send_log function - MINOR: config: Parse the string of the log-format config keyword - MINOR: add ultoa, ulltoa, ltoa, lltoa implementations - MINOR: Date and time fonctions that don't use snprintf - MEDIUM: log: make http_sess_log use log_format - DOC: log-format documentation - MEDIUM: log: use log_format for mode tcplog - MEDIUM: log-format: backend source address %Bi %Bp - BUG/MINOR: log-format: fix %o flag - BUG/MEDIUM: bad length in log_format and __send_log - MINOR: logformat %st is signed - BUILD/MINOR: fix the source URL in the spec file - DOC: acl is http_first_req, not http_req_first - BUG/MEDIUM: don't trim last spaces from headers consisting only of spaces - MINOR: acl: add new matches for header/path/url length - BUILD: halog: make halog build on solaris - BUG/MINOR: don't use a wrong port when connecting to a server with mapped ports - MINOR: remove the client/server side distinction in SI addresses - MINOR: halog: add support for matching queued requests - DOC: indicate that cookie "prefix" and "indirect" should not be mixed - OPTIM/MINOR: move struct sockaddr_storage to the tail of structs - OPTIM/MINOR: make it possible to change pipe size (tune.pipesize) - BUILD/MINOR: silent a build warning in src/pipe.c (fcntl) - OPTIM/MINOR: move the hdr_idx pools out of the proxy struct - MEDIUM: tune.http.maxhdr makes it possible to configure the maximum number of HTTP headers - BUG/MINOR: fix a segfault when parsing a config with undeclared peers - CLEANUP: rename possibly confusing struct field "tracked" - BUG/MEDIUM: checks: fix slowstart behaviour when server tracking is in use - MINOR: config: tolerate server "cookie" setting in non-HTTP mode - MEDIUM: buffers: add some new primitives and rework existing ones - BUG: buffers: don't return a negative value on buffer_total_space_res() - MINOR: buffers: make buffer_pointer() support negative pointers too - CLEANUP: kill buffer_replace() and use an inline instead - BUG: tcp: option nolinger does not work on backends - CLEANUP: ebtree: remove a few annoying signedness warnings - CLEANUP: ebtree: clarify licence and update to 6.0.6 - CLEANUP: ebtree: remove 4-year old harmless typo in duplicates insertion code - CLEANUP: ebtree: remove another typo, a wrong initialization in insertion code - BUG: ebtree: ebst_lookup() could return the wrong entry - OPTIM: stream_sock: reduce the amount of in-flight spliced data - OPTIM: stream_sock: save a failed recv syscall when splice returns EAGAIN - MINOR: acl: add support for TLS server name matching using SNI - BUG: http: re-enable TCP quick-ack upon incomplete HTTP requests - BUG: proto_tcp: don't try to bind to a foreign address if sin_family is unknown - MINOR: pattern: export the global temporary pattern - CLEANUP: patterns: get rid of pattern_data_setstring() - MEDIUM: acl: use temp_pattern to store fetched information in the "method" match - MINOR: acl: include pattern.h to make pattern migration more transparent - MEDIUM: pattern: change the pattern data integer from unsigned to signed - MEDIUM: acl: use temp_pattern to store any integer-type information - MEDIUM: acl: use temp_pattern to store any address-type information - CLEANUP: acl: integer part of acl_test is not used anymore - MEDIUM: acl: use temp_pattern to store any string-type information - CLEANUP: acl: remove last data fields from the acl_test struct - MEDIUM: http: replace get_ip_from_hdr2() with http_get_hdr() - MEDIUM: patterns: the hdr() pattern is now of type string - DOC: add minimal documentation on how ACLs work internally - DOC: add a coding-style file - OPTIM: halog: keep a fast path for the lines-count only - CLEANUP: silence a warning when building on sparc - BUG: http: tighten the list of allowed characters in a URI - MEDIUM: http: block non-ASCII characters in URIs by default - DOC: add some documentation from RFC3986 about URI format - BUG/MINOR: cli: correctly remove the whole table on "clear table" - BUG/MEDIUM: correctly disable servers tracking another disabled servers. - BUG/MEDIUM: zero-weight servers must not dequeue requests from the backend - MINOR: halog: add some help on the command line - BUILD: fix build error on FreeBSD - BUG: fix double free in peers config error path - MEDIUM: improve config check return codes - BUILD: make it possible to look for pcre in the default system paths - MINOR: config: emit a warning when 'default_backend' masks servers - MINOR: backend: rework the LC definition to support other connection-based algos - MEDIUM: backend: add the 'first' balancing algorithm - BUG: fix httplog trailing LF - MEDIUM: increase chunk-size limit to 2GB-1 - BUG: queue: fix dequeueing sequence on HTTP keep-alive sessions - BUG: http: disable TCP delayed ACKs when forwarding content-length data - BUG: checks: fix server maintenance exit sequence - BUG/MINOR: stream_sock: don't remove BF_EXPECT_MORE and BF_SEND_DONTWAIT on partial writes - DOC: enumerate valid status codes for "observe layer7" - MINOR: buffer: switch a number of buffer args to const - CLEANUP: silence signedness warning in acl.c - BUG: stream_sock: si->release was not called upon shutw() - MINOR: log: use "%ts" to log term status only and "%tsc" to log with cookie - BUG/CRITICAL: log: fix risk of crash in development snapshot - BUG/MAJOR: possible crash when using capture headers on TCP frontends - MINOR: config: disable header captures in TCP mode and complain	2012-03-26 06:16:43 +02:00
Simon Horman	63a4a822c1	CLEANUP: Make check_statuses, analyze_statuses and process_chk static These symbols are only used inside src/checks.c	2012-03-24 21:54:19 +01:00
Willy Tarreau	9a54e13788	MINOR: config: disable header captures in TCP mode and complain In order to help users fix their configs, report a warning when a capture has been set on a non-HTTP frontend. This should be backported to 1.4.	2012-03-24 08:35:37 +01:00
Willy Tarreau	42f7d89156	BUG/MAJOR: possible crash when using capture headers on TCP frontends Olufemi Omojola provided a config and a core showing a possible crash when captures are configured on a TCP-mode frontend which branches to an HTTP backend. The reason is that being in TCP mode, the frontend does not allocate capture pools for the request, but the HTTP backend tries to use them and dies on the NULL. While such a config has long been unlikely to happen, it looks like people using websocket tend to do this more often now. Change the control to use the pointer instead of the number of captures to know when to log. This bug was reported in 1.4.20, so it must be backported there.	2012-03-24 08:35:36 +01:00
William Lallemand	7f25debbd2	MINOR: logformat %st is signed replace ultoa by ltoa for HTTP status code (can be -1)	2012-03-22 17:23:23 +01:00
Adrian Bridgett	afdb6e57f7	MINOR: patch for minor typo (ressources/resources) The main stats page says "ressources" (French spelling) rather than "resources" (English spelling). One little patch attached (against v1.4.20). Many thanks, Adrian	2012-03-21 07:54:41 +01:00
William Lallemand	bfb099c3b3	BUG/MEDIUM: bad length in log_format and __send_log __send_log(): the size of the buffer sent is wrong when the facility is lower than 3 digits. logformat_write_string(): computation of size is wrong Note: this was introduced after 1.5-dev7, no backport needed.	2012-03-19 17:15:13 +01:00
Willy Tarreau	b1a2faf7c9	BUG/CRITICAL: log: fix risk of crash in development snapshot Commit a1cc38 introduced a regression which was easy to trigger till `ad4cd58` (snapshots 20120222 to 20120311 included). The bug was still present after that but harder to trigger. The bug is caused by the use of two distinct log buffers due to intermediary changes. The issue happens when an HTTP request is logged just after a TCP request during the same second and the HTTP request is too large for the buffer. In this case, it happens that the HTTP request is logged into the TCP buffer instead and that length controls can't detect anything. Starting with bddd4f, the issue is still possible when logging too large an HTTP request just after a send_log() call (typically a server status change). We owe a big thanks to Sander Klein for testing several snapshots and more specifically for taking significant risks in production by letting the buggy version crash several times in order to provide an exploitable core ! The bug could not have been found without this precious help. Thank you Sander ! This fix does not need to be backported, it did not affect any released version.	2012-03-19 17:09:30 +01:00
Willy Tarreau	6580c06ba3	MINOR: log: use "%ts" to log term status only and "%tsc" to log with cookie The difference could be seen when logging a request in HTTP mode with option tcplog, as it would keep emitting 4 chars. Better use two distinct flags to clear the confusion.	2012-03-12 15:50:53 +01:00
William Lallemand	81f5117a24	BUG/MINOR: log-format: fix %o flag The %o flag was not working at all.	2012-03-12 15:50:53 +01:00
William Lallemand	b7ff6a3a36	MEDIUM: log-format: backend source address %Bi %Bp %Bi return the backend source IP %Bp return the backend source port Add a function pointer in logformat_type to do additional configuration during the log-format variable parsing.	2012-03-12 15:50:52 +01:00
William Lallemand	bddd4fd93b	MEDIUM: log: use log_format for mode tcplog Merge http_sess_log() and tcp_sess_log() to sess_log() and move it to log.c A new field in logformat_type define if you can use a logformat variable in TCP or HTTP mode. doc: log-format in tcp mode Note that due to the way log buffer allocation currently works, trying to log an HTTP request without "option httplog" is still not possible. This will change in the near future.	2012-03-12 15:47:13 +01:00
Willy Tarreau	ad4cd58986	BUG: stream_sock: si->release was not called upon shutw() The ->release function of the stream interface is never called upon a shutw() because it's placed after a return statement. It is possible that it has impacted inter-process stick-table replication by preventing a full resync after certain sequences of connection breakage. Since this bug has been present since the introduction of the ->release() callback, it cannot have caused regressions, just possibly non-working situations. This was detected at Exceliance by Emeric Brun during a code review. It is 1.5-specific.	2012-03-10 13:42:32 +01:00
Willy Tarreau	62e7c7146e	CLEANUP: silence signedness warning in acl.c The recent SNI patch introduced a trivial warning in acl.c.	2012-03-10 09:05:30 +01:00
Willy Tarreau	f17810e4fa	BUG/MINOR: stream_sock: don't remove BF_EXPECT_MORE and BF_SEND_DONTWAIT on partial writes The flags are one-shot but should be maintained over all send() operations as long as send_max is not flushed. The flags were incidentely cleared once a complete send() was performed, regardless of the fact that the send() might have been on the first half of a buffer before a wrapping. The result is that on wrapping data (eg: which happens often with chunked encoding), many incomplete segments are transmitted instead of being aggregated. The fix consists in only flushing the flags only once send_max is empty, which was the expected behaviour. This fix should be backported to 1.4 though it is not critical, just sub-optimal.	2012-03-09 18:10:44 +01:00
Willy Tarreau	4544678490	BUG: checks: fix server maintenance exit sequence Recent commit 62c3be broke maintenance mode by fixing srv_is_usable(). Enabling a disabled server would not re-introduce it into the farm. The reason is that in set_server_up(), the SRV_MAINTAIN flag is still present when recounting the servers. The flag was removed late only to adjust a log message. Keep a copy of the old flag instead and update SRV_MAINTAIN earlier. This fix must also be backported to 1.4 (but no release got the regression).	2012-03-09 17:19:43 +01:00
Willy Tarreau	869fc1edc2	BUG: http: disable TCP delayed ACKs when forwarding content-length data Commits 5c6209 and 072930 were aimed at avoiding undesirable PUSH flags when forwarding chunked data, but had the undesired effect of causing data advertised by content-length to be affected by the delayed ACK too. This can happen when the data to be forwarded are small enough to fit into a single send() call, otherwise the BF_EXPECT_MORE flag would be removed. Content-length data don't need the BF_EXPECT_MORE flag since the low-level forwarder already knows it can safely rely on bf->to_forward to set the appropriate TCP flags. Note that the issue is only observed in requests at the moment, though the later introduction of server-side keep-alive could trigger the issue on the response path too. Special thanks to Randy Shults for reporting this issue with a lot of details helping to reproduce it. The fix must be backported to 1.4.	2012-03-05 08:46:34 +01:00
Willy Tarreau	2d5cd479bc	BUG: queue: fix dequeueing sequence on HTTP keep-alive sessions When a request completes on a server and the server connection is closed while the client connection stays open, the HTTP engine releases all server connection slots and scans the queues to offer the connection slot to another pending request. An issue happens when the released connection allows other requests to be dequeued : may_dequeue_tasks() relies on srv->served which is only decremented by sess_change_server() which itself is only called after may_dequeue_tasks(). This results in no connection being woken up until another connection terminates so that may_dequeue_tasks() is called again. This fix is minimalist and only moves sess_change_server() earlier (which is safe). It should be reworked and the code factored out so that the same occurrence in session.c shares the same code. This bug has been there since the introduction of option-http-server-close and the fix must be backported to 1.4.	2012-03-01 23:49:20 +01:00
Willy Tarreau	431946e961	MEDIUM: increase chunk-size limit to 2GB-1 Since commit `115acb97`, chunk size was limited to 256MB. There is no reason for such a limit and the comment on the code suggests a missing zero. However, increasing the limit past 2 GB causes trouble due to some 32-bit subtracts in various computations becoming negative (eg: buffer_max_len). So let's limit the chunk size to 2 GB - 1 max.	2012-02-27 09:51:52 +01:00
Willy Tarreau	53bf6af3f9	BUG: fix httplog trailing LF commit `a1cc3811` introduced an undesirable \0\n ending on HTTP log messages. This is because of an extra character count passed to __send_log() which causes the LF to be appended past the \0. Some syslog daemons thus log an extra empty line. The fix is obvious. Fix the function comments to remind what they expect on their input. This is past 1.5-dev7 regression so there's no backport needed.	2012-02-24 11:48:42 +01:00
Willy Tarreau	f09c6603d3	MEDIUM: backend: add the 'first' balancing algorithm The principle behind this load balancing algorithm was first imagined and modeled by Steen Larsen then iteratively refined through several work sessions until it would totally address its original goal. The purpose of this algorithm is to always use the smallest number of servers so that extra servers can be powered off during non-intensive hours. Additional tools may be used to do that work, possibly by locally monitoring the servers' activity. The first server with available connection slots receives the connection. The servers are choosen from the lowest numeric identifier to the highest (see server parameter "id"), which defaults to the server's position in the farm. Once a server reaches its maxconn value, the next server is used. It does not make sense to use this algorithm without setting maxconn. Note that it can however make sense to use minconn so that servers are not used at full load before starting new servers, and so that introduction of new servers requires a progressively increasing load (the number of servers would more or less follow the square root of the load until maxconn is reached). This algorithm ignores the server weight, and is more beneficial to long sessions such as RDP or IMAP than HTTP, though it can be useful there too.	2012-02-21 22:27:27 +01:00
Willy Tarreau	3ebb1163ba	MINOR: backend: rework the LC definition to support other connection-based algos The leastconn algorithm should be of kind "connection-based", not "leastconn" if we want to later support other connection-based LB algos.	2012-02-13 17:02:31 +01:00
Willy Tarreau	ff67813f58	MINOR: config: emit a warning when 'default_backend' masks servers When a "listen" instance uses a "default_backned" rule and has servers, the servers will never be used. Report it so that users don't get trapped.	2012-02-13 14:32:34 +01:00
William Lallemand	a1cc381151	MEDIUM: log: make http_sess_log use log_format http_sess_log now use the logformat linked list to make the log string, snprintf is not used for speed issue. CLF mode also uses logformat. NOTE: as of now, empty fields in CLF now are "" not "-" anymore.	2012-02-09 17:03:28 +01:00
William Lallemand	421f5b5882	MINOR: Date and time fonctions that don't use snprintf Also move human_time() to standard.c since it's not related to timeval calculations.	2012-02-09 17:03:28 +01:00
William Lallemand	e7340ec111	MINOR: add ultoa, ulltoa, ltoa, lltoa implementations Implementations that write result from left to right	2012-02-09 17:03:28 +01:00
William Lallemand	723b73ad75	MINOR: config: Parse the string of the log-format config keyword parse_logformat_string: parse the string, detect the type: text, separator or variable parse_logformat_var: dectect variable name parse_logformat_var_args: parse arguments and flags add_to_logformat_list: add to the logformat linked list	2012-02-09 17:03:24 +01:00
William Lallemand	2a4a44f0f9	REORG: log: split send_log function send_log function is now splited in 3 functions * hdr_log: generate the syslog header * send_log: send a syslog message with a printf format string * __send_log: send a syslog message	2012-02-09 15:54:43 +01:00
William Lallemand	d9e9066e71	BUILD: fix declaration inside a scope block	2012-02-06 09:46:16 +01:00
Willy Tarreau	8b15ba19c3	MEDIUM: improve config check return codes When checking a configuration file using "-c -f xxx", sometimes it is reported that a config is valid while it will later fail (eg: no enabled listener). Instead, let's improve the return values : - return 0 if config is 100% OK - return 1 if config has errors - return 2 if config is OK but no listener nor peer is enabled	2012-02-02 17:53:37 +01:00
Willy Tarreau	6f9b003c2b	BUG: fix double free in peers config error path If the local host is not found as a peer in a "peers" section, we have a double free, and possibly a use-after-free because the peers section is freed since it's aliased as the table's name.	2012-02-02 17:53:37 +01:00
Willy Tarreau	b05405a3a8	BUILD: fix build error on FreeBSD Marcello Gorlani reported that commit `5e205524ad` (BUG: http: re-enable TCP quick-ack upon incomplete HTTP requests) broke build on FreeBSD. Moving the include lower fixes the issue. This must be backported to 1.4 too.	2012-01-23 15:35:52 +01:00
Willy Tarreau	f8e8b76ed3	BUG/MEDIUM: zero-weight servers must not dequeue requests from the backend It was reported that a server configured with a zero weight would sometimes still take connections from the backend queue. This issue is real, it happens this way : 1) the disabled server accepts a request with a cookie 2) many cookie-less requests accumulate in the backend queue 3) when the disabled server completes its request, it checks its own queue and the backend's queue 4) the server takes a pending request from the backend queue and processes it. In response, the server's cookie is assigned to the client, which ensures that some requests will continue to be served by this server, leading back to point 1 above. The fix consists in preventing a zero-weight server from dequeuing pending requests from the backend. Making use of srv_is_usable() in such tests makes the tests more robust against future changes. This fix must be backported to 1.4 and 1.3.	2012-01-20 16:18:53 +01:00
Willy Tarreau	62c3be28ed	BUG/MEDIUM: correctly disable servers tracking another disabled servers. In a config where server "s1" is marked disabled and "s2" tracks "s1", s2 appears disabled on the stats but is still inserted into the LB farm because the tracking is resolved too late in the configuration process. We now resolve tracked servers before building LB maps and we also mark the tracking server in maintenance mode, which previously was not done, causing half of the issue. Last point is that we also protect srv_is_usable() against electing a server marked for maintenance. This is not absolutely needed but is a safe choice and makes a lot of sense. This fix must be backported to 1.4.	2012-01-20 16:18:30 +01:00
Stathis Voukelatos	09a030a9a4	BUG/MINOR: fix typo in processing of http-send-name-header I downloaded version 1.4.19 this morning. While merging the code changes to a custom build that we have here for our project I noticed a typo in 'session.c', in the new code for inserting the server name in the HTTP header. The fix that I did is shown in the patch below. [WT: the bug is harmless, it is only suboptimal]	2012-01-09 14:27:13 +01:00
Willy Tarreau	8fa52f4e0e	BUG/MINOR: cli: correctly remove the whole table on "clear table" Joe Price reported that "clear table xxx" sent on the CLI would only clear the last entry. This is true, some code was missing to remove an entry from within the loop, and only the final condition was able to remove an entry. The fix is obvious. No backport is needed.	2012-01-09 11:53:09 +01:00
Willy Tarreau	422246eb26	MEDIUM: http: block non-ASCII characters in URIs by default These ones are invalid and blocked unless "option accept-invalid-http-request" is specified in the frontend. In any case, the faulty request is logged. Note that some of the remaining invalid chars are still not checked against, those are the invalid ones between 32 and 127 : 34 ('"'), 60 ('<'), 62 ('>'), 92 ('\'), 94 ('^'), 96 ('`'), 123 ('{'), 124 ('\|'), 125 ('}') Using a lookup table might be better at some point.	2012-01-07 23:55:20 +01:00
Willy Tarreau	2e9506d771	BUG: http: tighten the list of allowed characters in a URI The HTTP request parser was considering that any non-LWS char was par of the URI. Unfortunately, this allows control chars to be sent in the URI, sometimes resulting in backend servers misbehaving, for instance when they interprete \0 as an end of string and respond with plain HTTP/0.9 without headers, that haproxy blocks as invalid responses. RFC3986 clearly states the list of allowed characters in a URI. Even non-ASCII chars are not allowed. Unfortunately, after having run 10 years with these chars allowed, we can't block them right now without an optional workaround. So the first step consists in only blocking control chars. A later patch will allow non-ASCII only when an appropriate option is enabled in the frontend. Control chars are 0..31 and 127, with the exception of 9, 10 and 13 (\t, \n, \r).	2012-01-07 23:22:31 +01:00
Willy Tarreau	7b77c9fd6d	CLEANUP: silence a warning when building on sparc On Solaris/sparc, getpid() returns pid_t which is not an int : src/peers.c: In function `peer_io_handler': src/peers.c:508: warning: int format, pid_t arg (arg 6)	2012-01-07 22:52:12 +01:00
Mark Lamourine	c2247f0b8d	MEDIUM: http: add support for sending the server's name in the outgoing request New option "http-send-name-header" specifies the name of a header which will hold the server name in outgoing requests. This is the name of the server the connection is really sent to, which means that upon redispatches, the header's value is updated so that it always matches the server's name.	2012-01-05 15:17:31 +01:00
Willy Tarreau	e428fb7b4e	MEDIUM: patterns: the hdr() pattern is now of type string This pattern previously was limited to type IP. With the new header extraction function, it becomes possible to extract strings, so that the header can be returned as a string. This will not change anything to existing configs, as string will automatically be converted to IP when needed. However, new configs will be able to use IPv6 addresses from headers in stick-tables, as well as stick on any non-IP header (eg: host, user-agent, ...).	2011-12-30 17:33:27 +01:00
Willy Tarreau	294c473756	MEDIUM: http: replace get_ip_from_hdr2() with http_get_hdr() The new function does not return IP addresses but header values instead, so that the caller is free to make what it want of them. The conversion is not quite clean yet, as the previous test which considered that address 0.0.0.0 meant "no address" is still used. A different IP parsing function should be used to take this into account.	2011-12-30 17:33:26 +01:00
Willy Tarreau	664092ccc1	MEDIUM: acl: use temp_pattern to store any string-type information Now strings and data blocks are stored in the temp_pattern's chunk and matched against this one. The rdp_cookie currently makes extensive use of acl_fetch_rdp_cookie() and will be a good candidate for the initial rework so that ACLs use the patterns framework and not the other way around.	2011-12-30 17:33:26 +01:00
Willy Tarreau	f4362b3e3b	MEDIUM: acl: use temp_pattern to store any address-type information IPv4 and IPv6 addresses are now stored into temp_pattern instead of the dirty hack consisting into storing them into the consumer's target address. Some refactoring should now be possible since the methods used to fetch source and destination addresses are similar between patterns and ACLs.	2011-12-30 17:33:26 +01:00
Willy Tarreau	a5e375646c	MEDIUM: acl: use temp_pattern to store any integer-type information All ACL fetches which return integer value now store the result into the temporary pattern struct. All ACL matches which rely on integer also get their value there. Note: the pattern data types are not set right now.	2011-12-30 17:33:26 +01:00
Willy Tarreau	8e5e955c50	MEDIUM: acl: use temp_pattern to store fetched information in the "method" match This match was using both the int and ptr part of the acl_test struct. Let's change this to be able to store it into a chunk with a special encoding.	2011-12-30 17:33:25 +01:00
Willy Tarreau	1ded605ad5	CLEANUP: patterns: get rid of pattern_data_setstring() This function was only used to call chunk_init_len() from another chunk, which in the end consists in simply assigning the source chunk to the destination chunk. Let's remove this indirection to make the code clearer. Anyway it was the only place such a function was used.	2011-12-30 17:33:25 +01:00
Willy Tarreau	5e6cc4aad8	MINOR: pattern: export the global temporary pattern The global pattern is used for pattern conversions. Export it under the name "temp_pattern" so that it can later be used by ACLs.	2011-12-30 17:33:25 +01:00
Willy Tarreau	5dc1e98905	BUG: proto_tcp: don't try to bind to a foreign address if sin_family is unknown This is 1.5-specific. It causes issues with transparent source binding involving hdr_ip. We must not try to bind() to a foreign address when the family is not set, and we must set the family when an address is set.	2011-12-30 17:33:24 +01:00
Willy Tarreau	5e205524ad	BUG: http: re-enable TCP quick-ack upon incomplete HTTP requests By default we disable TCP quick-acking on HTTP requests so that we avoid sending a pure ACK immediately followed by the HTTP response. However, if the client sends an incomplete request in a short packet, its TCP stack might wait for this packet to be ACKed before sending the rest of the request, delaying incoming requests by up to 40-200ms. We can detect this undesirable situation when parsing the request : - if an incomplete request is received - if a full request is received and uses chunked encoding or advertises a content-length larger than the data available in the buffer In these situations, we re-enable TCP quick-ack if we had previously disabled it.	2011-12-17 16:45:29 +01:00
Willy Tarreau	b6672b547a	MINOR: acl: add support for TLS server name matching using SNI Server Name Indication (SNI) is a TLS extension which makes a client present the name of the server it is connecting to in the client hello. It allows a transparent proxy to take a decision based on the beginning of an SSL/TLS stream without deciphering it. The new ACL "req_ssl_sni" matches the name extracted from the TLS handshake against a list of names which may be loaded from a file if needed.	2011-12-12 17:26:23 +01:00
Willy Tarreau	82a04566ec	OPTIM: stream_sock: save a failed recv syscall when splice returns EAGAIN When splice() returns EAGAIN, on old kernels it could be caused by a read shutdown which was not detected. Due to this behaviour, we had to fall back to recv(), which in turn says if it's a real EAGAIN or a shutdown. Since this behaviour was fixed in 2.6.27.14, on more recent kernels we'd prefer to avoid the fallback to recv() when possible. For this, we set a variable the first time splice() detects a shutdown, to indicate that it works. We can then rely on this variable to adjust our behaviour. Doing this alone increased the overall performance by about 1% on medium sized objects.	2011-12-12 00:03:55 +01:00
Willy Tarreau	eb9fd5178e	OPTIM: stream_sock: reduce the amount of in-flight spliced data First, it's a waste not to call chk_snd() when spliced data are available, because the pipe can almost always be transferred into the outgoing socket buffers. Starting from now, when we splice data in, we immediately try to send them. This results in less pipes used, and possibly less kernel memory in use at once. Second, if a pipe cannot be transferred into the outgoing socket buffers, it means this buffer is full. There's no point trying again then, as space will almost never be available, resulting in a useless syscall returning EAGAIN.	2011-12-12 00:03:55 +01:00
Willy Tarreau	f6f8225390	BUG: tcp: option nolinger does not work on backends Daniel Rankov reported that "option nolinger" is inefficient on backends. The reason is that it is set on the file descriptor only, which does not prevent haproxy from performing a clean shutdown() before closing. We must set the flag on the stream_interface instead if we want an RST to be emitted upon active close.	2011-11-30 18:06:23 +01:00
Willy Tarreau	19ae56b2b6	CLEANUP: kill buffer_replace() and use an inline instead This function is never used, only its buffer_replace2() alternative is used. Replace the former with an inline which calls the later.	2011-11-28 21:01:28 +01:00
Willy Tarreau	4b517ca93a	MEDIUM: buffers: add some new primitives and rework existing ones A number of primitives were missing for buffer management, and some of them were particularly awkward to use. Specifically, the functions used to compute free space could not always be used depending what was wrapping in the buffers. Some documentation has been added about how the buffers work and their properties. Some functions are still missing such as a buffer replacement which would support wrapping buffers.	2011-11-25 21:57:29 +01:00
William Lallemand	0f99e34978	MEDIUM: log: Use linked lists for loggers This patch settles the 2 loggers limitation. Loggers are now stored in linked lists. Using "global log", the global loggers list content is added at the end of the current proxy list. Each "log" entries are added at the end of the proxy list. "no log" flush a logger list.	2011-10-31 14:09:19 +01:00
Willy Tarreau	0cec331a0e	MINOR: config: tolerate server "cookie" setting in non-HTTP mode Up to now, if a cookie value was specified on a server when the proxy was in TCP mode, it would cause a fatal error. Now we only report a warning, since the cookie will be ignored. This makes it easier to generate configs from scripts.	2011-10-31 14:09:13 +01:00
Willy Tarreau	2e99390faf	BUG/MEDIUM: checks: fix slowstart behaviour when server tracking is in use Ludovic Levesque reported and diagnosed an annoying bug. When a server is configured to track another one and has a slowstart interval set, it's assigned a minimal weight when the tracked server goes back up but keeps this weight forever. This is because the throttling during the warmup phase is only computed in the health checking function. After several attempts to resolve the issue, the only real solution is to split the check processing task in two tasks, one for the checks and one for the warmup. Each server with a slowstart setting has a warmum task which is responsible for updating the server's weight after a down to up transition. The task does not run in othe situations. In the end, the fix is neither complex nor long and should be backported to 1.4 since the issue was detected there first.	2011-10-31 11:53:20 +01:00
Willy Tarreau	4426770013	CLEANUP: rename possibly confusing struct field "tracked" When reading the code, the "tracked" member of a server makes one think the server is tracked while it's the opposite, it's a pointer to the server being tracked. This is particularly true in constructs such as : if (srv->tracked) { Since it's the second time I get caught misunderstanding it, let's rename it to "track" to avoid the confusion.	2011-10-28 15:35:33 +02:00
Willy Tarreau	d66bf96d5b	BUG/MINOR: fix a segfault when parsing a config with undeclared peers Baptiste Assmann reported that a config where a non-existing peers section is referenced by a stick-table causes a segfault after displaying the error. This is caused by the freeing of the peers. Setting it to NULL after displaying the error fixes the issue.	2011-10-28 14:16:49 +02:00
Willy Tarreau	ac1932da3e	MEDIUM: tune.http.maxhdr makes it possible to configure the maximum number of HTTP headers For a long time, the max number of headers was taken as a part of the buffer size. Since the header size can be configured at runtime, it does not make much sense anymore. Nothing was making it necessary to have a static value, so let's turn this into a tunable with a default value of 101 which equals what was previously used.	2011-10-24 19:14:41 +02:00
Willy Tarreau	34eb671f24	OPTIM/MINOR: move the hdr_idx pools out of the proxy struct It makes no sense to have one pointer to the hdr_idx pool in each proxy struct since these pools do not depend on the proxy. Let's have a common pool instead as it is already the case for other types.	2011-10-24 18:15:04 +02:00
Willy Tarreau	9ed560e964	BUILD/MINOR: silent a build warning in src/pipe.c (fcntl)	2011-10-24 17:09:22 +02:00
Willy Tarreau	bd9a0a7781	OPTIM/MINOR: make it possible to change pipe size (tune.pipesize) By default, pipes are the default size for the system. But sometimes when using TCP splicing, it can improve performance to increase pipe sizes, especially if it is suspected that pipes are not filled and that many calls to splice() are performed. This has an impact on the kernel's memory footprint, so this must not be changed if impacts are not understood.	2011-10-23 21:15:38 +02:00
Sagi Bashari	1611e2d4a1	BUG/MINOR: fix options forwardfor if-none when an alternative header name is specified	2011-10-09 08:10:30 +02:00
Willy Tarreau	6471afb43d	MINOR: remove the client/server side distinction in SI addresses Stream interfaces used to distinguish between client and server addresses because they were previously of different types (sockaddr_storage for the client, sockaddr_in for the server). This is not the case anymore, and this distinction is confusing at best and has caused a number of regressions to be introduced in the process of converting everything to full-ipv6. We can now remove this and have a much cleaner code.	2011-09-23 10:54:59 +02:00
Willy Tarreau	dd164d0240	BUG/MINOR: don't use a wrong port when connecting to a server with mapped ports Nick Chalk reported that a connection to a server which has no port specified used twice the port number. The reason is that the port number was taken from the wrong part of the address, the client's destination address was used as the base port instead of the server's configured address. Thanks to Nick for his helpful diagnostic.	2011-09-23 10:27:12 +02:00
Willy Tarreau	0e69854ed4	MINOR: acl: add new matches for header/path/url length This patch introduces hdr_len, path_len and url_len for matching these respective parts lengths against integers. This can be used to detect abuse or empty headers.	2011-09-16 08:32:32 +02:00
Willy Tarreau	275600b6c7	BUG/MEDIUM: don't trim last spaces from headers consisting only of spaces Commit 588bd4 fixed header parsing so that trailing spaces were not part of the returned string. Unfortunately, if a header only had spaces, the last spaces were trimmed past the beginning of the value, causing a negative length to be returned. A quick code review shows that there should be no impact since the only places where the vlen is used are either compared to a specific value or with explicit contents (eg: digits). This must be backported to 1.4.	2011-09-16 08:11:26 +02:00
Willy Tarreau	eabea0763b	[MINOR] stats: report the number of requests intercepted by the frontend These requests are mainly monitor requests, as well as stats requests when the stats are processed by the frontend. Having this counter helps explain the difference in number of sessions that is sometimes observed between a frontend and a backend.	2011-09-10 23:32:41 +02:00
Willy Tarreau	69c0117ac2	[BUILD] stats: stdint is not present on solaris It was added with commit `cec9a227` to use uint32_t, though it does not exist on solaris 8 but is not needed either.	2011-09-10 20:38:15 +02:00
Willy Tarreau	98c6121ee5	[OPTIM] task: don't scan the run queue if we know it's empty It happens quite often in fact, so let's save those precious cycles.	2011-09-10 20:08:49 +02:00
Willy Tarreau	576132e533	[MINOR] startup: add an option to change to a new directory Passing -C <dir> causes haproxy to chdir to <dir> before loading any file. The argument may be passed anywhere on the command line. A typical use case is : $ haproxy -C /etc/haproxy -f global.cfg -f haproxy.cfg	2011-09-10 19:26:56 +02:00
Willy Tarreau	3bafcdc07e	[CLEANUP] startup: report only the basename in the usage message Don't write the full path to the program, just the program name.	2011-09-10 19:20:23 +02:00
Willy Tarreau	45a1251515	[MEDIUM] poll: add a measurement of idle vs work time We now measure the work and idle times in order to report the idle time in the stats. It's expected that we'll be able to use it at other places later.	2011-09-10 18:01:41 +02:00
Finn Arne Gangstad	e8c7ecc2dd	[MINOR] http: _dom matching header functions now also split on ":" _dom is mostly used for matching Host headers, and host headers may include port numbers. To avoid having to create multiple rules with and without :<port-number> in hdr_dom rules, change the *_dom matching functions to also handle : as a delimiter. Typically there are rules like this in haproxy.cfg: acl is_foo hdr_dom(host) www.foo.com Most clients send "Host: www.foo.com" in their HTTP header, but some send "Host: www.foo.com:80" (which is allowed), and the above rule will now work for those clients as well. [Note: patch was edited before merge, any unexpected bug is mine /willy]	2011-09-09 16:10:12 +02:00
Willy Tarreau	b0f7532a30	[MINOR] frontend: ensure debug message length is always initialized If the socket family ever changes from AF_INET*/AF_UNIX, we'd have a problem.	2011-09-09 11:21:06 +02:00
Willy Tarreau	52b2d228ed	[MEDIUM] stats: offer the possibility to kill sessions by server It's now possible to issue "shutdown sessions server <back/srv>" and have all this server's sessions immediately killed.	2011-09-07 23:56:16 +02:00
Willy Tarreau	d52c41ea2d	[CLEANUP] stats: centralize tests for backend/server inputs on the CLI The tests were repeated many times. Let's put them at one single place.	2011-09-07 23:56:16 +02:00
Willy Tarreau	a295edc51c	[MEDIUM] stats: offer the possibility to kill a session from the CLI It's now possible to issue "shutdown session 0xXXXXXXXX" and have this session immediately killed. Useful for long-running fantoms.	2011-09-07 23:56:16 +02:00
Willy Tarreau	a2a64e9689	[MEDIUM] session: make session_shutdown() an independant function We already had the ability to kill a connection, but it was only for the checks. Now we can do this for any session, and for this we add a specific flag "K" to the logs.	2011-09-07 23:01:56 +02:00
Willy Tarreau	532a450ebc	[MEDIUM] stats: add the ability to enable/disable/shutdown a frontend at runtime The stats socket now allows the admin to disable, enable or shutdown a frontend. This can be used when a bug is discovered in a configuration and it's desirable to fix it but the rules in place don't allow to change a running config. Thus it becomes possible to kill the frontend to release the port and start a new one in a separate process. This can also be used to temporarily make haproxy return TCP resets to incoming requests to pretend the service is not bound. For instance, this may be useful to quickly flush a very deep SYN backlog. The frontend check and lookup code was factored with the "set maxconn" usage.	2011-09-07 22:50:52 +02:00
Willy Tarreau	c03ebbfca4	[BUG] peers: ensure the peers are resumed if they were paused Upon an incoming soft restart request, we first pause all frontends and peers. If the caller changes its mind and asks us to resume (eg: failed binding), we must resume all the frontends and peers. Unfortunately the peers were not resumed. The code was arranged to avoid code duplication (which used to hide the issue till now).	2011-09-07 22:47:43 +02:00
Willy Tarreau	122541c06a	[BUG] peers: don't keep a peers section which has a NULL frontend If a peers section has no peer named as the local peer, we must destroy it, otherwise a NULL peer frontend remains in the lists and a segfault can happen upon a soft restart. We also now report the missing peer name in order to help troubleshooting.	2011-09-07 22:47:43 +02:00
Willy Tarreau	ce8fe259b5	[CLEANUP] proxy: make pause_proxy() perform the required controls and emit the logs It avoids duplicated code in the caller.	2011-09-07 22:47:43 +02:00
Willy Tarreau	b249e8454c	[BUG] peers: the peer frontend must not emit any log Peers' frontends must have logging disabled by default, which was not the case, so logs were randomly emitted upon restart, sometimes causing a new process to fail to replace the old one.	2011-09-07 22:47:43 +02:00
Willy Tarreau	3c63fd828a	[MEDIUM] don't limit peers nor stats socket to maxconn nor maxconnrate The peers and the stats socket are control sockets, they must not be limited by traffic rules.	2011-09-07 22:47:42 +02:00
Willy Tarreau	3ae65a16b9	[BUG] peers: don't pre-allocate 65000 connections to each peer This made sense a long time ago but since the maxconn is dynamically computed from the tracking tables, it does not make any sense anymore and will harm future changes.	2011-09-07 22:47:42 +02:00
Willy Tarreau	f5b22875cd	[MEDIUM] stats: add the ability to adjust the global maxconnrate Using "set rate-limit connections global <xxx>" on the CLI, we can now adjust the per-process connection rate limiting (equal to global.maxconnrate).	2011-09-07 22:47:42 +02:00
Willy Tarreau	9cd552d8f4	[MINOR] stats: report the current and max global connection rates The HTML page reports the current process connection rate, and the "show info" command on the stats socket also reports the conn rate limit and the max conn rate that was once reached. Note that the max value can be cleared using "clear counters".	2011-09-07 22:47:42 +02:00
Willy Tarreau	81c25d0ee6	[MEDIUM] add support for global.maxconnrate to limit the per-process conn rate. This one enforces a per-process connection rate limit, regardless of what may be set per frontend. It can be a way to limit the CPU usage of a process being severely attacked. The side effect is that the global process connection rate is now measured for each incoming connection, so it will be possible to report it.	2011-09-07 22:47:42 +02:00
Willy Tarreau	91886b692a	[MEDIUM] stats: add the "set maxconn" setting to the command line interface This option permits to change the global maxconn setting within the limit that was set by the initial value, which is now reported as the hard maxconn value. This allows to immediately accept more concurrent connections or to stop accepting new ones until the value passes below the indicated setting. The main use of this option is on systems where many haproxy instances are loaded and admins need to re-adjust resource sharing at run time to regain a bit of fairness between processes.	2011-09-07 22:47:41 +02:00
Willy Tarreau	abacc2cfd1	[CLEANUP] remove a useless test in manage_global_listener_queue() The test for the empty list was done twice.	2011-09-07 18:09:27 +02:00
Willy Tarreau	c2adf8b906	[MEDIUM] stats: disable complex socket reservation for stats socket The way the unix socket is initialized is awkward. Some of the settings are put in the sockets itself, other ones in the backend. And more importantly the global.maxsock value is adjusted so that the stats socket evades the global maxconn value. This complexifies maxsock computations for nothing, since the stats socket is not supposed to receive hundreds of concurrent connections when the global maxconn is very low. What is needed however is to ensure that there are always connections left for the stats socket even when traffic sockets are saturated, but this guarantee is not offered anymore by current code. So as of now, the stats socket is subject to the global maxconn limitation just as any other socket until a reservation mechanism is implemented.	2011-09-07 18:05:48 +02:00
Willy Tarreau	46fa8355c0	[CLEANUP] remove dirty left-over of a debugging message This debug message was added in commit `e9b2602a` and not noticed once committed.	2011-09-07 11:55:40 +02:00
Willy Tarreau	b48f958e05	[CLEANUP] cfgparse: fix reported options for the "bind" keyword	2011-09-05 01:17:06 +02:00
Willy Tarreau	ad14f753ea	[MINOR] http: take a capture of bad content-lengths. Sometimes a bad content-length header is encountered and this causes an abort. It's hard to debug without a trace, so let's take a capture of the contents when this happens.	2011-09-05 00:54:57 +02:00
Willy Tarreau	3b8c08a174	[MINOR] http: take a capture of truncated responses If a server starts to respond but stops before the body, then we capture the truncated response. We don't do this on the request because it would happen too often upon stupid attacks.	2011-09-05 00:54:56 +02:00
Willy Tarreau	fec4d89b24	[MINOR] http: take a capture of too large requests and responses It's hard to prove a request or response is too large if there is no capture, so let's take a snapshot of those too.	2011-09-05 00:54:56 +02:00
Willy Tarreau	509433391a	[MINOR] stats: display "<NONE>" instead of the frontend name when unknown "show sess" should display "<NONE>" instead of the frontend's name as the backend's.	2011-09-05 00:54:56 +02:00
Willy Tarreau	588bd4f813	[BUG] http: trailing white spaces must also be trimmed after headers Trailing spaces after headers were not trimmed, only the leading ones were. An issue was detected today with a content-length value which was padded with spaces and which was rejected. Recent updates to the http-bis draft made it a lot more clear that such spaces must be ignored, so this is what this patch does. It should be backported to 1.4.	2011-09-05 00:54:56 +02:00
Willy Tarreau	631f01c2f1	[MINOR] make use of addr_to_str() and get_host_port() to replace many inet_ntop() Many inet_ntop calls were partially right, which was hard to detect given the complex combinations. Some of them were relying on the listener's proto instead of the address itself, which could have been different when dealing with an accept-proxy connection. The new addr_to_str() function does the dirty job and returns the family, which makes it particularly suited to calls from switch/case statements. A large number of if/else statements were removed and the stats output could even be cleaned up in the case of session dump. As a side effect of doing this, the resulting code is smaller by almost 1kB. All changed parts have been tested and provided expected output.	2011-09-05 00:54:36 +02:00
Willy Tarreau	86ad42c5b7	[MINOR] make use of set_host_port() and get_host_port() to get rid of family mismatches This also simplifies the code and makes it more auditable.	2011-09-05 00:54:35 +02:00
Willy Tarreau	542a31d6c3	[BUG] backend: risk of picking a wrong port when mapping is used with crossed families A similar issue as the previous one causes port mapping to fail in some combinations of client and server address families. Using the macros fixes the issue.	2011-08-27 12:07:49 +02:00
Willy Tarreau	48da04a6af	[BUG] checks: use the correct destination port for sending checks In the number of switch/case statements added for IPv6 changes, one was wrong and caused the check port to be ignored for outgoing connection because the socket's family was not taken at the right place. Use the set_host_port() macro instead to fix the issue. The same cleanup could be performed at a number of other places and should follow shortly. Special thanks to Stephane Bakhos of Techboom for reporting a detailed analysis of this bug.	2011-08-27 11:51:36 +02:00
Willy Tarreau	e17a8d02d9	[BUG] possible crash in 'show table' on stats socket Patch `d5b9fd95` was missing an initialisation of "ctx.table.target", which caused "show table" to segfault if it was issued after a "show errors" (target pointer == -1).	2011-08-24 08:23:34 +02:00
Willy Tarreau	c9ebc446b8	[CLEANUP] update the year in the copyright banner It was still 2010 !	2011-08-23 00:23:54 +02:00
Willy Tarreau	43d8fb2d3a	[REORG] build: move syscall redefinition to specific places Some older libc don't define splice() and and don't define _syscall() either, which causes build errors if splicing is enabled. To solve this, we now split the syscall redefinition into two layers : - one file per syscall (epoll, splice) - one common file to declare the _syscall() macros The code is cleaner because files using the syscalls just have to include their respective file. It's not adviced to merge multiple syscall families into a same file if all are not intended to be used simultaneously, because defining unused static functions causes warnings to be emitted during build. As a result, the new USE_MY_SPLICE parameter was added in order to be able to define the splice() syscall separately.	2011-08-23 00:11:25 +02:00
Willy Tarreau	87cf51406c	[MEDIUM] http: make x-forwarded-for addition conditional If "option forwardfor" has the "if-none" argument, then the header is only added when the request did not already have one. This option has security implications, and should not be set blindly.	2011-08-19 22:57:24 +02:00
Willy Tarreau	1ee51a6581	[BUG] check: http-check expect + regex would crash in defaults section Manoj Kumar reported a case where haproxy would crash upon start-up. The cause was an "http-check expect" statement declared in the defaults section, which caused a NULL regex to be used during the check. This statement is not allowed in defaults sections precisely because this requires saving a copy of the regex in the default proxy. But the check was not made to prevent it from being declared there, hence the issue. Instead of adding code to detect its abnormal use, we decided to implement it. It was not that much complex because the expect_str part was not used with regexes, so it could hold the string form of the regex in order to compile it again for every backend (there's no way to clone regexes). This patch has been tested and works. So it's both a bugfix and a minor feature enhancement. It should be backported to 1.4 though it's not critical since the config was not supposed to be supported.	2011-08-19 20:14:01 +02:00
Simon Horman	8effd3de5b	[MINOR] Use DPRINTF in assign_server() Use DPRINTF in assign_server() rather than open-coding its logic.	2011-08-18 23:52:36 +02:00
Simon Horman	7abd00d7eb	[MINOR] Fix build error in stream_int_register_handler() There is no parameter or variable fct in stream_int_register_handler() so the build fails when DPRINTF is active.	2011-08-18 23:52:36 +02:00
Simon Horman	d281eedc07	[MEDIUM] Correct ipmask() logic The netmask applied to table entries as configured using ipmask() is stored in arg_p->data.ip not arg_i (which will be 1 if the netmask is set).	2011-08-18 23:52:35 +02:00
Simon Horman	8b7b05a92d	[MEDIUM] Fix stick-table replication on soft-restart "[MINOR] session: add a pointer to the new target into the session" (`664beb8`) introduced a regression by changing the type of a peer's target from TARG_TYPE_PROXY to TARG_TYPE_NONE. The effect of this is that during a soft-restart the new process no longer tries to connect to the old process to replicate its stick tables. This patch sets the type of a peer's target as TARG_TYPE_PROXY and replication on soft-restart works once again.	2011-08-18 23:52:35 +02:00
Willy Tarreau	f73cd1198f	[MINOR] session-counters: add the ability to clear the counters Sometimes it can be useful to reset a counter : one condition increments it and another one resets it. It can be used to better detect abuses.	2011-08-13 01:45:16 +02:00
Willy Tarreau	1620ec39a7	[MEDIUM] checks: group health checks methods by values and save option bits Adding health checks has become a real pain, with cross-references to all checks everywhere because they're all a single bit. Since they're all exclusive, let's change this to have a check number only. We reserve 4 bits allowing up to 16 checks (15+tcp), only 7 of which are currently used. The code has shrunk by almost 1kB and we saved a few option bits. The "dispatch" option has been moved to px->options, making a few tests a bit cleaner.	2011-08-06 17:08:40 +02:00
Herv� COMMOWICK	ec032d63a6	[MINOR] check: add redis check support This patch provides a new "option redis-check" statement to enable server health checks based on redis PING request (http://www.redis.io/commands/ping).	2011-08-06 15:52:47 +02:00
Herv� COMMOWICK	daa824e513	[MINOR] acl: add srv_conn acl to count connections on a specific backend server These ACLs are used to check the number of active connections on the specified server in the specified backend.	2011-08-06 15:52:27 +02:00
Willy Tarreau	2a0f4d27a4	[MEDIUM] stats: add support for changing frontend's maxconn at runtime The new "set maxconn frontend XXX" statement on the stats socket allows the admin to change a frontend's maxconn value. If some connections are queued, they will immediately be accepted up to the new limit. If the limit is lowered, new connections acceptation might be delayed. This can be used to temporarily reduce or increase the impact of a specific frontend's traffic on the whole process.	2011-08-02 11:49:05 +02:00
Willy Tarreau	bc216c4ad0	[MINOR] proxy: make findproxy() return proxies from numeric IDs too Sometimes it's useful to be able to search a proxy by its numeric ID, so let's add support for names such as #<id>.	2011-08-02 11:25:54 +02:00
Willy Tarreau	e9b2602ac5	[MEDIUM] listeners: add a global listener management task This global task is used to periodically check for end of resource shortage and to try to enable queued listeners again. This is important in case some temporary system-wide shortage is encountered, so that we don't have to wait for an existing connection to be released before checking the queue again. For situations where listeners are queued due to the global maxconn being reached, the task is woken up at least every second. For situations where a system resource shortage is detected (memory, sockets, ...) the task is woken up at least every 100 ms. That way, recovery from severe events can still be achieved under acceptable conditions.	2011-08-01 20:57:55 +02:00
Willy Tarreau	237250cc0d	[BUG] proxy: stats frontend and peers were missing many initializers This was revealed with one of the very latest patches which caused the listener_queue not to be initialized on the stats socket frontend. And in fact a number of other ones were missing too. This is getting so boring that now we'll always make use of the same function to initialize any proxy. Doing so has even saved about 500 bytes on the binary due to the avoided code redundancy. No backport is needed.	2011-07-29 02:00:19 +02:00
Willy Tarreau	918ff608f8	[MAJOR] proxy: finally get rid of maintain_proxies() This function is finally not needed anymore, as it has been replaced with a per-proxy task that is scheduled when some limits are encountered on incoming connections or when the process is stopping. The savings should be noticeable on configs with a large number of proxies. The most important point is that the rate limiting is now enforced in a clean and solid way.	2011-07-25 16:33:49 +02:00
Willy Tarreau	d634e7c673	[CLEANUP] proxy: merge maintain_proxies() operation inside a single loop This will help transforming the processing into per-proxy tasks.	2011-07-25 11:54:17 +02:00
Willy Tarreau	bbe11b1e3c	[BUG] proxy: peers must only be stopped once, not upon every call to maintain_proxies Peers were stopped on every call to maintain_proxies when stopping=1, while they should only be stopped once upon call to soft_stop(). This bug has little impact, mostly increased CPU usage. It's not needed to backport it.	2011-07-25 11:16:24 +02:00
Willy Tarreau	b32907b6c7	[MINOR] sessions: only wake waiting listeners up if rate limit is OK Instead of waking a listener up then making it sleep, we only wake them up if we know their rate limit is fine. In the future we could improve on top of that by deciding to wake a proxy-specific task in XX milliseconds to take care of enabling the listeners again.	2011-07-25 08:37:44 +02:00
Willy Tarreau	d408bd40f3	[MINOR] proxy: make session rate-limit more accurate Patch `d9bbe17b` used to limit the rate-limit to off-by-one to avoid a busy loop when the limit is reached. Now that the listeners are automatically disabled and queued when a limit is reached, we don't need this workaround anymore and can bring back the most accurate computation.	2011-07-25 08:30:51 +02:00
Willy Tarreau	a17c2d9361	[MINOR] stats: report a "WAITING" state for sockets waiting for resource This is useful when enabling socket-stats to know that a socket is being waiting for some resource (RAM, global connections, etc...).	2011-07-25 08:18:47 +02:00
Willy Tarreau	562515cac1	[CLEANUP] proxy: rename a few proxy states (PR_STIDLE and PR_STRUN) Those states have been replaced with PR_STFULL and PR_STREADY respectively, as it is what matches them the best now. Also, two occurrences of PR_STIDLE in peers.c have been removed as this did not provide any form of error recovery anyway.	2011-07-25 08:11:52 +02:00
Willy Tarreau	f3f8c70bd6	[MEDIUM] listeners: don't change listeners states anymore in maintain_proxies Now maintain_proxies() only changes proxies states and does not affect their listeners anymore since they are autonomous. A proxy will switch between the PR_STIDLE and PR_STRUN states depending whether it's saturated or not. Next step will consist in renaming PR_STIDLE to PR_STFULL. This state is now only used to report the proxy state in the stats.	2011-07-25 07:37:28 +02:00
Willy Tarreau	2242649b3a	[MEDIUM] listeners: don't stop proxies when global maxconn is reached Now we don't have to stop proxies anymore since their listeners will be queued if they attempt to accept a connection past the global limits.	2011-07-25 07:08:45 +02:00
Willy Tarreau	07687c171e	[MEDIUM] listeners: queue proxy-bound listeners at the proxy's All listeners that are limited by a proxy-specific resource are now queued at the proxy's and not globally. This allows finer-grained wakeups when releasing resource.	2011-07-24 23:55:06 +02:00
Willy Tarreau	08ceb1012b	[MEDIUM] listeners: put listeners in queue upon resource shortage When an accept() fails because of a connection limit or a memory shortage, we now disable it and queue it so that it's dequeued only when a connection is released. This has improved the behaviour of the process near the fd limit as now a listener with a no connection (eg: stats) will not loop forever trying to get its connection accepted. The solution is still not 100% perfect, as we'd like to have this used when proxy limits are reached (use a per-proxy list) and for safety, we'd need to have dedicated tasks to periodically re-enable them (eg: to overcome temporary system-wide resource limitations when no connection is released).	2011-07-24 22:58:00 +02:00
Willy Tarreau	e6ca1fcd84	[MINOR] listeners: add support for queueing resource limited listeners When a listeners encounters a resource shortage, it currently stops until one re-enables it. This is far from being perfect as it does not yet handle the case where the single connection from the listener is rejected (eg: the stats page). Now we'll have a special status for resource limited listeners and we'll queue them into one or multiple lists. That way, each time we have to stop a listener because of a resource shortage, we can enqueue it and change its state, so that it is dequeued once more resources are available. This patch currently does not change any existing behaviour, it only adds the basic building blocks for doing that.	2011-07-24 22:03:52 +02:00
Willy Tarreau	627937158f	[MINOR] listeners: add listen_full() to mark a listener full This is just a cleanup which removes calls to EV_FD_CLR() and state setting everywhere in the code.	2011-07-24 19:25:28 +02:00
Willy Tarreau	ff45b8ccc6	[BUG] stream_sock: ensure orphan listeners don't accept too many connections For listeners that are not bound to a frontend, the limit on the number of accepted connections is tested at the end of the accept() loop, but we don't break out of the loop, meaning that if more connections than what the listener allows are available and if this is less than the proxy's limits and within the size of a batch, then they could be accepted. In practice, this problem currently cannot appear since all listeners are bound to a frontend, and it's a very minor issue anyway. 1.4 has the same issue (which cannot happen there either), but there is some code after it, so it's the code cleanup which revealed it.	2011-07-24 19:16:52 +02:00
Willy Tarreau	be58c38264	[MEDIUM] proxy: add a PAUSED state to listeners and move socket tricks out of proxy.c Managing listeners state is difficult because they have their own state and can at the same time have theirs dictated by their proxy. The pause is not done properly, as the proxy code is fiddling with sockets. By introducing new functions such as pause_listener()/resume_listener(), we make it a bit more obvious how/when they're supposed to be used. The listen_proxies() function was also renamed to resume_proxies() since it's only used for pause/resume. This patch is the first in a series aiming at getting rid of the maintain_proxies mess. In the end, proxies should not call enable_listener()/disable_listener() anymore.	2011-07-24 19:09:37 +02:00
Willy Tarreau	100298749b	[BUG] stream_sock: disable listener when system resources are exhausted When an accept() returns -1 ENFILE du to system limits, it leaves the connection pending in the backlog and epoll() comes back immediately afterwards trying to make it accept it again. This causes haproxy to remain at 100% CPU until something makes an accept() possible again. Now upon such resource shortage, we mark the listener FULL so that we only enable it again once at least one connection has been released. In fact we only do that if there are some active connections on this proxy, so that it has a chance to be marked not full again. This makes haproxy remain idle when all resources are used, which helps a lot releasing those resource as fast as possible. Backport to 1.4 might be desirable but difficult and tricky.	2011-07-24 16:16:14 +02:00
Willy Tarreau	4827fd2a7e	[OPTIM] stream_sock: reduce the default number of accepted connections at once By default on a single process, we accept 100 connections at once. This is too much on recent CPUs where the cache is constantly thrashing, because we visit all those connections several times. We should batch the processing slightly less so that all the accepted session may remain in cache during their initial processing. Lowering the batch size from 100 to 32 has changed the connection rate for concurrencies between 5-10k from 67 kcps to 94 kcps on a Core i5 660 (4M L3), and forward rates from 30k to 39.5k. Tests on this hardware show that values between 10 and 30 seem to do the job fine.	2011-07-24 16:12:27 +02:00
Willy Tarreau	2b15492a75	[MINOR] session: try to emit a 500 response on memory allocation errors When we fail to create a session because of memory shortage, let's at least try to send a 500 message directly on the socket. Even if we don't have any buffers left, the kernel's orphans management will take care of delivering the message as long as there are socket buffers left.	2011-07-24 16:12:25 +02:00
Willy Tarreau	9bd0d744ef	[BUG] session: risk of crash on out of memory (1.5-dev regression) Patch af5149 introduced an issue which can be detected only on out of memory conditions : a LIST_DEL() may be performed on an uninitialized struct member instead of a LIST_INIT() during the accept() phase, causing crashes and memory corruption to occur. This issue was detected and diagnosed by the Exceliance R&D team. This is 1.5-specific and very recent, so no existing deployment should be impacted.	2011-07-20 00:22:54 +02:00
Simon Horman	6fb8259014	[MINOR] Free stick rules on denint() The motivation for this is that when soft-restart is merged it will be come more important to free all relevant memory in deinit() Discovered using valgrind.	2011-07-18 10:21:24 +02:00
Simon Horman	b08584ac71	[MINOR] Free stick table pool on denint() The motivation for this is that when soft-restart is merged it will be come more important to free all relevant memory in deinit() Discovered using valgrind.	2011-07-18 10:21:24 +02:00
Simon Horman	ac8214260e	[MINOR] Free tcp rules on denint() The motivation for this is that when soft-restart is merged it will be come more important to free all relevant memory in deinit() Discovered using valgrind.	2011-07-18 10:21:23 +02:00
Simon Horman	a31c7f716b	[MINOR] Free rdp_cookie_name on denint() The motivation for this is that when soft-restart is merged it will be come more important to free all relevant memory in deinit() Discovered using valgrind.	2011-07-18 10:21:23 +02:00
Simon Horman	5e55f5dadc	[MINOR] Consistently free expr on error in cfg_parse_listen() It seems to me that without this change cfg_parse_listen() may leak memory.	2011-07-18 10:21:23 +02:00
Simon Horman	6c54d8b63b	[MINOR] Consistently use error in tcp_parse_tcp_req() It seems to me that without this change tcp_parse_tcp_req() may leak memory.	2011-07-18 10:21:23 +02:00
Willy Tarreau	b3eb221e78	[MEDIUM] http: add support for 'cookie' and 'set-cookie' patterns This is used to perform cookie-based stickiness with table replication between multiple masters and across restarts. This partially overrides some of the appsession capabilities.	2011-07-01 16:16:17 +02:00
Simon Horman	fa46168c8f	[MINOR] Add non-stick server option Never add connections allocated to this sever to a stick-table. This may be used in conjunction with backup to ensure that stick-table persistence is disabled for backup servers.	2011-06-25 21:14:17 +02:00
Simon Horman	de072bd8ff	[CLEANUP] Remove unnecessary casts There is no need to cast when going to or from void *	2011-06-25 21:14:10 +02:00
Simon Horman	ab814e0a6b	[MINOR] Add rdp_cookie pattern fetch function This pattern fetch function extracts the value of the rdp cookie <name> as a string and uses this value to match. This enables implementation of persistence based on the mstshash cookie. This is typically done if there is no msts cookie present. This differs from "balance rdp-cookie" in that any balancing algorithm may be used and thus the distribution of clients to backend servers is not linked to a hash of the RDP cookie. It is envisaged that using a balancing algorithm such as "balance roundrobin" or "balance leastconnect" will lead to a more even distribution of clients to backend servers than the hash used by "balance rdp-cookie". Example : listen tse-farm bind 0.0.0.0:3389 # wait up to 5s for an RDP cookie in the request tcp-request inspect-delay 5s tcp-request content accept if RDP_COOKIE # apply RDP cookie persistence persist rdp-cookie # Persist based on the mstshash cookie # This is only useful makes sense if # balance rdp-cookie is not used stick-table type string size 204800 stick on rdp_cookie(mstshash) server srv1 1.1.1.1:3389 server srv1 1.1.1.2:3389	2011-06-25 21:07:02 +02:00
Simon Horman	e869176486	[MINOR] Make appsess{,ion}_refresh static apsession_refresh() and apsess_refressh are only used inside apsession.c and thus can be made static. The only use of apsession_refresh() is appsession_task_init(). These functions have been re-ordered to avoid the need for a forward-declaration of apsession_refresh().	2011-06-25 21:07:01 +02:00
Simon Horman	752dc4ab2d	[MINOR] Add down termination condition If a connection is closed by because the backend became unavailable then log 'D' as the termination condition. Signed-off-by: Simon Horman <horms@verge.net.au>	2011-06-21 22:10:56 +02:00
Simon Horman	e0d1bfb4c1	[MINOR] Allow shutdown of sessions when a server becomes unavailable This adds the "on-marked-down shutdown-sessions" statement on "server" lines, which causes all sessions established on a server to be killed at once when the server goes down. The task's priority is reniced to the highest value (1024) so that servers holding many tasks don't cause a massive slowdown due to the wakeup storm.	2011-06-21 22:00:21 +02:00
Simon Horman	af51495397	[MINOR] Add active connection list to server The motivation for this is to allow iteration of all the connections of a server without the expense of iterating over the global list of connections. The first use of this will be to implement an option to close connections associated with a server when is is marked as being down or in maintenance mode.	2011-06-21 22:00:12 +02:00
Simon Horman	dec5be4ed4	[CLEANUP] session.c: Make functions static where possible	2011-06-18 20:27:19 +02:00
Simon Horman	96553775a0	[CLEANUP] peers.h: fix declarations * The declaration of peer_session_create() does not match its definition. As it is only used inside of peers.c make it static. * Make the declaration of peers_register_table() match its definition. * Also, make all functions in peers.c that are not also in peers.h static	2011-06-18 20:27:19 +02:00
Simon Horman	70735c98f7	[CLEANUP] Remove assigned but unused variables gcc (Debian 4.6.0-2) 4.6.1 20110329 (prerelease) Copyright (C) 2011 Free Software Foundation, Inc. This is free software; see the source for copying conditions. There is NO warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. ... src/proto_http.c:3029:14: warning: variable ‘del_cl’ set but not used [-Wunused-but-set-variable] In file included from ebtree/eb64tree.c:23:0: ebtree/eb64tree.h: In function ‘__eb64_lookup’: ebtree/eb64tree.h:128:6: warning: variable ‘node_bit’ set but not used [-Wunused-but-set-variable] ebtree/eb64tree.h: In function ‘__eb64i_lookup’: ebtree/eb64tree.h:180:6: warning: variable ‘node_bit’ set but not used [-Wunused-but-set-variable] In file included from ebtree/ebpttree.h:26:0, from ebtree/ebimtree.c:23: ebtree/eb64tree.h: In function ‘__eb64_lookup’: ebtree/eb64tree.h:128:6: warning: variable ‘node_bit’ set but not used [-Wunused-but-set-variable] ebtree/eb64tree.h: In function ‘__eb64i_lookup’: ebtree/eb64tree.h:180:6: warning: variable ‘node_bit’ set but not used [-Wunused-but-set-variable] In file included from ebtree/ebpttree.h:26:0, from ebtree/ebistree.h:25, from ebtree/ebistree.c:23: ebtree/eb64tree.h: In function ‘__eb64_lookup’: ebtree/eb64tree.h:128:6: warning: variable ‘node_bit’ set but not used [-Wunused-but-set-variable] ebtree/eb64tree.h: In function ‘__eb64i_lookup’: ebtree/eb64tree.h:180:6: warning: variable ‘node_bit’ set but not used [-Wunused-but-set-variable]	2011-06-18 20:21:33 +02:00
Simon Horman	619e3cc245	[MINOR] Allow showing and clearing by key of string stick tables	2011-06-17 11:39:30 +02:00
Simon Horman	cec9a22780	[MINOR] Allow showing and clearing by key of integer stick tables	2011-06-17 11:39:30 +02:00
Simon Horman	c5b89f6495	[MINOR] Allow showing and clearing by key of ipv6 stick tables	2011-06-17 11:39:30 +02:00
Simon Horman	c88b887d8d	[MINOR] More flexible clearing of stick table * Allow clearing of all entries of a table * Allow clearing of all entries of a table that match a data filter	2011-06-17 11:39:29 +02:00
Simon Horman	d5b9fd9591	[MINOR] Break out all stick table socat command parsing This will allow reuse for clearing table entries other than by key	2011-06-17 11:39:29 +02:00
Simon Horman	17bce34a20	[MINOR] Allow listing of stick table by key	2011-06-17 11:39:29 +02:00
Simon Horman	121f305f3b	[MINOR] Break out processing of clear table This will allow the code to be reused for showing a table filtered by key	2011-06-17 11:39:29 +02:00
Simon Horman	d936658b7b	[MINOR] Break out dumping table This will allow the code to be reused for showing a table filtered by key Signed-off-by: Simon Horman <horms@verge.net.au>	2011-06-17 11:39:28 +02:00
Simon Horman	9bd2c73916	[CLEANUP] dumpstats: make symbols static where possible	2011-06-17 11:39:28 +02:00
Herv� COMMOWICK	212f778d6a	[BUG] checks: fix support of Mysqld >= 5.5 for mysql-check mysqld >= 5.5 want the client to announce 4.1+ authentication support, even if we have no password, so we do this. I also check on a debian potato mysqld 3.22 and it works too so i assume we are good from 3.22 to 5.5. [WT: this must be backported to 1.4]	2011-06-17 11:18:52 +02:00
Willy Tarreau	ab1a3e97a4	[CLEANUP] config: remove some left-over printf debugging code from previous patch Last patch fbb784 unexpectedly left some debugging printf messages which can be seen in debug mode.	2011-06-14 07:49:12 +02:00
Willy Tarreau	fbb78421d4	[MINOR] config: automatically compute a default fullconn value The fullconn value is not easy to get right when doing dynamic regulation, as it should depend on the maxconns of the frontends that can reach a backend. Since the parameter is mandatory, many configs are found with an inappropriate default value. Instead of rejecting configs without a fullconn value, we now set it to 10% of the sum of the configured maxconns of all the frontends which are susceptible to branch to the backend. That way if new frontends are added, the backend's fullconn automatically adjusts itself.	2011-06-05 15:43:27 +02:00
Willy Tarreau	bf9c2fcd93	[BUG] stats: support url-encoded forms Bashkim Kasa reported that the stats admin page did not work when colons were used in server or backend names. This was caused by url-encoding resulting in ':' being sent as '%3A'. Now we systematically decode the field names and values to fix this issue.	2011-05-31 22:44:28 +02:00
Willy Tarreau	44702af019	[MINOR] config: make it possible to specify a cookie even without a server Since version 1.0.0, it's forbidden to have a cookie specified without at least one server. This test is useless and makes it complex to write APIs to iteratively generate working configurations. Remove the test.	2011-05-31 06:44:04 +02:00
Willy Tarreau	14acc7072e	[OPTIM] stream_sock: don't use splice on too small payloads It's more expensive to call splice() on short payloads than to use recv()+send(). One of the reasons is that doing a splice() involves allocating a pipe. One other reason is that the kernel will have to copy itself if we try to splice less than a page. So let's fix a short offset of 4kB below which we don't splice. A quick test shows that on chunked encoded data, with splice we had 6826 syscalls (1715 splice, 3461 recv, 1650 send) while with this patch, the same transfer resulted in 5793 syscalls (3896 recv, 1897 send).	2011-05-30 18:42:41 +02:00
Willy Tarreau	22be90b8db	[OPTIM] stream_sock: avoid fast-forwarding of partial data Fast-forwarding between file descriptors is nice but can be counter-productive when only one part of the buffer is forwarded, because it can result in doubling the number of send() syscalls. This is what happens on HTTP chunking, because the chunk data are sent, then the CRLF + next chunk size are parsed and immediately scheduled for forwarding. This results in two send() for the same block while a single one would have done it.	2011-05-30 18:42:41 +02:00
Willy Tarreau	0729303fb0	[OPTIM] http: optimize chunking again in non-interactive mode Now that we support the http-no-delay mode, we can optimize HTTP chunking again by always waiting for more data to come until the last chunk is met. This patch may or may not be backported to 1.4, it's not a big deal, it will mainly help for chunks which are aligned with the buffer size.	2011-05-30 18:42:41 +02:00
Willy Tarreau	96e312139a	[MEDIUM] http: add support for "http-no-delay" There are some very rare server-to-server applications that abuse the HTTP protocol and expect the payload phase to be highly interactive, with many interleaved data chunks in both directions within a single request. This is absolutely not supported by the HTTP specification and will not work across most proxies or servers. When such applications attempt to do this through haproxy, it works but they will experience high delays due to the network optimizations which favor performance by instructing the system to wait for enough data to be available in order to only send full packets. Typical delays are around 200 ms per round trip. Note that this only happens with abnormal uses. Normal uses such as CONNECT requests nor WebSockets are not affected. When "option http-no-delay" is present in either the frontend or the backend used by a connection, all such optimizations will be disabled in order to make the exchanges as fast as possible. Of course this offers no guarantee on the functionality, as it may break at any other place. But if it works via HAProxy, it will work as fast as possible. This option should never be used by default, and should never be used at all unless such a buggy application is discovered. The impact of using this option is an increase of bandwidth usage and CPU usage, which may significantly lower performance in high latency environments. This change should be backported to 1.4 since the first report of such a misuse was in 1.4. Next patch will also be needed.	2011-05-30 18:42:41 +02:00
Willy Tarreau	f8ca19bcd9	[CLEANUP] stream_sock: remove unneeded FL_TCP and factor out test The FL_TCP flag was a leftover from the old days we were using TCP_CORK. With MSG_MORE it's not needed anymore so we can remove the condition and sensibly simplify the test.	2011-05-30 18:42:40 +02:00
Willy Tarreau	8f8b492295	[MINOR] stream_sock: always clear BF_EXPECT_MORE upon complete transfer When sending is complete, it's preferred to systematically clear the flags that were set for that transfer. What could happen is that the to_forward counter had caused the MSG_MORE flag to be set and BF_EXPECT_MORE not to be cleared, resulting in this flag being unexpectedly maintained for next round. The code has taken extreme care of not doing this till now, but it's not acceptable that the caller has to know these precise semantics. So let's unconditionnally clear the flag instead. For the sake of safety, this fix should be backported to 1.4.	2011-05-30 18:42:40 +02:00
Willy Tarreau	5c62092ca1	[MINOR] http: partially revert the chunking optimization for now Commit 57f5c1 used to provide a nice improvement on chunked encoding since it ensured that we did not set a PUSH flag for every chunk or buffer data part of a chunked transfer. Some applications appear to erroneously abuse HTTP chunking in order to get interactive exchanges between a user agent and an origin server with very small chunks. While it happens to work through haproxy, it's terribly slow due to the latency added after passing each chunk to the system, who could wait up to 200ms before pushing them onto the wire. So we need an interactive mode for such usages. In the mean time, step back on the optim, but not completely, so that we still keep the flag as long as we know we're not finished with the current chunk. This change should be backported to 1.4 too as the issue was discovered with it.	2011-05-11 20:17:42 +02:00
Willy Tarreau	ae94d4df8f	[MINOR] http: make the "HTTP 200" status code configurable. This status code is used in response to requests matching "monitor-uri". Some users need to adjust it to fit their needs (eg: make some strings appear there). As it's already defined as a chunked string and used exactly like other status codes, it makes sense to make it configurable with the usual "errorfile", "errorloc", ...	2011-05-11 16:31:43 +02:00
Willy Tarreau	436d9ed808	[REORG] http: move HTTP error codes back to proto_http.h This one was left isolated in its own file. It probably is a leftover from the 1.2->1.3 split.	2011-05-11 16:31:43 +02:00
Willy Tarreau	027a85bb03	[MINOR] http: don't report the "haproxy" word on the monitoring response Some people like to make the monitoring URL testable from unsafe locations. Reporting haproxy's existence there can sometimes be problematic. This patch should not be backported to 1.4 because it is possible, eventhough unlikely, that some scripts rely on this word to appear there.	2011-05-11 16:31:43 +02:00
Cyril Bont�	7c51a732f7	[BUG] fix binary stick-tables As reported by Lauri-Alo Adamson, version 1.5-dev6 doesn't support stick-tables with a binary type. This issue was introduced in the commit `4f92d32` where a line was erroneously deleted, and is 1.5-specific.	2011-05-09 23:30:58 +02:00
Willy Tarreau	96dd079b49	[BUG] proto_tcp: fix address binding on remote source Mark Brooks reported that commit 1b4b7c broke tproxy in 1.5-dev6. Nick Chalk tracked the issue down to a missing address family setting in tcp_bind_socket() which resulted in a failure to use get_addr_len(). This issue is 1.5-specific.	2011-04-19 07:20:57 +02:00
Willy Tarreau	a164fb5721	[BUG] checks: http-check expect could fail a check on multi-packet responses Christopher Blencowe reported that the httpchk_expect() function was lacking a test for incomplete responses : if the server sends only the headers in the first packet and the body in a subsequent one, there is a risk that the check fails without waiting for more data. A failure rate of about 1% was reported. This fix must be backported to 1.4.	2011-04-13 09:32:41 +02:00
Willy Tarreau	1fc1f45618	[CRITICAL] fix risk of crash when dealing with space in response cookies When doing fix `24581bae02` to correctly handle response cookies, an unfortunate typo was inserted in the less likely code path, resulting in a risk of crash when cookie-based persistence is enabled and the server emits a cookie with several spaces around the equal sign. This bug was noticed during a code backport. Its effects were never reported because this situation is very unlikely to appear, but it can be provoked on purpose by the server. This patch must be backported to 1.4 versions which contain the fix above (anything > 1.4.8), and to similar 1.3 versions > 1.3.25. 1.5-dev versions after 1.5-dev2 are affected too.	2011-04-08 00:50:36 +02:00
Willy Tarreau	442452034e	[BUG] stick-tables did not work when converting IPv6 to IPv4 A stick-table of type IPv6 would store a wrong IPv4 address as the result of an IPv6 to IPv4 conversion. This bug was introduced in 1.5-dev5.	2011-04-07 10:53:30 +02:00
Willy Tarreau	1b4b7ce6dd	[BUG] stream_sock: use get_addr_len() instead of sizeof() on sockaddr_storage John Helliwell reported a runtime issue on Solaris since 1.5-dev5. Traces show that connect() returns EINVAL, which means the socket length is not appropriate for the family. Solaris does not like being called with sizeof and needs the address family's size on sockaddr_storage. The fix consists in adding a get_addr_len() function which returns the socket's address length based on its family. Tests show that this works for both IPv4 and IPv6 addresses.	2011-04-05 16:56:50 +02:00
David du Colombier	4f92d32004	[MEDIUM] IPv6 support for stick-tables Since IPv6 is a different type than IPv4, the pattern fetch functions src6 and dst6 were added. IPv6 stick-tables can also fetch IPv4 addresses with src and dst. In this case, the IPv4 addresses are mapped to their IPv6 counterpart, according to RFC 4291.	2011-03-29 01:09:14 +02:00
Willy Tarreau	c735a0728e	[MINOR] acl: add support for table_cnt and table_avl matches Those trivial matches respectively return the number of entries used in a stick-table and the number of entries still available in a table.	2011-03-29 00:57:02 +02:00
Willy Tarreau	68f49da972	[BUG] stream_sock: fix handling for server side PROXY protocol Patch `5ab04ec47c` was incomplete, because if the first send() fails on an empty buffer, we fail to rearm the polling and we can't establish the connection anymore. The issue was reported by Ben Timby who provided large amounts of traces of various tests helping to reliably reproduce the issue.	2011-03-28 23:17:54 +02:00
David du Colombier	11bcb6c4f5	[MEDIUM] IPv6 support for syslog	2011-03-28 18:45:15 +02:00
Willy Tarreau	0bc3493d2c	[OPTIM] buffers: uninline buffer_forward() Since the latest additions to buffer_forward(), it became too large for inlining, so let's uninline it. The code size drops by 3kB. Should be backported to 1.4 too.	2011-03-28 16:25:58 +02:00
Willy Tarreau	d8ee85a0a3	[BUG] http: fix content-length handling on 32-bit platforms Despite much care around handling the content-length as a 64-bit integer, forwarding was broken on 32-bit platforms due to the 32-bit nature of the ->to_forward member of the "buffer" struct. The issue is that this member is declared as a long, so while it works OK on 64-bit platforms, 32-bit truncate the content-length to the lower 32-bits. One solution could consist in turning to_forward to a long long, but it is used a lot in the critical path, so it's not acceptable to perform all buffer size computations on 64-bit there. The fix consists in changing the to_forward member to a strict 32-bit integer and ensure in buffer_forward() that only the amount of bytes that can fit into it is considered. Callers of buffer_forward() are responsible for checking that their data were taken into account. We arbitrarily ensure we never consider more than 2G at once. That's the way it was intended to work on 32-bit platforms except that it did not. This issue was tracked down hard at Exosec with Bertrand Jacquin, Thierry Fournier and Julien Thomas. It remained undetected for a long time because files larger than 4G are almost always transferred in chunked-encoded format, and most platforms dealing with huge contents these days run on 64-bit. The bug affects all 1.5 and 1.4 versions, and must be backported.	2011-03-28 16:25:16 +02:00
Willy Tarreau	26f0f17200	[BUG] http: fix possible incorrect forwarded wrapping chunk size (take 2) Fix `acd20f80` was incomplete, the computed "bytes" value was not used. This fix must be backported to 1.4.	2011-03-27 20:00:03 +02:00
Willy Tarreau	7b7a8e9d83	[BUG] log: retrieve the target from the session, not the SI Since we now have the copy of the target in the session, use it instead of relying on the SI for it. The SI drops the target upon unregister() so applets such as stats were logged as "NOSRV".	2011-03-27 19:53:06 +02:00
Willy Tarreau	0b3a411543	[BUG] session: conn_retries was not always initialized Johannes Smith reported some wrong retries count in logs associated with bad requests. The cause was that the conn_retries field in the stream interface was only initialized when attempting to connect, but is used when logging, possibly with an uninitialized value holding last connection's conn_retries. This could have been avoided by making use of a stream interface initializer. This bug is 1.5-specific.	2011-03-27 19:16:56 +02:00
David du Colombier	d5f4328efd	[MEDIUM] use getaddrinfo to resolve names if gethostbyname fail Function gethostbyname is deprecated since IEEE Std 1003.1-2008 and was replaced by getaddrinfo (available since IEEE Std 1003.1-2004). Contrary to gethostbyname, getaddrinfo is specified to support both IPv4 and IPv4 addresses. Since some libc doesn't handle getaddrinfo properly, constant USE_GETADDRINFO must be defined at compile time to enable use of getaddrinfo.	2011-03-23 22:49:55 +01:00
Willy Tarreau	2dff0c28e8	[MINOR] cfgparse: better report wrong listening addresses and make use of str2sa_range It's always been a mess to debug wrong listening addresses because the parsing function does not indicate the file and line number. Now it does. Since the code was almost a duplicate of str2sa_range, it now makes use of it and has been sensibly reduced.	2011-03-23 22:49:55 +01:00
David du Colombier	9842ff1ae6	[MINOR] update comment about IPv6 support for server	2011-03-23 22:49:55 +01:00
Willy Tarreau	fab5a43726	[MEDIUM] config: rework the IPv4/IPv6 address parser to support host-only addresses The parser now distinguishes between pure addresses and address:port. This is useful for some config items where only an address is required. Raw IPv6 addresses are now parsed, but IPv6 host name resolution is still not handled (gethostbyname does not resolve IPv6 names to addresses).	2011-03-23 19:01:18 +01:00
Willy Tarreau	6f831b446c	[BUILD] proto_tcp: fix build issue with CTTPROXY Recent sockaddr_storage changes broke the almost unused cttproxy code. Fix is obvious.	2011-03-20 14:03:54 +01:00
Willy Tarreau	5ab04ec47c	[MEDIUM] server: add support for the "send-proxy" option This option enables use of the PROXY protocol with the server, which allows haproxy to transport original client's address across multiple architecture layers.	2011-03-20 11:53:50 +01:00
Willy Tarreau	b22e55bc8f	[MEDIUM] stream_sock: add support for sending the proxy protocol header line Upon connection establishment, stream_sock is now able to send a PROXY line before sending any data. Since it's possible that the buffer is already full, and we don't want to allocate a block for that line, we compute it on-the-fly when we need it. We just store the offset from which to (re-)send from the end of the line, since it's assumed that multiple outputs of the same proxy line will be strictly equivalent. In practice, one call is enough. We just make sure to handle the case where the first send() would indicate an incomplete output, eventhough it's very unlikely to ever happen.	2011-03-20 10:16:46 +01:00
Willy Tarreau	a73fcaf424	[MINOR] frontend: add a make_proxy_line function This function will build a PROXY protocol line header from two addresses (IPv4 or IPv6). AF_UNIX family will be reported as UNKNOWN.	2011-03-20 10:15:22 +01:00
Willy Tarreau	1b6e608c11	[BUG] session: src_conn_cur was returning src_conn_cnt instead Issue reported by Cory Forsyth and diagnosed by Cyril Bont�. Just a plain stupid copy-paste of the wrong fetch function call.	2011-03-16 06:56:57 +01:00
Willy Tarreau	d11ad78c26	[MINOR] checks: report it if checks fail due to socket creation error If the check fails for a low-level socket error (eg: address family not supportd), we currently ignore the status. We must report the error and declare a failed health check in this case. The only real reason for this would be when an IPv6 check is required on an IPv4-only system.	2011-03-13 22:12:54 +01:00
Willy Tarreau	6da0f6d6dd	[BUG] http: stats were not incremented on http-request deny A counter increase was missing here. This should be backported to 1.4 with care, as the code has changed a bit.	2011-03-13 22:00:24 +01:00
Willy Tarreau	ff011f26e9	[REORG] http: move the http-request rules to proto_http And also rename "req_acl_rule" "http_req_rule". At the beginning that was a bit confusing to me, especially the "req_acl" list which in fact holds what we call rules. After some digging, it appeared that some part of the code is 100% HTTP and not just related to authentication anymore, so let's move that part to HTTP and keep the auth-only code in auth.c.	2011-03-13 22:00:24 +01:00
Willy Tarreau	f68a15a951	[MEDIUM] http: always evaluate http-request rules before stats http-request Right now, http-request rules are not evaluated if the URL matches the stats request. This is quite unexpected. For instance, in the config below, an abuser present in the abusers list will not be prevented access to the stats. listen pub bind :8181 acl abuser src -f abusers.lst http-request deny if abuser stats uri /stats It is not a big deal but it's not documented as such either. For 1.5, let's have both lists be evaluated in turn, until one blocks. For 1.4 we'll simply update the doc to indicate that. Also instead of duplicating the code, the patch factors out the list walking code. The HTTP auth has been moved slightly earlier, because it was set after the header addition code, but we don't need to add headers to a request we're dropping.	2011-03-13 22:00:24 +01:00
Willy Tarreau	7d0aaf39d1	[MEDIUM] stats: split frontend and backend stats It's very annoying that frontend and backend stats are merged because we don't know what we're observing. For instance, if a "listen" instance makes use of a distinct backend, it's impossible to know what the bytes_out means. Some points take care of not updating counters twice if the backend points to the frontend, indicating a "listen" instance. The thing becomes more complex when we try to add support for server side keep-alive, because we have to maintain a pointer to the backend used for last request, and to update its stats. But we can't perform such comparisons anymore because the counters will not match anymore. So in order to get rid of this situation, let's have both frontend AND backend stats in the "struct proxy". We simply update the relevant ones during activity. Some of them are only accounted for in the backend, while others are just for frontend. Maybe we can improve a bit on that later, but the essential part is that those counters now reflect what they really mean.	2011-03-13 22:00:23 +01:00
David du Colombier	6f5ccb1589	[MEDIUM] add internal support for IPv6 server addresses This patch turns internal server addresses to sockaddr_storage to store IPv6 addresses, and makes the connect() function use it. This code already works but some caveats with getaddrinfo/gethostbyname still need to be sorted out while the changes had to be merged at this stage of internal architecture changes. So for now the config parser will not emit an IPv6 address yet so that user experience remains unchanged. This change should have absolutely zero user-visible effect, otherwise it's a bug introduced during the merge, that should be reported ASAP.	2011-03-13 22:00:12 +01:00
Willy Tarreau	827aee913f	[MAJOR] session: remove the ->srv pointer from struct session This one has been removed and is now totally superseded by ->target. To get the server, one must use target_srv(&s->target) instead of s->srv now. The function ensures that non-server targets still return NULL.	2011-03-10 23:32:17 +01:00
Willy Tarreau	9e000c6ec8	[CLEANUP] stream_interface: use inline functions to manipulate targets The connection target involves a type and a union of pointers, let's make the code cleaner using simple wrappers.	2011-03-10 23:32:17 +01:00
Willy Tarreau	3d80d911aa	[MEDIUM] session: remove s->prev_srv which is not needed anymore s->prev_srv is used by assign_server() only, but all code paths leading to it now take s->prev_srv from the existing s->srv. So assign_server() can do that copy into its own stack. If at one point a different srv is needed, we still have a copy of the last server on which we failed a connection attempt in s->target.	2011-03-10 23:32:16 +01:00
Willy Tarreau	664beb8610	[MINOR] session: add a pointer to the new target into the session When dealing with HTTP keep-alive, we'll have to know if we can reuse an existing connection. For that, we'll have to check if the current connection was made on the exact same target (referenced in the stream interface). Thus, we need to first assign the next target to the session, then copy it to the stream interface upon connect(). Later we'll check for equivalence between those two operations.	2011-03-10 23:32:16 +01:00
Willy Tarreau	d6cc532ca1	[MINOR] cfgparse: only keep one of dispatch, transparent, http_proxy Since all of them are defined as proxy options, it's better to ensure that at most one of them is enabled at once. The priority has been set according to what is already performed in the backend : 1) dispatch 2) http_proxy 3) transparent	2011-03-10 23:32:16 +01:00
Willy Tarreau	f5ab69aad9	[MINOR] proxy: add PR_O2_DISPATCH to detect dispatch mode Till now we used the fact that the dispatch address was not null to use the dispatch mode. This is very unconvenient, so let's have a dedicated option.	2011-03-10 23:32:16 +01:00
Willy Tarreau	295a837726	[REORG] session: move the data_ctx struct to the stream interface's applet This is in fact where those parts belong to. The old data_state was replaced by applet.state and is now initialized when the applet is registered. It's worth noting that the applet does not need to know the session nor the buffer anymore since everything is brought by the stream interface. It is possible that having a separate applet struct would simplify the code but that's not a big deal.	2011-03-10 23:32:16 +01:00
Willy Tarreau	5ec29ffa42	[CLEANUP] stats: make all dump functions only rely on the stream interface This will be needed to move the applet-specific data out of the session.	2011-03-10 23:32:16 +01:00
Willy Tarreau	75581aebb0	[CLEANUP] session: remove data_source from struct session This one was only used for logging purposes, it's not needed anymore.	2011-03-10 23:32:15 +01:00
Willy Tarreau	71904a4ee8	[MEDIUM] log: take the logged server name from the stream interface With HTTP keep-alive, logging the right server name will be quite complex because the assigned server will possibly change before we log. Also, when we want to log accesses to an applet, it's not easy because the applet becomes NULL again before logging. The logged server's name is now taken from the target stored in the stream interface. That way we can log an applet, a server name, or we could even log a proxy or anything else if we wanted to. Ideally the session should contain a desired target which is the one which should be logged.	2011-03-10 23:32:15 +01:00
Willy Tarreau	7c0a151a2e	[CLEANUP] stream_interface: remove the applet.handler pointer Now that we have the target pointer and type in the stream interface, we don't need the applet.handler pointer anymore. That makes the code somewhat cleaner because we know we're dealing with an applet by checking its type instead of checking the pointer is not null.	2011-03-10 23:32:15 +01:00
Willy Tarreau	ac82540c35	[MEDIUM] stream_interface: store the target pointer and type When doing a connect() on a stream interface, some information is needed from the server and from the backend. In some situations, we don't have a server and only a backend (eg: peers). In other cases, we know we have an applet and we don't want to connect to anything, but we'd still like to have the info about the applet being used. For this, we now store a pointer to the "target" into the stream interface. The target describes what's on the other side before trying to connect. It can be a server, a proxy or an applet for now. Later we'll probably have descriptors for multiple-stage chains so that the final information may still be found. This will help removing many specific cases in the code. It already made it possible to remove the "srv" and "be" parameters to tcpv4_connect_server().	2011-03-10 23:32:15 +01:00
Willy Tarreau	f153686a71	[REORG] tcp: make tcpv4_connect_server() take the target address from the SI The address is now available in the stream interface, no need to pass it by argument.	2011-03-10 23:32:15 +01:00
Willy Tarreau	957c0a5845	[REORG] session: move client and server address to the stream interface This will be needed very soon for the keep-alive.	2011-03-10 23:32:14 +01:00
Willy Tarreau	bc4af0573c	[REORG] stream_interface: move the st0, st1 and private members to the applet Those fields are only used by the applets, so let's move them to the struct.	2011-03-10 23:32:14 +01:00
Willy Tarreau	b24281b0ff	[MINOR] stream_interface: make use of an applet descriptor for IO handlers I/O handlers are still delicate to manipulate. They have no type, they're just raw functions which have no knowledge of themselves. Let's have them declared as applets once for all. That way we can have multiple applets share the same handler functions and we can store their names there. When we later need to add more parameters (eg: usage stats), we'll be able to do so in the applets themselves. The CLI functions has been prefixed with "cli" instead of "stats" as it's clearly what is going on there. The applet descriptor in the stream interface should get all the applet specific data (st0, ...) but this will be done in the next patch so that we don't pollute this one too much.	2011-03-10 23:32:14 +01:00
Willy Tarreau	dfd7fca26c	[BUG] config: don't crash on empty pattern files. Both Hank A. Paulson and Rob at pixsense reported a crash when loading ACLs from a pattern file which contains empty lines. From the tests, it appears that only files that contain nothing but empty lines are causing that (in the past they would have had their line feeds loaded as patterns). The crash happens in the free_pattern() call which doesn't like to be called with a NULL pattern. Let's make it accept it so that it's more in line with the standard uses of free() which ignores NULLs.	2011-03-09 10:22:30 +01:00
Cyril Bonté	1e2a170cf8	[BUG] stats: admin web interface must check the proxy state Similar to the stats socket bug, we must check that the proxy is not disabled before trying to enable/disable a server. Even if a disabled proxy is not displayed, someone can inject a faulty proxy name in the POST parameters. So, we must ensure that no disabled proxy can be used.	2011-03-04 10:01:40 +01:00
Cyril Bonté	613f0df88b	[BUG] stats: admin commands must check the proxy state As reported by Bryan Talbot, enabling and disabling a server in a disabled proxy causes a segfault. Changing the weight can also cause a similar segfault.	2011-03-04 10:01:39 +01:00
Willy Tarreau	61a21a34da	[BUG] http: balance url_param did not work with first parameters on POST Bryan Talbot reported that POST requests with a query string were not correctly processed if the hash parameter was the first one, because the delimiter that was looked for to trigger the parsing was '&' instead of '?'. Also, while checking the code, it became apparent that it was enough for a query string to be present in the request for POST parameters to be ignored, even if the url_param was in the body and not in the URL. The code has then been fixed like this : 1) look for URL param. If found, return it. 2) if no URL param was found and method is POST, then look it up into the body The code now seems to pass all request combinations. This patch must be backported to 1.4 since 1.4 is equally broken right now.	2011-03-01 20:42:20 +01:00
Willy Tarreau	124d99181c	[BUG] http: fix computation of message body length after forwarding has started Till now, the forwarding code was making use of the hdr_content_len member to hold the size of the last chunk parsed. As such, it was reset after being scheduled for forwarding. The issue is that this entry was reset before the data could be viewed by backend.c in order to parse a POST body, so the "balance url_param check_post" did not work anymore. In order to fix this, we need two things : - the chunk size (reset upon every forward) - the total body size (not reset) hdr_content_len was thus replaced by the former (hence the size of the patch) as it makes more sense to have it stored that way than the way around. This patch should be backported to 1.4 with care, considering that it affects the forwarding code.	2011-03-01 20:30:48 +01:00
Willy Tarreau	acd20f80c1	[BUG] http: fix possible incorrect forwarded wrapping chunk size It seems like if a response message is chunked and the chunk size wraps at the end of the buffer and the crlf sequence is incomplete, then we can forward a wrong chunk size due to incorrect handling of the wrapped size. It seems extremely unlikely to occur on real traffic (no reason to have half of the CRLF after a chunk) but nothing prevents it from being possible. This fix must be backported to 1.4.	2011-03-01 20:04:36 +01:00
Willy Tarreau	6a8097f034	[BUG] acl: fd leak when reading patterns from file The fd is not closed after patterns have successfully been read from a file. Bug reported by Bertrand Jacquin. Should be backported to 1.4.	2011-02-26 15:14:15 +01:00
Willy Tarreau	9d9ed0113b	[MINOR] config: warn if response-only conditions are used in "redirect" rules	2011-02-23 15:32:21 +01:00
Willy Tarreau	4a0d828546	[MINOR] acl: srv_id is only valid in responses	2011-02-23 15:32:21 +01:00
Willy Tarreau	17af419a01	[BUG] acl: srv_id must return no match when the server is NULL Reported by Herv� Commowick, causes crashes when the server is not known.	2011-02-23 15:32:15 +01:00
Willy Tarreau	dc23a92ee7	[BUG] startup: set the rlimits before binding ports, not after. As reported by the Loadbalancer.org team, it was not possible to bind more than 1024 ports. This is because the process' limits were set after trying to bind the sockets, which defeats their purpose. This fix must be backported to 1.4 and 1.3.	2011-02-16 11:14:30 +01:00
Willy Tarreau	c8b11090b0	[BUG] cfgparse: correctly count one socket per port in ranges We used to only count one socket instead of one per listener. This makes the socket count wrong, preventing from automatically computing the proper number of sockets to bind. This fix must be backported to 1.4 and 1.3.	2011-02-16 11:14:29 +01:00
Willy Tarreau	910ef306bc	[BUG] http: use correct ACL pointer when evaluating authentication req_acl was used instead of req_acl_final. As a matter of luck, both happen to be the same at this point, but this is not granted in the future. This fix should be backported to 1.4.	2011-02-13 12:18:22 +01:00
Cyril Bont�	23b39d9859	[MINOR] stats: add support for several packets in stats admin Some browsers send POST requests in several packets, which was not supported by the "stats admin" function. This patch allows to wait for more data when they are not fully received (we are still limited to a certain size defined by the buffer size minus its reserved space). It also adds support for the "Expect: 100-Continue" header.	2011-02-12 13:10:18 +01:00
Willy Tarreau	5c4784f4b8	[BUG] http: update the header list's tail when removing the last header Stefan Behte reported a strange case where depending on the position of the Connection header in the header list, some headers added after it were or were not usable in "balance hdr()". The reason is that when the last header is removed, the list's tail was not updated, so any header added after that one was not visible from the list. This fix must be backported to 1.4 and possibly 1.3.	2011-02-12 13:07:35 +01:00
Andreas Kohn	16171e234b	[MINOR] cfgparse: Check whether the path given for the stats socket actually fits into the sockaddr_un structure to avoid truncation. while working further on the changes to allow for dynamic adding/removing of backend servers we noticed a potential problem: the path given for the 'stats socket' global option may get truncated when copying it into the sockaddr_un.sun_path field. Attached patch checks the length, and reports an error if truncation would happen. This issue was noticed by Joerg Sonnenberger <joerg@NetBSD.org>.	2011-01-23 07:26:05 +01:00
Willy Tarreau	7d286a0f63	[BUILD] frontend: shut a warning with TCP_MAXSEG src/frontend.c: In function 'frontend_accept': src/frontend.c:110: warning: pointer targets in passing argument 5 of 'getsockopt' differ in signedness The argument should be socklen_t and not int.	2011-01-05 19:35:41 +01:00
Rauf Kuliyev	38b4156a69	[MINOR] checks: add PostgreSQL health check I have written a small patch to enable a correct PostgreSQL health check It works similar to mysql-check with the very same parameters. E.g.: listen pgsql 127.0.0.1:5432 mode tcp option pgsql-check user pgsql server masterdb pgsql.server.com:5432 check inter 10000	2011-01-04 15:14:13 +01:00
Willy Tarreau	0013433b09	[MINOR] http: improve url_param pattern extraction to ignore empty values It's better to avoid sticking on empty parameter values, as this almost always indicates a missing parameter. Otherwise it's easy to enter a situation where all new visitors stick to the same server.	2011-01-04 14:57:34 +01:00
Willy Tarreau	a0e5861302	[REVERT] undo the stick-table string key lookup fixes Revert commits `035da6d1b0` and `f18b5f21ba`. These fixes were wrong. They worked but they were fixing the symptom instead of the root cause of the problem. The real issue was in the ebtree lookup code and it has been fixed now so these patches are not needed anymore. It's better not to copy memory blocks when we don't need to, so let's revert them.	2011-01-04 14:50:49 +01:00
Willy Tarreau	f18b5f21ba	[BUG] stick-table: use the private buffer when padding strings Commit `035da6d1b0` was incorrect as it could modify a live buffer. We must first ensure that we're on the private buffer or perform a copy before modifying the data.	2011-01-04 06:29:44 +01:00
Willy Tarreau	5109196275	[BUG] acl: fix handling of empty lines in pattern files Gabriel Sosa reported that haproxy unexpectedly reports an error when a pattern file loaded by an ACL contains an empty line. The test was present but inefficient as it did not consider the '\n' as the end of the line. This fix relies on the line length instead. It should be backported to 1.4.	2011-01-03 21:06:32 +01:00
David Cournapeau	16023eef0b	[MINOR] http: add pattern extraction method to stick on query string parameter This is an updated version of my patch for url parameter extraction on stick table. It adds "url_param(name)" as a possible stick method.	2011-01-03 13:26:02 +01:00
Willy Tarreau	035da6d1b0	[BUG] stick-table: correctly terminate string keys during lookups If a key to be looked up is extracted from data without being padded and if it matches the beginning of another stored key, it is not found in subsequent lookups because it does not end with a zero. This bug was discovered and diagnosed by David Cournapeau.	2011-01-02 20:12:10 +01:00
Kevinm	48936af9a2	[MINOR] log: ability to override the syslog tag One of the requirements we have is to run multiple instances of haproxy on a single host; this is so that we can split the responsibilities (and change permissions) between product teams. An issue we ran up against is how we would distinguish between the logs generated by each instance. The solution we came up with (please let me know if there is a better way) is to override the application tag written to syslog. We can then configure syslog to write these to different files. I have attached a patch adding a global option 'log-tag' to override the default syslog tag 'haproxy' (actually defaults to argv[0]).	2010-12-30 11:43:36 +01:00
Willy Tarreau	48a7e72c5d	[MINOR] tcp: add support for dynamic MSS setting By passing a negative value to the "mss" argument of "bind" lines, it becomes possible to subtract this value to the MSS advertised by the client, which results in segments smaller than advertised. The effect is useful with some TCP stacks which ACK less often when segments are not full, because they only ACK every other full segment as suggested by RFC1122. NOTE: currently this has no effect on Linux kernel 2.6, a kernel patch is still required to change the MSS of established connections.	2010-12-30 09:50:23 +01:00
Joe Williams	df5b38fac1	[MINOR] log: add support for passing the forwarded hostname Haproxy does not include the hostname rather the IP of the machine in the syslog headers it sends. Unfortunately this means that for each log line rsyslog does a reverse dns on the client IP and in the case of non-routable IPs one gets the public hostname not the internal one. While this is valid according to RFC3164 as one might imagine this is troublsome if you have some machines with public IPs, internal IPs, no reverse DNS entries, etc and you want a standardized hostname based log directory structure. The rfc says the preferred value is the hostname. This patch adds a global "log-send-hostname" statement which accepts an optional string to force the host name. If unset, the local host name is used.	2010-12-29 17:05:48 +01:00
Cyril Bonté	9ea2b9ac75	[BUG] http: fix http-pretend-keepalive and httpclose/tunnel mode Since haproxy 1.4.9, combining option httpclose and option http-pretend-keepalive can leave the connections opened until the backend keep-alive timeout is reached, providing bad performances. The same can occur when the proxy is in tunnel mode. This patch ensures that the server side connection is closed after the response and ignore http-pretend-keepalive in tunnel mode.	2010-12-29 15:24:48 +01:00
Willy Tarreau	b89cfca494	[BUG] session: release slot before processing pending connections When a connection error is encountered on a server and the server's connection pool is full, pending connections are not woken up because the current connection is still accounted for on the server, so it still appears full. This becomes visible on a server which has "maxconn 1" because the pending connections will only be able to expire in the queue. Now we take care of releasing our current connection before trying to offer it to another pending request, so that the server can accept a next connection. This patch should be backported to 1.4.	2010-12-29 14:38:29 +01:00
Willy Tarreau	32d3ee99ee	[CRITICAL] session: correctly leave turn-around and queue states on abort When a client connection aborts while the server-side connection is in turn-around after a failed connection attempt, the turn-around timeout is reset in shutw() but the state is not changed. The session then remains stuck in this state forever. Change the QUE and TAR states to DIS just as we do for CER to fix this. This patch should be backported to 1.4.	2010-12-29 14:38:15 +01:00
Willy Tarreau	ed2fd2daea	[BUG] http: fix incorrect error reporting during data transfers We've had several issues related to data transfers. First, if a client aborted an upload before the server started to respond, it would get a 502 followed by a 400. The same was true (in the other way around) if the server suddenly aborted while the client was uploading the data. The flags reported in the logs were misleading. Request errors could be reported while the transfer was stopped during the data phase. The status codes could also be overwritten by a 400 eventhough the start of the response was transferred to the client. The stats were also wrong in case of data aborts. The server or the client could sometimes be miscredited for being the author of the abort depending on where the abort was detected. Some client aborts could also be accounted as request errors and some server aborts as response errors. Now it seems like all such issues are fixed. Since we don't have a specific state for data flowing from the client to the server before the server responds, we're still counting the client aborted transfers as "CH", and they become "CD" when the server starts to respond. Ideally a "P" state would be desired. This patch should be backported to 1.4.	2010-12-29 13:55:32 +01:00
Willy Tarreau	9c3bc229ec	[CLEANUP] frontend: only apply TCP-specific settings to TCP/TCP6 sockets It's useless to apply keep-alive or lingering to non-TCP sockets.	2010-12-24 14:49:37 +01:00
Willy Tarreau	0499e3575c	[BUG] http: analyser optimizations broke pipelining HTTP pipelining currently needs to monitor the response buffer to wait for some free space to be able to send a response. It was not possible for the HTTP analyser to be called based on response buffer activity. Now we introduce a new buffer flag BF_WAKE_ONCE which is set when the HTTP request analyser is set on the response buffer and some activity is detected. This is not clean at all but once of the only ways to fix the issue before we make it possible to register events for analysers. Also it appeared that one realign condition did not cover all cases.	2010-12-17 07:15:57 +01:00
Herv� COMMOWICK	35ed8019e3	[MINOR] acl: add be_id/srv_id to match backend's and server's id These ones can be useful in responses.	2010-12-15 23:36:59 +01:00
Cyril Bont�	02ff8ef677	[MINOR] add warnings on features not compatible with multi-process mode Using haproxy in multi-process mode (nbproc > 1), some features can be not fully compatible or not work at all. haproxy will now display a warning on startup for : - appsession - sticking rules - stats / stats admin - stats socket - peers (fatal error in that case)	2010-12-15 07:28:11 +01:00
Willy Tarreau	10479e4bac	[MINOR] stats: add global event ID and count This counter will help quickly spot whether there are new errors or not. It is also assigned to each capture so that a script can keep trace of which capture was taken when.	2010-12-12 14:00:34 +01:00
Willy Tarreau	e1582eb7f6	[MINOR] http: capture incorrectly chunked message bodies It is possible to block on incorrectly chunked requests or responses, but this becomes very hard to debug when it happens once in a while. This patch adds the ability to also capture incorrectly chunked requests and responses. The chunk will appear in the error buffer and will be verifiable with the usual "show errors". The incorrect byte will match the error location.	2010-12-12 13:10:11 +01:00
Willy Tarreau	81f2fb97fe	[MINOR] http: support wrapping messages in error captures Error captures did only support contiguous messages. This is annoying for capturing chunking errors, so let's ensure the function is able to copy wrapped messages.	2010-12-12 13:09:08 +01:00
Willy Tarreau	798e128a4d	[BUG] stream_interface: truncate buffers when sending error messages When an error message is returned to a client, all buffer contents were left intact. Since the analysers were removed, the potentially invalid data that were read had a chance to be sent too. Now we ensure we only keep the already scheduled data in the buffer and we truncate it after that. That means that responses with data that must be blocked will really be blocked, and that incorrectly chunked data will be stopped at the point where the chunking fails.	2010-12-12 13:06:00 +01:00
Willy Tarreau	3fe693b4d6	[BUG] http chunking: don't report a parsing error on connection errors When haproxy parses chunk-encoded data that are scheduled to be sent, it is possible that the other end is closed (mainly due to a client abort returning as an error). The message state thus changes to HTTP_MSG_ERROR and the error is reported as a chunk parsing error ("PD--") while it is not. Detect this case before setting the flags and set the appropriate flag in this case.	2010-12-12 12:50:05 +01:00
Willy Tarreau	078272e115	[MINOR] stats: report HTTP message state and buffer flags in error dumps Debugging parsing errors can be greatly improved if we know what the parser state was and what the buffer flags were (especially for closed inputs/outputs and full buffers). Let's add that to the error snapshots.	2010-12-12 12:46:33 +01:00
Willy Tarreau	57f5c12c04	[OPTIM] http: don't send each chunk in a separate packet When forwarding chunk-encoded data, each chunk gets a TCP PUSH flag when going onto the wire simply because the send() function does not know that some data remain after it (next chunk). Now we set the BF_EXPECT_MORE flag on the buffer if the chunk size is not null. That way we can reduce the number of packets sent, which is particularly noticeable when forwarding compressed data, especially as it requires less ACKs from the client.	2010-12-02 00:39:33 +01:00
Willy Tarreau	342b11c4d4	[BUG] http: do not re-enable the PROXY analyser on keep-alive The PROXY analyser is connection-oriented and must only be set once. When an HTTP transaction is done, we must not re-enable it.	2010-11-29 07:32:02 +01:00
Willy Tarreau	798a39cdc9	[MEDIUM] hash: add support for an 'avalanche' hash-type When the number of servers is a multiple of the size of the input set, map-based hash can be inefficient. This typically happens with 64 servers when doing URI hashing. The "avalanche" hash-type applies an avalanche hash before performing a map lookup in order to smooth the distribution. The result is slightly less smooth than the map for small numbers of servers, but still better than the consistent hashing.	2010-11-29 07:28:16 +01:00
Willy Tarreau	4c14eaa0d4	[CLEANUP] hash: move the avalanche hash code globally available We'll use this hash at other places, let's make it globally available. The function has also been renamed because its "chash_hash" name was not appropriate.	2010-11-29 07:28:16 +01:00
Willy Tarreau	26db59ea6b	[BUG] http: correctly update the header list when removing two consecutive headers When a header is removed, the previous header's next pointer is updated to reflect the next of the current header. However, when cycling through the loop, we update the prev pointer to point to the deleted header, which means that if we delete another header, it's the deleted header's next pointer that will be updated, leaving the deleted header in the list with a null length, which is forbidden. We must just not update the prev pointer after a removal. This bug was present when either "reqdel" and "rspdel" removed two consecutive headers. It could also occur when removing cookies in either requests or responses, but since headers were the last header processing, the issue remained unnoticed. Issue reported by Hank A. Paulson. This fix must be ported to 1.4 and possibly 1.3.	2010-11-28 07:06:23 +01:00
Willy Tarreau	b810554f8f	[CRITICAL] cookies: mixing cookies in indirect mode and appsession can crash the process Cookies in indirect mode are removed from the cookie header. Three pointers ought to be updated when appsession cookies are processed next, but were not. The result is that a memcpy() can be called with a negative value causing the process to crash. It is not sure whether this can be remotely exploited or not. (cherry picked from commit c5f3749aa3ccfdebc4992854ea79823d26f66213)	2010-11-28 07:06:22 +01:00
Willy Tarreau	77eb9b8a2d	[BUG] appsession: fix possible double free in case of out of memory In out of memory conditions, the ->destroy function would free all possibly allocated pools from the current appsession, including those that were not yet allocated nor assigned, which used to point to a previous allocation, obviously resulting in a segfault. (cherry picked from commit 75eae485921d3a6ce197915c769673834ecbfa5c)	2010-11-19 13:25:11 +01:00
Willy Tarreau	f70fc75296	[BUG] capture: do not capture a cookie if there is no memory left In case of out of memory, it was possible to write to a null pointer when capturing response cookies due to a missing "else" block. The request handling was fine though. (cherry picked from commit 62e3604d7dd27741c0b4c9e27d9e7c73495dfc32)	2010-11-19 13:25:11 +01:00
Willy Tarreau	e79c3b24fb	[BUG] debug: report the correct poller list in verbose mode When running with -vv or -V -d, the list of usable polling systems is reported. The final selection did not take into account the possible failures during the tests, which is misleading and could make one think that a non-working poller will be used, while it is not the case. Fix that to really report the correct ones. (cherry picked from commit 6d0e354e0171f08b7b3868ad2882c3663bd068a7)	2010-11-19 13:25:10 +01:00
Cyril Bont�	1f5848a460	[CLEANUP] unix sockets : move create_uxst_socket() in uxst_bind_listener() The code of create_uxst_socket() is moved in uxst_bind_listener() so that we don't need to pass a lot of parameters, as it was only called there.	2010-11-14 17:21:44 +01:00
Cyril Bont�	e4cbbe2a0e	[MINOR] unix sockets : inherits the backlog size from the listener Since unix sockets are supported for bind, the default backlog size was not enough to accept the traffic. The size is now inherited from the listener to behave like the tcp listeners. This also affects the "stats socket" backlog, which is now determined by "stats maxconn".	2010-11-14 17:21:31 +01:00
Willy Tarreau	48d84c10b5	[OPTIM] linux: add support for bypassing libc to force using vsyscalls Some distros' libc are built for CPUs earlier than i686 and as such do not offer support for Linux kernel's faster vsyscalls. This code adds a new build option USE_VSYSCALLS to bypass libc for most commonly used system calls. A net gain of about 10% can be observed with this change alone. It only works when /proc/sys/abi/vsyscall32 equals exactly 2. When it's set to 1, the VDSO is randomized and cannot be used.	2010-11-14 17:09:33 +01:00
Willy Tarreau	11f49408f2	[OPTIM] stream_sock: don't clear FDs that are already cleared We can on average two calls to __fd_clr() per session by avoiding to call it unnecessarily.	2010-11-11 23:08:17 +01:00
Willy Tarreau	2f976e18b8	[OPTIM] session: don't recheck analysers when buffer flags have not changed Analysers were re-evaluated when some flags were still present in the buffers, even if they had not changed since previous pass, resulting in a waste of CPU cycles. Ensuring that the flags have changed has saved some useless calls : function min calls per session (before -> after) http_request_forward_body 5 -> 4 http_response_forward_body 3 -> 2 http_sync_req_state 10 -> 8 http_sync_res_state 8 -> 6 http_resync_states 8 -> 6	2010-11-11 14:28:47 +01:00
Willy Tarreau	abe8ea5c1d	[BUG] accept: don't close twice upon error The stream_sock's accept() used to close the FD upon error, but this was also sometimes performed by the frontend's accept() called via the session's accept(). Those interlaced calls were also responsible for the spaghetti-looking error unrolling code in session.c and stream_sock.c. Now the frontend must not close the FD anymore, the session is responsible for that. It also takes care of just closing the FD or also removing from the FD lists, depending on its state. The socket-level accept() does not have to care about that anymore.	2010-11-11 11:05:20 +01:00
Willy Tarreau	bd55e3167b	[BUILD] peers: shut a printf format warning (key_size is a size_t) Also fix a few misleading comments.	2010-11-11 11:05:04 +01:00
Willy Tarreau	fffe1325df	[CLEANUP] accept: replace some inappropriate Alert() calls with send_log() Some Alert() messages were remaining in the accept() path, which they would have no chance to be detected. Remove some of them (the impossible ones) and replace the relevant ones with send_log() so that the admin has a chance to catch them.	2010-11-11 09:51:38 +01:00
Emeric Brun	5a8c0a9f52	[MEDIUM] Manage soft stop on peers proxy	2010-11-11 09:29:08 +01:00
Emeric Brun	32da3c40db	[MEDIUM] Manage peers section parsing and stick table registration on peers.	2010-11-11 09:29:08 +01:00
Emeric Brun	2b920a1af1	[MAJOR] Add new files src/peer.c, include/proto/peers.h and include/types/peers.h for sync stick table management Add cmdline option -L to configure local peer name	2010-11-11 09:29:08 +01:00
Emeric Brun	85e77c7f0d	[MEDIUM] Create updates tree on stick table to manage sync.	2010-11-11 09:29:08 +01:00
Emeric Brun	1e029aa965	[MINOR] Manage all types (ip, integer, string, binary) on cli "show table" command	2010-11-11 09:29:07 +01:00
Emeric	f2d7caedd1	[MINOR] Add pattern's fetchs payload and payload_lv	2010-11-11 09:29:07 +01:00
Emeric Brun	485479d8e9	[MEDIUM] Create new protected pattern types CONSTSTRING and CONSTDATA to force memcpy if data from protected areas need to be manipulated. Enhance pattern convs and fetch argument parsing, now fetchs and convs callbacks used typed args. Add more details on error messages on parsing pattern expression function. Update existing pattern convs and fetchs to new proto. Create stick table key type "binary". Manage Truncation and padding if pattern's fetch-converted result don't match table key size.	2010-11-11 09:29:07 +01:00
Emeric Brun	38e7176961	[MINOR] new acls fetch req_ssl_hello_type and rep_ssl_hello_type	2010-11-11 09:28:55 +01:00
Emeric Brun	97679e7901	[MEDIUM] Implement tcp inspect response rules	2010-11-11 09:28:18 +01:00
Emeric Brun	fbce6d0215	[BUG] stick table purge failure if size less than 255 If table size is lower than 256, we can't force to purge old entries. This patch should be backported to 1.4.	2010-11-11 09:28:18 +01:00
Willy Tarreau	da4d9fe5a4	[BUG] session: don't stop forwarding of data upon last packet If a read shutdown is encountered on the first packet of a connection right after the data and the last analyser is unplugged at the same time, then that last data chunk may never be forwarded. In practice, right now it cannot happen on requests due to the way they're scheduled, nor can it happen on responses due to the way their analysers work. But this behaviour has been observed with new response analysers being developped. The reason is that when the read shutdown is encountered and an analyser is present, data cannot be forwarded but the BF_SHUTW_NOW flag is set. After that, the analyser gets called and unplugs itself, hoping that process_session() will automatically forward the data. This does not happen due to BF_SHUTW_NOW. Simply removing the test on this flag is not enough because then aborted requests still get forwarded, due to the forwarding code undoing the abort. The solution here consists in checking BF_SHUTR_NOW instead of BF_SHUTW_NOW. BF_SHUTR_NOW is only set on aborts and remains set until ->shutr() is called. This is enough to catch recent aborts but not prevent forwarding in other cases. Maybe a new special buffer flag "BF_ABORT" might be desirable in the future. This patch does not need to be backported because older versions don't have the analyser which make the problem appear.	2010-11-11 09:26:29 +01:00
Cyril Bont�	62846b2674	[MINOR] config: detect options not supported due to compilation options Some options depends on the target architecture or compilation options. When such an option is used on a compiled version that doesn't support it, it's probably better to identify it as an unsupported option due to compilation options instead of an unknown option. Edit: better check on the empty capability than on the option bits. -Willy	2010-11-11 09:26:28 +01:00
Cyril Bont�	acd7d63ff9	[CLEANUP] Remove unneeded chars allocation Some arrays used to log addresses add some more bytes for ports but this space is never used.	2010-11-11 09:26:28 +01:00
Willy Tarreau	b40dc94a9a	[MEDIUM] unix sockets: cleanup the error reporting path There were a lot of snprintf() everywhere in the UNIX bind code. Now we proceed as for tcp and indicate the socket path at the end between square brackets. The code is smaller and more readable.	2010-11-11 09:26:28 +01:00
Cyril Bont�	43ba1b331c	[MINOR] startup: print the proxy socket which caused an error Add the address and port to the error message of the proxy socket that caused the error. This can be helpful when several listening addresses are used in a proxy. Edit: since we now also support unix sockets (which already report their path), better move the address reporting to proto_tcp.c by analogy. -Willy	2010-11-11 09:26:28 +01:00
Willy Tarreau	17f449b214	[MINOR] move MAXPATHLEN definition to compat.h MAXPATHLEN may be used at other places, it's unconvenient to have it redefined in a few files. Also, since checking it requires including sys/param.h, some versions of it cause a macro declaration conflict with MIN/MAX which are defined in tools.h. The solution consists in including sys/param.h in both files so that we ensure it's loaded before the macros are defined and MAXPATHLEN is checked.	2010-11-11 09:21:53 +01:00
Willy Tarreau	d55c3feca6	[MINOR] cfgparse: report support of <path> for the 'bind' statements "bind" now supports unix sockets, so report that in the error message.	2010-11-09 15:59:42 +01:00
Emeric Brun	ed76092e10	[MEDIUM] Add supports of bind on unix sockets.	2010-11-09 15:59:42 +01:00
Emeric Brun	5bd86a8ff5	[MINOR] Support listener's sockets unix on http logs. Enhance controls of sockets family on X-Forwarded-For and X-Original-To insert	2010-11-09 15:59:42 +01:00
Emeric Brun	f769f51af6	[MINOR] Enhance controls of socket's family on acls and pattern fetch	2010-11-09 15:59:42 +01:00
Emeric Brun	0aaccf88f9	[MINOR] Manage socket type unix for some logs	2010-11-09 15:59:41 +01:00
Emeric Brun	ec810d1dc7	[MINOR] Add some tests on sockets family for port remapping and mode transparent.	2010-11-09 15:59:41 +01:00
Emeric Brun	ab844ea9e1	[MINOR] Support of unix listener sockets for debug and log event messages on frontend.c	2010-11-09 15:57:37 +01:00
Emeric Brun	837ca52de3	[MINOR] Manage unix socket source field on session dump on sock stats	2010-11-05 10:34:07 +01:00
Emeric Brun	4ab9262894	[MINOR] Manage unix socket source field on logs	2010-11-05 10:34:07 +01:00
Emeric Brun	cf20bf1c1c	[MEDIUM] Enhance message errors management on binds	2010-11-05 10:34:07 +01:00
Emeric Brun	861ccff9ca	[MINOR] frontend: add tcpv6 support on accept-proxy bind	2010-10-30 19:04:38 +02:00
Emeric Brun	f4711a3221	[MINOR] frontend: improve accept-proxy header parsing The accept-proxy now automatically fails as soon as a character does not match the expected syntax.	2010-10-30 19:04:38 +02:00
Willy Tarreau	3041b9fcc3	[MEDIUM] session: call the frontend_decode_proxy analyser on proxied connections This analyser must absolutely be the earliest one to process contents, given the nature of the protocol.	2010-10-30 19:04:38 +02:00
Willy Tarreau	8b0cbf9969	[MINOR] frontend: add a new analyser to parse a proxied connection The introduction of a new PROXY protocol for proxied connections requires an early analyser to decode the incoming connection and set the session flags accordingly. Some more work is needed, among which setting a flag on the session to indicate it's proxied, and copying the original parameters for later comparisons with new ACLs (eg: real_src, ...).	2010-10-30 19:04:38 +02:00
Willy Tarreau	74172757c7	[MINOR] standard: change arg type from const char* to char* inetaddr_host_lim_ret() used to make use of const char for some args, but that make it impossible ot use char due to the way controls are made by gcc. So let's change that.	2010-10-30 19:04:37 +02:00
Willy Tarreau	4ec83cd939	[MINOR] standard: add read_uint() to parse a delimited unsigned integer This function parses an integer and returns it along with the pointer to the next char not part of the number.	2010-10-30 19:04:37 +02:00
Willy Tarreau	8a95691ae8	[MINOR] listener: add the "accept-proxy" option to the "bind" keyword This option will enable the AN_REQ_DECODE_PROXY analyser on the requests that come from those listeners.	2010-10-30 19:04:37 +02:00
Willy Tarreau	ba4c5be880	[MINOR] cookie: add support for the "preserve" option This option makes haproxy preserve any persistence cookie emitted by the server, which allows the server to change it or to unset it, for instance, after a logout request. (cherry picked from commit 52e6d75374c7900c1fe691c5633b4ae029cae8d5)	2010-10-30 19:04:36 +02:00
Willy Tarreau	c63d4bbff9	[BUG] cookie: correctly unset default cookie parameters When a backend defines a new cookie, it forgot to unset any params that could have been set in a defaults section, resulting in configs that would sometimes refuse to load or not work as expected. (cherry picked from commit f80bf174ed905a29a3ed8ee91fcd528da6df174f)	2010-10-30 19:04:36 +02:00
Willy Tarreau	7f18e52b13	[MINOR] acl: add the http_req_first match This match returns true when the request calling it is the first one of a connection. (cherry picked from commit 922ca979c50653c415852531f36fe409190ad76b)	2010-10-30 19:04:35 +02:00
emeric	8aa6b3762c	[BUG] proto_tcp: potential bug on pattern fetch dst and dport Pattern fetches relying on destination address must first fetch the address if it has not been done yet. (cherry picked from commit 21abf441feb318b2ccd7df590fd89e9e824627f6)	2010-10-30 19:04:35 +02:00
Herv� COMMOWICK	8776f1b3a0	[MINOR] add better support to "mysql-check" The MySQL check has been revamped to be able to send real MySQL data, and to avoid Aborted connects on MySQL side. It is however backward compatible with older version, but it is highly recommended to use the new mode, by adding "user <username>" on the "mysql-check" line. The new check consists in sending two MySQL packet, one Client Authentication packet, with "haproxy" username (by default), and one QUIT packet, to correctly close MySQL session. We then parse the Mysql Handshake Initialisation packet and/or Error packet. It is a basic but useful test which does not produce error nor aborted connect on the server. (cherry picked from commit a1e4dcfe5718311b7653d7dabfad65c005d0439b)	2010-10-30 19:04:35 +02:00
Willy Tarreau	aa2f389cbb	[MINOR] checks: ensure that we can inherit binary checks from the defaults section Health checks were all pure ASCII, but we're going to have to support some binary checks (eg: SQL). When they're inherited from the default section, they will be truncated to the first \0 due to strdup(). Let's fix that with a simple malloc. (cherry picked from commit 98fc04a766bcff80f57db2b1cd865c91761b131b)	2010-10-30 19:04:35 +02:00
Willy Tarreau	53621e0eb6	[BUG] config: report correct keywords for "observe" Keywords were changed just before the commit but not in the help message. Spotted by Hank A. Paulson. (cherry picked from commit fdd46a0766dccec704aa1bd5acb0ac99a801c549)	2010-10-30 19:04:34 +02:00
Willy Tarreau	70461308fe	[MEDIUM] checks: set server state to one state from failure when leaving maintenance When we're enabling a server again (unix CLI or stats interface), we must not mark it completely up because it can take a while before a failure is detected. So we mark it one step above failure, which means it's up but will be marked down upon first failure. (cherry picked from commit 83c3e06452457ed5660fc814cbda5bf878bf19a2)	2010-10-30 19:04:34 +02:00
Cyril Bont�	474be415af	[MEDIUM] stats: add an admin level The stats web interface must be read-only by default to prevent security holes. As it is now allowed to enable/disable servers, a new keyword "stats admin" is introduced to activate this admin level, conditioned by ACLs. (cherry picked from commit 5334bab92ca7debe36df69983c19c21b6dc63f78)	2010-10-30 19:04:34 +02:00
Cyril Bont�	70be45dbdf	[MEDIUM] enable/disable servers from the stats web interface Based on a patch provided by Judd Montgomery, it is now possible to enable/disable servers from the stats web interface. This allows to select several servers in a backend and apply the action to them at the same time. Currently, there are 2 known limitations : - The POST data are limited to one packet (don't alter too many servers at a time). - Expect: 100-continue is not supported. (cherry picked from commit 7693948766cb5647ac03b48e782cfee2b1f14491)	2010-10-30 19:04:34 +02:00
Willy Tarreau	d64d225e52	[BUG] checks: don't log backend down for all zero-weight servers In a down backend, when a zero-weight server is lost, a new "backend down" message was emitted and the down transition of that backend was wrongly increased. This change ensures that we don't count that transition again. This patch should be backported to 1.3. (cherry picked from commit 60efc5f745b5fa70d811f977727592e47e32a281)	2010-10-30 19:04:34 +02:00
Willy Tarreau	ef4f391cc4	[MEDIUM] cookie: set the date in the cookie if needed If a maxidle or maxlife parameter is set on the persistence cookie in insert mode and the client did not provide a recent enough cookie, then we emit a new cookie with a new last_seen date and the same first_seen (if maxlife is set). Recent enough here designates a cookie that would be rounded to the same date. That way, we can refresh a cookie when required without doing it in all responses. If the request did not contain such parameters, they are set anyway. This means that a monitoring request that is forced to a server will get an expiration date anyway, but this should not be a problem given that the client is able to set its cookie in this case. This also permits to force an expiration date on visitors who previously did not have one. If a request comes with a dated cookie while no date check is performed, then a new cookie is emitted with no date, so that we don't risk dropping the user too fast due to a very old date when we re-enable the date check. All requests that were targetting the correct server and which had their expiration date added/updated/removed in the response cookie are logged with the 'U' ("updated") flag instead of the 'I' ("inserted"). So very often we'll see "VU" instead of "VN". (cherry picked from commit 8b3c6ecab6d37be5f3655bc3a2d2c0f9f37325eb)	2010-10-30 19:04:33 +02:00
Willy Tarreau	f64d1410fc	[MEDIUM] cookie: check for maxidle and maxlife for incoming dated cookies If a cookie comes in with a first or last date, and they are configured on the backend, they're checked. If a date is expired or too far in the future, then the cookie is ignored and the specific reason appears in the cookie field of the logs. (cherry picked from commit faa3019107eabe6b3ab76ffec9754f2f31aa24c6)	2010-10-30 19:04:33 +02:00
Willy Tarreau	c01062bead	[MINOR] add encode/decode function for 30-bit integers from/to base64 These functions only require 5 chars to encode 30 bits, and don't expect any padding. They will be used to encode dates in cookies. (cherry picked from commit a7e2b5fc4612994c7b13bcb103a4a2c3ecd6438a)	2010-10-30 19:04:33 +02:00
Willy Tarreau	f1348310e8	[MEDIUM] cookie: reassign set-cookie status flags to store more states The set-cookie status flags were not very handy and limited. Reorder them to save some room for additional values and add the "U" flags (for Updated expiration date) that will be used with expirable cookies in insert mode. (cherry picked from commit 5bab52f821bb0fa99fc48ad1b400769e66196ece)	2010-10-30 19:04:33 +02:00
Willy Tarreau	b761ec4c94	[MINOR] cookie: add the expired (E) and old (O) flags for request cookies These flags will indicate the cookie status when an expiration date is set. (cherry picked from commit 3f0f0e4583a432d34b75bc7b9dd2c756b4e181a7)	2010-10-30 19:04:33 +02:00
Willy Tarreau	bca9969daf	[MEDIUM] cookie: support client cookies with some contents appended to their value In all cookie persistence modes but prefix, we now support cookies whose value is suffixed with some contents after a vertical bar ('\|'). This will be used to pass an optional expiration date. So as of now we only consider the part of the cookie value which is used before the vertical bar. (cherry picked from commit a4486bf4e5b03b5a980d03fef799f6407b2c992d)	2010-10-30 19:04:32 +02:00
Willy Tarreau	3193685865	[MINOR] cookie: add options "maxidle" and "maxlife" Add two new arguments to the "cookie" keyword, to be able to fix a max idle and max life on them. Right now only the parameter parsing is implemented. (cherry picked from commit 9ad5dec4c3bb8f29129f292cb22d3fc495fcc98a)	2010-10-30 19:04:32 +02:00
Willy Tarreau	43961d523f	[MINOR] global: add "tune.chksize" to change the default check buffer size HTTP content-based health checks will be involved in searching text in pages. Some pages may not fit in the default buffer (16kB) and sometimes it might be desired to have larger buffers in order to find patterns. Running checks on smaller URIs is always preferred of course. (cherry picked from commit 043f44aeb835f3d0b57626c4276581a73600b6b1)	2010-10-30 19:04:32 +02:00
Willy Tarreau	bd741540d2	[MEDIUM] checks: add support for HTTP contents lookup This patch adds the "http-check expect [r]{string,status}" statements which enable health checks based on whether the response status or body to an HTTP request contains a string or matches a regex. This probably is one of the oldest patches that remained unmerged. Over the time, several people have contributed to it, among which FinalBSD (first and second implementations), Nick Chalk (port to 1.4), Anze Skerlavaj (tests and fixes), Cyril Bont� (general fixes), and of course myself for the final fixes and doc during integration. Some people already use an old version of this patch which has several issues, among which the inability to search for a plain string that is not at the beginning of the data, and the inability to look for response contents that are provided in a second and subsequent recv() calls. But since some configs are already deployed, it was quite important to ensure a 100% compatible behaviour on the working cases. Thus, that patch fixes the issues while maintaining config compatibility with already deployed versions. (cherry picked from commit b507c43a3ce9a8e8e4b770e52e4edc20cba4c37f)	2010-10-30 19:04:31 +02:00
Gabor Lekeny	b4c81e4c81	[MINOR] checks: add support for LDAPv3 health checks This patch provides a new "option ldap-check" statement to enable server health checks based on LDAPv3 bind requests. (cherry picked from commit b76b44c6fed8a7ba6f0f565dd72a9cb77aaeca7c)	2010-10-30 19:04:31 +02:00
Willy Tarreau	b824b002cd	[MEDIUM] tcp-request : don't wait for inspect-delay to expire when the buffer is full If a request buffer is full, there's no point waiting for the timeout to expire, the contents will not change.	2010-10-30 19:04:31 +02:00
Willy Tarreau	22a9534213	[MEDIUM] make it possible to combine http-pretend-keepalived with httpclose Some configs may involve httpclose in a frontend and http-pretend-keepalive in a backend. httpclose used to take priority over keepalive, thus voiding its effect. This change ensures that when both are combined, keepalive is still announced to the server while close is announced to the client. (cherry picked from commit 2be7ec90fa9caf66294f446423bbab2d00db9004)	2010-10-30 19:04:31 +02:00
Willy Tarreau	e3f284aa7b	[BUILD] proto_http: eliminate some build warnings with gcc-2.95 gcc-2.95 does not like labels before the first case in a switch statement. (cherry picked from commit e1c51a861ba0c389d31dfb010e8b188f5f43313a)	2010-10-30 19:04:31 +02:00
Krzysztof Piotr Oledzki	3bb057170c	[BUG] Restore info about available active/backup servers Bug introduced by `5f5b7d2c1d` This bug was reported by Guido Krause. (cherry picked from commit 0c801d1f5ffdc2fe3d036c1e5203d617507c55c3)	2010-10-30 19:04:30 +02:00
Willy Tarreau	58bd8fd46d	[BUG] stream_sock: try to flush any extra pending request data after a POST Some broken browsers still happen to send a CRLF after a POST. Those which send a CRLF in a second packet have it queued into the system's buffers, which causes an RST to be emitted by some systems upon close of the response (eg: Linux). The client may then receive the RST without the last response segments, resulting in a truncated response. This change leaves request polling enabled on a POST so that we can flush any late data from the request buffers. A more complete workaround would consist in reading from the request for a long time, until we get confirmation that the close has been ACKed. This is much more complex and should only be studied for newer versions. (cherry picked from commit 12e316af4f0245fde12dbc224ebe33c8fea806b2)	2010-10-30 19:04:30 +02:00
Willy Tarreau	fe598a7779	[BUILD] stream_sock: previous fix lacked the #include, causing a warning.	2010-09-21 21:48:23 +02:00
Willy Tarreau	e9f32dbf5c	[BUG] stream_sock: cleanly disable the listener in case of resource shortage Jozsef R.Nagy reported a reliability issue on FreeBSD. Sometimes an error would be emitted, reporting the inability to switch a socket to non-blocking mode and the listener would definitely not accept anything. Cyril Bont� narrowed this bug down to the call to EV_FD_CLR(l->fd, DIR_RD). He was right because this call is wrong. It only disables input events on the listening socket, without setting the listener to the LI_LISTEN state, so any subsequent call to enable_listener() from maintain_proxies() is ignored ! The correct fix consists in calling disable_listener() instead. It is discutable whether we should keep such error path or just ignore the event. The goal in earlier versions was to temporarily disable new activity in order to let the system recover while releasing resources.	2010-09-21 21:14:29 +02:00
Willy Tarreau	74b08c9ab7	[MEDIUM] buffers: rework the functions to exchange between SI and buffers There was no consistency between all the functions used to exchange data between a buffer and a stream interface. Also, the functions used to send data to a buffer did not consider the possibility that the buffer was shutdown for read. Now the functions are called buffer_{put,get}_{char,block,chunk,string}. The old buffer_feed* functions have been left available for existing code but marked deprecated.	2010-09-08 17:04:31 +02:00
Willy Tarreau	d8ccffe0f6	[BUG] stream_interface: only call si->release when both dirs are closed si->release() was called each time we closed one direction of a stream interface, while it should only have been called when both sides are closed. This bug is specific to 1.5 and only affects embedded tasks.	2010-09-07 16:16:50 +02:00
Willy Tarreau	f6e2cc79d8	[BUG] deinit: unbind listeners before freeing them In deinit(), it is possible that we first free the listeners, then unbind them all. Right now this situation can't happen because the only way to call deinit() is to pass via a soft-stop which will already unbind all protocols. But later this might become a problem.	2010-09-03 10:38:17 +02:00
Willy Tarreau	24581bae02	[MEDIUM] http: fix space handling in the response cookie parser This patch addresses exactly the same issues as the previous one, but for responses this time. It also introduces implicit support for the Set-Cookie2 header, for which there's almost nothing specific to do since it is a clean header. This one allows multiple cookies in a same header, by respecting the HTTP messaging semantics. The new parser has been tested with insertion, rewrite, passive, removal, prefixing and captures, and it looks OK. It's still able to rewrite (or delete) multiple cookies at once. Just as with the request parser, it tries hard to fix formating of the cookies it displaces. This patch too should be backported to 1.4 and possibly to 1.3.	2010-09-01 00:02:44 +02:00
Willy Tarreau	eb7b0a2b56	[MEDIUM] http: fix space handling in the request cookie parser The request cookie parser did not allow spaces to appear in cookie values nor around the equal sign. The various RFCs on the subject say different things, some suggesting that a space is allowed after the equal sign and being worded in a way that lets one believe it is allowed before too. Some spaces may appear inside values and be part of the values. The quotes allow delimiters to be embedded in values. The spaces before and after attributes should be trimmed. The new parser addresses all those points and has been carefully tested. It fixes misplaced spaces around equal signs before processing the cookies or forwarding them. It also tries its best to perform clean removals by always keeping the delimiter after the value being removed and leaving one space after it. The variable inside the parser have been renamed to make the code a lot more understandable, and one multi-function pointer has been eliminated. Since this patch fixes real possible issues, it should be backported to 1.4 and possibly 1.3, since one (single) case of wrong spaces has been reported in 1.3. The code handling the Set-Cookie has not been touched yet.	2010-09-01 00:02:21 +02:00
Willy Tarreau	af7ad00a99	[MINOR] support a global jobs counter This counter is incremented for each incoming connection and each active listener, and is used to prevent haproxy from stopping upon SIGUSR1. It will thus be possible for some tasks in increment this counter in order to prevent haproxy from dying until they have completed their job.	2010-08-31 15:39:26 +02:00
Willy Tarreau	0f7f51fbe0	[BUG] http: don't consider commas as a header delimitor within quotes The header parser has a bug which causes commas to be matched within quotes while it was not expected. The way the code was written could make one think it was OK. The resulting effect is that the following config would use the second IP address instead of the third when facing this request : source 0.0.0.0 usesrc hdr_ip(X-Forwarded-For,2) GET / HTTP/1.0 X-Forwarded-for: "127.0.0.1, 127.0.0.2", 127.0.0.3 This fix must be backported to 1.4 and 1.3.	2010-08-30 11:06:34 +02:00
Willy Tarreau	92aa1fac0a	[BUG] http: don't set auto_close if more data are expected Fix `4fe4190278` was a bit too strong. It has caused some chunked-encoded responses to be truncated when a recv() call could return multiple chunks followed by a close. The reason is that when a chunk is parsed, only its contents are scheduled to be forwarded. Thus, the reader sees auto_close+shutr and sets shutw_now. The sender in turn sends the last scheduled data and does shutw(). Another nasty effect is that it has reduced the keep-alive rate. If a response did not completely fit into the buffer, then the auto_close bit was left on and the sender would close upon completion. The fix consists in not making use of auto_close when chunked encoding is used nor when keep-alive is used, which makes sense. However it is maintained on error processing. Thanks to Cyril Bont� for reporting the issue early.	2010-08-28 19:06:28 +02:00
Willy Tarreau	d0807c3c60	[MEDIUM] signals: support redistribution of signal zero when stopping Signal zero is never delivered by the system. However having a signal to which functions and tasks can subscribe to be notified of a stopping event is useful. So this patch does two things : 1) allow signal zero to be delivered from any function of signal handler 2) make soft_stop() deliver this signal so that tasks can be notified of a stopping condition.	2010-08-27 18:26:11 +02:00
Willy Tarreau	24f4efa670	[MEDIUM] signals: add support for registering functions and tasks The two new functions below make it possible to register any number of functions or tasks to a system signal. They will be called in the registration order when the signal is received. struct sig_handler signal_register_fct(int sig, void (fct)(struct sig_handler ), int arg); struct sig_handler signal_register_task(int sig, struct task *task, int reason);	2010-08-27 18:00:40 +02:00
Willy Tarreau	bb545b4cfc	[MINOR] startup: don't wait for nothing when no old pid remains In case of binding failure during startup, we wait for some time sending signals to old pids so that they release the ports we need. But if there aren't any old pids anymore, it's useless to wait, we prefer to fail fast. Along with this change, we now have the number of old pids really found in the nb_oldpids variable.	2010-08-25 12:58:59 +02:00
Willy Tarreau	d137dd3151	[MINOR] startup: release unused structs after forking Don't keep the old pid list or chroot place after startup, they won't be used anymore.	2010-08-25 12:52:29 +02:00
Willy Tarreau	fb024dc1c9	[BUG] conf: add tcp-request content rules to the correct list Due to the change in commit 68c03, the tcp-request content rules were unfortunately being added to the request rules.	2010-08-20 13:35:41 +02:00
Willy Tarreau	07e9e64a34	[BUG] stats: global stats timeout may be specified before stats socket. If the global stats timeout statement was found before the stats socket (or without), the parser would crash because the stats frontend was not initialized. Now we have an allocation function which solves the issue. This bug was introduced with 1.4 so it does not need backporting. (was commit 1c5819d2498ae3643c3880507847f948a53d2773 in 1.4)	2010-08-17 21:55:54 +02:00
Willy Tarreau	d132f746f2	[BUG] queue: don't dequeue proxy-global requests on disabled servers If a server is disabled or tracking a disabled server, it must not dequeue requests pending in the proxy queue, it must only dequeue its own ones. The problem that was caused is that if a backend always had requests in its queue, a disabled server would continue to take traffic forever. (was commit 09d02aaf02d1f21c0c02672888f3a36a14bdd299 in 1.4)	2010-08-17 21:39:07 +02:00
Cyril Bont�	4d179ebd21	[BUG] stats: session rate limit gets garbaged in the stats The statistics page (the HTML one) displays a garbage value on frontends using "rate-limit session" in HTTP mode. This is due to the usage of the same buffer for the macros converting the max session rate and the limit. Steps to reproduce : Configuration file example : listen bug :80 mode http rate-limit sessions stats enable Then start refreshing the statistics page. This bug was introduced just before the release of haproxy 1.4.0. (was commit 6cfaf9e91969c87a9eab1d58a15d2d0a3f346c9b in 1.4)	2010-08-17 21:38:25 +02:00
Willy Tarreau	5c54c71463	[MEDIUM] http: forward client's close when abortonclose is set While it's usually desired to wait for a server response even when the client closes its request channel, it can be problematic with long polling requests. In order to let the server decide what to do in such a case, if option abortonclose is set, we simply forward the shutdown to the server. That way, it can decide to take the appropriate action. Most servers will still process the request, while some will probably want to abort. Obviously, this only works as long as the client has not sent another pipelined request over the same connection. (was commit 0e25d86da49827ff6aa3c94132c01292b5ba4854 in 1.4)	2010-08-17 21:37:51 +02:00
Willy Tarreau	df39e955c0	[CLEANUP] stats: use stksess_kill() to remove table entries Using it will be more reliable in the long term as we'll only have to modify stksess_kill() if we want to extend the tables.	2010-08-10 18:04:16 +02:00
Willy Tarreau	0a4838cd31	[MEDIUM] session-counters: correctly unbind the counters tracked by the backend In case of HTTP keepalive processing, we want to release the counters tracked by the backend. Till now only the second set of counters was released, while it could have been assigned by the frontend, or the backend could also have assigned the first set. Now we reuse to unused bits of the session flags to mark which stick counters were assigned by the backend and to release them as appropriate.	2010-08-10 18:04:16 +02:00
Willy Tarreau	56123282ef	[MINOR] session-counters: use "track-sc{1,2}" instead of "track-{fe,be}-counters" The assumption that there was a 1:1 relation between tracked counters and the frontend/backend role was wrong. It is perfectly possible to track the track-fe-counters from the backend and the track-be-counters from the frontend. Thus, in order to reduce confusion, let's remove this useless {fe,be} reference and simply use {1,2} instead. The keywords have also been renamed in order to limit confusion. The ACL rule action now becomes "track-sc{1,2}". The ACLs are now "sc{1,2}_" instead of "trk{fe,be}_". That means that we can reasonably document "sc1" and "sc2" (sticky counters 1 and 2) as sort of patterns that are available during the whole session's life and use them just like any other pattern.	2010-08-10 18:04:15 +02:00
Willy Tarreau	9e9879a263	[MEDIUM] session-counters: make it possible to count connections from frontend In case a "track-be-counters" rule is referenced in the frontend, count it so that the connection counts are correct.	2010-08-10 18:04:15 +02:00
Willy Tarreau	68c03aba9e	[MEDIUM] config: replace 'tcp-request <action>' with "tcp-request connection" It began to be problematic to have "tcp-request" followed by an immediate action, as sometimes it was a keyword indicating a hook or setting ("content" or "inspect-delay") and sometimes it was an action. Now the prefix for connection-level tcp-requests is "tcp-request connection" and the ones processing contents remain "tcp-request contents". This has allowed a nice simplification of the config parser and to clean up the doc a bit. Also now it's a bit more clear why tcp-request connection are not allowed in backends.	2010-08-10 18:04:15 +02:00
Willy Tarreau	f6efda1189	[MEDIUM] session counters: automatically remove expired entries. When a ref_cnt goes down to zero and the entry is expired, remove it.	2010-08-10 18:04:15 +02:00
Willy Tarreau	d1f9652d90	[MEDIUM] tcp: accept the "track-counters" in "tcp-request content" rules Doing so allows us to track counters from backends or depending on contents. For instance, it now becomes possible to decide to track a connection based on a Host header if enough time is granted to parse the HTTP request. It is also possible to just track frontend counters in the frontend and unconditionally track backend counters in the backend without having to write complex rules. The first track-fe-counters rule executed is used to track counters for the frontend, and the first track-be-counters rule executed is used to track counters for the backend. Nothing prevents a frontend from setting a track-be rule nor a backend from setting a track-fe rule. In fact these rules are arbitrarily split between FE and BE with no dependencies.	2010-08-10 18:04:15 +02:00
Willy Tarreau	f059a0f63a	[MAJOR] session-counters: split FE and BE track counters Having a single tracking pointer for both frontend and backend counters does not work. Instead let's have one for each. The keyword has changed to "track-be-counters" and "track-fe-counters", and the ACL "trk_" changed to "trkfe_" and "trkbe_*".	2010-08-10 18:04:15 +02:00
Willy Tarreau	4f3f01fa39	[MEDIUM] stats: add the ability to dump table entries matching criteria It is now possible to dump some select table entries based on criteria which apply to the stored data. This is enabled by appending the following options to the end of the "show table" statement : data.<data_type> {eq\|ne\|lt\|gt\|le\|ge} <value> For intance : show table http_proxy data.conn_rate gt 5 show table http_proxy data.gpc0 ne 0 The compare applies to the integer value as it would be displayed, and operates on signed long long integers.	2010-08-10 18:04:14 +02:00
Willy Tarreau	603861ed9d	[MINOR] stats: correctly report errors on "show table" and "clear table" "show table XXX" did not report that the table did not exist, and errors produced by "clear table" missed the trailing "\n".	2010-08-10 18:04:14 +02:00
Willy Tarreau	3b9c6e053e	[MEDIUM] stick-table: make use of generic types for stored data It's a bit cumbersome to have to know all possible storable types from the stats interface. Instead, let's have generic types for all data, which will facilitate their manipulation.	2010-08-10 18:04:14 +02:00
Willy Tarreau	88ee39758a	[MEDIUM] stats: add "clear table <name> key <value>" to clear table entries This feature will be required at some point, when the stick tables are used to enforce security measures. For instance, some visitors may be incorrectly flagged as abusers and would ask the site admins to remove their entry from the table.	2010-08-10 18:04:14 +02:00
Willy Tarreau	69f58c8058	[MEDIUM] stats: add "show table [<name>]" to dump a stick-table It is now possible to dump a table's contents with keys, expire, use count, and various data using the command above on the stats socket. "show table" only shows main table stats, while "show table <name>" dumps table contents, only if the socket level is admin.	2010-08-10 18:04:14 +02:00
Willy Tarreau	da7ff64aa9	[MEDIUM] session-counters: add HTTP req/err tracking This patch adds support for the following session counters : - http_req_cnt : HTTP request count - http_req_rate: HTTP request rate - http_err_cnt : HTTP request error count - http_err_rate: HTTP request error rate The equivalent ACLs have been added to check the tracked counters for the current session or the counters of the current source.	2010-08-10 18:04:14 +02:00
Willy Tarreau	c3bd972cda	[MINOR] session-counters: add a general purpose counter (gpc0) This counter may be used to track anything. Two sets of ACLs are available to manage it, one gets its value, and the other one increments its value and returns it. In the second case, the entry is created if it did not exist. Thus it is possible for example to mark a source as being an abuser and to keep it marked as long as it does not wait for the entry to expire : # The rules below use gpc0 to track abusers, and reject them if # a source has been marked as such. The track-counters statement # automatically refreshes the entry which will not expire until a # 1-minute silence is respected from the source. The second rule # evaluates the second part if the first one is true, so GPC0 will # be increased once the conn_rate is above 100/5s. stick-table type ip size 200k expire 1m store conn_rate(5s),gpc0 tcp-request track-counters src tcp-request reject if { trk_get_gpc0 gt 0 } tcp-request reject if { trk_conn_rate gt 100 } { trk_inc_gpc0 gt 0} Alternatively, it is possible to let the entry expire even in presence of traffic by swapping the check for gpc0 and the track-counters statement : stick-table type ip size 200k expire 1m store conn_rate(5s),gpc0 tcp-request reject if { src_get_gpc0 gt 0 } tcp-request track-counters src tcp-request reject if { trk_conn_rate gt 100 } { trk_inc_gpc0 gt 0} It is also possible not to track counters at all, but entry lookups will then be performed more often : stick-table type ip size 200k expire 1m store conn_rate(5s),gpc0 tcp-request reject if { src_get_gpc0 gt 0 } tcp-request reject if { src_conn_rate gt 100 } { src_inc_gpc0 gt 0} The '0' at the end of the counter name is there because if we find that more counters may be useful, other ones will be added.	2010-08-10 18:04:14 +02:00
Willy Tarreau	1f7e925d6a	[MINOR] stktable: add a stktable_update_key() function This function looks up a key, updates its expiration date, or creates it if it was not found. acl_fetch_src_updt_conn_cnt() was updated to make use of it.	2010-08-10 18:04:14 +02:00
Willy Tarreau	6c59e0a942	[MEDIUM] session counters: add bytes_in_rate and bytes_out_rate counters These counters maintain incoming and outgoing byte rates in a stick-table, over a period which is defined in the configuration (2 ms to 24 days). They can be used to detect service abuse and enforce a certain bandwidth limits per source address for instance, and block if the rate is passed over. Since 32-bit counters are used to compute the rates, it is important not to use too long periods so that we don't have to deal with rates above 4 GB per period. Example : # block if more than 5 Megs retrieved in 30 seconds from a source. stick-table type ip size 200k expire 1m store bytes_out_rate(30s) tcp-request track-counters src tcp-request reject if { trk_bytes_out_rate gt 5000000 } # cause a 15 seconds pause to requests from sources in excess of 2 megs/30s tcp-request inspect-delay 15s tcp-request content accept if { trk_bytes_out_rate gt 2000000 } WAIT_END	2010-08-10 18:04:13 +02:00
Willy Tarreau	91c43d7fe4	[MEDIUM] session counters: add conn_rate and sess_rate counters These counters maintain incoming connection rates and session rates in a stick-table, over a period which is defined in the configuration (2 ms to 24 days). They can be used to detect service abuse and enforce a certain accept rate per source address for instance, and block if the rate is passed over. Example : # block if more than 50 requests per 5 seconds from a source. stick-table type ip size 200k expire 1m store conn_rate(5s),sess_rate(5s) tcp-request track-counters src tcp-request reject if { trk_conn_rate gt 50 } # cause a 3 seconds pause to requests from sources in excess of 20 requests/5s tcp-request inspect-delay 3s tcp-request content accept if { trk_sess_rate gt 20 } WAIT_END	2010-08-10 18:04:13 +02:00
Willy Tarreau	ac78288eaf	[MEDIUM] stick-tables: add stored data argument type checking We're now able to return errors based on the validity of an argument passed to a stick-table store data type. We also support ARG_T_DELAY to pass delays to stored data types (eg: for rate counters).	2010-08-10 18:04:13 +02:00
Willy Tarreau	888617dc3b	[MEDIUM] stick-tables: add support for arguments to data_types Some data types will require arguments (eg: period for a rate counter). This patch adds support for such arguments between parenthesis in the "store" directive of the stick-table statement. Right now only integers are supported.	2010-08-10 18:04:13 +02:00
Willy Tarreau	b084e9ccb9	[MINOR] config: support a comma-separated list of store data types in stick-table Sometimes we need to store many data types in stick-tables. Let's support a comma-separated list instead of repeating "store" with each keyword.	2010-08-10 18:04:13 +02:00
Willy Tarreau	f4d17d9071	[MEDIUM] session: add a counter on the cumulated number of sessions Sessions are like connections but they have been accepted by L4 rules and really became sessions.	2010-08-10 18:04:13 +02:00
Willy Tarreau	1aa006fe7a	[MINOR] session: add trk_kbytes_* ACL keywords to track data size These one apply to the entry being tracked by current session.	2010-08-10 18:04:13 +02:00
Willy Tarreau	9b0ddcfd84	[MINOR] session: add the trk_conn_cur ACL keyword to track concurrent connection This one applies to the entry being tracked by current session.	2010-08-10 18:04:13 +02:00
Willy Tarreau	9a3f849371	[MINOR] session: add the trk_conn_cnt ACL keyword to track connection counts Most of the time we'll want to check the connection count of the criterion we're currently tracking. So instead of duplicating the src* tests, let's add trk_conn_cnt to report the total number of connections from the stick table entry currently being tracked. A nice part of the code was factored, and we should do the same for the other criteria.	2010-08-10 18:04:12 +02:00
Willy Tarreau	855e4bbcc7	[MEDIUM] session: add data in and out volume counters The new "bytes_in_cnt" and "bytes_out_cnt" session counters have been added. They're automatically updated when session counters are updated. They can be matched with the "src_kbytes_in" and "src_kbytes_out" ACLs which apply to the volume per source address. This can be used to deny access to service abusers.	2010-08-10 18:04:12 +02:00
Willy Tarreau	38285c18f4	[MEDIUM] session: add concurrent connections counter The new "conn_cur" session counter has been added. It is automatically updated upon "track XXX" directives, and the entry is touched at the moment we increment the value so that we don't consider further counter updates as real updates, otherwise we would end up updating upon completion, which may not be desired. Probably that some other event counters (eg: HTTP requests) will have to be updated upon each event though. This counter can be matched against current session's source address using the "src_conn_cur" ACL.	2010-08-10 18:04:12 +02:00
Willy Tarreau	8b22a71a4d	[MEDIUM] session: move counter ACL fetches from proto_tcp It was not normal to have counter fetches in proto_tcp.c. The only reason was that the key based on the source address was fetched there, but now we have split the key extraction and data processing, we must move that to a more appropriate place. Session seems OK since the counters are all manipulated from here. Also, since we're precisely counting number of connections with these ACLs, we rename them src_conn_cnt and src_updt_conn_cnt. This is not a problem right now since no version was emitted with these keywords.	2010-08-10 18:04:12 +02:00
Willy Tarreau	8fb12c4b61	[MINOR] stick-table: use suffix "_cnt" for cumulated counts The "_cnt" suffix is already used by ACLs to count various data, so it makes sense to use the same one in "conn_cnt" instead of "conn_cum" to count cumulated connections. This is not a problem because no version was emitted with those keywords. Thus we'll try to stick to the following rules : xxxx_cnt : cumulated event count for criterion xxxx xxxx_cur : current number of concurrent entries for criterion xxxx xxxx_rate: event rate for criterion xxxx	2010-08-10 18:04:12 +02:00
Willy Tarreau	4a0347add0	[MINOR] stick-table: provide a table lookup function We'll often need to lookup a table by its name. This will change in the future once we can resolve these names on startup.	2010-08-10 18:04:12 +02:00
Willy Tarreau	9ba2dcc86c	[MAJOR] session: add track-counters to track counters related to the session This patch adds the ability to set a pointer in the session to an entry in a stick table which holds various counters related to a specific pattern. Right now the syntax matches the target syntax and only the "src" pattern can be specified, to track counters related to the session's IPv4 source address. There is a special function to extract it and convert it to a key. But the goal is to be able to later support as many patterns as for the stick rules, and get rid of the specific function. The "track-counters" directive may only be set in a "tcp-request" statement right now. Only the first one applies. Probably that later we'll support multi-criteria tracking for a single session and that we'll have to name tracking pointers. No counter is updated right now, only the refcount is. Some subsequent patches will have to bring that feature.	2010-08-10 18:04:12 +02:00
Willy Tarreau	171819b5d7	[MINOR] tcp: src_count acl does not have a permanent result This ACL's count can change along the session's life because it depends on other sessions' activity. Switch it to volatile since any session could appear while evaluating the ACLs.	2010-08-10 18:04:11 +02:00
Willy Tarreau	591fedc2c3	[MEDIUM] buffer: make buffer_feed* support writing non-contiguous chunks The buffer_feed* functions that are used to send data to buffers did only support sending contiguous chunks while they're relying on memcpy(). This patch improves on this by making them able to write in two chunks if needed. Thus, the buffer_almost_full() function has been improved to really consider the remaining space and not just what can be written at once.	2010-08-10 17:48:57 +02:00
Willy Tarreau	3488e2548f	[MAJOR] stream_interface: fix the wakeup conditions for embedded iohandlers Now we stop relying on BF_READ_DONTWAIT, which is unrelated to the wakeups, and only consider activity to decide whether to wake the task up instead of considering the other side's activity. It is worth noting that the local stream interface's flags were not updated consecutively to a call to chk_snd(), which could possibly result in hung tasks from time to time. This fix will avoid possible loops and uncaught events.	2010-08-10 17:47:17 +02:00
Willy Tarreau	fb35620e87	[MEDIUM] session: support "tcp-request content" rules in backends Sometimes it's necessary to be able to perform some "layer 6" analysis in the backend. TCP request rules were not available till now, although documented in the diagram. Enable them in backend now.	2010-08-10 14:10:58 +02:00
Willy Tarreau	6df7a0e7d3	[MINOR] http: reset analysers to listener's, not frontend's When resetting a session's request analysers, we must take them from the listener, not from the frontend. At the moment there is no difference but this might change.	2010-08-10 14:04:42 +02:00
Willy Tarreau	815a9b2039	[BUG] session: analysers must be checked when SI state changes Since the BF_READ_ATTACHED bug was fixed, a new issue surfaced. When a connection closes on the return path in tunnel mode while the request input is already closed, the request analyser which is waiting for a state change never gets woken up so it never closes the request output. This causes stuck sessions to remain indefinitely. One way to reliably reproduce the issue is the following (note that the client expects a keep-alive but not the server) : server: printf "HTTP/1.0 303\r\n\r\n" \| nc -lp8080 client: printf "GET / HTTP/1.1\r\n\r\n" \| nc 127.1 2500 The reason for the issue is that we don't wake the analysers up on stream interface state changes. So the least intrusive and most reliable thing to do is to consider stream interface state changes to call the analysers. We just need to remember what state each series of analysers have seen and check for the differences. In practice, that works. A later improvement later could consist in being able to let analysers state what they're interested to monitor : - left SI's state - right SI's state - request buffer flags - response buffer flags That could help having only one set of analysers and call them once status changes.	2010-08-10 14:04:28 +02:00
Willy Tarreau	5af1fa1df0	[MAJOR] stream_sock: better wakeup conditions on read() After a read, there was a condition to mandatorily wake the task up if the BF_READ_DONTWAIT flag was set. This was wrong because the wakeup condition in this case can be deduced from the other ones. Another condition was put on the other side not being in SI_ST_EST state. It is not appropriate to do this because it causes a useless wakeup at the beginning of every first request in case of speculative polling, due to the fact that we don't read anything and that the other side is still in SI_ST_INI. Also, the wakeup was performed whenever to_forward was null, which causes an unexpected wakeup upon the first read for the same reason. However, those two conditions are valid if and only if at least one read was performed. Also, the BF_SHUTR flag was tested as part of the wakeup condition, while this one can only be set if BF_READ_NULL is set too. So let's simplify this ambiguous test by removing the BF_SHUTR part from the condition to only process events. Last, the BF_READ_DONTWAIT flag was unconditionally cleared, while sometimes there would have been no I/O. Now we only clear it once the I/O operation has been performed, which maintains its validity until the I/O occurs. Finally, those fixes saved approximately 16% of the per-session wakeups and 20% of the epoll_ctl() calls, which translates into slightly less under high load due to the request often being ready when the read() occurs. A performance increase between 2 and 5% is expected depending on the workload. It does not seem necessary to backport this change to 1.4, eventhough it fixes some performance issues. It may later be backported if required to fix something else because the risk of regression seems very low due to the fact that we're more in line with the documented semantics.	2010-08-10 14:04:09 +02:00
Willy Tarreau	1c7cc5bf95	[MEDIUM] acl: make use of get_std_op() to parse intger ranges Using the common operator parser for the ACLs saves about 1.5 kB of code.	2010-08-10 14:03:40 +02:00
Willy Tarreau	5b18020201	[MINOR] tools: add a get_std_op() function to parse operators We already have several places where we use operators to compare values. Each time the parsing is done again. Let's have a central function for this.	2010-08-10 14:03:25 +02:00
Willy Tarreau	bb695393da	[BUG] http: denied requests must not be counted as denied resps in listeners Socket stats had a wrong counter. This harmless bugfix must be backported to 1.4.	2010-08-10 14:02:54 +02:00
Willy Tarreau	2970b0bedf	[MINOR] freq_ctr: add new types and functions for periods different from 1s Some freq counters will have to work on periods different from 1 second. The original freq counters rely on the period to be exactly one second. The new ones (freq_ctr_period) let the user define the period in ticks, and all computations are operated over that period. When reading a value, it indicates the amount of events over that period too.	2010-08-10 14:01:09 +02:00
Willy Tarreau	7a20aa6e6b	[MEDIUM] session: make it possible to call an I/O handler on both SI This will be used when an I/O handler running in a stream interface needs to establish a connection somewhere. We want the session processor to evaluate both I/O handlers, depending on which side has one. Doing so also requires that stream_int_update_embedded() wakes the session up only when the other side is established or has closed, for instance in order to handle connection errors without looping indefinitely during the connection setup time. The session processor still relies on BF_READ_ATTACHED being set, though we must do whatever is required to remove this dependency.	2010-07-13 16:34:26 +02:00
Willy Tarreau	0bd05eaf24	[MEDIUM] stream-interface: add a ->release callback When a connection is closed on a stream interface, some iohandlers will need to be informed in order to release some resources. This normally happens upon a shutr+shutw. It is the equivalent of the fd_delete() call which is done for real sockets, except that this time we release internal resources. It can also be used with real sockets because it does not cost anything else and might one day be useful.	2010-07-13 16:06:23 +02:00
Willy Tarreau	e8f6338c5d	[BUG] stick-table: correctly refresh expiration timers The store operation did not correctly refresh the expiration timer on the stick entry. It did so on the temporary one instead.	2010-07-13 15:20:24 +02:00
Willy Tarreau	d669a4f72b	[MEDIUM] backend: support servers on 0.0.0.0 Till now when a server was configured with address 0.0.0.0, the connection was forwarded to this address which generally is intercepted by the system as a local address, so this was completely useless. One sometimes useful feature for outgoing transparent proxies is to be able to forward the connection to the same address the client requested. This patch fixes the meaning of 0.0.0.0 precisely to ensure that the connection will be forwarded to the initial client's destination address.	2010-07-13 14:57:52 +02:00
Willy Tarreau	2a164ee549	[BUG] stick_table: the fix for the memory leak caused a regression (cherry picked from commit 61ba936e6858dfcf9964d25870726621d8188fb9) [ note: the bug was finally not present in 1.5-dev but at least we have to reset store_count to be compatible with 1.4 ] Commit d6e9e3b5e320b957e6c491bd92d91afad30ba638 caused recently created entries to be removed as soon as they were created, breaking stickiness. It is not clear whether a use-after-free was possible or not in this case. This bug was reported by Ben Congleton and narrowed down by Herv� Commowick, both of whom also tested the fix. Thanks to them !	2010-06-18 09:57:45 +02:00
Willy Tarreau	acf9577350	[MINOR] config: provide a function to quote args in a more friendly way The quote_arg() function can be used to quote an argument or indicate "end of line" if it's null or empty. It should be useful to more precisely report location of problems in the configuration.	2010-06-14 19:09:21 +02:00
Willy Tarreau	f535683123	[BUG] config: report the correct proxy type in tcp-request errors A copy-paste typo caused a wrong proxy's type to be reported in case of parsing errors.	2010-06-14 18:40:26 +02:00
Willy Tarreau	6a984fa7c1	[CLEANUP] proto_tcp: make the config parser a little bit more flexible We'll need to let the tcp-request parser able to delegate parsing of track-counters to a commun function, let's prepare it.	2010-06-14 16:44:27 +02:00
Willy Tarreau	5214be1b22	[MINOR] session: add a pointer to the tracked counters for the source We'll have to keep counters of various criteria specific to the session's source. When we get one, keep a pointer to it in the session.	2010-06-14 15:32:18 +02:00
Willy Tarreau	e7f3d7ab9f	[MEDIUM] stick-tables: add a reference counter to each entry We'll soon have to maintain links from sessions to entries, so let's add a refcount in entries to avoid purging them if it's not null.	2010-06-14 15:10:26 +02:00
Willy Tarreau	cb18364ca7	[MEDIUM] stick_table: separate storage and update of session entries When an entry already exists, we just need to update its expiration timer. Let's have a dedicated function for that instead of spreading open code everywhere. This change also ensures that an update of an existing sticky session really leads to an update of its expiration timer, which was apparently not the case till now. This point needs to be checked in 1.4.	2010-06-14 15:10:26 +02:00
Willy Tarreau	a975b8f381	[MINOR] tcp: add per-source connection rate limiting This change makes use of the stick-tables to keep track of any source address activity. Two ACLs make it possible to check the count of an entry or update it and act accordingly. The typical usage will be to reject a TCP request upon match of an excess value.	2010-06-14 15:10:25 +02:00
Willy Tarreau	41883e2041	[MINOR] stick_table: export the stick_table_key This one is huge and will be needed by other portions of code for various data lookups. Let's not have them allocate it in the stack.	2010-06-14 15:10:25 +02:00
Willy Tarreau	c00cdc2eb0	[MINOR] stick_table: enable it for frontends too A frontend may very well host a stick-table. In fact it will be useful with connection throttling.	2010-06-14 15:10:25 +02:00
Willy Tarreau	13c29dee21	[MEDIUM] stick_table: move the server ID to a generic data type The server ID is now stored just as any other data type. It is only allocated if needed and is manipulated just like the other ones.	2010-06-14 15:10:25 +02:00
Willy Tarreau	68129b90eb	[MINOR] stick_table: provide functions to return stksess data from a type This function does the indirection job in the table to find the pointer to the real data matching the requested type.	2010-06-14 15:10:25 +02:00
Willy Tarreau	056f5683e3	[MINOR] config: initialize stick tables after all the parsing We'll be able to add data types to stick tables while parsing their users, so let's initialize them at the end.	2010-06-14 15:10:24 +02:00
Willy Tarreau	f16d2b8c1b	[MEDIUM] stick_table: don't overwrite data when storing an entry Till now sticky sessions only held server IDs. Now there are other data types so it is not acceptable anymore to overwrite the server ID when writing something. The server ID must then only be written from the caller when appropriate. Doing this has also led to separate lookup and storage.	2010-06-14 15:10:24 +02:00

... 14 15 16 17 18 ...

2725 Commits