haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-08 16:17:09 +02:00

Author	SHA1	Message	Date
Willy Tarreau	b025325274	[MINOR] stream_sock_data_finish() should not expose fd stream_sock_data_finish was still using a file descriptor as only argument, while a stream interface is preferred. This is now fixed.	2008-11-30 21:37:12 +01:00
Willy Tarreau	42ffbf248b	[CLEANUP] session.c: removed some migration left-overs in sess_establish() A few obsolete fd manipulations were left in sess_establish. Obviously they must go away.	2008-11-30 21:13:54 +01:00
Willy Tarreau	0cac36f415	[MEDIUM] make the http server error function a pointer in the session It was a bit awkward to have session.c call return_srv_error() for HTTP error messages related to servers. The function has been adapted to be passed a pointer to the faulty stream interface, and is now a pointer in the session. It is possible that in the future, it will become a callback in the stream interface itself.	2008-11-30 20:44:17 +01:00
Willy Tarreau	2d3d94cf23	[MINOR] replace srv_close_with_err() with http_server_error() The new function looks like the previous one except that it operates at the stream interface level and assumes an already closed SI. Also remove some old unused occurrences of srv_close_with_err().	2008-11-30 20:28:57 +01:00
Willy Tarreau	dded32defa	[MINOR] replace client_retnclose() with stream_int_retnclose() This makes more sense to return a message to a stream interface than to a session. senddata.{c,h} have been removed.	2008-11-30 19:48:07 +01:00
Willy Tarreau	81acfab4fd	[MINOR] replace the ambiguous client_return function by stream_int_return This one applies to a stream interface, which makes more sense.	2008-11-30 19:22:53 +01:00
Willy Tarreau	a5555ec68a	[MINOR] call session->do_log() for logging In order to avoid having to call per-protocol logging function directly from session.c, it's better to assign the logging function when the session is created. This also eliminates a test when the function is needed, and opens the way to more complete logging functions.	2008-11-30 19:02:32 +01:00
Willy Tarreau	55a8d0e1bb	[CLEANUP] move the session-related functions to session.c proto_http.c was not suitable for session-related processing, it was just convenient for the tranformation. Some more splitting must occur: process_request/response in proto_http.c must be split again per protocol, and the caller must run a list. Some functions should be directly attached to the session or the buffer (eg: perform_http_redirect, return_srv_error, http_sess_log).	2008-11-30 18:47:21 +01:00
Willy Tarreau	fe3718ab79	[MAJOR] complete layer4/7 separation All the processing has now completely been split in layers. As of now, everything is still in process_session() which is not the right place, but the code sequence works. Timeouts, retries, errors, all work. The shutdown sequence has been strictly applied: BF_SHUTR/BF_SHUTW are only assigned by lower layers. Upper layers can only indicate their wish to close using BF_SHUTR_NOW and BF_SHUTW_NOW. When a shutdown is performed on a stream interface, the buffer flags are updated accordingly and re-checked by upper layers. A lot of care has been taken to ensure that aborts during intermediate connection setups are correctly handled and shutdowns correctly propagated to both buffers. A future evolution would consist in ensuring that BF_SHUT?_NOW may be set at any time, and applies only when the buffer is empty. This might help with error messages, but might complicate the processing of data remaining in buffers. Some useless buffer flag combinations have been removed. Stat counters are still broken (eg: per-server total number of sessions). Error messages should be delayed to the close instant and be produced by protocol. Many functions must now move to proper locations.	2008-11-30 18:14:12 +01:00
Willy Tarreau	99126c35c1	[MEDIUM] make the stream interface control the SHUT{R,W} bits It's better that the stream interface controls the BF_SHUT* bits so that they always reflect the real state of the interface.	2008-11-27 22:32:14 +01:00
Willy Tarreau	8bfa426cad	[MEDIUM] process shutw during connection attempt It sometimes happens that a connection is aborted at the exact same moment it establishes. We have to close the socket and not only to shut it down for writes. Some corner cases remain. We have to handle the shutr/shutw at the stream interface and only report the status to the buffer, not the opposite.	2008-11-27 09:25:45 +01:00
Willy Tarreau	b38903cf3c	[BUG] shutw must imply close during a connect The sessions which were remaining stuck were being connecting to the server while they received a shutw which caused them to partially stop. A shutw() during a connect() must imply a close().	2008-11-23 21:33:29 +01:00
Willy Tarreau	f54f8bdd8d	[MINOR] maintain a global session list in order to ease debugging Now the global variable 'sessions' will be a dual-linked list of all known sessions. The list element is set at the beginning of the session so that it's easier to follow them all with gdb.	2008-11-23 19:53:55 +01:00
Willy Tarreau	0a5d5ddeb9	[MEDIUM] remove stream_sock_update_data() Two new functions are used instead : buffer_check_{shutr,shutw}. It is indeed more adequate to check for new closures only when the buffer reports them. Several remaining unclosed connections were detected after a test, even before this patch, so a bug remains. To reproduce, try the following during 30 seconds : inject30l4 -n 20000 -l -t 1000 -P 10 -o 4 -u 100 -s 100 -G 127.0.0.1:8000/	2008-11-23 19:31:35 +01:00
Willy Tarreau	74ab2ac7b0	[MEDIUM] stream_interface: added a DISconnected state between CON/EST and CLO There were rare situations where it was not easy to detect that a failed session attempt had occurred and needed some server cleanup. In particular, client aborts sometimes lead to session leaks on the server side. A new state "SI_ST_DIS" (disconnected) has been introduced for this. When a session has been closed at a stream interface but the server cleanup has not occurred, this state is entered instead of CLO. The cleanup is then performed there and the state goes to CLO. A new diagram has been added to show possible stream_interface state transitions that can occur in a stream-sock. It makes debugging easier.	2008-11-23 17:23:07 +01:00
Willy Tarreau	4351b3a4ca	[MEDIUM] continue layering cleanups. The server sessions are now only decremented when entering SI_ST_CER and SI_ST_CLO states. A state is clearly missing between EST and CLO, or after CLO (eg: END), because many cleanups are performed upon CLO and must rely on tricks to ensure being done only once. The goal of next changes will be to improve what has been started. Ideally, the FD should only notify the SI about the change, which should itself only notify the session when it has some news or when it needs help (eg: redispatch). The buffer's error processing should not change the FD's status immediately, otherwise we risk race conds between a pending connect and a shutw (for instance). Also, the new connect attempt should only be made after layer 7 and all the crap above buffers.	2008-11-12 01:51:41 +01:00
Willy Tarreau	1e62de615b	[MEDIUM] add the SN_CURR_SESS flag to the session to track open sessions It is quite hard to track when the current session has already been counted or discounted from the server's total number of established sessions. For this reason, we introduce a new session flag, SN_CURR_SESS, which indicates if the current session is one of those reported by the server or not. It simplifies session accounting and makes it far more robust. It also makes it possible to perform a last-minute cleanup during session_free(). Right now, with this fix and a few more buffer transitions fixes, no session were found to remain after a test.	2008-11-11 20:26:58 +01:00
Willy Tarreau	cff6411f9a	[MAJOR] add a connection error state to the stream_interface Tracking connection status changes was hard, and some code was redundant. A new SI_ST_CER state was added to the stream interface to indicate a past connection error, and an SI_FL_ERR flag was added to report past I/O error. The stream_sock code does not set the connection to SI_ST_CLO anymore in case of I/O error, it's the upper layer which does it. This makes it possible to know exactly when the file descriptors are allocated. The new SI_ST_CER state permitted to split tcp_connection_status() in two parts, one processing SI_ST_CON and the other one SI_ST_CER. Synchronous connection errors now make use of this last state, hence eliminating duplicate code. Some ib<->ob copy paste errors were found and fixed, and all entities setting SI_ST_CLO also shut the buffers down. Some of these stream_interface specific functions and structures have migrated to a new stream_interface.c file. Some types of errors are still not detected by the buffers. For instance, let's assume the following scenario in one single pass of process_session: a connection sits in SI_ST_TAR state during a retry. At TAR expiration, a new connection attempt is made, the connection is obtained and srv->cur_sess is increased. Then the buffer timeout is fires and everything is cleared, the new state becomes SI_ST_CLO. The cleaning code checks that previous state was either SI_ST_CON or SI_ST_EST to release the connection. But that's wrong because last state is still SI_ST_TAR. So the server's connection count does not get decreased. This means that prev_state must not be used, and must be replaced by some transition detection instead of level detection. The following debugging line was useful to track state changes : fprintf(stderr, "%s:%d: cs=%d ss=%d(%d) rqf=0x%08x rpf=0x%08x\n", __FUNCTION__, __LINE__, s->si[0].state, s->si[1].state, s->si[1].err_type, s->req->flags, s-> rep->flags);	2008-11-03 06:26:53 +01:00
Willy Tarreau	efb453c259	[MAJOR] migrate the connection logic to stream interface The connection setup code has been refactored in order to make it run only on low level (stream interface). Several complicated functions have been removed from backend.c, and we now have sess_update_stream_int() to manage an assigned connection, sess_prepare_conn_req() to assign a server to a connection request, perform_http_redirect() to redirect instead of connecting to server, and return_srv_error() to return connection error status messages. The stream_interface status changes are checked before adjusting buffer flags, so that the buffers can be informed about this lower level update. A new connection is initiated by changing si->state from SI_ST_INI to SI_ST_REQ. The code seems to work but is awfully dirty. Some functions need to be moved, and the layering is not yet quite clear. A lot of dead old code has simply been removed.	2008-11-02 10:19:10 +01:00
Willy Tarreau	d7704b5343	[MINOR] add an expiration flag to the stream_sock_interface This expiration flag is used to indicate that the timer has expired without having to check it everywhere.	2008-11-02 10:19:10 +01:00
Willy Tarreau	3c6ab2e28d	[MEDIUM] use buffer_check_timeouts instead of stream_sock_check_timeouts() It's more appropriate to use buffer_check_timeouts() to check for buffer timeouts and si->shutw/shutr to shutdown the stream interfaces.	2008-11-02 10:19:10 +01:00
Willy Tarreau	3537467679	[MEDIUM] move QUEUE and TAR timers to stream interfaces It was not practical to have QUEUE and TAR timers in buffers, as they caused triggering of the timeout flags. Move them to the stream interface where they belong.	2008-11-02 10:19:09 +01:00
Willy Tarreau	a37095b96f	[CLEANUP] process_session: move debug outputs out of the critical loop The if(debug&closed) printfs have moved outside of the loop. It also permitted to merge several of them.	2008-11-02 10:19:09 +01:00
Willy Tarreau	4ffd51a848	[MEDIUM] process_session: make use of the new buffer flags Now we have almost two distinct parts between tcp and http. Only the connection establishment code still requires some resynchronization, the rest does not.	2008-11-02 10:19:09 +01:00
Willy Tarreau	9a2d15429d	[MEDIUM] buffers: add BF_READ_ATTACHED and BF_ANA_TIMEOUT Those two flags will be used to wake up analysers only when needed.	2008-11-02 10:19:09 +01:00
Willy Tarreau	48adac5db9	[MEDIUM] stream interface: add the ->shutw method as well as in and out buffers Those entries were really needed for cleaner and better code. Using them has permitted to automatically close a file descriptor during a shut write, reducing by 20% the number of calls to process_session() and derived functions. Process_session() does not need to know the file descriptor anymore, though it still remains very complicated due to the special case for the connect mode.	2008-11-02 10:19:08 +01:00
Willy Tarreau	e5ed406715	[MAJOR] make stream sockets aware of the stream interface As of now, a stream socket does not directly wake up the task but it does contact the stream interface which itself knows the task. This allows us to perform a few cleanups upon errors and shutdowns, which reduces the number of calls to data_update() from 8 per session to 2 per session, and make all the functions called in the process_session() loop completely swappable. Some improvements are required. We need to provide a shutw() function on stream interfaces so that one side which closes its read part on an empty buffer can propagate the close to the remote side.	2008-11-02 10:19:08 +01:00
Willy Tarreau	eabf313df2	[MINOR] change type of fdtab[]->owner to void* The owner of an fd was initially a task but this was sometimes casted to a (struct listener ). We'll soon need more types, so void is more appropriate.	2008-11-02 10:19:08 +01:00
Willy Tarreau	fdccded0e8	[MEDIUM] indicate a reason for a task wakeup It's very frequent to require some information about the reason why a task is running. Some flags have been added so that a task now knows if it got woken up due to I/O completion, timeout, etc...	2008-11-02 10:19:08 +01:00
Willy Tarreau	4df8206832	[OPTIM] reduce the number of calls to task_wakeup() A test has shown that more than 16% of the calls to task_wakeup() could be avoided because the task is already woken up. So make it inline and move the test to the inline part.	2008-11-02 10:19:07 +01:00
Willy Tarreau	cb651251f9	[OPTIM] ev_sepoll: detect newly created FDs and check them once When an accept() creates a new FD, it is already marked as set for reads. But the task will be woken up without first checking if the socket could be read. The speculative I/O gives us a chance to either read the FD if there are data pending on it, or immediately mark it for poll mode if nothing is pending. Simply doing this reduces the number of calls to process_session from 6 to 5 per session, 2 to 1 calls to process_request, 10% less calls to epoll_ctl, fd_clr, fd_set, stream_sock_data_update, 20% less eb32_insert/eb_delete, etc... General performance increase seems to be around 3%.	2008-11-02 10:19:07 +01:00
Willy Tarreau	21e1be8152	[MINOR] do not check for BF_SHUTR when computing write timeout This check was useless as !BF_SHUTR is already implied by tick_isset(rex).	2008-11-02 10:19:07 +01:00
Willy Tarreau	3da77c5abd	[MINOR] re-arrange buffer flags and rename some of them The buffer flags became a big bazaar. Re-arrange them so that their names are more explicit and so that they are more easily readable in hex form. Some aggregates have also been adjusted.	2008-11-02 10:19:07 +01:00
Willy Tarreau	72b179a53c	[MEDIUM] reintroduce BF_HIJACK with produce_content The stats dump are back. Even very large config files with 5000 servers work fast and well. The SN_SELF_GEN flag has completely been removed.	2008-11-02 10:19:06 +01:00
Willy Tarreau	36e6a41bc8	[MINOR] only call flow analysers when their read side is connected. It's useless to call flow analysers when their read side has not seen a connection yet.	2008-11-02 10:19:06 +01:00
Willy Tarreau	2bea3a1155	[OPTIM] stream_sock_read must check for null-reads more often With small HTTP messages, stream_sock_read() tends to wake the task up for a message read without indicating that it may be the last one. The reason is that level-triggered pollers generally don't report HUP with data, but only afterwards, so stream_sock_read has no chance to detect this condition and needs a respin. So now we return on incomplete buffers only when the buffer is known as a streamer, because here it generally makes sense. The net result is that the number of calls in a single HTTP session has dropped from 5 to 3, with one less wake up and several less calls to stream_sock_data_update().	2008-11-02 10:19:06 +01:00
Willy Tarreau	3a16b2c9cd	[MEDIUM] split stream_sock_process_data It was a waste to constantly update the file descriptor's status and timeouts during a flags update. So stream_sock_process_data has been slit in two parts : stream_sock_data_update() => computes updated flags stream_sock_data_finish() => computes timeouts Only the first one is called during flag updates. The second one is only called upon completion. The number of calls to fd_set/fd_clr has now significantly dropped. Also, it's useless to check for errors and timeouts in the process_session() loop, it's enough to check for them at the beginning.	2008-11-02 10:19:06 +01:00
Willy Tarreau	f9839bdffe	[MAJOR] make the client side use stream_sock_process_data() The client side now relies on stream_sock_process_data(). One part has not yet been re-implemented, it concerns the calls to produce_content(). process_session() has been adjusted to correctly check for changing bits in order not to call useless functions too many times. It already appears that stream_sock_process_data() should be split so that the timeout computations are only performed at the exit of process_session().	2008-11-02 10:19:06 +01:00
Willy Tarreau	2d2127989c	[MEDIUM] stream_sock_process_data moved to stream_sock.c The old temporary process_srv_data function moved to stream_sock.c.	2008-11-02 10:19:05 +01:00
Willy Tarreau	8a8188301b	[MEDIUM] process_srv_data: ensure that we always correctly re-arm timeouts We really want to ensure that we don't miss a timeout update and do not update them for nothing. So the code takes care of updating the timeout in the two following circumstances : - it was not set - some I/O has been performed Maybe we'll be able to remove that from stream_sock_{read\|write}, or we'll find a way to ensure that we never have to re-enable this.	2008-11-02 10:19:05 +01:00
Willy Tarreau	2ac679d9aa	[MEDIUM] third cleanup and optimization of process_srv_data() Some repeated tests were factored out. Now the code makes sense and is fully understandable.	2008-11-02 10:19:05 +01:00
Willy Tarreau	8fbd3b4ce7	[MEDIUM] second level of code cleanup for process_srv_data Now the function is 100% server-independant. Next step will consist in using the same function for the client side too.	2008-11-02 10:19:05 +01:00
Willy Tarreau	376580a873	[MEDIUM] massive cleanup of process_srv() Server-specific calls were extracted and moved to the caller. The function is now nearly server-agnostic.	2008-11-02 10:19:05 +01:00
Willy Tarreau	8b46aa01ac	[OPTIM] remove useless fd_set(read) upon shutdown(write) Those old tricks are no longer needed and are overwritten anyway. Remove them.	2008-11-02 10:19:05 +01:00
Willy Tarreau	fa7e10251d	[MAJOR] rework of the server FSM srv_state has been removed from HTTP state machines, and states have been split in either TCP states or analyzers. For instance, the TARPIT state has just become a simple analyzer. New flags have been added to the struct buffer to compensate this. The high-level stream processors sometimes need to force a disconnection without touching a file-descriptor (eg: report an error). But if they touched BF_SHUTW or BF_SHUTR, the file descriptor would not be closed. Thus, the two SHUT?_NOW flags have been added so that an application can request a forced close which the stream interface will be forced to obey. During this change, a new BF_HIJACK flag was added. It will be used for data generation, eg during a stats dump. It prevents the producer on a buffer from sending data into it. BF_SHUTR_NOW /* the producer must shut down for reads ASAP / BF_SHUTW_NOW / the consumer must shut down for writes ASAP / BF_HIJACK / the producer is temporarily replaced / BF_SHUTW_NOW has precedence over BF_HIJACK. BF_HIJACK has precedence over BF_MAY_FORWARD (so that it does not need it). New functions buffer_shutr_now(), buffer_shutw_now(), buffer_abort() are provided to manipulate BF_SHUT flags. A new type "stream_interface" has been added to describe both sides of a buffer. A stream interface has states and error reporting. The session now has two stream interfaces (one per side). Each buffer has stream_interface pointers to both consumer and producer sides. The server-side file descriptor has moved to its stream interface, so that even the buffer has access to it. process_srv() has been split into three parts : - tcp_get_connection() obtains a connection to the server - tcp_connection_failed() tests if a previously attempted connection has succeeded or not. - process_srv_data() only manages the data phase, and in this sense should be roughly equivalent to process_cli. Little code has been removed, and a lot of old code has been left in comments for now.	2008-11-02 10:19:04 +01:00
Willy Tarreau	41f40ede3b	[MEDIUM] make it possible for analysers to follow the whole session Some analysers will need to remain present after connection is established. Change the way BF_MAY_FORWARD is set to allow this.	2008-11-02 10:19:04 +01:00
Willy Tarreau	788e284d93	[BUG] fix harmless but wrong fd insertion sequence In backend.c, we had an EV_FD_SET() called before fd_insert(). This is wrong because fd_insert updates maxfd which might be used by some of the pollers during EV_FD_SET(), although this is not currently the case.	2008-08-26 13:25:39 +02:00
Willy Tarreau	79f5fe82f8	[BUG] Fix empty X-Forwarded-For header name when set in defaults section The following patch introduced a minor bug : [MINOR] permit renaming of x-forwarded-for header If "option forwardfor" is declared in a defaults section, the header name is never set and we see an empty header name before the value. Also, the header name was not reset between two defaults sections.	2008-08-26 13:22:19 +02:00
Willy Tarreau	c52164a1a8	[BUG] process_request: HTTP body analysis must return zero if missing data This missing return and timeout check caused an infinite loop too.	2008-08-17 19:27:11 +02:00
Willy Tarreau	2500981dc1	[BUG] process_cli/process_srv: don't call shutdown when already done A few missing checks of BF_SHUTR and BF_SHUTW caused busy loops upon some error paths.	2008-08-17 18:16:38 +02:00
Willy Tarreau	ffab5b4ab0	[MEDIUM] merge inspect_exp and txn->exp into request buffer Since we may have several analysers on a buffer, it's more convenient to have the analyser timeout attached to the buffer itself.	2008-08-17 18:03:28 +02:00
Willy Tarreau	c7e961e5f7	[BUILD] fix warning in proto_tcp.c with gcc >= 4 signedness issues.	2008-08-17 17:13:47 +02:00
Willy Tarreau	6d2889ba3d	[OPTIM] process_cli/process_srv: reduce the number of tests We can skip a number of tests by simply checking a few flags, it saves a few CPU cycles in the fast path.	2008-08-17 16:25:06 +02:00
Willy Tarreau	2df28e8110	[MEDIUM] session: move the analysis bit field to the buffer It makes more sense to store the list of analysers in the buffer than in the session since they are precisely plugged onto one buffer.	2008-08-17 15:20:19 +02:00
Willy Tarreau	f495ddf9d4	[MINOR] ensure the termination flags are set by process_xxx When any processing remains on a buffer, it must be up to the processing functions to set the termination flags, because they are the only ones who know about higher levels.	2008-08-17 14:38:41 +02:00
Willy Tarreau	507385d0e1	[MEDIUM] centralize buffer timeout checks at the top of process_session it's more efficient and easier to check all the timeouts at once and always rely on the buffer flags than to check them everywhere.	2008-08-17 13:04:25 +02:00
Willy Tarreau	26ed74dadc	[MEDIUM] use buffer->wex instead of buffer->cex for connect timeout It's a shame not to use buffer->wex for connection timeouts since by definition it cannot be used till the connection is not established. Using it instead of ->cex also makes the buffer processing more symmetric.	2008-08-17 12:11:14 +02:00
Willy Tarreau	dafde43410	[MAJOR] process_session: rely only on buffer flags Instead of calling all functions in a loop, process_session now calls them according to buffer flags changes. This ensures that we almost never call functions for nothing. The flags settings are still quite coarse, but the number of average functions calls per session has dropped from 31 to 18 (the calls to process_srv dropped from 13 to 7 and the calls to process_cli dropped from 13 to 8). This could still be improved by memorizing which flags each function uses, but that would add a level of complexity which is not desirable and maybe even not worth the small gain.	2008-08-17 01:15:41 +02:00
Willy Tarreau	e393fe224b	[MEDIUM] buffers: add BF_EMPTY and BF_FULL to remove dependency on req/rep->l It is not always convenient to run checks on req->l in functions to check if a buffer is empty or full. Now the stream_sock functions set flags BF_EMPTY and BF_FULL according to the buffer contents. Of course, functions which touch the buffer contents adjust the flags too.	2008-08-16 22:18:07 +02:00
Willy Tarreau	ba392cecf9	[CLEANUP] get rid of BF_SHUT_PENDING BF_SHUTR_PENDING and BF_SHUTW_PENDING were poor ideas because BF_SHUTR is the pending of BF_SHUTW_DONE and BF_SHUTW is the pending of BF_SHUTR_DONE. Remove those two useless and confusing "pending" versions and rename buffer_shut{r,w}_ functions.	2008-08-16 21:13:23 +02:00
Willy Tarreau	d5382b4aaa	[BUG] maintain_proxies must not disable backends maintain_proxies could disable backends (p->maxconn == 0) which is wrong (but apparently harmless). Add a check for p->maxconn == 0.	2008-08-16 18:41:13 +02:00
Willy Tarreau	a7c52761b4	[BUG] process_response: do not touch srv_state process_response is not allowed to touch srv_state (this is an incident which has survived the code migration). This bug was causing connection exhaustion on frontend due to some closed sockets marked SV_STDATA again.	2008-08-16 18:40:18 +02:00
Willy Tarreau	d9f483646d	[BUG] buffers: remove BF_MAY_CONNECT and fix forwarding issue It wasn't really wise to separate BF_MAY_CONNECT and BF_MAY_FORWARD, as it caused trouble in TCP mode because the connection was allowed but not the forwarding. Remove BF_MAY_CONNECT.	2008-08-16 16:39:26 +02:00
Willy Tarreau	9a8c5de375	[BUG] process_response must not enable the read FD Since the separation of TCP and HTTP state machines, the HTTP code must not play anymore with the file descriptor status without checking if they are closed. Remains of such practice have caused busy loops under some circumstances (mainly when client closed during headers response).	2008-08-16 16:11:07 +02:00
Willy Tarreau	7a52a5c468	[BUG] ev_sepoll: closed file descriptors could persist in the spec list If __fd_clo() was called on a file descriptor which was previously disabled, it was not removed from the spec list. This apparently could not happen on previous code because the TCP states prevented this, but now it happens regularly. The effects are spec entries stuck populated, leading to busy loops.	2008-08-16 16:06:02 +02:00
Willy Tarreau	f853320b44	[MINOR] term_trace: add better instrumentations to trace the code A new member has been added to the struct session. It keeps a trace of what block of code performs a close or a shutdown on a socket, and in what sequence. This is extremely convenient for post-mortem analysis where flag combinations and states seem impossible. A new ABORT_NOW() macro has also been added to make the code immediately segfault where called.	2008-08-16 14:55:08 +02:00
Willy Tarreau	1ae3a057df	[MEDIUM] remove unused references to {CL\|SV}_STSHUT* All references to CL_STSHUT* and SV_STSHUT* were removed where possible. Some of them could not be removed because they are still in use by the unix sockets. A bug remains at this stage. Injecting with a very short timeout sometimes leads to a client in close state and a server in data state with all buffer flags indicating a shutdown but the server fd still enable, thus causing a busy loop.	2008-08-16 10:56:30 +02:00
Willy Tarreau	461f662846	[MAJOR] clearly separate HTTP response processing from TCP server state The HTTP response is now processed in its own function, regardless of the TCP state. All FSMs have become fairly simpler and must still be improved by removing useless CL_STSHUT* and SV_STSHUT* (still used by proto_uxst). The number of calls to process_* is still huge though. Next steps consist in : - removing useless assignments of CL_STSHUT* and SV_STSHUT* - add a BF_EMPTY flag to buffers to indicate an empty buffer - returning smarter values in process_* so that each callee may explicitly indicate whom needs to be called after it. - unify read and write timeouts for a same side. The way it is now is too complicated and error-prone - auditing code for regression testing We're close to getting something which works fairly better now.	2008-08-15 23:43:19 +02:00
Willy Tarreau	cebf57e0bf	[MAJOR] better separation of response processing and server state TCP timeouts are not managed anymore by the response FSM. Warning, the FORCE_CLOSE state does not work anymore for now. All remaining bugs causing stale connections have been swept.	2008-08-15 18:16:37 +02:00
Willy Tarreau	f5483bf639	[MAJOR] get rid of the SV_STHEADERS state The HTTP response code has been moved to a specific function called "process_response" and the SV_STHEADERS state has been removed and replaced with the flag AN_RTR_HTTP_HDR.	2008-08-14 18:35:40 +02:00
Willy Tarreau	e46ab5524f	[BUG] fix recently introduced loop when client closes early Due to a recent change in the FSMs, if the client closes with buffer full, then the server loops waiting for headers. We can safely ignore this case since the server FSM will have to be reworked too. Let's fix the root cause for now.	2008-08-14 00:18:39 +02:00
Willy Tarreau	c65a3ba3d4	[MAJOR] completely separate HTTP and TCP states on the request path For the first time, HTTP and TCP are not merged anymore. All request processing has moved to process_request while the TCP processing of the frontend remains in process_cli. The code is a lot cleaner, simpler, smaller (1%) and slightly faster (1% too). Right now, the HTTP state machine cannot easily command the TCP state machine, but it does not cause that many difficulties. The response processing has not yet been extracted, and the unix-stream state machines have to be broken down that way too. The CL_STDATA, CL_STSHUTR and CL_STSHUTW states still exist and are exactly the sames. They will have to be all merged into CL_STDATA once the work has stabilized. It is also possible that this single state will disappear in favor of just buffer flags.	2008-08-14 00:18:39 +02:00
Willy Tarreau	7f875f6c8f	[MEDIUM] simplify and centralize request timeout cancellation and request forwarding Instead of playing with req->flags and request timeout everywhere, tweak them only at precise locations.	2008-08-14 00:18:38 +02:00
Willy Tarreau	adfb8569f7	[MAJOR] get rid of SV_STANALYZE (step 2) The SV_STANALYZE state was installed on the server side but was really meant to be processed with the rest of the request on the client side. It suffered from several issues, mostly related to the way timeouts were handled while waiting for data. All known issues related to timeouts during a request - and specifically a request involving body processing - have been raised and fixed. At this point, the code is a bit dirty but works fine, so next steps might be cleanups with an ability to come back to the current state in case of trouble.	2008-08-14 00:18:38 +02:00
Willy Tarreau	67f0eead22	[MAJOR] kill CL_STINSPECT and CL_STHEADERS (step 1) This is a first attempt at separating data processing from the TCP state machine. Those two states have been replaced with flags in the session indicating what needs to be analyzed. The corresponding code is still called before and in lieu of TCP states. Next change should get rid of the specific SV_STANALYZE which is in fact a client state. Then next change should consist in making it possible to analyze TCP contents while being in CL_STDATA (or CL_STSHUT*).	2008-08-14 00:18:38 +02:00
Aleksandar Lazic	697bbb0106	[PATCH] appsessions: cleanup DEBUG_HASH and initialize request_counter This patch cleanup the -DDEBUG=DEBUG_HASH output setting and initialize the request_counter for the appsessions.	2008-08-13 23:43:26 +02:00
Willy Tarreau	9f1f24bb7f	[BUG] client timeout incorrectly rearmed while waiting for server Client timeout could be refreshed in stream_sock_*, but this is undesired when the timeout is already set to eternity. The effect is that a session could still be aborted if client timeout was smaller than server timeout. A second effect is that sessions expired on the server side would expire with "cD" flags. The fix consists in not updating it if it was not previously set. A cleaner method might consist in updating the buffer timeout. This is probably what will be done later when the state machines only deal with the buffers.	2008-08-11 11:34:18 +02:00
Willy Tarreau	ce09c52187	[BUG] server timeout was not considered in some circumstances Due to a copy-paste typo, the client timeout was refreshed instead of the server's when waiting for server response. This means that the server's timeout remained eternity.	2008-08-11 11:34:16 +02:00
Willy Tarreau	fb0528bd56	[BUG] fix segfault with url_param + check_post If an HTTP/0.9-like POST request is sent to haproxy while configured with url_param + check_post, it will crash. The reason is that the total buffer length was computed based on req->total (which equals the number of bytes read) and not req->l (number of bytes in the buffer), thus leading to wrong size calculations when calling memchr(). The affected code does not look like it could have been exploited to run arbitrary code, only reads were performed at wrong locations.	2008-08-11 11:34:01 +02:00
Willy Tarreau	718f0ef129	[MEDIUM] process_cli: don't rely at all on server state A new buffer flag BF_MAY_FORWARD has been added so that the client FSM can check whether it is allowed to forward the response to the client. The client FSM does not have to monitor the server state anymore.	2008-08-10 16:21:32 +02:00
Willy Tarreau	dc0a6a0dea	[MEDIUM] process_srv: don't rely at all on client state A new buffer flag BF_MAY_CONNECT has been added so that the server FSM can check whether it is allowed to establish a connection or not. That way, the client FSM only has to move this flag and the server side does not need to monitor client state anymore.	2008-08-03 22:47:10 +02:00
Willy Tarreau	6468d924ea	[MEDIUM] process_srv: rely on buffer flags for client shutdown The open/close nature of each half of the client side is known to the buffer, so let the server state machine rely on this instead of checking the client state for CL_STSHUT* or CL_STCLOSE.	2008-08-03 20:48:51 +02:00
Willy Tarreau	89edf5e629	[MEDIUM] buffers: ensure buffer_shut* are properly called upon shutdowns It is important that buffer states reflect the state of both sides so that we can remove client and server state inter-dependencies.	2008-08-03 20:48:50 +02:00
Willy Tarreau	48d63db7a8	[MEDIUM] memory: update pool_free2() to support NULL pointers In order to make pool usage more convenient, let pool_free2() support NULL pointers by doing nothing, just like the standard free(3) call does. The various call places have been updated to remove the now useless checks.	2008-08-03 20:48:50 +02:00
Willy Tarreau	a534fea478	[CLEANUP] remove 65 useless NULL checks before free C specification clearly states that free(NULL) is a no-op. So remove useless checks before calling free.	2008-08-03 20:48:50 +02:00
Ross West	af72a1d8ec	[MINOR] permit renaming of x-forwarded-for header Because I needed it in my situation - here's a quick patch to allow changing of the "x-forwarded-for" header by using a suboption to "option forwardfor". Suboption "header XYZ" will set the header from "x-forwarded-for" to "XYZ". Default is still "x-forwarded-for" if the header value isn't defined. Also the suboption 'except a.b.c.d/z' still works on the same line. So it's now: option forwardfor [except a.b.c.d[/z]] [header XYZ]	2008-08-03 10:51:45 +02:00
Willy Tarreau	dd64f8d394	[MEDIUM] acl: when possible, report the name and requirements of ACLs in warnings When an ACL is referenced at a wrong place (eg: response during request, layer7 during layer4), try to indicate precisely the name and requirements of this ACL. Only the first faulty ACL is returned. A small change consisting in iterating that way may improve reports : cap = ACL_USE_any_unexpected while ((acl=cond_find_require(cond, cap))) { warning() cap &= ~acl->requires; } This will report the first ACL of each unsupported type. But doing so will mangle the error reporting a lot, so we need to rework error reports first.	2008-08-03 09:41:05 +02:00
Willy Tarreau	0ceba5af74	[MEDIUM] acl: set types on all currently known ACL verbs All currently known ACL verbs have been assigned a type which makes it possible to detect inconsistencies, such as response values used in request rules.	2008-07-25 19:31:03 +02:00
Willy Tarreau	a9802633d8	[MEDIUM] acl: enforce ACL type checking ACL now hold information on the availability of the data they rely on. They can indicate which parts of the requests/responses they require, and the rules parser may now report inconsistencies. As an example, switching rules are now checked for response-specific ACLs, though those are not still set. A warning is reported in case of mismatch. ACLs keyword restrictions will now have to be specifically set wherever a better control is expected. The line number where an ACL condition is declared has been added to the conditions in order to be able to report the faulty line number during post-loading checks.	2008-07-25 19:13:19 +02:00
Willy Tarreau	b6fb420c7e	[MINOR] acl: add the "wait_end" acl verb The new "wait_end" acl delays evaluation of the rule (and the next ones) to the end of the analysis period. This is intented to be used with TCP content analysis. A rule referencing such an ACL will not match until the delay is over. An equivalent default ACL "WAIT_END" has been created.	2008-07-20 11:18:28 +02:00
Willy Tarreau	58393e103f	[MEDIUM] acl: get rid of dummy values in always_true/always_false make use of last change in order to get rid of dummy values in always_true/always_false.	2008-07-20 10:39:22 +02:00
Willy Tarreau	a79534fce1	[MEDIUM] acl: permit fetch() functions to set the result themselves For protocol analysis, it's not always convenient to have to run through a fetch then a match against dummy values. It's easier to let the fetch() function set the result itself. This obviously works only for boolean values.	2008-07-20 10:17:20 +02:00
Willy Tarreau	c6317703ce	[MINOR] acl: add REQ_CONTENT to the list of default acls With content inspection, checking the presence of data in the request buffer is very important. It's getting boring to always add such an ACL, so let's add it by default.	2008-07-20 09:29:50 +02:00
Willy Tarreau	177e2b0127	[CLEANUP] remove dependency on obsolete INTBITS macro The INTBITS macro was found to be already defined on some platforms, and to equal 32 (while INTBITS was 5 here). Due to pure luck, there was no declaration conflict, but it's nonetheless a problem to fix. Looking at the code showed that this macro was only used for left shifts and nothing else anymore. So the replacement is obvious. The new macro, BITS_PER_INT is more obviously correct.	2008-07-16 10:30:44 +02:00
Willy Tarreau	ec6c5df018	[CLEANUP] remove many #include <types/xxx> from C files It should be stated as a rule that a C file should never include types/xxx.h when proto/xxx.h exists, as it gives less exposure to declaration conflicts (one of which was caught and fixed here) and it complicates the file headers for nothing. Only types/global.h, types/capture.h and types/polling.h have been found to be valid includes from C files.	2008-07-16 10:30:42 +02:00
Willy Tarreau	284648e079	[CLEANUP] remove unused include/types/client.h This file is not used anymore.	2008-07-16 10:30:40 +02:00
Willy Tarreau	655e26af24	[MINOR] acl: add req_ssl_ver in TCP, to match an SSL version This new keyword matches an dotted version mapped into an integer. It permits to match an SSL message protocol version just as if it was an integer, so that it is easy to map ranges, like this : acl obsolete_ssl req_ssl_ver lt 3 acl correct_ssl req_ssl_ver 3.0-3.1 acl invalid_ssl req_ssl_ver gt 3.1 Both SSLv2 hello messages and SSLv3 messages are supported. The test tries to be strict enough to avoid being easily fooled. In particular, it waits for as many bytes as announced in the message header if this header looks valid (bound to the buffer size). The same decoder will be usable with minor changes to check the response messages.	2008-07-16 10:30:06 +02:00
Willy Tarreau	4a26d2f2fa	[MINOR] acl: add a new parsing function: parse_dotted_ver This new function supports one major and one minor and makes an int of them. It is very convenient to compare versions (eg: SSL) just as if they were plain integers, as the comparison functions will still be based on integers.	2008-07-16 10:29:51 +02:00
Willy Tarreau	b686644ad8	[MAJOR] implement tcp request content inspection Some people need to inspect contents of TCP requests before deciding to forward a connection or not. A future extension of this demand might consist in selecting a server farm depending on the protocol detected in the request. For this reason, a new state CL_STINSPECT has been added on the client side. It is immediately entered upon accept() if the statement "tcp-request inspect-delay <xxx>" is found in the frontend configuration. Haproxy will then wait up to this amount of time trying to find a matching ACL, and will either accept or reject the connection depending on the "tcp-request content <action> {if\|unless}" rules, where <action> is either "accept" or "reject". Note that it only waits that long if no definitive verdict can be found earlier. That generally implies calling a fetch() function which does not have enough information to decode some contents, or a match() function which only finds the beginning of what it's looking for. It is only at the ACL level that partial data may be processed as such, because we need to distinguish between MISS and FAIL before applying the term negation. Thus it is enough to add "\| ACL_PARTIAL" to the last argument when calling acl_exec_cond() to indicate that we expect ACL_PAT_MISS to be returned if some data is missing (for fetch() or match()). This is the only case we may return this value. For this reason, the ACL check in process_cli() has become a lot simpler. A new ACL "req_len" of type "int" has been added. Right now it is already possible to drop requests which talk too early (eg: for SMTP) or which don't talk at all (eg: HTTP/SSL). Also, the acl fetch() functions have been extended in order to permit reporting of missing data in case of fetch failure, using the ACL_TEST_F_MAY_CHANGE flag. The default behaviour is unchanged, and if no rule matches, the request is accepted. As a side effect, all layer 7 fetching functions have been cleaned up so that they now check for the validity of the layer 7 pointer before dereferencing it.	2008-07-16 10:29:07 +02:00
Willy Tarreau	9de1bbd004	[MEDIUM] modularize the "timeout" keyword configuration parser The "timeout" keyword already relied on an external parser, let's make use of the new keyword registration mechanism.	2008-07-09 20:34:27 +02:00
Willy Tarreau	39f23b6c7e	[MINOR] cfgparse: add support for warnings in external functions Some parsers will need to report warnings in some cases. Let's use positive values for that.	2008-07-09 20:23:15 +02:00
Willy Tarreau	10522fd113	[MEDIUM] modularize the global "stats" keyword configuration parser The "stats" keyword already relied on an external parser, let's make use of the new keyword registration mechanism.	2008-07-09 20:12:41 +02:00
Willy Tarreau	5b2c33683b	[MEDIUM] add support for configuration keyword registration Any module which needs configuration keywords may now dynamically register a keyword in a given section, and associate it with a configuration parsing function using cfg_register_keywords() from a constructor function. This makes the configuration parser more modular because it is not required anymore to touch cfg_parse.c. Example : static int parse_global_blah(char *args, int section_type, struct proxy curpx, struct proxy defpx, char err, int errlen) { printf("parsing blah in global section\n"); return 0; } static int parse_listen_blah(char *args, int section_type, struct proxy curpx, struct proxy defpx, char err, int errlen) { printf("parsing blah in listen section\n"); if (*args[1]) { snprintf(err, errlen, "missing arg for listen_blah!!!"); return -1; } return 0; } static struct cfg_kw_list cfg_kws = {{ },{ { CFG_GLOBAL, "blah", parse_global_blah }, { CFG_LISTEN, "blah", parse_listen_blah }, { 0, NULL, NULL }, }}; __attribute__((constructor)) static void __module_init(void) { cfg_register_keywords(&cfg_kws); }	2008-07-09 19:44:58 +02:00
Willy Tarreau	11382813a1	[TESTS] added test-acl.cfg to test some ACL combinations various rules constructions can be tested with this test case.	2008-07-09 16:18:21 +02:00
Willy Tarreau	a8cfa34a9c	[BUG] use_backend would not correctly consider "unless" A copy-paste typo made use_backend not correctly consider the "unless" case, depending on the previous "block" rule.	2008-07-09 11:23:31 +02:00
Willy Tarreau	0c303eec87	[MAJOR] convert all expiration timers from timeval to ticks This is the first attempt at moving all internal parts from using struct timeval to integer ticks. Those provides simpler and faster code due to simplified operations, and this change also saved about 64 bytes per session. A new header file has been added : include/common/ticks.h. It is possible that some functions should finally not be inlined because they're used quite a lot (eg: tick_first, tick_add_ifset and tick_is_expired). More measurements are required in order to decide whether this is interesting or not. Some function and variable names are still subject to change for a better overall logics.	2008-07-07 00:09:58 +02:00
Willy Tarreau	ce44f12c1e	[OPTIM] task_queue: assume most consecutive timers are equal When queuing a timer, it's very likely that an expiration date is equal to that of the previously queued timer, due to time rounding to the millisecond. Optimizing for this case provides a noticeable 1% performance boost.	2008-07-05 18:16:19 +02:00
Willy Tarreau	91e99931b7	[MEDIUM] introduce task->nice and boot access to statistics The run queue scheduler now considers task->nice to queue a task and to pick a task out of the queue. This makes it possible to boost the access to statistics (both via HTTP and UNIX socket). The UNIX socket receives twice as much a boost as the HTTP socket because it is more sensible.	2008-06-30 07:51:00 +02:00
Willy Tarreau	58b458d8ba	[MAJOR] use an ebtree instead of a list for the run queue We now insert tasks in a certain sequence in the run queue. The sorting key currently is the arrival order. It will now be possible to apply a "nice" value to any task so that it goes forwards or backwards in the run queue. The calls to wake_expired_tasks() and maintain_proxies() have been moved to the main run_poll_loop(), because they had nothing to do in process_runnable_tasks(). The task_wakeup() function is not inlined anymore, as it was only used at one place. The qlist member of the task structure has been removed now. The run_queue list has been replaced for an integer indicating the number of tasks in the run queue.	2008-06-29 22:40:23 +02:00
Willy Tarreau	af754fc88f	[OPTIM] shrink wake_expired_tasks() by using task_wakeup() It's not worth duplicating task_wakeup() in wake_expired_tasks(). Calling it reduces code size and slightly improves performance.	2008-06-29 19:25:52 +02:00
Willy Tarreau	69e989ccbc	[BUILD] change declaration of base64tab to fix build with Intel C++ I got a report that Intel C++ complains about the size of the base64tab in base64.c. Setting it to 65 chars to allow for the trailing zero fixes the problem.	2008-06-29 17:17:38 +02:00
Willy Tarreau	28c41a4041	[MEDIUM] rework the wait queue mechanism The wait queues now rely on 4 trees for past, present and future timers. The computations are cleaner and more reliable. The wake_expired_tasks function has become simpler. Also, a bug previously introduced in task_queue() by the first introduction of eb_trees has been fixed (the eb->key was never updated).	2008-06-29 17:00:59 +02:00
Willy Tarreau	284c7b3195	[BUG] disable buffer read timeout when reading stats The buffer read timeouts were not reset when stats were produced. This caused unneeded wakeups.	2008-06-29 16:38:43 +02:00
Willy Tarreau	e6313a37d6	[MINOR] introduce now_ms, the current date in milliseconds This new time value will be used to compute timeouts and wait queue positions. The operation is made once for all when time is retrieved. A future improvement might consist in having it in ticks of 1/1024 second and to convert all timeouts into ticks.	2008-06-29 13:47:25 +02:00
Willy Tarreau	e62bdd4026	[BUG] wqueue: perform proper timeout comparisons with wrapping values With wrapping keys, we cannot simply do "if (key > now)", but we must at least do "if ((signed)(key-now) > 0)".	2008-06-29 10:32:02 +02:00
Willy Tarreau	accc4e1e86	[BUG] we could segfault during exit while freeing uri_auths The following config makes haproxy segfault on exit : defaults mode http balance roundrobin listen no-stats bind :8001 listen stats bind :8002 stats uri /stats The simple fix is to ensure that p->uri_auth is not NULL before dereferencing it.	2008-06-24 11:14:45 +02:00
Willy Tarreau	9789f7bd68	[MAJOR] replace ultree with ebtree in wait-queues The ultree code has been removed in favor of a simpler and cleaner ebtree implementation. The eternity queue does not need to exist anymore, and the pool_tree64 has been removed. The ebtree node is stored in the task itself. The qlist list header is still used by the run-queue, but will be able to disappear once the run-queue uses ebtree too.	2008-06-24 08:17:16 +02:00
Willy Tarreau	b0b37bcd65	[MEDIUM] further improve monotonic clock by check forward jumps The first implementation of the monotonic clock did not verify forward jumps. The consequence is that a fast changing time may expire a lot of tasks. While it does seem minor, in fact it is problematic because most machines which boot with a wrong date are in the past and suddenly see their time jump by several years in the future. The solution is to check if we spent more apparent time in a poller than allowed (with a margin applied). The margin is currently set to 1000 ms. It should be large enough for any poll() to complete. Tests with randomly jumping clock show that the result is quite accurate (error less than 1 second at every change of more than one second).	2008-06-23 14:00:57 +02:00
Willy Tarreau	b7f694f20e	[MEDIUM] implement a monotonic internal clock If the system date is set backwards while haproxy is running, some scheduled events are delayed by the amount of time the clock went backwards. This is particularly problematic on systems where the date is set at boot, because it seldom happens that health-checks do not get sent for a few hours. Before switching to use clock_gettime() on systems which provide it, we can at least ensure that the clock is not going backwards and maintain two clocks : the "date" which represents what the user wants to see (mostly for logs), and an internal date stored in "now", used for scheduled events.	2008-06-22 17:18:02 +02:00
Willy Tarreau	7c669d7e0f	[BUG] fix the dequeuing logic to ensure that all requests get served The dequeuing logic was completely wrong. First, a task was assigned to all servers to process the queue, but this task was never scheduled and was only woken up on session free. Second, there was no reservation of server entries when a task was assigned a server. This means that as long as the task was not connected to the server, its presence was not accounted for. This was causing trouble when detecting whether or not a server had reached maxconn. Third, during a redispatch, a session could lose its place at the server's and get blocked because another session at the same moment would have stolen the entry. Fourth, the redispatch option did not work when maxqueue was reached for a server, and it was not possible to do so without indefinitely hanging a session. The root cause of all those problems was the lack of pre-reservation of connections at the server's, and the lack of tracking of servers during a redispatch. Everything relied on combinations of flags which could appear similarly in quite distinct situations. This patch is a major rework but there was no other solution, as the internal logic was deeply flawed. The resulting code is cleaner, more understandable, uses less magics and is overall more robust. As an added bonus, "option redispatch" now works when maxqueue has been reached on a server.	2008-06-20 15:08:06 +02:00
Willy Tarreau	7a63abd84f	[BUG] log: reported queue position was offed-by-one The reported queue position in the logs was 0 for the first pending request in the queue, which is wrong because it means that one request will have to be completed before the queued one may execute. It caused the undesired side effect that 0/0 was reported when either 0 or 1 request was pending in the queue. Thus, we have to increment the queue size before reporting the value.	2008-06-20 15:08:04 +02:00
Willy Tarreau	7008987813	[BUG] queue management: wake oldest request in queues When a server terminates a connection, the next session in its own queue was immediately processed. Because of this, if all server queues are always filled, then no new anonymous request will be processed. Consider oldest request between global and server queues to choose from which to pick the request. An improvement over this will consist in adding a configurable offset when comparing expiration dates, so that cookie-less requests can get either less or more priority.	2008-06-20 15:07:40 +02:00
Willy Tarreau	3a6281199a	[BUG] event pollers must not wait if a task exists in the run queue Under some circumstances, a task may already lie in the run queue (eg: inter-task wakeup). It is disastrous to wait for an event in this case because some processing gets delayed.	2008-06-20 15:05:56 +02:00
Willy Tarreau	b463dfb2de	[MEDIUM] add support for conditional HTTP redirection A new "redirect" keyword adds the ability to send an HTTP 301/302/303 redirection to either an absolute location or to a prefix followed by the original URI. The redirection is conditionned by ACL rules, so it becomes very easy to move parts of a site to another site using this. This work was almost entirely done at Exceliance by Emeric Brun. A test-case has been added in the tests/ directory.	2008-06-07 23:08:56 +02:00
Krzysztof Piotr Oledzki	8001d6162e	[MEDIUM] Fix memory freeing at exit, part 2 - free oldpids - call free(exp->preg), not only regfree(exp->preg): req_exp, rsp_exp - build a list of unique uri_auths and eventually free it - prune_acl_cond/free for switching_rules - add a callback pointer to free ptr from acl_pattern (used for regexs) and execute it ==1180== malloc/free: in use at exit: 0 bytes in 0 blocks. ==1180== malloc/free: 5,599 allocs, 5,599 frees, 4,220,556 bytes allocated. ==1180== All heap blocks were freed -- no leaks are possible.	2008-06-07 11:06:14 +02:00
Krzysztof Piotr Oledzki	a643baf091	[MEDIUM] Fix memory freeing at exit New functions implemented: - deinit_pollers: called at the end of deinit()) - prune_acl: called via list_for_each_entry_safe Add missing pool_destroy2 calls: - p->hdr_idx_pool - pool2_tree64 Implement all task stopping: - health-check: needs new "struct task" in the struct server - queue processing: queue_mgt - appsess_refresh: appsession_refresh before (idle system): ==6079== LEAK SUMMARY: ==6079== definitely lost: 1,112 bytes in 75 blocks. ==6079== indirectly lost: 53,356 bytes in 2,090 blocks. ==6079== possibly lost: 52 bytes in 1 blocks. ==6079== still reachable: 150,996 bytes in 504 blocks. ==6079== suppressed: 0 bytes in 0 blocks. after (idle system): ==6945== LEAK SUMMARY: ==6945== definitely lost: 7,644 bytes in 137 blocks. ==6945== indirectly lost: 9,913 bytes in 587 blocks. ==6945== possibly lost: 0 bytes in 0 blocks. ==6945== still reachable: 0 bytes in 0 blocks. ==6945== suppressed: 0 bytes in 0 blocks. before (running system for ~2m): ==9343== LEAK SUMMARY: ==9343== definitely lost: 1,112 bytes in 75 blocks. ==9343== indirectly lost: 54,199 bytes in 2,122 blocks. ==9343== possibly lost: 52 bytes in 1 blocks. ==9343== still reachable: 151,128 bytes in 509 blocks. ==9343== suppressed: 0 bytes in 0 blocks. after (running system for ~2m): ==11616== LEAK SUMMARY: ==11616== definitely lost: 7,644 bytes in 137 blocks. ==11616== indirectly lost: 9,981 bytes in 591 blocks. ==11616== possibly lost: 0 bytes in 0 blocks. ==11616== still reachable: 4 bytes in 1 blocks. ==11616== suppressed: 0 bytes in 0 blocks. Still not perfect but significant improvement.	2008-05-30 07:07:19 +02:00
Krzysztof Piotr Oledzki	1acf217366	[BUG/CLEANUP] cookiedomain -> cookie_domain rename + free(p->cookie_domain) Rename cookiedomain -> cookie_domain to be consistent with current naming scheme. Also make sure cookie_domain is deallocated at deinit()	2008-05-30 07:03:22 +02:00
Willy Tarreau	8a7af60312	[MEDIUM] detect streaming buffers and tag them as such Add the ability to detect streaming buffers, and set a flag indicating it. It will later serve us in order to dynamically resize them, and to prioritize file descriptors during polls.	2008-05-25 10:41:12 +02:00
Willy Tarreau	f2e8ee2b46	[MEDIUM] reduce risk of event starvation in ev_sepoll If too many events are set for spec I/O, those ones can starve the polled events. Experiments show that when polled events starve, they quickly turn into spec I/O, making the situation even worse. While we can reduce the number of polled events processed at once, we cannot do this on speculative events because most of them are new ones (avg 2/3 new - 1/3 old from experiments). The solution against this problem relies on those two factors : 1) one FD registered as a spec event cannot be polled at the same time 2) even during very high loads, we will almost never be interested in simultaneous read and write streaming on the same FD. The first point implies that during starvation, we will not have more than half of our FDs in the poll list, otherwise it means there is less than that in the spec list, implying there is no starvation. The second point implies that we're statically only interested in half of the maximum number of file descriptors at once, because we will unlikely have simultaneous read and writes for a same buffer during long periods. So, if we make it possible to drain maxsock/2/2 during peak loads, then we can ensure that there will be no starvation effect. This means that we must always allocate maxsock/4 events for the poller. Last, sepoll uses an optimization consisting in reducing the number of calls to epoll_wait() to once every too polls. However, when dealing with many spec events, we can wait very long and skipping epoll_wait() every second time increases latency. For this reason, we try to detect if we are beyond a reasonable limit and stop doing so at this stage.	2008-05-25 10:39:02 +02:00
Krzysztof Piotr Oledzki	efe3b6f524	[MINOR] Allow to specify a domain for a cookie This patch allows to specify a domain used when inserting a cookie providing a session stickiness. Usefull for example with wildcard domains. The patch adds one new variable to the struct proxy: cookiedomain. When set the domain is appended to a Set-Cookie header. Domain name is validated using the new invalid_domainchar() function. It is basically invalid_char() limited to [A-Za-z0-9_.-]. Yes, the test is too trivial and does not cover all wrong situations, but the main purpose is to detect most common mistakes, not intentional abuses. The underscore ("_") character is not RFC-valid but as it is often (mis)used so I decided to allow it.	2008-05-25 10:09:02 +02:00
Marek Majkowski	9c30fc161f	[MEDIUM] add support for URI hash depth and length limits This patch adds two optional arguments "len" and "depth" to "balance uri". They are used to limit the length in characters of the analysis, as well as the number of directory components it applies to.	2008-04-28 00:43:55 +02:00
Krzysztof Piotr Oledzki	8e4b21d5eb	[BUG] Flush buffers also where there are exactly 0 bytes left I noticed it was possible to get truncated http/csv stats. Sometimes. Usually the problem disappeared as fast as it appeared, but once it happend that my http-stats page was truncated for about one hour. It was quite weird as it happened independently for csv and http output and it took me some time to track & fix this bug. Both buffer_write & buffer_write_chunk used to return 0 in two situations: is case of success or where there was exactly 0 bytes left. The first one is intentional but I believe the second one is not as it was not possible to distinguish between successful write and unsuccessful one, which means that if the buffer was 100% filled, it was never flushed and it was not possible to write more data. This patch fixes this problem.	2008-04-21 07:22:33 +02:00
Willy Tarreau	7b4c5aee55	[RELEASE] Released version 1.3.15 Released version 1.3.15 with the following main changes : - [BUILD] Added support for 'make install' - [BUILD] Added 'install-man' make target for installing the man page - [BUILD] Added 'install-bin' make target - [BUILD] Added 'install-doc' make target - [BUILD] Removed "/" after '$(DESTDIR)' in install targets - [BUILD] Changed 'install' target to install the binaries first - [BUILD] Replace hardcoded 'LD = gcc' with 'LD = $(CC)' - [MEDIUM]: Inversion for options - [MEDIUM]: Count retries and redispatches also for servers, fix redistribute_pending, extend logs, %d->%u cleanup - [BUG]: Restore clearing t->logs.bytes - [MEDIUM]: rework checks handling - [DOC] Update a "contrib" file with a hint about a scheme used for formathing subjects - [MEDIUM] Implement "track [<backend>/]<server>" - [MINOR] Implement persistent id for proxies and servers - [BUG] Don't increment server connections too much + fix retries - [MEDIUM]: Prevent redispatcher from selecting the same server, version #3 - [MAJOR] proto_uxst rework -> SNMP support - [BUG] appsession lookup in URL does not work - [BUG] transparent proxy address was ignored in backend - [BUG] hot reconfiguration failed because of a wrong error check - [DOC] big update to the configuration manual - [DOC] large update to the configuration manual - [DOC] document more options - [BUILD] major rework of the GNU Makefile - [STATS] add support for "show info" on the unix socket - [DOC] document options forwardfor to logasap - [MINOR] add support for the "backlog" parameter - [OPTIM] introduce global parameter "tune.maxaccept" - [MEDIUM] introduce "timeout http-request" in frontends - [MINOR] tarpit timeout is also allowed in backends - [BUG] increment server connections for each connect() - [MEDIUM] add a turn-around state of one second after a connection failure - [BUG] fix typo in redispatched connection - [DOC] document options nolinger to ssl-hello-chk - [DOC] added documentation for "option tcplog" to "use_backend" - [BUG] connect_server: server might not exist when sending error report - [MEDIUM] support fully transparent proxy on Linux (USE_LINUX_TPROXY) - [MEDIUM] add non-local bind to connect() on Linux - [MINOR] add transparent proxy support for balabit's Tproxy v4 - [BUG] use backend's source and not server's source with tproxy - [BUG] fix overlapping server flags - [MEDIUM] fix server health checks source address selection - [BUG] build failed on CONFIG_HAP_LINUX_TPROXY without CONFIG_HAP_CTTPROXY - [DOC] added "server", "source" and "stats" keywords - [DOC] all server parameters have been documented - [DOC] document all req* and rsp* keywords. - [DOC] added documentation about HTTP header manipulations - [BUG] log response byte count, not request - [BUILD] code did not build in full debug mode - [BUG] fix truncated responses with sepoll - [MINOR] use s->frt_addr as the server's address in transparent proxy - [MINOR] fix configuration hint about timeouts - [DOC] minor cleanup of the doc and notice to contributors - [MINOR] report correct section type for unknown keywords. - [BUILD] update MacOS Makefile to build on newer versions - [DOC] fix erroneous "useallbackups" option in the doc - [DOC] applied small fixes from early readers - [MINOR] add configuration support for "redir" server keyword - [MEDIUM] completely implement the server redirection method - [TESTS] add a test case for the server redirection mechanism - [DOC] add a configuration entry for "server ... redir <prefix>" - [BUILD] backend.c and checks.c did not build without tproxy ! - Revert "[BUILD] backend.c and checks.c did not build without tproxy !" - [BUILD] backend.c and checks.c did not build without tproxy ! - [OPTIM] used unsigned ints for HTTP state and message offsets - [OPTIM] GCC4's builtin_expect() is suboptimal - [BUG] failed conns were sometimes incremented in the frontend! - [BUG] timeout.check was not pre-set to eternity - [TESTS] add test-pollers.cfg to easily report pollers in use - [BUG] do not apply timeout.connect in checks if unset - [BUILD] ensure that makefile understands USE_DLMALLOC=1 - [MINOR] silent gcc for a wrong warning - [CLEANUP] update .gitignore to ignore more temporary files - [CLEANUP] report dlmalloc's source path only if explictly specified - [BUG] str2sun could leak a small buffer in case of error during parsing - [BUG] option allbackups was not working anymore in roundrobin mode - [MAJOR] implementation of the "leastconn" load balancing algorithm - [BUILD] ensure that users don't build without setting the target anymore. - [DOC] document the leastconn LB algo - [MEDIUM] fix stats socket limitation to 16 kB - [DOC] fix unescaped space in httpchk example. - [BUG] fix double-decrement of server connections - [TESTS] add a test case for port mapping - [TESTS] add a benchmark for integer hashing - [TESTS] add new methods in ip-hash test file - [MAJOR] implement parameter hashing for POST requests	2008-04-19 21:25:12 +02:00
Willy Tarreau	192ee3e630	[BUILD] fix build of POST analysis code with gcc < 3 move variable declarations at beginning of blocks.	2008-04-19 21:24:56 +02:00
matt.farnsworth@nokia.com	1c2ab96be5	[MAJOR] implement parameter hashing for POST requests This patch extends the "url_param" load balancing method by introducing the "check_post" option. Using this option enables analysis of the beginning of POST requests to search for the specified URL parameter. The patch also fixes a few minor typos in comments that were discovered during code review.	2008-04-15 15:30:41 +02:00
Willy Tarreau	f899b94e63	[BUG] fix double-decrement of server connections If a client does a sudden dirty close (CL_STCLOSE) during a server connect turn-around, then the number of server connections is decremented twice. This causes huge problems on the affected server because when its connection number becomes negative, it overflows and prevents the server from accepting new connections due to an apparent saturation. The fix consists in not decrementing the counter if the server is in a turn-around state.	2008-03-28 18:19:05 +01:00
Willy Tarreau	39f7e6d516	[MEDIUM] fix stats socket limitation to 16 kB Due to the way the stats socket work, it was not possible to maintain the information related to the command entered, so after filling a whole buffer, the request was lost and it was considered that there was nothing to write anymore. The major reason was that some flags were passed directly during the first call to stats_dump_raw() instead of being stored persistently in the session. To definitely fix this problem, flags were added to the stats member of the session structure. A second problem appeared. When the stats were produced, a first call to client_retnclose() was performed, then one or multiple subsequent calls to buffer_write_chunks() were done. But once the stats buffer was full and a reschedule operated, the buffer was flushed, the write flag cleared from the buffer and nothing was done to re-arm it. For this reason, a check was added in the proto_uxst_stats() function in order to re-call the client FSM when data were added by stats_dump_raw(). Finally, the whole unix stats dump FSM was rewritten to avoid all the magics it depended on. It is now simpler and looks more like the HTTP one.	2008-03-17 22:08:01 +01:00
Willy Tarreau	51406233bb	[MAJOR] implementation of the "leastconn" load balancing algorithm The new "leastconn" LB algorithm selects the server which has the least established or pending connections. The weights are considered, so that a server with a weight of 20 will get twice as many connections as the server with a weight of 10. The algorithm respects the minconn/maxconn settings, as well as the slowstart since it is a dynamic algorithm. It also correctly supports backup servers (one and all). It is generally suited for protocols with long sessions (such as remote terminals and databases), as it will ensure that upon restart, a server with no connection will take all new ones until its load is balanced with others. A test configuration has been added in order to ease regression testing.	2008-03-10 22:04:30 +01:00
Willy Tarreau	f4cca45b5e	[BUG] option allbackups was not working anymore in roundrobin mode Commit `3168223a7b` broke option "allbackups" in roundrobin mode due to an erroneous structure member replacement in backend.c. The PR_O_USE_ALL_BK flag was not tested in the right member anymore. This bug uncoverred another one, by which all backup servers would be used whatever the option's value, if all of them had been seen as simultaneously failed at one moment. This patch fixes the two stupid errors. Correctness has been tested using the test-fwrr.cfg config example.	2008-03-08 21:42:54 +01:00
Willy Tarreau	caf720d3ff	[BUG] str2sun could leak a small buffer in case of error during parsing Matt Farnsworth reported a memory leak in str2sun() in case a too large socket path is passed. The bug is very minor because it only happens once during config parsing, but has to be fixed nevertheless. The patch Matt provided could even be improved by completely removing the useless strdup() in this function.	2008-03-07 10:07:04 +01:00
Krzysztof Piotr Oledzki	2c6962c3c0	[MAJOR] proto_uxst rework -> SNMP support Currently there is a ~16KB limit for a data size passed via unix socket. It is caused by a trivial bug ttat is going to fixed soon, however in most cases there is no need to dump a full stats. This patch makes possible to select a scope of dumped data by extending current "show stat" to "show stat [<iid> <type> <sid>]": - iid is a proxy id, -1 to dump all proxies - type selects type of dumpable objects: 1 for frontend, 2 for backend, 4 for server, -1 for all types. Values can be ORed, for example: 1+2=3 -> frontend+backend. 1+2+4=7 -> frontend+backend+server. - sid is a service id, -1 to dump everything from the selected proxy. To do this I implemented a new session flag (SN_STAT_BOUND), added three variables in data_ctx.stats (iid, type, sid), modified dumpstats.c and completely revorked the process_uxst_stats: now it waits for a "\n" terminated string, splits args and uses them. BTW: It should be quite easy to add new commands, for example to enable/disable servers, the only problem I can see is a not very lucky config name (stats socket). :\| During the work I also fixed two bug: - s->flags were not initialized for proto_uxst - missing comma if throttling not enabled (caused by a stupid change in "Implement persistent id for proxies and servers") Other changes: - No more magic type valuse, use STATS_TYPE_FE/STATS_TYPE_BE/STATS_TYPE_SV - Don't memset full s->data_ctx (it was clearing s->data_ctx.stats.{iid/type/sid}, instead initialize stats.sv & stats.sv_st (stats.px and stats.px_st were already initialized) With all that changes it was extremely easy to write a short perl plugin for a perl-enabled net-snmp (also included in this patch). 29385 is my PEN (Private Enterprise Number) and I'm willing to donate the SNMPv2-SMI::enterprises.29385.106.* OIDs for HAProxy if there is nothing assigned already.	2008-03-04 06:32:16 +01:00
Krzysztof Piotr Oledzki	5a329cf017	[MEDIUM]: Prevent redispatcher from selecting the same server, version #3 When haproxy decides that session needs to be redispatched it chose a server, but there is no guarantee for it to be a different one. So, it often happens that selected server is exactly the same that it was previously, so a client ends up with a 503 error anyway, especially when one sever has much bigger weight than others. Changes from the previous version: - drop stupid and unnecessary SN_DIRECT changes - assign_server(): use srvtoavoid to keep the old server and clear s->srv so SRV_STATUS_NOSRV guarantees that t->srv == NULL (again) and get_server_rr_with_conns has chances to work (previously we were passing a NULL here) - srv_redispatch_connect(): remove t->srv->cum_sess and t->srv->failed_conns incrementing as t->srv was guaranteed to be NULL - add avoididx to get_server_rr_with_conns. I hope I correctly understand this code. - fix http_flush_cookie_flags() and move it to assign_server_and_queue() directly. The code here was supposed to set CK_DOWN and clear CK_VALID, but: (TX_CK_VALID \| TX_CK_DOWN) == TX_CK_VALID == TX_CK_MASK so: if ((txn->flags & TX_CK_MASK) == TX_CK_VALID) txn->flags ^= (TX_CK_VALID \| TX_CK_DOWN); was really a: if ((txn->flags & TX_CK_MASK) == TX_CK_VALID) txn->flags &= TX_CK_VALID Now haproxy logs "--DI" after redispatching connection. - defer srv->redispatches++ and s->be->redispatches++ so there are called only if a conenction was redispatched, not only supposed to. - don't increment lbconn if redispatcher selected the same sarver - don't count unsuccessfully redispatched connections as redispatched connections - don't count redispatched connections as errors, so: - the number of connections effectively served by a server is: srv->cum_sess - srv->failed_conns - srv->retries - srv->redispatches and SUM(servers->failed_conns) == be->failed_conns - requires the "Don't increment server connections too much + fix retries" patch - needs little more testing and probably some discussion so reverting to the RFC state Tests #1: retries 4 redispatch i) 1 server(s): b (wght=1, down) b) sessions=5, lbtot=1, err_conn=1, retr=4, redis=0 -> request failed ii) server(s): b (wght=1, down), u (wght=1, down) b) sessions=4, lbtot=1, err_conn=0, retr=3, redis=1 u) sessions=1, lbtot=1, err_conn=1, retr=0, redis=0 -> request FAILED iii) 2 server(s): b (wght=1, down), u (wght=1, up) b) sessions=4, lbtot=1, err_conn=0, retr=3, redis=1 u) sessions=1, lbtot=1, err_conn=0, retr=0, redis=0 -> request OK iv) 2 server(s): b (wght=100, down), u (wght=1, up) b) sessions=4, lbtot=1, err_conn=0, retr=3, redis=1 u) sessions=1, lbtot=1, err_conn=0, retr=0, redis=0 -> request OK v) 1 server(s): b (down for first 4 SYNS) b) sessions=5, lbtot=1, err_conn=0, retr=4, redis=0 -> request OK Tests #2: retries 4 i) 1 server(s): b (down) b) sessions=5, lbtot=1, err_conn=1, retr=4, redis=0 -> request FAILED	2008-03-04 06:16:37 +01:00
Krzysztof Piotr Oledzki	626a19b66f	[BUG] Don't increment server connections too much + fix retries Commit `98937b8757` while fixing one bug introduced another one. With "retries 4" and "option redispatch" haproxy tries to connect 4 times to one server server and 1 time to a second one. However logs showed 5 connections to the first server (the last one was counted twice) and 2 to the second. This patch also fixes srv->retries and be->retries increments. Now I get: 3 retries and 1 error in a first server (4 cum_sess) and 1 error in a second server (1 cum_sess) with: retries 4 option redispatch and: 4 retries and 1 error (5 cum_sess) with: retries 4 So, the number of connections effectively served by a server is: srv->cum_sess - srv->failed_conns - srv->retries	2008-03-04 06:11:17 +01:00
Krzysztof Piotr Oledzki	f58a962247	[MINOR] Implement persistent id for proxies and servers This patch adds a possibility to set a persistent id for a proxy/server. Now, even if some proxies/servers are inserted/deleted/moved, iids and sids can be still used reliable. Some people add servers with tricky names (BACKEND or FRONTEND for example). So I also added one more field ('type') to distinguish between a backend (0), frontend (1) and server (2) without complicated logic: if name==BACKEND and sid==0 then type is BACKEND else type is SERVER, etc for a FRONTEND. It also makes possible to have one frontend with more than one IP (a patch coming soon) with independed stats - for example to differs between remote and local traffic. Finally, I added documentation about the CSV format. This patch depends on '[MEDIUM] Implement "track [<backend>/]<server>"'	2008-02-28 17:23:59 +01:00
Krzysztof Piotr Oledzki	c8b16fc948	[MEDIUM] Implement "track [<backend>/]<server>" This patch implements ability to set the current state of one server by tracking another one. It: - adds two variables: tracknext, tracked to struct server - implements findserver(), similar to findproxy() - adds "track" keyword accepting both "proxy/server" and "server" (assuming current proxy) - verifies if both checks and tracking is not enabled at the same time - changes set_server_down() to notify tracking server - creates set_server_up(), set_server_disabled(), set_server_enabled() by moving the code from process_chk() and adding notifications - changes stats to show a name of tracked server instead of Chk/Dwn/Dwntime(html) or by adding new variable (csv) Changes from the previuos version: - it is possibile to track independently of the declaration order - one extra comma bug is fixed - new condition to check if there is no disable-on-404 inconsistency	2008-02-27 10:39:53 +01:00
Willy Tarreau	6054819a70	[BUG] do not apply timeout.connect in checks if unset tv_bound() does not consider infinite timeouts, so we must check that timeout.connect is set before applying it to the checks.	2008-02-17 11:34:10 +01:00
Ryan Warnick	6d0b1fac23	[BUG] appsession lookup in URL does not work We've been trying to use the latest release (1.3.14.2) of haproxy to do sticky sessions. Cookie insertion is not an option for us, although we would much rather use it, as we are trying to work around a problem where cookies are unreliable. The appsession functionality only partially worked (it wouldn't read the session id out of a query string) until we made the following code change to the get_srv_from_appsession function in proto_http.c.	2008-02-17 11:24:35 +01:00
Willy Tarreau	3a70f94991	[BUG] timeout.check was not pre-set to eternity If timeout.check was not set, check were using 0 as the timeout, causing odd behaviours.	2008-02-15 11:15:34 +01:00
Willy Tarreau	50fd1e1e3b	[BUG] failed conns were sometimes incremented in the frontend!	2008-02-15 10:09:15 +01:00
Willy Tarreau	70bcfb77a7	[OPTIM] GCC4's builtin_expect() is suboptimal GCC4 is stupid (unbelievable news!). When some code uses __builtin_expect(x != 0, 1), it really performs the check of x != 0 then tests that the result is not zero! This is a double check when only one was expected. Some performance drops of 10% in the HTTP parser code have been observed due to this bug. GCC 3.4 is fine though. A solution consists in expecting that the tested value is 1. In this case, it emits the correct code, but it's still not optimal it seems. Finally the best solution is to ignore likely() and to pray for the compiler to emit correct code. However, we still have to fix unlikely() to remove the test there too, and to fix all code which passed pointers overthere to pass integers instead.	2008-02-14 23:14:33 +01:00
Willy Tarreau	e69eada057	[OPTIM] used unsigned ints for HTTP state and message offsets State and offsets within http_msg were incorrectly set to signed int. Turning them into unsigned slightly improved performance while reducing code size.	2008-02-14 23:14:30 +01:00
Willy Tarreau	cf1d572f2a	[BUILD] backend.c and checks.c did not build without tproxy ! missing #ifdefs. The right patch this time!	2008-02-14 20:28:18 +01:00
Willy Tarreau	21d2af3e9f	Revert "[BUILD] backend.c and checks.c did not build without tproxy !" This reverts commit `3c3c0122f8`. This commit was buggy as it also removed previous tproxy changes !	2008-02-14 20:25:24 +01:00
Willy Tarreau	3c3c0122f8	[BUILD] backend.c and checks.c did not build without tproxy ! missing #ifdefs.	2008-02-13 22:22:56 +01:00
Willy Tarreau	9c33612f53	[MEDIUM] completely implement the server redirection method Now when a server has "redir <prefix>" on its config line, any HEAD or GET request addressing it will lead to a 302 with Location set to "<prefix>" immediately followed by the relative URI of the incoming request. This makes it very easy to send redirect to browsers to check remote static servers, as well as to provide redirection for remote sites when the local one is down.	2008-02-13 00:55:49 +01:00
Willy Tarreau	7a58a72e85	[MINOR] add configuration support for "redir" server keyword The servers now support the "redir" keyword, making it possible to return a 302 with the specified prefix in front of the request instead of connecting to them. This is generally useful for multi-site load balancing but may also serve in order to achieve very high traffic rate. The keyword has only been added to the config parser and to structures, it's not used yet.	2008-02-13 00:55:49 +01:00
Willy Tarreau	6daf34352f	[MINOR] report correct section type for unknown keywords. An unknown keyword was always reported in section "listen" for any section type (defaults, listen, frontend, backend, ...).	2008-01-22 16:44:08 +01:00
Krzysztof Piotr Oledzki	5259dfedd1	[MEDIUM]: rework checks handling This patch adds two new variables: fastinter and downinter. When server state is: - non-transitionally UP -> inter (no change) - transitionally UP (going down), unchecked or transitionally DOWN (going up) -> fastinter - down -> downinter It allows to set something like: server sr6 127.0.51.61:80 cookie s6 check inter 10000 downinter 20000 fastinter 500 fall 3 weight 40 In the above example haproxy uses 10000ms between checks but as soon as one check fails fastinter (500ms) is used. If server is down downinter (20000) is used or fastinter (500ms) if one check pass. Fastinter is also used when haproxy starts. New "timeout.check" variable was added, if set haproxy uses it as an additional read timeout, but only after a connection has been already established. I was thinking about using "timeout.server" here but most people set this with an addition reserve but still want checks to kick out laggy servers. Please also note that in most cases check request is much simpler and faster to handle than normal requests so this timeout should be smaller. I also changed the timeout used for check connections establishing. Changes from the previous version: - use tv_isset() to check if the timeout is set, - use min("timeout connect", "inter") but only if "timeout check" is set as this min alone may be to short for full (connect + read) check, - debug code (fprintf) commented/removed - documentation Compile tested only (sorry!) as I'm currently traveling but changes are rather small and trivial.	2008-01-22 11:29:06 +01:00
Krzysztof Piotr Oledzki	f1e1cb463f	[BUG]: Restore clearing t->logs.bytes Commit `8b3977ffe3` removed "t->logs.bytes_in = 0;" but instead it should change it into "t->logs.bytes_out = 0;" as since `583bc96606` counters are incremented not set. It should be incremented in session_process_counters while sending data to a client: bytes = s->rep->total - s->logs.bytes_out; s->logs.bytes_out = s->rep->total; However, if we increment (set) s->logs.bytes_out while processing "logasap", statistics get wrong values added for headers: 0 or even negative if haproxy adds some headers itself. To test it, please enable logasap and download one empty file and look at stats. Without my fix information available on that page are invalid, for example: # pxname,svname,qcur,qmax,scur,smax,slim,stot,bin,bout,dreq,dresp,ereq,econ,eresp,wretr,wredis,status,weight,act,bck,chkfail,chkdown,lastchg,downtime,qlimit,pid,iid,sid,throttle,lbtot, www,b,0,0,0,1,,1,24,-92,,0,,0,0,0,,UP,1,1,0,0,0,3121,0,,1,2,1,,1, www,BACKEND,0,0,0,1,0,1,24,-92,0,0,,0,0,0,0,UP,1,1,0,,0,3121,0,,1,2,0,,1,	2008-01-22 10:30:26 +01:00
Willy Tarreau	0f68eaca1a	[MINOR] fix configuration hint about timeouts Do not talk about "clitimeout", "contimeout" or "srvtimeout" anymore.	2008-01-20 23:25:06 +01:00
Willy Tarreau	bd41428fee	[MINOR] use s->frt_addr as the server's address in transparent proxy There's no point trying to check original dest addr with only one method when doing transparent proxy as in full transparent mode, the real destination address is required. Let's copy the one from the frontend.	2008-01-19 13:46:35 +01:00
Willy Tarreau	d6f087ea1c	[BUG] fix truncated responses with sepoll Due to the way Linux delivers EPOLLIN and EPOLLHUP, a closed connection received after some server data sometimes results in truncated responses if the client disconnects before server starts to respond. The reason is that the EPOLLHUP flag is processed as an indication of end of transfer while some data may remain in the system's socket buffers. This problem could only be triggered with sepoll, although nothing should prevent it from happening with normal epoll. In fact, the work factoring performed by sepoll increases the risk that this bug appears. The fix consists in making FD_POLL_HUP and FD_POLL_ERR sticky and that they are only checked if FD_POLL_IN is not set, meaning that we have read all pending data. That way, the problem is definitely fixed and sepoll still remains about 17% faster than epoll since it can take into account all information returned by the kernel.	2008-01-18 17:20:13 +01:00
Willy Tarreau	b881608e57	[BUILD] code did not build in full debug mode	2008-01-18 12:18:15 +01:00
Willy Tarreau	8b3977ffe3	[BUG] log response byte count, not request Due to a shameless copy-paste typo, the number of bytes logged was from the request and not the response. This bug has been present for a long time.	2008-01-18 11:16:32 +01:00
Willy Tarreau	e8c66afd41	[MEDIUM] fix server health checks source address selection The source address selection for health checks did not consider the new transparent proxy method. Rely on the same unified function as the other connect() calls. This patch also fixes a bug by which the proxy's source address was ignored if cttproxy was used.	2008-01-13 18:40:14 +01:00
Willy Tarreau	786d1915b0	[BUG] use backend's source and not server's source with tproxy copy-paste typo.	2008-01-13 18:10:06 +01:00
Willy Tarreau	0a45989de3	[MINOR] add transparent proxy support for balabit's Tproxy v4 Balabit's TPROXY version 4 which replaces CTTPROXY provides a similar API to the previous proxy, but relies on IP_FREEBIND instead of IP_TRANSPARENT. Let's add it.	2008-01-13 17:37:16 +01:00
Willy Tarreau	5b6995c31b	[MEDIUM] add non-local bind to connect() on Linux Using some Linux kernel patches which add the IP_TRANSPARENT SOL_IP option , it is possible to bind to a non-local address on without having resort to any sort of NAT, thus causing no performance degradation. This is by far faster and cleaner than the previous CTTPROXY method. The code has been slightly changed in order to remain compatible with CTTPROXY as a fallback for the new method when it does not work. It is not needed anymore to specify the outgoing source address for connect, it can remain 0.0.0.0.	2008-01-13 16:31:17 +01:00
Willy Tarreau	b1e52e8c44	[MEDIUM] support fully transparent proxy on Linux (USE_LINUX_TPROXY) Using some Linux kernel patches, it is possible to redirect non-local traffic to local sockets when IP forwarding is enabled. In order to enable this option, we introduce the "transparent" option keyword on the "bind" command line. It will make the socket reachable by remote sources even if the destination address does not belong to the machine.	2008-01-13 14:49:51 +01:00
Willy Tarreau	fe10a0619d	[BUG] connect_server: server might not exist when sending error report In connect_server(), we may send an alert with the server name while the server might not exist, eg in dispatch mode.	2008-01-12 22:22:34 +01:00
Willy Tarreau	00559e7117	[BUG] fix typo in redispatched connection a copy-paste typo was present in the reconnection code responsible for respatching. The client's FSM would not be re-evaluated if an error occurred. It looks harmless but better fix it.	2008-01-06 23:46:19 +01:00
Willy Tarreau	541b5c24ca	[MEDIUM] add a turn-around state of one second after a connection failure Several users have complained that when haproxy gets a connection failure due to an active reject from a server, it immediately retries, often leading to the same situation being repeated until the retry counter reaches zero. Now if a connection error shows up, a turn-around state of 1 second is applied before retrying. This is performed by faking a connection timeout in order not to touch much code. However, a cleaner method would involve an extra state.	2008-01-06 23:34:21 +01:00
Krzysztof Piotr Oledzki	25b501a6b1	[MEDIUM]: Count retries and redispatches also for servers, fix redistribute_pending, extend logs, %d->%u cleanup This patch extends a little previously added functionality to also count retries and redispatches for servers. Now it is possible to know which server causes redispatches as it is not always the same that takes most retries. While working with the code I found that redistribute_pending() does not increment srv->redispatches && be->redispatches. I don't know how to test it but I think the fix is correct. If not I can withdraw it. I also extended logs to show how many retries were done and if redispatching was necessary ('+'). I'm using an additional session flag SN_REDISP to match redispatched connections. I had to rearrange all defines in session.h to make more room for it. The documentation about logs was also fixed a little (sorry, english only), as current version uses totally different format. BTW: examples are still outdated, maybe next time... Finally, I changed %d -> %u for retries/redispatches as those variables are declared as unsigned.	2008-01-06 16:43:05 +01:00
Willy Tarreau	98937b8757	[BUG] increment server connections for each connect() It was abnormal to see more connect errors than connect attempts. This was caused by the fact that the server's connection count was not incremented for failed connect() attempts. Now the per-server connections are correctly incremented for each connect() attempt. This includes the retries too. The number of connections effectively served by a server will then be : srv->cum_sess - srv->errors - srv->warnings	2008-01-06 15:43:38 +01:00
Willy Tarreau	51c9bde060	[MINOR] tarpit timeout is also allowed in backends Since the tarpit action may be set in backends too, its timeout must be configurable there.	2008-01-06 13:40:03 +01:00
Willy Tarreau	036fae0ec9	[MEDIUM] introduce "timeout http-request" in frontends In order to offer DoS protection, it may be required to lower the maximum accepted time to receive a complete HTTP request without affecting the client timeout. This helps protecting against established connections on which nothing is sent. The client timeout cannot offer a good protection against this abuse because it is an inactivity timeout, which means that if the attacker sends one character every now and then, the timeout will not trigger. With the HTTP request timeout, no matter what speed the client types, the request will be aborted if it does not complete in time.	2008-01-06 13:24:40 +01:00
Willy Tarreau	a0250ba38d	[OPTIM] introduce global parameter "tune.maxaccept" This new parameter makes it possible to override the default number of consecutive incoming connections which can be accepted on a socket. By default it is not limited on single process mode, and limited to 8 in multi-process mode.	2008-01-06 11:22:57 +01:00
Willy Tarreau	c73ce2b111	[MINOR] add support for the "backlog" parameter Add the "backlog" parameter to frontends, to give hints to the system about the approximate listen backlog desired size. In order to protect against SYN flood attacks, one solution is to increase the system's SYN backlog size. Depending on the system, sometimes it is just tunable via a system parameter, sometimes it is not adjustable at all, and sometimes the system relies on hints given by the application at the time of the listen() syscall. By default, HAProxy passes the frontend's maxconn value to the listen() syscall. On systems which can make use of this value, it can sometimes be useful to be able to specify a different value, hence this backlog parameter.	2008-01-06 10:55:10 +01:00
Willy Tarreau	a8efd362b2	[STATS] add support for "show info" on the unix socket It is sometimes required to know some informations such as the process uptime when consulting statistics. This patch adds the "show info" command to query those informations on the UNIX socket.	2008-01-03 10:19:15 +01:00
Willy Tarreau	9f2b73064b	[BUILD] major rework of the GNU Makefile The build process was getting annoying under some conditions, especially on platforms which are used to set CFLAGS, as well as those which set a lot of complex defines. The new Makefile takes care of this situation by not mixing TARGET, CPU and user values, and by making privileging the pre-setting of common variables with the ability to override them. Now CFLAGS and LDFLAGS are set by default and may be overridden without the risk of breaking useful defines. Options are better dealt with, and as a bonus, it was possible to merge the FreeBSD and OpenBSD targets into the common GNU Makefile. The report of build options by "haproxy -vv" has been slightly adapted to the new mode. Options implied by architecture are not reported, only user-specified options are. It is also possible to add options which will not be reported in order not to mangle the output when specifying dirty informations such as URLs... The Makefile was copiously documented and it should be easier to build for any target now. Backwards compatibility with older build processes was kept, and warnings are emitted for deprecated build options.	2008-01-02 20:48:34 +01:00
Krzysztof Oledzki	336d475d13	[MEDIUM]: Inversion for options This patch adds a possibility to invert most of available options by introducing the "no" keyword, available as an additional prefix. If it is found arguments are shifted left and an additional flag (inv) is set. It allows to use all options from a current defaults section, except the selected ones, for example: -- cut here -- defaults contimeout 4200 clitimeout 50000 srvtimeout 40000 option contstats listen stats 1.2.3.4:80 no option contstats -- cut here -- Currenly inversion works only with the "option" keyword. The patch also moves last_checks calculation at the end of the readcfgfile() function and changes "PR_O_FORCE_CLO \| PR_O_HTTP_CLOSE" into "PR_O_FORCE_CLO" in cfg_opts so it is possible to invert forceclose without breaking httpclose (and vice versa) and to invert tcpsplice in one proxy but to keep a proper last_checks value when tcpsplice is used in another proxy. Now, the code checks for PR_O_FORCE_CLO everywhere it checks for PR_O_HTTP_CLOSE. I also decided to depreciate "redisp" and "redispatch" keywords as it is IMHO better to use "option redispatch" which can be inverted. Some useful documentation were added and at the same time I sorted (alfabetically) all valid options both in the code and the documentation.	2007-12-27 11:52:06 +01:00
Willy Tarreau	e13e9251a6	[BUG] hot reconfiguration failed because of a wrong error check The error check in return of start_proxies checked for exact ERR_RETRYABLE but did not consider the return as a bit field. The function returned both ERR_RETRYABLE and ERR_ALERT, hence the problem.	2007-12-20 23:09:54 +01:00
Willy Tarreau	4009f016c2	[BUG] transparent proxy address was ignored in backend When the "source x.x.x.x usesrc y.y.y.y" statement was present in a backend, the y.y.y.y address was fetched from the server instead of the backend.	2007-12-14 19:54:43 +01:00
Willy Tarreau	127f966f4b	[BUILD] fix build on Solaris due to recent log changes Solaris, as well as many other unixes doesn't know about sun_len for UNIX domain sockets. It does not honnor the __SOCKADDR_COMMON macro either. After looking at MacOS-X man (which is the same as BSD man), OpenBSD man, and examples on the net, it appears that those which support sun_len do not actually use it, or at least ignore it as long as it's zero. Since all the sockaddr structures are zeroed prior to being filled, it causes no problem not to set sun_len, and this fixes build on other platforms. Another problem on Solaris was that the "sun" name is already defined as a macro returning a number, so it was necessary to rename it.	2007-12-06 00:53:51 +01:00
Willy Tarreau	019767b546	[BUILD] fix build on AIX due to recent log changes	2007-12-05 11:11:55 +01:00
Robert Tsai	81ae1953bf	[MEDIUM] add support for logging via a UNIX socket The code in haproxy-1.3.13.1 only supports syslogging to an internet address. The attached patch: - Adds support for syslogging to a UNIX domain socket (e.g., /dev/log). If the address field begins with '/' (absolute file path), then AF_UNIX is used to construct the socket. Otherwise, AF_INET is used. - Achieves clean single-source build on both Mac OS X and Linux (sockaddr_in.sin_len and sockaddr_un.sun_len field aren't always present). For handling sendto() failures in send_log(), it appears that the existing code is fine (no need to close/recreate socket) for both UDP and UNIX-domain syslog server. So I left things alone (did not close/recreate socket). Closing/recreating socket after each failure would also work, but would lead to increased amount of unnecessary socket creation/destruction if syslog is temporarily unavailable for some reason (especially for verbose loggers). Please consider this patch for inclusion into the upstream haproxy codebase.	2007-12-05 10:47:29 +01:00
Willy Tarreau	ddbb82ff47	[STATS] report the number of times each server was selected One user reported that an indicator was missing in the statistics: the number of times each server was selected by load balancing. It is in fact the total number of sessions assigned to a server by the load balancing algorithm. It should directly reflect the weight for "fair" algorithms such as round-robin, since it will not account for persistant connections. It should help a lot tuning each server's weight depending on the load it receives.	2007-12-05 10:34:49 +01:00
Willy Tarreau	5542af65dc	[MEDIUM] slowstart: ensure we don't start with a null weight Because of a divide, it was possible to have a null weight during a slowstart, which is pretty annoying, especially with a single server and a long slowstart. Also, fix the way we report the values in the stats page to avoid confusion.	2007-12-03 02:04:00 +01:00
Willy Tarreau	3259e3369e	[BUG] slowstart is in ms, not seconds	2007-12-03 01:51:45 +01:00
Willy Tarreau	d7c30f9a8c	[CLEANUP] grouped all timeouts in one structure All known timeouts in a proxy have been grouped into a "timeout" sub-structure.	2007-12-03 01:38:36 +01:00
Willy Tarreau	e219db7a46	[MEDIUM] introduce the "timeout" keyword A new "timeout" keyword replaces old "{con\|cli\|srv}timeout", and provides the ability to independantly set the following timeouts : - client - tarpit - queue - connect - server - appsession Additionally, the "clitimeout", "contimeout" and "srvtimeout" values are supported but deprecated. No warning is emitted yet when they are used since the option is very new. Other timeouts should follow soon now.	2007-12-03 01:30:13 +01:00
Willy Tarreau	1fa3126ec4	[MEDIUM] introduce separation between contimeout, and tarpit + queue Now the connect timeout, tarpit timeout and queue timeout are distinct. In order to retain compatibility with older versions, if either queue or tarpit is left unset both in the proxy and in the default proxy, then it is inherited from the connect timeout as before.	2007-12-03 00:36:16 +01:00
Willy Tarreau	b3f32f5f8a	[MEDIUM] add support for time units in the configuration It is not always handy to manipulate large values exprimed in milliseconds for timeouts. Also, some values are entered in seconds (such as the stats refresh interval). This patch adds support for time units. It knows about 'us', 'ms', 's', 'm', 'h', and 'd'. It automatically converts each value into the caller's expected unit. Unit-less values are still passed unchanged. The unit must be passed as a suffix to the number. For instance: clitimeout 15m If any character is not understood, an error is returned.	2007-12-02 22:15:14 +01:00
Willy Tarreau	a0d37b69ef	[MINOR] implement a time parsing function This new function accepts inputs in various default units, from the microsecond to the day. It detects suffixes after numbers and performs the appropriate conversions between the user's unit and the program's unit, considering a unit-less number in the default unit.	2007-12-02 22:00:35 +01:00
Willy Tarreau	2e74c3f202	[MEDIUM] restrict the set of allowed characters for identifiers In order to avoid issues in the future, we want to restrict the set of allowed characters for identifiers. Starting from now, only A-Z, a-z, 0-9, '-', '_', '.' and ':' will be allowed for a proxy, a server or an ACL name. A test file has been added to check the restriction.	2007-12-02 18:45:09 +01:00
Willy Tarreau	7b066db3bf	[MINOR] store the build options to report with -vv Sometimes it is useful to find out how a given binary version was built. The build compiler and options are now provided for this, and it's possible to get them with the -vv option.	2007-12-02 11:28:59 +01:00
Willy Tarreau	b698f0f4a2	[CLEANUP] fwrr: ensure that we never overflow in placements Now we can compute the max place depending on the number of servers, maximum weight and weight scale. The formula has been stored as a comment so that it's easy to choose between smooth weight ramp up and high number of servers. The default scale has been set to 16, which permits 4000 servers with a granularity of 6% in the worst case (weight=1).	2007-12-02 11:01:23 +01:00
Willy Tarreau	d1cd276456	[CLEANUP] remove a warning from gcc due to htons() in standard.c Due to the fact that htons is defined as a macro, it's dangerous to call it with auto-incremented arguments such as htons(f(++x)) : src/standard.c: In function 'url2sa': src/standard.c:291: warning: operation on 'curr' may be undefined The solution is simply to store the intermediate result an pass it to htons() at once.	2007-12-02 10:55:56 +01:00
Willy Tarreau	b80c230f41	[MEDIUM] add the "fail" condition to monitor requests Under certain circumstances, it is very useful to be able to fail some monitor requests. One specific case is when the number of servers in the backend falls below a certain level. The new "monitor fail" construct followed by either "if"/"unless" <condition> makes it possible to specify ACL-based conditions which will make the monitor return 503 instead of 200. Any number of conditions can be passed. Another use may be to limit the requests to local networks only.	2007-11-30 20:51:32 +01:00
Willy Tarreau	a9d3c1e6a3	[MEDIUM] add the "nbsrv" ACL verb The new "nbsrv" ACL verb matches the number of active servers in a backend. By default, it applies to the backend where it is declared, but optionally it can receive the name of another backend as an argument in parenthesis. It counts the number of enabled active servers first, then the number of enabled backup servers.	2007-11-30 20:48:53 +01:00
Willy Tarreau	c8f24f8ec1	[BUILD] fix 2 minor issues on AIX AIX does not know about MSG_DONTWAIT. Fortunately, nearly all sockets are already set to O_NONBLOCK, so it's not even required to change the code. It was only necessary to add this fcntl to the log socket which lacked it. The MSG_DONTWAIT value has been defined to zero when unset in order to make the code cleaner and more portable. Also, on AIX, "hz" is defined, which causes a problem with one function parameter in time.c. It's enough to rename the parameter there. Last, fix a missing #include <string.h> in proxy.c.	2007-11-30 18:38:35 +01:00
Willy Tarreau	4bab24d955	[MINOR] stats: report the server warm up status in a "throttle" column A new "throttle" column has been added to HTML and RAW stats to indicate in percent, the level of throttling due to server warmup. The column is empty at 100%.	2007-11-30 18:16:29 +01:00
Willy Tarreau	9909fc13f1	[MEDIUM] implement the slowstart parameter for servers The new 'slowstart' parameter for a server accepts a value in milliseconds which indicates after how long a server which has just come back up will run at full speed. The speed grows linearly from 0 to 100% during this time. The limitation applies to two parameters : - maxconn: the number of connections accepted by the server will grow from 1 to 100% of the usual dynamic limit defined by (minconn,maxconn,fullconn). - weight: when the backend uses a dynamic weighted algorithm, the weight grows linearly from 1 to 100%. In this case, the weight is updated at every health-check. For this reason, it is important that the 'inter' parameter is smaller than the 'slowstart', in order to maximize the number of steps. The slowstart never applies when haproxy starts, otherwise it would cause trouble to running servers. It only applies when a server has been previously seen as failed.	2007-11-30 17:42:05 +01:00
Willy Tarreau	df36614b97	[CLEANUP] use distinct bits per load-balancing algorithm type It's useful to be able to check against an LB algorithm type by testing just one bit.	2007-11-30 16:23:20 +01:00
Willy Tarreau	8293658170	[MINOR] http-check disable-on-404 is not limited to HTTP mode This option is for health-checks, do not limit it to HTTP proxies.	2007-11-30 15:20:09 +01:00
Willy Tarreau	2ea81930e7	[MEDIUM] report disabled servers as "NOLB" when they are still UP It's important to be able to distinguish between servers which are UP and those which are UP but disabled via a 404 response. For this reason, the status entries report "NOLB" instead of "UP", and the HTML page uses darker colors. As a complement, write "DOWN" in bold red on the backend if it has no server left for load balancing.	2007-11-30 12:04:38 +01:00
Willy Tarreau	0ebe106ef1	[MEDIUM] secure the calling conditions of ->set_server_status_{up,down} It's not always obvious for the callers of set_server_status_{up,down} whether the new state really is up or down. Some flags as well as the effective weight have to be considered. Let's ensure that those functions perform the necessary check themselves so that if the state transition cannot be performed, at least everything is updated as required.	2007-11-30 11:11:02 +01:00
Willy Tarreau	48494c0c5c	[MEDIUM] implement "http-check disable-on-404" for graceful shutdown When an HTTP server returns "404 not found", it indicates that at least part of it is still running. For this reason, it can be convenient for application administrators to be able to consider code 404 as valid, but for a server which does not want to participate to load balancing anymore. This is useful to seamlessly exclude a server from a farm without acting on the load balancer. For instance, let's consider that haproxy checks for the "/alive" file. To enable load balancing on a server, the admin would simply do : # touch /var/www/alive And to disable the server, he would simply do : # rm /var/www/alive Another immediate gain from doing this is that it is now possible to send NOTICE messages instead of ALERT messages when a server is first disable, then goes down. This provides a graceful shutdown method. To enable this behaviour, specify "http-check disable-on-404" in the backend.	2007-11-30 10:41:39 +01:00
Willy Tarreau	c7dd71ae5b	[MEDIUM] change server check result to a bit field A server check currently returns either -1 or 1. This is not very convenient to enhance the health-checks system. Let's use flags instead.	2007-11-30 08:33:21 +01:00
Alexandre Cassen	5eb1a9033a	[MEDIUM] New option http_proxy Hello, You will find attached an updated release of previously submitted patch. It polish some part and extend ACL engine to match IP and PORT parsed in HTTP request. (and take care of comments made by Willy ! ;)) Best regards, Alexandre	2007-11-29 15:43:32 +01:00
Willy Tarreau	3168223a7b	[MINOR] move the load balancing algorithm to be->lbprm.algo The number of possible options for a proxy has already reached 32, which is the current limit due to the fact that they are each represented as a bit in a 32-bit word. It's possible to move the load balancing algorithms to another place. It will also save some space for future algorithms.	2007-11-29 15:38:04 +01:00
Willy Tarreau	b625a085d8	[MAJOR] implement the Fast Weighted Round Robin (FWRR) algo This round robin algorithm was written from trees, so that we do not have to recompute any table when changing server weights. This solution allows on-the-fly weight adjustments with immediate effect on the load distribution. There is still a limitation due to 32-bit computations, to about 2000 servers at full scale (weight 255), or more servers with lower weights. Basically, sum(srv.weight)*4096 must be below 2^31. Test configurations and an example program used to develop the tree will be added next. Many changes have been brought to the weights computations and variables in order to accomodate for the possiblity of a server to be running but disabled from load balancing due to a null weight.	2007-11-28 14:23:17 +01:00
Willy Tarreau	5dc2fa660c	[MINOR] add a weight divisor to the struct proxy Under some circumstances, it will be useful to be able to have a server's effective weight bigger than the user weight, and this is particularly true for dynamic weight-based algorithms. In order to support this, we add a "wdiv" member to the lbprm structure which will always be used to divide the weights before reporting them.	2007-11-28 14:23:13 +01:00
Willy Tarreau	2069704492	[MEDIUM] differentiate between generic LB params and map-specific ones Since the introduction of server weights, all load balancing algorithms relied on a pre-computed map. Incidently, quite a bunch of map-specific parameters were used at random places in order to get the number of servers or their total weight. It was not architecturally acceptable that optimizations for the map computation had impact on external parts. For instance, during this cleanup it was found that a backend weight was seen as 1 when only the first backup server is used, whatever its weight. This cleanup consists in differentiating between LB-generic parameters, such as total weights, number of servers, etc... and map-specific ones. The struct proxy has been enhanced in order to make it easier to later support other algorithms. The recount_servers() function now also updates generic values such as total weights so that it's not needed anymore to call recalc_server_map() when weights are needed. This permitted to simplify some code which does not need to know about map internals anymore.	2007-11-28 14:23:10 +01:00
Willy Tarreau	e6d2e4dbdf	[MINOR] merge ebtree version 3.0 Version 3.0 of ebtree has been merged in but is not used yet.	2007-11-28 14:20:44 +01:00
Willy Tarreau	30e7101137	[OPTIM] small optimization on session_process_counters() It was possible to slightly reduce the size and the number of operations in session_process_counters(). Two 64 bit comparisons were removed, reducing the code by 98 bytes on x86 due to the lack of registers. The net observed performance gain is almost 2%, which cannot be attributed to those optimizations, but more likely to induced changes in code alignment in other functions.	2007-11-26 20:22:47 +01:00
Krzysztof Piotr Oledzki	583bc96606	[MEDIUM] continous statistics By default, counters used for statistics calculation are incremented only when a session finishes. It works quite well when serving small objects, but with big ones (for example large images or archives) or with A/V streaming, a graph generated from haproxy counters looks like a hedgehog. This patch implements a contstats (continous statistics) option. When set counters get incremented continuously, during a whole session. Recounting touches a hotpath directly so it is not enabled by default, as it has small performance impact (~0.5%).	2007-11-26 20:21:47 +01:00
Willy Tarreau	5df518788d	[BUG] fix missing parenthesis in check_response_for_cacheability Parenthesis were missed when code was moved to this function. This results in non-cacheable transactions not being ignored.	2007-11-26 20:16:53 +01:00
Willy Tarreau	1fbe4932fc	[BUG] missing header names in raw stats output qlimit, pid, iid and sid were missing from the raw stats output	2007-11-26 16:15:35 +01:00
Willy Tarreau	2815664277	[BUG] relative_pid was not initialized	2007-11-26 16:13:36 +01:00
Willy Tarreau	dcd4771b3d	[MINOR] stats: report numerical process ID, proxy ID and server ID It is very convenient for SNMP monitoring to have unique process ID, proxy ID and server ID. Those have been added to the CSV outputs. The numbers start at 1. 0 is reserved. For servers, 0 means that the reported name is not a server name but half a proxy (FRONTEND/BACKEND). A remaining hidden "-" in the CSV output has been eliminated too.	2007-11-04 23:35:08 +01:00
Willy Tarreau	e6b989479c	[MAJOR] create proto_tcp and move initialization of proxy listeners Proxy listeners were very special and not very easy to manipulate. A proto_tcp file has been created with all that is required to manage TCPv4/TCPv6 as raw protocols, and provide generic listeners. The code of start_proxies() and maintain_proxies() now looks less like spaghetti. Also, event_accept will need a serious lifting in order to use more of the information provided by the listener.	2007-11-04 22:42:49 +01:00
Willy Tarreau	3acf8c3da8	[MINOR] add a generic unbind_all_listeners() primitive Most protocols will be able to share a single unbind_all_listeners() primitive. Provide it in protocols.c.	2007-11-04 22:42:49 +01:00
Willy Tarreau	1a64d16720	[MINOR] add a generic delete_listener() primitive Most protocols will be able to share a single delete_listener() primitive. Provide it in protocols.c, and remove the specific version from proto_uxst.	2007-11-04 22:42:49 +01:00
Willy Tarreau	b648d6383b	[MINOR] add a generic unbind_listener() primitive Most protocols will be able to share a single unbind_listener() primitive. Provided it in protocols.c.	2007-11-04 22:42:49 +01:00
Willy Tarreau	8eebe5ea40	[MEDIUM] unbind_listener() must use fd_delete() and not close() It is important that unbind_listener() calls fd_delete() to remove a file descriptor, because only this one can update the fdtab and the maxfd entries.	2007-11-04 22:42:48 +01:00
Willy Tarreau	dabf2e2647	[MAJOR] added a new state to listeners There was a missing state for listeners, when they are not listening but still attached to the protocol. The LI_ASSIGNED state was added for this purpose. This permitted to clean up the assignment/release workflow quite a bit. Generic enable/enable_all/disable/disable_all primitives were added, and a disable_all entry was added to the struct protocol.	2007-11-04 22:42:48 +01:00
Willy Tarreau	6fb42e0694	[MINOR] add an options field to the listeners	2007-11-04 22:42:48 +01:00
Willy Tarreau	8ced9a4b91	[MEDIUM] simplify error path in event_accept() The error path in event_accept() was complicated by many code duplications. Use the classical unrolling with the gotos. This fix alone reduced the code by 2.5 kB.	2007-11-04 22:41:52 +01:00
Willy Tarreau	396d2c6782	[MINOR] avoid calling some layer7 functions if not needed Small optimization: in some cases, it's not interesting to call functions which are dedicated to checking the cache headers or cookies. Avoid calling them when not necessary.	2007-11-04 22:41:49 +01:00
Willy Tarreau	816eb54e9b	[MINOR] adjust error messages about conflicting proxies It's not easy to report useful information to help the user quickly fix a configuration. This patch : - removes the word "listener" in favor of "proxy" as it has been used since the beginning ; - ensures that the same function (hence the same words) will be used to report capabilities of a proxy being declared and an existing proxy ; - avoid the term "conflicting capabilities" in favor of "overlapping capabilities" which is more exact. - just report that the same name is reused in case of warnings	2007-11-04 08:14:25 +01:00
Krzysztof Piotr Oledzki	6eb730ded9	[MEDIUM] Implement and use generic findproxy and relax duplicated proxy check This patch: - adds proxy_mode_str() similar to proxy_type_str() - adds a generic findproxy function used with default_backend/setbe/use_backed - rewrite default_backend/senbe/use_backed to use introduced findproxy() - relaxes duplicated proxy check - changes capabilities displaying from "%X" to "%s" with a call to proxy_type_str()	2007-11-04 08:14:20 +01:00
Willy Tarreau	a7e76142a1	[MEDIUM] make default_backend work in TCP mode too The default_backend did not work in TCP mode since there was no header state to assign the backend. This causes much trouble when configs are created by copy-paste. The solution was to fix the way the backend is assigned upon accept(). A wrong contimeout assignment was fixed too.	2007-11-03 14:28:39 +01:00
Willy Tarreau	0173280bfa	[MEDIUM] introduce the "url_param" balance method Some applications do not have a strict persistence requirement, yet it is still desirable for performance considerations, due to local caches on the servers. For some reasons, there are some applications which cannot rely on cookies, and for which the last resort is to use a parameter passed in the URL. The new 'url_param' balance method is there to solve this issue. It accepts a parameter name which is looked up from the URL and which is then hashed to select a server. If the parameter is not found, then the round robin algorithm is used in order to provide a normal load balancing across the servers for the first requests. It would have been possible to use a source IP hash instead, but since such applications are generally buried behind multiple levels of reverse-proxies, it would not provide a good balance. The doc has been updated, and two regression testing configurations have been added.	2007-11-01 23:05:09 +01:00
Willy Tarreau	a0cbda61a7	[MINOR] externalize the "balance" option parser to backend.c A new function "backend_parse_balance" has been created in backend.c, which is dedicated to the parsing of the "balance" keyword. It will provide easier methods for adding new algorithms.	2007-11-01 23:04:55 +01:00
Willy Tarreau	1a20a5d1b2	[CLEANUP] group PR_O_BALANCE_* bits into a checkable value In preparation for newer balance algorithms, group the sparse PR_O_BALANCE_* values into layer4 and layer7-based algorithms. This will ease addition of newer algorithms.	2007-11-01 23:01:49 +01:00
Krzysztof Piotr Oledzki	e6bbd74690	[MEDIUM] Handle long lines properly Currently, there is a hidden line length limit in the haproxy, set to 256-1 chars. With large acls (for example many hdr(host) matches) it may be not enough and which is even worse, error message may be totally confusing as everything above this limit is treated as a next line: echo -ne "frontend aqq 1.2.3.4:80\nmode http\nacl e hdr(host) -i X X X X X X X www.xx.example.com stats\n"\| sed s/X/www.some-host-name.example.com/g > ha.cfg && haproxy -c -f ./ha.cfg [WARNING] 300/163906 (11342) : parsing [./ha.cfg:4] : 'stats' ignored because frontend 'aqq' has no backend capability. Recently I hit simmilar problem and it took me a while to find why requests for "stats" are not handled properly. This patch: - makes the limit configurable (LINESIZE) - increases default line length limit from 256 to 2048 - increases MAX_LINE_ARGS from 40 to 64 - fixes hidden assignment in fgets() - moves arg/end/args/line inside the loop, making code auditing easier - adds a check that shows error if the limit is reached - changes "line++ = 0;" to "line++ = '\0';" (cosmetics) With this patch, when LINESIZE is defined to 256, above example produces: [ALERT] 300/164724 (27364) : parsing [/tmp/ha.cfg:3]: line too long, limit: 255. [ALERT] 300/164724 (27364) : Error reading configuration file : /tmp/ha.cfg	2007-11-01 23:00:51 +01:00
Krzysztof Oledzki	0259419f41	[PATCH] use backends only with use_backend directive Hello, As it is possible to use the same name for two proxies, make sure that use_backed & friends does not match wrong proxy when used with use_backend/ default_backend/setbe. For example, without this patch, when there is a backend and frontend with the same name (first backend and then frontend trying to use specific backend), the application will likely try to use frontend instead of backend, complaining loudly about a loop. Best regards, Krzysztof Oledzki	2007-11-01 23:00:46 +01:00
Willy Tarreau	106bf274c4	[MINOR] add socket address length to the protocols The protocol struct can be more useful if it also provides address lengths. Add sock_addrlen, as used by bind(), as well as l3_addrlen for hashes.	2007-10-28 12:09:45 +01:00
Willy Tarreau	d740babd0e	[MINOR] move error codes to common/errors.h It's useful to be able to share error codes between C files, so move the codes currently only used in protocols to a generic file.	2007-10-28 11:14:07 +01:00
Elijah Epifanov	acafc5f88c	[MEDIUM] add support for "maxqueue" to limit server queue overload This patch adds the "maxqueue" parameter to the server. This allows new sessions to be immediately rebalanced when the server's queue is filled. It's useful when session stickiness is just a performance boost (even a huge one) but not a requirement. This should only be used if session affinity isn't a hard functional requirement but provides performance boost by keeping server-local caches hot and compact). Absence of 'maxqueue' option means unlimited queue. When queue gets filled up to 'maxqueue' client session is moved from server-local queue to a global one.	2007-10-25 20:15:38 +02:00
Willy Tarreau	91092e5739	[MINOR] provide easy-to-use limit_r and LIM2A* macros This is in fact the same as ultoa() except that it's possible to pass the string to be returned in case the value is NULL. This is useful to report limits in printf calls.	2007-10-25 16:59:40 +02:00
Willy Tarreau	72d759c9c1	[MINOR] provide easier-to-use ultoa_* functions Current ultoa() function is limited to one use per expression or function call. Sometimes this is limitating. Change this in favor of an array of 10 return values and shorter macros U2A0..U2A9 which respectively call the function with the 10 different buffers.	2007-10-25 16:59:40 +02:00
Willy Tarreau	fe94460d53	[BUG] fix calls to localtime() localtime() was called with pointers to tv_sec, which is time_t on some platforms and long on others. A problem was encountered on Sparc64 under OpenBSD where tv_sec is long (64 bits) and time_t is 32 bits. Since this architecture is big-endian, it exhibited the bug because localtime() always worked with the high part of the value which is always zero. This problem was identified and debugged by Thierry Fournier. The correct solution is to pass the date by value and not by pointer, through an intermediate function. The use of localtime_r() instead of localtime() also made it possible to get rid of the first call to localtime() since it does not need to allocate memory anymore.	2007-10-25 10:34:16 +02:00
Willy Tarreau	3f0c976135	[BUG] fix error checking in strl2ic/strl2uic() The strl2ic() and strl2uic() primitives used to convert string to integers could return 10 times the value read if they stopped on non-digit because of a mis-placed loop exit.	2007-10-25 09:42:24 +02:00
Krzysztof Oledzki	85130941e7	[MEDIUM] stats: report server and backend cumulated downtime Hello, This patch implements new statistics for SLA calculation by adding new field 'Dwntime' with total down time since restart (both HTTP/CSV) and extending status field (HTTP) or inserting a new one (CSV) with time showing how long each server/backend is in a current state. Additionaly, down transations are also calculated and displayed for backends, so it is possible to know how many times selected backend was down, generating "No server is available to handle this request." error. New information are presentetd in two different ways: - for HTTP: a "human redable form", one of "100000d 23h", "23h 59m" or "59m 59s" - for CSV: seconds I believe that seconds resolution is enough. As there are more columns in the status page I decided to shrink some names to make more space: - Weight -> Wght - Check -> Chk - Down -> Dwn Making described changes I also made some improvements and fixed some small bugs: - don't increment s->health above 's->rise + s->fall - 1'. Previously it was incremented an then (re)set to 's->rise + s->fall - 1'. - do not set server down if it is down already - do not set server up if it is up already - fix colspan in multiple places (mostly introduced by my previous patch) - add missing "status" header to CSV - fix order of retries/redispatches in server (CSV) - s/Tthen/Then/ - s/server/backend/ in DATA_ST_PX_BE (dumpstats.c) Changes from previous version: - deal with negative time intervales - don't relay on s->state (SRV_RUNNING) - little reworked human_time + compacted format (no spaces). If needed it can be used in the future for other purposes by optionally making "cnt" as an argument - leave set_server_down mostly unchanged - only little reworked "process_chk: 9" - additional fields in CSV are appended to the rigth - fix "SEC" macro - named arguments (human_time, be_downtime, srv_downtime) Hope it is OK. If there are only cosmetic changes needed please fill free to correct it, however if there are some bigger changes required I would like to discuss it first or at last to know what exactly was changed especially since I already put this patch into my production server. :) Thank you, Best regards, Krzysztof Oledzki	2007-10-22 21:36:23 +02:00
Krzysztof Oledzki	365d1cd84c	[PATCH]: Check for duplicated conflicting proxies Currently haproxy accepts a config with duplicated proxies (listen/fronted/backed/ruleset). This patch fix this, so the application will complain when there is an error. With this modification it is still possible to use the same name for two proxies (for example frontend&backend) as long there is no conflict: listen backend frontend ruleset listen - - - - backend - - OK - frontend - OK - - ruleset - - - - Best regards, Krzysztof Oledzki	2007-10-21 10:16:27 +02:00
Willy Tarreau	d4e1b5ffa5	[MINOR] stats: update the width of the table to 22 columns Unfortunately, we forgot to increase the table from 20 to 22 cols when we added retries and redisp.	2007-10-19 06:23:19 +02:00
Krzysztof Oledzki	1cf36ba3ae	[MEDIUM] stats: count server retries and redispatches It is important to know how your installation performs. Haproxy masks connection errors, which is extremely good for a client but it is bad for an administrator (except people believing that "ignorance is a bless"). Attached patch adds retries and redispatches counters, so now haproxy: 1. For server: - counts retried connections (masked or not) 2. For backends: - counts retried connections (masked or not) that happened to a slave server - counts redispatched connections - does not count successfully redispatched connections as backend errors. Errors are increased only when client does not get a valid response, in other words: with failed redispatch or when this function is not enabled. 3. For statistics: - display Retr (retries) and Redis (redispatches) as a "Warning" information.	2007-10-18 19:12:30 +02:00
Willy Tarreau	9edd161554	[MINOR] use nolinger on health-checks if backend is set to nolinger If the administrator finds it useful to disable lingering on the backend, let's disable lingering on health-checks too.	2007-10-18 18:07:48 +02:00
Willy Tarreau	1388a3a8e8	[BUG] scope "." must match the backend and not the frontend	2007-10-18 16:38:37 +02:00
Willy Tarreau	10ae548052	[BUG] fix off-by-one in path length in destroy_uxst_socket() An off-by-one error was left in the computation of the unix socket path.	2007-10-18 16:15:52 +02:00
Willy Tarreau	03f6d67c48	[BUILD] fix build of global section with older gcc versions The way the global section was initialized was not correct, which made older versions of GCC complain.	2007-10-18 15:15:57 +02:00
Willy Tarreau	fbee71331d	[MEDIUM] introduce the "stats" keyword in global section Removed old unused MODE_LOG and MODE_STATS, and replaced the "stats" keyword in the global section. The new "stats" keyword in the global section is used to create a UNIX socket on which the statistics will be accessed. The client must issue a "show stat\n" command in order to get a CSV-formated output similar to the output on the HTTP socket in CSV mode.	2007-10-18 14:16:11 +02:00
Willy Tarreau	3e76e728ce	[MEDIUM] implement the statistics output on a unix socket A unix socket can now access the statistics. It currently only recognizes the "show stat\n" command at the beginning of the input, then returns the statistics in CSV format.	2007-10-18 14:13:13 +02:00
Willy Tarreau	5031e6adf5	[MINOR] add a link to the CSV export on the stats page.	2007-10-18 14:12:30 +02:00
Willy Tarreau	55bb8450c0	[MEDIUM] implement the CSV output for the statistics It is now possible to get CSV ouput from the statistics by simply appending ";csv" to the HTTP request sent to get the stats. The fields keep the same ordering as in the HTML page, and a field "pxname" has been prepended at the beginning of the line.	2007-10-18 14:12:28 +02:00
Willy Tarreau	9186126e1c	[MEDIUM] moved stats and buffer generic functions to new files Neither the primitives used to write data to a buffer, nor the stats dump functions are HTTP-specific anymore. Move them to dedicated files	2007-10-18 14:12:21 +02:00
Willy Tarreau	e6ad2b165e	[MINOR] make it possible to set unix socket permissions Under most systems, it is possible to set permissions on unix sockets. This has been added to the listeners and to unix sockets.	2007-10-18 14:11:55 +02:00
Willy Tarreau	92fb9836ee	[MAJOR] implemented client-side support for PF_UNIX sockets A new file, proto_uxst.c, implements support of PF_UNIX sockets of type SOCK_STREAM. It relies on generic stream_sock_read/write and uses its own accept primitive which also tries to be generic. Right now it only implements an echo service in sight of a general support for start dumping via unix socket. The echo code is more of a proof of concept than useful code.	2007-10-18 14:11:15 +02:00
Willy Tarreau	dd81598553	[MAJOR] added generic protocol support A new generic protocol mechanism has been added. It provides an easy method to implement new protocols with different listeners (eg: unix sockets). The listeners are automatically started at the right moment and enabled after the possible fork().	2007-10-18 14:11:12 +02:00
Willy Tarreau	d680371064	[BUG] remove condition for exit() under fork() failure This must come from a copy-paste typo: in the unlikely event that fork() would fail, the parent process would only exit(1) if there were old pids. That's non-sense.	2007-10-16 07:44:56 +02:00
Willy Tarreau	d95dcb51a8	[BUG] fix wrong timeout computation in event_accept() In case the incoming socket is set for write and not for read (very unlikely, except in HEALTH mode), the timeout may remain eternity due to a copy-paste typo.	2007-10-16 07:41:52 +02:00
Willy Tarreau	177a16a8d1	[BUG] fix segfault on exit in new appsession code The new appsession code didn't like it when appsession_hash_destroy() was called with an empty hash table. Simply add the check.	2007-10-15 20:08:16 +02:00
Willy Tarreau	f223cc0b5c	[MEDIUM] fixed call to chroot() during startup It wasn't very wise to chroot() early during the startup. Also, the exit() was missing if the chroot() failed.	2007-10-15 18:57:08 +02:00
Willy Tarreau	e94ebd0e37	[MEDIUM] moved the sockaddr pointer to the fdtab structure The stream_sock_* functions had to know about sessions just in order to get the server's address for a connect() operation. This is not desirable, particularly for non-IP protocols (eg: PF_UNIX). Put a pointer to the peer's sockaddr_storage or sockaddr address in the fdtab structure so that we never need to look further. With this small change, the stream_sock.c file is now 100% protocol independant.	2007-10-15 17:14:01 +02:00
Krzysztof Oledzki	d9db9274fe	[MINOR] report haproxy's version by default on the stats page For people who manage many haproxies, it is sometimes convenient to be informed of their version. This patch adds this, with the option to disable this report by specifying "stats hide-version". Also, the feature may be permanently disabled by setting the STATS_VERSION_STRING to "" (empty string), or the format can simply be adjusted.	2007-10-15 10:05:11 +02:00
Willy Tarreau	44ec0f003d	[MINOR] spread checks also when the server is OK. Initial patch only managed to spread the checks when the checks failed. The randomization code needs to be added also in the path where the server is going fine.	2007-10-15 09:33:17 +02:00
Willy Tarreau	2c43a1e2f0	[MEDIUM] only consider slow checks when looking for the common interval When one server in one backend has a very low check interval, it imposes its value as the minimal interval, causing all other servers to start their checks close to each other, thus partially voiding the benefits of the spread checks. The solution consists in ignoring intervals lower than a given value (SRV_CHK_INTER_THRES = 1000 ms) when computing the minimal interval, and then assigning them a start date relative to their own interval and not the global one. With this change, the checks distribution clearly looks better.	2007-10-15 09:33:14 +02:00
Krzysztof Oledzki	b304dc7fd7	[MEDIUM] Spread health checks even more When one server appears at the same position in multiple backends, it receives all the checks from all the backends exactly at the same time because the health-checks are only spread within a backend but not globally. Attached patch implements per-server start delay in a different way. Checks are now spread globally - not locally to one backend. It also makes them start faster - IMHO there is no need to add a 'server->inter' when calculating first execution. Calculation were moved from cfgparse.c to checks.c. There is a new function start_checks() and now it is not called when haproxy is started in MODE_CHECK. With this patch it is also possible to set a global 'spread-checks' parameter. It takes a percentage value (1..50, probably something near 5..10 is a good idea) so haproxy adds or removes that many percent to the original interval after each check. My test shows that with 18 backends, 54 servers total and 10000ms/5% it takes about 45m to mix them completely. I decided to use rand/srand pseudo-random number generator. I am aware it is not recommend for a good randomness but a) we do not need a good random generator here b) it is probably the most portable one.	2007-10-15 09:33:10 +02:00
Alexandre Cassen	87ea548313	[MINOR] add the "nolinger" option to disable data lingering The following patch will give the ability to tweak socket linger mode. You can use this option with "option nolinger" inside fronted or backend configuration declaration. This will help in environments where lots of FIN_WAIT sockets are encountered.	2007-10-15 09:33:06 +02:00
Krzysztof Oledzki	9198ab5e7c	[MEDIUM] do not add a cache-control: header when on non-cacheable responses I noticed that haproxy, with "cookie (...) nocache" option, always adds "Cache-control: private" at the end of a header list received from this server: Cache-Control: no-cache (...) Set-Cookie: SERVERID=s6; path=/ Cache-control: private or: Set-Cookie: ASPSESSIONIDCSRCTSSB=HCCBGGACGBHDHMMKIOILPHNG; path=/ Cache-control: private Set-Cookie: SERVERID=s5; path=/ Cache-control: private It may be just redundant (two "Cache-control: private"), but sometimes it may be quite confused as we may end with two different, more and less restricted directions (no-cache & private) and even quite conflicting directions (eg. public & private): So, I added and rearranged a code, so now haproxy adds a "Cache-control: private" header only when there is no the same (private) or more restrictive (no-cache) one. It was done in three steps: 1. Use check_response_for_cacheability to check if response is not cacheable. I simply moved this call before http_header_add_tail2. 2. Use TX_CACHEABLE (not TX_CACHE_COOK - apache <= 1.3.26) to check if we need to add a Cache-control header. If we add it, clear TX_CACHEABLE and TX_CACHE_COOK. 3. Check cacheability not only with PR_O_CHK_CACHE but also with PR_O_COOK_NOC, so: - unlikely(t->be->options & PR_O_CHK_CACHE)) + (t->be->options & (PR_O_CHK_CACHE\|PR_O_COOK_NOC))) txn->flags \|= TX_CACHEABLE \| TX_CACHE_COOK; I removed this unlikely since I believe that now it is not so unlikely. The patch is definitely not perfect, proxy should probably also remove "Cache-control: public". Unfortunately, I do not know the code good enough to do in myself, yet. ;) Anyway, I think that even now, it should be very useful.	2007-10-15 09:33:02 +02:00
Krzysztof Oledzki	6b3f8b4b8f	[MINOR] prevent the system from sending an RST when closing health-checks On Sat, 22 Sep 2007, Willy Tarreau wrote: > On Sun, Sep 23, 2007 at 03:23:38AM +0200, Krzysztof Oledzki wrote: > > I noticed that with httpchk, haproxy generates TCP RST at end of a check. > > IMHO, it would be more polite to send FIN to a server, especially that > > each TCP RST found by a tcpdump makes me concerned that something is > > wrong, as it is hard to distinguish between a RST from a httpchk and from > > a normal request, forwarded for a client. > > I have also noticed it very recently. In fact, it's never the > application (here haproxy) which decides to send an RST, it's the > system. It does so because the server returns data on a terminated > socket. I guess it's because the health-check code does not read much > of the response. In fact, we just need to read enough to process common > responses. If people are dumb enough to check with something like "GET > /image.iso", they should expect to get an RST after a few kbytes > instead of reading the whole file! Right, that was easy. Attached patch changed what you described. Now haproxy finishes http checks with FIN.	2007-10-15 09:32:58 +02:00
Krzysztof Oledzki	56f1e8b368	[BUG] fix double-free during clean exit This patch fixes a nasty bug raported by both glibc and valgrind, which leads into a problem that haproxy does not exit when a new instace starts ap (-sf/-st). ==9299== Invalid free() / delete / delete[] ==9299== at 0x401D095: free (in /usr/lib/valgrind/x86-linux/vgpreload_memcheck.so) ==9299== by 0x804A377: deinit (haproxy.c:721) ==9299== by 0x804A883: main (haproxy.c:1014) ==9299== Address 0x41859E0 is 0 bytes inside a block of size 21 free'd ==9299== at 0x401D095: free (in /usr/lib/valgrind/x86-linux/vgpreload_memcheck.so) ==9299== by 0x804A84B: main (haproxy.c:985) ==9299== 6542 open("/dev/tty", O_RDWR\|O_NONBLOCK\|O_NOCTTY) = -1 ENOENT (No such file or directory) 6542 writev(2, [{"* glibc detected * ", 23}, {"corrupted double-linked list", 28}, {": 0x", 4}, {"6ff91878", 8}, {" ***\n", 5}], 5) = -1 EBADF (Bad file descriptor) I found this bug trying to find why, after one week with many restarts, I finished with >100 haproxy process running. ;)	2007-10-15 09:32:54 +02:00
Willy Tarreau	6e4261ee2f	[MAJOR] timeouts and retries could be ignored when switching backend When switching from a frontend to a backend, the "retries" parameter was not kept, resulting in the impossibility to reconnect after the first connection failure. This problem was reported and analyzed by Krzysztof Oledzki. While fixing the code, it appeared that some of the backend's timeouts were not updated in the session when using "use_backend" or "default_backend". It seems this had no impact but just in case, it's better to set them as they should have been.	2007-10-15 09:32:19 +02:00
Willy Tarreau	5fcc8f1ed9	[MINOR] fix the SIGHUP message not to alert on server-less proxies The SIGHUP message was designed long before it was possible to have no server in a proxy. Remove the alert in case there's no server.	2007-10-15 09:32:15 +02:00
Willy Tarreau	fdd0f5568a	[MEDIUM] pre-initialize timeouts to infinity, not zero Since the timers have been changed, the timeouts for the default instance have not been adjusted. This results in unspecified timeouts becoming zero instead of infinite.	2007-10-15 09:32:11 +02:00
Willy Tarreau	3d08953ce0	[MINOR] set the log socket receive window to zero bytes The syslog UDP socket may receive data, which is not cool because those data accumulate in the system buffers up to the receive socket buffer size. To prevent this, we set the receive window to zero and try to shutdown(SHUT_RD) the socket.	2007-10-15 09:32:07 +02:00
Willy Tarreau	193cf93ec0	[MEDIUM] fix configuration sanity checks for TCP listeners A log chain of if/else prevented many sanity checks from being performed on TCP listeners, resulting in dangerous configs being accepted. Removed the offending 'else'.	2007-10-15 09:32:02 +02:00
Willy Tarreau	51041c737c	[MAJOR] remove files distributed under an obscure license src/chtbl.c, src/hashpjw.c and src/list.c are distributed under an obscure license. While Aleks and I believe that this license is OK for haproxy, other people think it is not compatible with the GPL. Whether it is or not is not the problem. The fact that it rises a doubt is sufficient for this problem to be addressed. Arnaud Cornet rewrote the unclear parts with clean GPLv2 and LGPL code. The hash algorithm has changed too and the code has been slightly simplified in the process. A lot of care has been taken in order to respect the original API as much as possible, including the LGPL for the exportable parts. The new code has not been thoroughly tested but it looks OK now.	2007-09-09 21:56:53 +02:00
Willy Tarreau	4eac209555	[MAJOR] spec I/O: fix allocations of spec entries for an FD Under some circumstances, it was possible with speculative I/O to reallocate multiple entries for the same FD if an fd_{set,clr,set} or fd_{clr,set,clr} sequences were performed before a schedule. Fix this by keeping a an allocation flag for each fd.	2007-09-09 21:09:29 +02:00
Willy Tarreau	e7150cdcfa	[MEDIUM] stats page: added links for 'refresh' and 'hide down' The stats page now supports an option to hide servers which are DOWN and to enable/disable automatic refresh. It is also possible to ask for an immediate refresh.	2007-09-09 21:09:29 +02:00
Willy Tarreau	dceaa0894b	[MEDIUM] ensure we never overflow in chunk_printf() The result of the vsnprintf() called in chunk_printf() must be checked, and should be added only if lower than the requested size. We simply return zero if we cannot write the chunk.	2007-09-09 21:09:28 +02:00
Willy Tarreau	bbd42123e1	[MINOR] add support for "stats refresh <interval>" Sometimes it may be desirable to automatically refresh the stats page. Most browsers support the "Refresh:" header with an interval in seconds. Specifying "stats refresh xxx" will automatically add this header.	2007-09-09 21:09:28 +02:00
Willy Tarreau	4b946c8564	[MINOR] fix backend's weight in the stats page. The GCD used when computing the servers' weights causes the total weight of the backend to appear lower than expected because it is divided by the GCD. Easy solution consists in recomputing the GCD from the first server and apply it to the global weight.	2007-09-09 21:09:28 +02:00
Willy Tarreau	5af3a694f5	[MEDIUM] improve behaviour with large number of servers per proxy When a very large number of servers is configured (thousands), shutting down many of them at once could lead to large number of calls to recalc_server_map() which already takes some time. This would result in an O(N^3) computation time, leading to noticeable pauses on slow embedded CPUs on test platforms. Instead, mark the map as dirty and recalc it only when needed.	2007-09-09 21:09:28 +02:00
Willy Tarreau	632f5a7b6f	[MEDIUM] fade out memory usage when stopping proxies Now we try to free as many pools as possible when a proxy is stopping. The reason is that we want to ease the process replacement when applying a new configuration, without keeping too many unused memory allocated.	2007-07-11 10:42:35 +02:00
Willy Tarreau	8f8e645066	[CLEANUP] shut warnings 'is' macros from ctype.h on solaris Solaris visibly uses an array for is, which returns warnings about the use of signed chars as indexes. Good opportunity to put casts everywhere.	2007-06-17 21:51:38 +02:00
Willy Tarreau	a590983fe5	[MEDIUM] acl: added the TRUE and FALSE ACLs. Those ACLs are sometimes useful for troubleshooting. Two ACL subjects "always_true" and "always_false" have been added too. They return what their subject says for every pattern. Also, acl_match_pst() has been removed.	2007-06-17 20:40:25 +02:00
Willy Tarreau	55ea7579d7	[MAJOR] added the 'use_backend' keyword for full content-switching The new "use_backend" keyword permits full content switching by the use of ACLs. Its usage is simple : use_backend <backend_name> {if\|unless} <acl_cond>	2007-06-17 19:56:27 +02:00
Willy Tarreau	c11416f22f	[MEDIUM] acl: distinguish between request and response headers hdr(x) will now still be used for request headers, and shdr(x) for server headers (response).	2007-06-17 16:58:38 +02:00
Willy Tarreau	16fbe82bfc	[MEDIUM] provide default ACLs The following ACLs are predefined : LOCALHOST = src 127.0.0.1/8 HTTP_1.0 = req_ver 1.0 HTTP_1.1 = req_ver 1.1 METH_CONNECT = method CONNECT METH_GET = method GET HEAD METH_HEAD = method HEAD METH_OPTIONS = method OPTIONS METH_POST = method POST METH_TRACE = method TRACE HTTP_URL_ABS = url_reg ^[^/:]:// HTTP_URL_SLASH = url_beg / HTTP_URL_STAR = url HTTP_CONTENT = hdr_val(content-length) gt 0	2007-06-17 11:54:31 +02:00
Willy Tarreau	8aeae4af23	[BUG] str2net() must not change the const char * str2net needs to put \0 in a const char *. Use strdup() for that.	2007-06-17 11:42:08 +02:00
Willy Tarreau	c8d7c96b26	[MEDIUM] acl: support '-i' to ignore case when matching Implemented the "-i" option on ACLs to state that the matching will have to be performed for all patterns ignoring case. The usage is : acl <aclname> <aclsubject> -i pattern1 ... If a pattern must begin with "-", either it must not be the first one, or the "--" option should be specified first.	2007-06-17 08:20:33 +02:00
Willy Tarreau	0fc45a7e83	[MINOR] improve memory freeing upon exit The deinit() function is specialized in memory area freeing. There were a ton of information that were not released at the exit time, which made valgrind complain. Now, most of the entries are freed. However, it seems like regfree() does not completely free a regex (12 bytes lost per regex).	2007-06-17 00:36:03 +02:00
Willy Tarreau	dae4aa8c4a	[BUG] fix segfault at exit when using captures since pools v2, the way pools were destroyed at exit is incorrect because it ought to account for users of those pools before freeing them. This test also ensures there is no double free.	2007-06-16 23:19:53 +02:00
Willy Tarreau	74b98a8c22	[BUG] negation in ACL conds was not cleared between terms The exclamation mark (!) in front of an ACL condition was propagated to the whole line instead of being flushed after parsing an acl name.	2007-06-16 19:35:18 +02:00
Willy Tarreau	3f49b30284	[MEDIUM] errorfile: use a local file to feed error messages It is now possible to read error messages from local files, using the 'errorfile' keyword. Those files are read during parsing, so there's no I/O involved. They make it possible to return custom error messages with custom status and headers.	2007-06-11 00:29:26 +02:00
Willy Tarreau	1ad7c6dd85	[MINOR] acl: permit to return any header when no name specified Having the ability to match on hdr_xxx in addition to hdr_xxx(yyy) makes it possible to match any value or to count the headers easily.	2007-06-10 21:42:55 +02:00
Willy Tarreau	737b0c12a6	[MEDIUM] acl: support maching on 'path' component 'path', 'path_reg', 'path_beg', 'path_end', 'path_sub', 'path_dir' and 'path_dom' have been implemented to process the path component of the URI. It starts after the host part, and stops before the question mark.	2007-06-10 21:28:46 +02:00
Willy Tarreau	33a7e6901f	[MEDIUM] acl: implement matching on header values hdr(x), hdr_reg(x), hdr_beg(x), hdr_end(x), hdr_sub(x), hdr_dir(x), hdr_dom(x), hdr_cnt(x) and hdr_val(x) have been implemented. They apply to any of the possibly multiple values of header <x>. Right now, hdr_val() is limited to integer matching, but it should reasonably be upgraded to match long long ints.	2007-06-10 19:45:56 +02:00
Willy Tarreau	97be145991	[MINOR] acl: provide a reference to the expr to fetch() The fetch() functions may need to access the full expr to get their args. Turn the void arg into a struct acl_expr expr.	2007-06-10 11:47:14 +02:00
Willy Tarreau	bb76891d0f	[MINOR] acl: provide the argument length for fetch functions Some fetch() functions will require an argument (eg: header). It's wise to provide the argument size to optimize string comparisons.	2007-06-10 11:17:01 +02:00
Willy Tarreau	d41f8d85e8	[MINOR] acl: specify the direction during fetches Some fetches such as 'line' or 'hdr' need to know the direction of the test (request or response). A new 'dir' parameter is now propagated from the caller to achieve this.	2007-06-10 10:06:18 +02:00
Willy Tarreau	ae8b796722	[MEDIUM] smarter integer comparison support in ACLs ACLs now support operators such as 'eq', 'le', 'lt', 'ge' and 'gt' in order to give more flexibility to the language. Because of this change, the 'dst_limit' keyword changed to 'dst_conn' and now requires either a range or a test such as 'dst_conn lt 1000' which is more understandable.	2007-06-09 23:10:04 +02:00
Willy Tarreau	1db37710dc	[MEDIUM] limit the number of events returned by poll By default, epoll/kqueue used to return as many events as possible. This could sometimes cause huge latencies (latencies of up to 400 ms have been observed with many thousands of fds at once). Limiting the number of events returned also reduces the latency by avoiding too many blind processing. The value is set to 200 by default and can be changed in the global section using the tune.maxpollevents parameter.	2007-06-03 17:16:49 +02:00
Willy Tarreau	fb8983f21b	[BUG] the epoll FD must not be shared between processes Recreate the epoll file descriptor after a fork(). It will ensure that all processes will not share their epoll_fd. Some side effects were encountered because of this, such as epoll_wait() returning an FD which was previously deleted, in multi-process mode.	2007-06-03 16:40:44 +02:00
Willy Tarreau	ab3e1d313c	[MEDIUM] optimize I/O by detecting system starvation Compare the results of recv/send with the parameter passed and detect whether the system has no free buffer space for send() or has no data anymore for recv(). This dramatically reduces the number of syscalls (by about 23%).	2007-06-03 16:05:39 +02:00
Willy Tarreau	fa64558402	[BUG] do not re-arm read timeout after writing data A second occurrence of read-timeout rearming was present in stream_sock.c. To fix the problem, it was necessary to put the shutdown information in the buffer (already planned).	2007-06-03 16:03:49 +02:00
Willy Tarreau	33014d0d8d	[BUG] do not re-arm read timeout in SHUTR state ! There is a long-time bug causing busy loops when either client-side or server-side enters a SHUTR state. When writing data to the FD, it was possible to re-arm the read side if the write had been paused.	2007-06-03 16:03:45 +02:00
Willy Tarreau	ee99136992	[BUG] pre-initialize timeouts with tv_eternity during parsing ETERNITY is not 0 anymore, so all timeouts will not be initialized to ETERNITY by a simple calloc(). We have to explictly assign them. This bug caused random session aborts.	2007-05-14 14:37:50 +02:00
Willy Tarreau	8eee9c8457	[BUG] fix broken health-checks since switch to timeval Health-checks were broken because of a return which was unexpectedly removed.	2007-05-14 03:40:11 +02:00
Willy Tarreau	d9b744104e	[MINOR] allow null timeouts for past events in select	2007-05-14 03:16:06 +02:00
Willy Tarreau	79b8a62ff6	[BUG] ev_kqueue was forgotten during the switch to timeval	2007-05-14 03:15:46 +02:00
Willy Tarreau	315bff5183	Merge branch 'pools' into merge-pools	2007-05-14 02:11:56 +02:00
Willy Tarreau	1209033e46	[MINOR] disable useless hint in wake_expired_tasks wake_expired_tasks() used a hint to avoid scanning the tree in most cases, but it looks like the hint is more expensive than reaching the first node in the tree. Disable it for now.	2007-05-14 02:11:39 +02:00
Willy Tarreau	fbfc053e34	[BUG] fix buggy timeout computation in wake_expired_tasks Wake_expired_tasks is supposed to return a date, not an interval. It was causing busy loops in pollers.	2007-05-14 02:03:47 +02:00
Willy Tarreau	bdefc513a0	[BUG] fix null timeouts in poll-based pollers Introduction of timeval timers broke poll-based pollers, because the call to tv_ms_remain may return 0 while the event is not elapsed yet. Now we carefully check for those cases and round the result up by 1 ms.	2007-05-14 02:02:04 +02:00
Willy Tarreau	4d2d098ea3	[MAJOR] call garbage collector when doing soft stop When we're interrupted by another instance, it is very likely that the other one will need some memory. Now we know how to free what is not used, so let's do it. Also only free non-null pointers. Previously, pool_destroy() did implicitly check for this case which was incidentely needed.	2007-05-14 00:39:29 +02:00
Willy Tarreau	7dcd46d471	[MEDIUM] enhance behaviour of mempools v2 - keep the number of users of each pool - call the garbage collector on out of memory conditions - sort the pools by size for faster creation - force the alignment size to 16 bytes instead of 4sizeof(void )	2007-05-14 00:16:13 +02:00
Willy Tarreau	1d4154a7c0	[MAJOR] convert the header indexes to use mempool v2	2007-05-13 22:57:02 +02:00
Willy Tarreau	cf7f320f9d	[MAJOR] last bunch of capture changes for mempool v2 The header captures had lots of pools. They have all been transformed.	2007-05-13 22:46:04 +02:00
Willy Tarreau	086b3b4c9f	[MAJOR] ported the captures to use the new mempool v2 The "capture.c" file has also been removed since it was empty.	2007-05-13 21:45:51 +02:00
Willy Tarreau	332f8bfc5b	[MAJOR] ported requri to use mempools v2	2007-05-13 21:36:56 +02:00
Willy Tarreau	63963c62e7	[MAJOR] ported appsession to use mempools v2 Also during this process, a bug was found in appsession_refresh(). It would not automatically requeue the task in the queue, so the old sessions would not vanish.	2007-05-13 21:29:55 +02:00
Willy Tarreau	e4d7e55061	[MAJOR] ported pendconn to mempools v2 A pool_destroy() was also missing in deinit()	2007-05-13 20:19:55 +02:00
Willy Tarreau	7341d94c5d	[MAJOR] switched buffers to mempools v2	2007-05-13 19:56:02 +02:00
Willy Tarreau	c6ca1a02aa	[MAJOR] migrated task, tree64 and session to pool2 task and tree64 are already very close in size and are merged together. Overall performance gained slightly by this simple change.	2007-05-13 19:43:47 +02:00
Willy Tarreau	e6ce59deb7	[MEDIUM] add new memory management functions Implement pool_destroy2, pool_flush2, pool_gc2. It is safe to call pool_gc2 to free whatever memory possible.	2007-05-13 19:38:49 +02:00
Willy Tarreau	50e608d721	[MEDIUM] implement memory pools version 2 The new pools know about their size and usage. Malloc is not used anymore, instead a dedicated function to refill the entries is used.	2007-05-13 18:26:08 +02:00
Willy Tarreau	aff694f3b6	Merge branch 'timers' into merge-timers	2007-05-13 16:10:04 +02:00
Willy Tarreau	a8b55e33da	[MINOR] use non-inline tv_* functions in many locations The __tv_* functions were abused. They are not that small and it is not always worth using them.	2007-05-13 16:08:19 +02:00
Willy Tarreau	c64e5397f6	[MINOR] avoid inlining in task.c The task management functions used to call __tv_* which is not really optimal given the size of the functions.	2007-05-13 16:07:06 +02:00
Willy Tarreau	0481c20e66	[MINOR] add new tv_* functions The most useful, tv_add_ifset only adds the increment if it is set. It is designed for use in expiration computation.	2007-05-13 16:03:27 +02:00
Willy Tarreau	01ba1c909d	Merge branch 'master' into timers	2007-05-13 14:52:43 +02:00
Willy Tarreau	6653d17b8d	[BUG] fix ev_sepoll again, this time with a new state machine It was possible in ev_sepoll() to ignore certain events if all speculative events had been processed at once, because the epoll_wait() timeout was not cleared, thus delaying the events delivery. The state machine was complicated, it has been rewritten. It seems faster and more correct right now.	2007-05-13 01:52:05 +02:00
Willy Tarreau	d825eef9c5	[MAJOR] replaced all timeouts with struct timeval The timeout functions were difficult to manipulate because they were rounding results to the millisecond. Thus, it was difficult to compare and to check what expired and what did not. Also, the comparison functions were heavy with multiplies and divides by 1000. Now, all timeouts are stored in timevals, reducing the number of operations for updates and leading to cleaner and more efficient code.	2007-05-12 22:35:00 +02:00
Willy Tarreau	dc246a7f3e	[BUG] two missing states in sepoll transition matrix Two states were missing in the speculative epoll state transition matrix. This could cause some timeouts and unhandled events. The problem showed up in TCP mode with a fast server at high session rates, but could in theory also affect HTTP mode.	2007-05-09 21:57:51 +02:00
Willy Tarreau	7317eb5a1d	[MAJOR] fixed some expiration dates on tasks The time subsystem really needs fixing. It was still possible that some tasks with expiration date below the millisecond in the future caused busy loop around poll() waiting for the timeout to happen.	2007-05-09 00:54:10 +02:00
Willy Tarreau	23677908dd	[MEDIUM] implement SMTP health checks Peter van Dijk contributed this patch which implements the "smtpchk" option, which is to SMTP what "httpchk" is to HTTP. By default, it sends "HELO localhost" to the servers, and waits for the 250 message, but it can also send a specific request.	2007-05-08 23:50:35 +02:00
Willy Tarreau	f3d259868b	[MINOR] ACL regex matching on the URI ; uri_reg The URI can be matched on regexen now. The upcase/lowcase flag can not be set yet and will soon have to.	2007-05-08 23:24:51 +02:00
Willy Tarreau	662b2d8d18	[MINOR] implement the ACL keywords 'dst' and 'dport' The file client.c now provides acl_fetch_dip and acl_fetch_dport to be able to check the client's destination address and port. The corresponding ACL keywords 'dst' and 'dport' have been added.	2007-05-08 23:24:51 +02:00
Willy Tarreau	a67fad9d68	[MINOR] implement acl_parse_ip and acl_match_ip The ACL can now compare IP addresses. The client's IP address can be checked.	2007-05-08 23:24:51 +02:00
Willy Tarreau	5c8e3e09e9	[MEDIUM] added the 'block' keyword to the config language The new 'block' keyword makes it possible to block a request based on ACL test results. Block accepts two optional arguments : 'if' <cond> and 'unless' <cond>. The request will be blocked with a 403 response if the condition is validated (if) or if it is not (unless). Do not rely on this one too much, as it's more of a proof of concept helping in developing other matches.	2007-05-08 23:24:51 +02:00
Willy Tarreau	8797c06327	[MEDIUM] added several ACL criteria and matches Many ACL criteria have been added. Some others are still commented out because some functions are still missing.	2007-05-08 23:24:50 +02:00
Willy Tarreau	eb0c614f0e	[MEDIUM] add the 'acl' keyword to the config language The 'acl' keyword allows one to declare a new ACL. It is an important part of the ACL framework.	2007-05-08 23:24:50 +02:00
Willy Tarreau	a84d374367	[MAJOR] new framework for generic ACL support This framework offers all other subsystems the ability to register ACL matching criteria. Some generic matching functions are already provided. Others will come soon and the framework shall evolve.	2007-05-08 23:24:50 +02:00
Willy Tarreau	14c8aac63b	[MEDIUM] store the original destination address in the session There are multiple places where the client's destination address is required. Let's store it in the session when needed, and add a flag to inform that it has been retrieved.	2007-05-08 23:24:20 +02:00
Willy Tarreau	d077a8e67c	[MINOR] fixed useless memory allocation in str2net() It was not necessary anymore to allocate memory in str2net(). Moreover, some calls to free() were missing in case of errors.	2007-05-08 23:23:38 +02:00
Willy Tarreau	c9b654b48b	[BUG] fix early server close after client close Problem reported by Andy Smith. If a client sends TCP data and quickly closes the connection before the server connection is established, AND the whole buffer can be sent at once when the connection establishes, then the server side believes that it can simply abort the connection because the buffer is empty, without checking that some work was performed. Fix: ensure that nothing was written before closing.	2007-05-08 14:46:53 +02:00
Willy Tarreau	540abe406d	[MEDIUM] ensure that we always have a null word in config It is important when parsing configuration file to ensure that at least one word is empty to mark the end of the line. This will be required with ACLs in order to avoid reading past the end of line.	2007-05-08 14:12:06 +02:00
Willy Tarreau	2fcb500481	[MEDIUM] implement the URI hash algorithm Guillaume Dallaire contributed the URI hashing algorithm for use with proxy-caches. It provides the advantage of optimizing the cache hit rate.	2007-05-08 14:05:27 +02:00
Willy Tarreau	9cdde230a5	[MEDIUM] always have msg->sol point to beginning of message Since the 'data' pointer is not stored in message structures, it is useful to have such a pointer to it when the message has been fully parsed.	2007-05-08 14:05:14 +02:00
Willy Tarreau	e33aecefa6	[MINOR] uninline task_wakeup task_wakup has become bigger since we used the trees. Let's not inline it anymore.	2007-04-30 14:38:03 +02:00
Willy Tarreau	8bb46f4015	[MINOR] ev_sepoll: refine flags management. Ensure that we don't call the event handlers if the FD is already marked FD_STERROR, and ensure that we properly catch HUP and ERR.	2007-04-30 14:38:00 +02:00
Willy Tarreau	6996e15e16	[BUG] fixed connection establishment detection Since the introduction of speculative I/O, it was not always possible to correctly detect a connection establishment. Particularly, in TCP mode, there is no data to send and getsockopt() returns no error. The solution consists in trying a connect() again to get its diagnostic.	2007-04-30 14:37:43 +02:00
Willy Tarreau	c2c078362a	[MINOR] remove wait_time nullification in ev_sepoll in ev_sepoll(), wait_time is forced to zero if at least one speculative event is converted to a real event. This is completely wrong.	2007-04-29 21:49:00 +02:00
Willy Tarreau	5465e111fd	[MINOR] pre-compute t->expire in event_accept At the end of event_accept(), t->expire is computed with tv_min between two exclusive values. Let's simply assign it at the same time.	2007-04-29 19:09:47 +02:00
Willy Tarreau	42aae5c7cf	[MEDIUM] many cleanups in the time functions Now, functions whose name begins with '__tv_' are inlined. Also, 'tv_ms' is used as a prefix for functions using milliseconds.	2007-04-29 17:43:56 +02:00
Willy Tarreau	f41d4b15ee	[MINOR] tell the compiler that debug more is unlikely to happen In process_session(), add unlikely() around debug code.	2007-04-29 13:44:48 +02:00
Willy Tarreau	8d7d1497e0	[MEDIUM] implement and use tv_cmp2_le instead of tv_cmp2_ms tv_cmp2_ms handles multiple combinations of tv1 and tv2, but only one form is used: (tv1 <= tv2). So it is overkill to use it everywhere. A new function designed to do exactly this has been written for that purpose: tv_cmp2_le. Also, removed old unused tv_* functions.	2007-04-29 13:44:43 +02:00
Willy Tarreau	a6a6a93e56	[MAJOR] changed TV_ETERNITY to ~0 instead of 0 The fact that TV_ETERNITY was 0 was very awkward because it required that comparison functions handled the special case. Now it is ~0 and all comparisons are performed on unsigned values, so that it is naturally greater than any other value. A performance gain of about 2-5% has been noticed.	2007-04-29 13:44:24 +02:00
Willy Tarreau	96bcfd75aa	[MAJOR] replaced rbtree with ul2tree. The rbtree-based wait queue consumes a lot of CPU. Use the ul2tree instead. Lots of cleanups and code reorganizations made it possible to reduce the task struct and simplify the code a bit.	2007-04-29 13:43:53 +02:00
Willy Tarreau	de99e99ecf	[MAJOR] introduced speculative I/O with epoll() The principle behind speculative I/O is to speculatively try to perform I/O before registering the events in the system. This considerably reduces the number of calls to epoll_ctl() and sometimes even epoll_wait(), and manages to increase overall performance by about 10%. The new poller has been called "sepoll". It is used by default on Linux when it works. A corresponding option "nosepoll" and the command line argument "-ds" allow to disable it.	2007-04-16 00:53:59 +02:00
Willy Tarreau	ef1d1f859b	[MAJOR] auto-registering of pollers at load time Gcc provides __attribute__((constructor)) which is very convenient to execute functions at startup right before main(). All the pollers have been converted to have their register() function declared like this, so that it is not necessary anymore to call them from a centralized file.	2007-04-16 00:25:25 +02:00
Willy Tarreau	b40d42006c	[BUILD] declare epoll_* as static when using our own functions We will have to share this code among several implementations.	2007-04-15 23:57:41 +02:00
Willy Tarreau	9f195293de	[MAJOR] remove useless calls to shutdown(SHUT_RD) shutdown(SHUT_RD) is useless on data TCP sockets. It does nothing and consumes one syscall. Remove it.	2007-04-15 21:26:58 +02:00
Willy Tarreau	8374918cce	[MAJOR] implemented support for speculative I/O processing The pollers will now be able to speculatively call the I/O processing functions and decide whether or not they want to poll on those FDs. The changes primarily consist in teaching those functions how to pass the info they got an EAGAIN.	2007-04-15 20:56:27 +02:00
Willy Tarreau	3d32d3a849	[MINOR] add support for the polling results in fdtab Now fdtab can contain the FD_POLL_* events so that the pollers which can fill them can give userful information to readers and writers about the precise condition of wakeup.	2007-04-15 11:31:05 +02:00
Willy Tarreau	7a9664872e	[MINOR] recompute maxfd before touching fdtab It may be dangerous to play with fdtab before doing fd_insert() because this last one is responsible for growing maxfd as needed. Call fd_insert() before instead.	2007-04-15 10:58:02 +02:00
Willy Tarreau	69cad1a338	[MINOR] copy-paste typo when checking for -dk to disable kqueue.	2007-04-10 22:45:11 +02:00
Willy Tarreau	258696f5d8	[MAJOR] missing tv_now in kqueue_poll() blocking timeouts a missing call to tv_now(&now) just after kevent() prevented the timeouts from expiring.	2007-04-10 02:31:54 +02:00
Willy Tarreau	58094f2fd9	[MAJOR] ev_epoll: do not rely on fd_sets anymore The new epoll-based poller uses a list of changes in order to process only the fds which have changed.	2007-04-10 01:43:43 +02:00
Willy Tarreau	40562cb00c	[MINOR] kqueue: use fd_clo() to close the fd fd_clo() does not call kevent() which is not needed during a close(). This one will be faster.	2007-04-09 20:38:57 +02:00
Willy Tarreau	2ff7622c0c	[MAJOR] delay registering of listener sockets at startup Some pollers such as kqueue lose their FD across fork(), meaning that the registered file descriptors are lost too. Now when the proxies are started by start_proxies(), the file descriptors are not registered yet, leaving enough time for the fork() to take place and to get a new pollfd. It will be the first call to maintain_proxies that will register them.	2007-04-09 19:29:56 +02:00
Willy Tarreau	8755285486	[MEDIUM] kqueue: do not manually remove fds FDs attached to a kevent are automatically removed after close(). Also, do not mark the FDs as EV_CLEAR. We want to stay informed about readiness.	2007-04-09 17:16:07 +02:00
Willy Tarreau	cd5ce2a514	[MAJOR] kqueue bug in handling infinite timeouts Calls to kevent() need to pass NULL when there is no timeout.	2007-04-09 16:25:46 +02:00
Willy Tarreau	e1a7a2f0d8	[MAJOR] kqueue was not initialized during startup	2007-04-09 16:11:49 +02:00
Willy Tarreau	a8cff1d6a7	[BUILD] fixed a warning on OpenBSD : MIN/MAX redefined	2007-04-09 16:10:57 +02:00
Willy Tarreau	63455a9be5	[MINOR] use 'is_set' instead of 'isset' in struct poller 'isset' was defined as a macro in /usr/include/sys/param.h, and it breaks build on at least OpenBSD.	2007-04-09 15:34:49 +02:00
Willy Tarreau	69801b8e77	[MINOR] removed proto/polling.h which was not used anymore	2007-04-09 15:28:51 +02:00
Willy Tarreau	1e63130a37	[MAJOR] implemented support for FreeBSD's kqueue() polling mechanism It has not been tested yet, but at least it builds.	2007-04-09 12:03:06 +02:00
Willy Tarreau	e54e9176a3	[MINOR] ev_* : moved the poll function closer to fd_*	2007-04-09 09:23:31 +02:00
Willy Tarreau	97129b5408	[MINOR] changed fd_set/fd_clr functions to return ints The fd_* functions now return ints so that they can be factored when appropriate.	2007-04-09 00:54:46 +02:00
Willy Tarreau	28d86862bc	[MEDIUM] pollers: store the events in arrays Instead of managing StaticReadEvent/StaticWriteEvent, use evts[dir]	2007-04-08 17:42:27 +02:00
Willy Tarreau	663193882a	[MEDIUM] factor FD_ISSET/FD_CLR and !FD_ISSET/FD_SET Use the new FD_COND_C/FD_COND_S macros to reduce the number of operations during tests and sets.	2007-04-08 17:17:37 +02:00
Willy Tarreau	f161a34fb3	[MEDIUM] updated all files to use EV_FD_* Removed the temporary dirty hack.	2007-04-08 16:59:42 +02:00
Willy Tarreau	4f60f16dd3	[MAJOR] modularize the polling mechanisms select, poll and epoll now have their dedicated functions and have been split into distinct files. Several FD manipulation primitives have been provided with each poller. The rest of the code needs to be cleaned to remove traces of StaticReadEvent/StaticWriteEvent. A trick involving a macro has temporarily been used right now. Some work needs to be done to factorize tests and sets everywhere.	2007-04-08 16:39:58 +02:00
Willy Tarreau	b3107b9383	[MINOR] pollers should not use MY_FD_*	2007-04-08 09:32:47 +02:00
Willy Tarreau	a9bd19853e	[BUG] initialize msg->sol before parsing first line Before calling http_parse_{sts,req}line(), it is necessary to make msg->sol point to the beginning of the line. This was not done, resulting in the proxy sometimes crashing when URI rewriting or result rewriting was used.	2007-04-03 20:03:18 +02:00
Willy Tarreau	02785764a4	[BUG] Status line in HTTP response could not be rewritten Typo implied use of HTTP_MSG_RQMETH state instead of HTTP_MSG_RPVER.	2007-04-03 14:45:44 +02:00
Willy Tarreau	422505801f	[MEDIUM] splitted logs into two versions : TCP and HTTP logs are handled better with dedicated functions. The HTTP implementation moved to proto_http.c. It has been cleaned up a bit. Now a frontend with option httplog and no log will not call the function anymore.	2007-04-01 01:30:43 +02:00
Willy Tarreau	e2e27a5c8d	[MEDIUM] removed now unused fiprm and beprm from proxies The fiprm and beprm were added to ease the transition between a single listener mode to frontends+backends. They are no longer needed and make the code a bit more complicated. Remove them.	2007-04-01 00:01:37 +02:00
Willy Tarreau	f2f0ee81ad	[BUG] fix reqadd when no option httpclose is used. Due to a code indentation mismatch, the rspadd headers were only added if option httpclose was not set.	2007-03-30 12:02:43 +02:00
Willy Tarreau	0b4ed90de4	[BUILD] cfgparse requires errno.h on OpenBSD.	2007-03-26 00:18:40 +02:00
Willy Tarreau	2807efdb02	[MEDIUM] do not add Connection: close in HTTP/1.0 mode If we already are in HTTP/1.0 and if no connection: has been seen, it is not necessary to add Connection: close.	2007-03-25 23:47:23 +02:00
Willy Tarreau	f2b74c26c5	[CLEANUP] added a few missing newlines to the HTML report Sometimes it is preferable that the HTML output can be parsed. Ensure better use of the newlines for this.	2007-03-25 22:44:08 +02:00
Willy Tarreau	417fae0e60	[MINOR] changed server weight storage from char to unsigned int This change does not affect memory usage much, but it simplifies the code a lot by removing many +1/-1 operations on weights.	2007-03-25 21:16:40 +02:00
Willy Tarreau	0f03c6f60b	[MINOR] cleaned up the check_addr patch a bit removed useless set_check_addr entry and rely on check_addr itself.	2007-03-25 20:46:19 +02:00
Willy Tarreau	2ea3abb7bf	[MEDIUM] add support for health-checks on other addresses Patch from Fabrice Dulaunoy. Explanation below, and script merged in examples/. This patch allow to put a different address in the check part for each server (and not only a specific port) I need this feature because I've a complex settings where, when a specific farm goes down, I need to switch a set of other farm either if these other farm behave perfectly well. For that purpose, I've made a small PERL daemon with some REGEX or PORT test which allow me to test a bunch of thing.	2007-03-25 16:45:16 +02:00
Willy Tarreau	7ac51f61f5	[MEDIUM] add the "except" keyword to the "forwardfor" option Patch from Bryan Germann for 1.2.17. In some circumstances, it is useful not to add the X-Forwarded-For header, for instance when the client is another reverse-proxy or stunnel running on the same machine and which already adds it. This patch adds the "except" keyword to the "forwardfor" option, allowing to specify an address or network which will not be added to this header.	2007-03-25 16:00:04 +02:00
Willy Tarreau	95c20aca35	[MEDIUM] add user/groupname support Patch from Marcus Rueckert for 1.2.17 : "I added the attached patch to haproxy. I don't have a static uid/gid for haproxy so i need to specify the username/groupname to run it as non root user."	2007-03-25 15:39:23 +02:00
Willy Tarreau	b38651a435	[MEDIUM] check for cttproxy support when required Previously, use of the "usesrc" keyword could silently fail if either the module was not loaded, or the user did not have enough permissions. Now the errors are better diagnosed and more appropriate advices are given.	2007-03-24 17:24:39 +01:00
Willy Tarreau	8d9246d282	[MINOR] more friendly reports of wrong uses of the usesrc keyword It was difficult to find how to enter the "usesrc" keyword. Now the configuration checker is a bit more friendly and tries to identify most mistakes and gives some hints back.	2007-03-24 12:47:24 +01:00
Willy Tarreau	9641e8f6ee	[MINOR] read optimizations based on the MSS Generally, if a recv() returns less bytes than the MSS, it means that there is nothing left in the system's buffers, and that it's not worth trying to read again because we are very likely to get nothing. A default read low limit has been set to 1460 bytes below which we stop reading. This has brought a little speed boost on small objects while maintaining the same speed on large objects.	2007-03-23 23:02:09 +01:00
Willy Tarreau	b8949f1ed0	[MEDIUM] re-implemented the multiple read polling Multiple read polling was temporarily disabled, which had the side effect of burning huge amounts of CPU on large objects. It has now been re-implemented with a limit of 8 calls per wake-up, which seems to provide best results at least on Linux.	2007-03-23 22:39:59 +01:00
Willy Tarreau	042cc79e59	[BUG] fix pointer initializations for TCP connections. Very recent changes consisting in moving some pointers to the transaction instead of the session have lead to a bug because those pointers were only initialized if the protocol was HTTP, but they were freed based on their value. In some cases, it was possible to cause double frees.	2007-03-19 16:20:06 +01:00
Willy Tarreau	aa9dce3bd6	[MINOR] added new function http_header_match2() HTTP header matching is now made easier with http_header_match2(). Various locations have been adapted to use it. A small bug was also fixed causing empty headers to be matched till next one.	2007-03-18 23:50:16 +01:00
Willy Tarreau	4af6f3a9ea	[MINOR] HTTP: factorize all the header insertions Two new functions http_header_add_tail() and http_header_add_tail2() make it easier to append headers, and also reduce the number of sprintf() calls and perform stricter checks.	2007-03-18 22:36:26 +01:00
Willy Tarreau	a5e65754e6	[MINOR] used http_flush_cookie_flags() instead of a dirty code block	2007-03-18 20:53:22 +01:00
Willy Tarreau	3d300596bb	[MINOR] move some flags from session.h to proto_http.h Some session flags were clearly related to HTTP transactions. A new 'flags' field has been added to http_txn, and the associated flags moved to proto_http.h.	2007-03-18 18:34:41 +01:00
Willy Tarreau	3bac9ffe20	[CLEANUP] move http_txn out of session.h The http_txn structure definitions moved to proto_http.h	2007-03-18 17:31:28 +01:00
Willy Tarreau	5416b36b43	[CLEANUP] removed useless includes from streamsock.c	2007-03-18 17:03:19 +01:00
Willy Tarreau	e09e0cef62	[MINOR] removed the ->h member in struct buffer The buffer does not need the header pointer anymore, it has been removed everywhere.	2007-03-18 16:31:29 +01:00
Willy Tarreau	b49871738e	[MINOR] fix accounting for response bytes A remaining reference to rep->h was replaced.	2007-03-18 16:28:03 +01:00
Willy Tarreau	a15645d435	[MAJOR] completed the HTTP response processing. Now the response is correctly processed in the backend first then in the frontend. It has followed intensive tests to catch regressions, and everything seems OK now, but the code is young anyway.	2007-03-18 16:22:39 +01:00
Willy Tarreau	117f59e282	[MINOR] code factoring : capture_headers() serves requests and responses Both request and response captures will have to parse headers following the same methods. It's better to factorize the code, hence the new capture_headers() function.	2007-03-04 18:17:17 +01:00
Willy Tarreau	4b89ad4358	[MINOR] implement http_is_ver_token to fix response parsing This new character map improves accuracy when parsing HTTP version, which helps inspecting requests, and fixes response handling.	2007-03-04 18:13:58 +01:00
Willy Tarreau	6911fa484c	[MINOR] added new str2i* functions Those functions provide faster and more flexible alternatives to atoi(), some of which are able to work on sub-strings.	2007-03-04 18:06:08 +01:00
Willy Tarreau	bb046ac8c5	[MINOR] option forwardfor is for frontends too Finally, if the "option forwardfor" is specified in the frontend and not in the backend, apply it.	2007-03-03 20:54:01 +01:00
Willy Tarreau	c2168d3ccb	[CLEANUP] replaced occurrences of 'hreq' with 'txn' (bis) Did the same in client.c	2007-03-03 20:53:23 +01:00
Willy Tarreau	4dbc4a2ee4	[CLEANUP] replaced occurrences of 'hreq' with 'txn' In many places, the variable "hreq" designated a transaction more than a request. This has been changed to avoid confusion.	2007-03-03 16:23:22 +01:00
Willy Tarreau	b326fcc46a	[CLEANUP] renamed several HTTP structures Some parts of HTTP processing were incorrectly called "request" while they are messages or transactions. The following structure members have changed : http_msg.hdr_state => msg_state http_msg.sor => som http_req.req_state => removed http_req => http_txn	2007-03-03 13:54:32 +01:00
Willy Tarreau	5e8f066961	[MINOR] slightly optimize time calculation for rbtree The new rbtree-based scheduler makes heavy use of tv_cmp2(), and this function becomes a huge CPU eater. Refine it a little bit in order to slightly reduce CPU usage.	2007-02-12 00:59:08 +01:00
Willy Tarreau	b1b8272a54	[MINOR] uninline rb_insert_task_queue() rb_insert_task_queue() was inlined and is quite large. Uninlining it reduces code size by about 2 kB and slightly improves performance.	2007-02-11 13:52:16 +01:00
Willy Tarreau	92f2ab1b1f	[BUG] fix crash when no cookie is set on server In cookie prefix or rewrite modes, if the elected server had no cookie, a NULL pointer was passed to the rewrite function, causing a SIGSEGV.	2007-02-02 22:14:47 +01:00
Willy Tarreau	4266a36c5a	[BUG] segfault on some erroneous configurations If captures were configured in a TCP-only listener, and the logs were enabled, the proxy could segfault when trying to scan the capture buffer which was NULL. Such an erroneous configuration will not be possible anymore soon, but let's avoid the problem for now by detecting the NULL condition.	2007-02-01 23:15:45 +01:00
Willy Tarreau	b9ebf70a3a	[CRITICAL] an empty header may lead to a crash A missing pointer assignment in case of an empty header will result in this header's length being 65535, causing a SEGV when accessing the next header. It should not be possible to exploit this problem to run arbitrary code because the crash occurs while reading the data.	2007-01-26 23:39:38 +01:00
Willy Tarreau	f0d058e8ab	[BUG] hdr_idx might be left uninitialized in some cases When a request is invalid during RQ_BEFORE AND the debug mode is active, the hdr_idx might be used uninitialized. Let's initialize it right after the accept() for now.	2007-01-25 12:03:42 +01:00
Willy Tarreau	83969f42ba	[MAJOR] invalid header offset broke cookies and authentication Since the request is no longer part of the headers, cookies and authentication did not work anymore. Obvious fix is to add the request offset to the start pointer.	2007-01-22 08:55:47 +01:00
Willy Tarreau	49e1ee83be	[RELEASE] Released 1.3.6 with the following changes : - stats now support the HEAD method too - extracted http request from the session - huge rework of the HTTP parser which is now a 28-state FSM. - linux-style likely/unlikely macros for optimization hints - do not create a server socket when there's no server	2007-01-22 00:56:46 +01:00
Willy Tarreau	8973c70f7d	[MEDIUM] implemented the status-line parser in http_msg_analyzer(). The status line parser has been written. With it, it should not be too hard to replace the response parser to benefit from the new header facilities.	2007-01-21 23:58:29 +01:00
Willy Tarreau	362b34d05c	[MINOR] move the response headers to the http_req	2007-01-21 20:49:31 +01:00
Willy Tarreau	8d5d7f20b9	[MAJOR] huge rework of the HTTP request FSM The HTTP parser has been rewritten for better compliance to RFC2616. The same parser is now usable for both requests and responses, and it now supports HTTP/0.9 as well as multi-line headers. It has also been improved for speed ; a typicial HTTP request is parsed in about 2 microseconds on a 1 GHz processor. The monitor-uri check has been moved so that the requests are not logged. The httpclose option now tries to change as little as possible in the request, and does not affect the first header if it is already set to 'close'. HTTP/0.9 requests are converted to HTTP/1.0 before being forwarded. Headers and request transformations are now distinct. The headers list is updated after each insertion/removal/transformation. The request is re-parsed and checked after each transformation. It is not possible anymore to remove a request, and requests which lead to invalid request lines are now rejected.	2007-01-21 19:16:41 +01:00
Willy Tarreau	5d65bbb2aa	[BUG] last backend change broke server assignment Due to a change in the if/else paths, s->flags did not receive the SN_ASSIGNED value anymore.	2007-01-21 12:47:26 +01:00
Willy Tarreau	1a1158b0bd	[MINOR] do not create a socket if there is no server Since the distinction of backends and frontends, it has become possible that some requests reach a frontend which has no backend parameters. We must not create a socket on the backend side just to destroy it later in such a case. The real problem comes from the dispatch mode not being explictly stated.	2007-01-20 11:07:46 +01:00
Willy Tarreau	0637fa0671	[MINOR] add the end of line pointer in each HTTP header	2007-01-13 23:07:22 +01:00
Willy Tarreau	0f7562b8d3	[MEDIUM] separate the http request from the session (step 1) A struct http_req has been created to collect every information related to an HTTP request being processed. Right now, it is still in the struct session but the frontier is clear now.	2007-01-07 15:46:13 +01:00
Willy Tarreau	0214c3a307	[MEDIUM] Stats: add support for the HEAD method There are browsers which sometimes send HEAD requests to the stats page, but it was not handled so it returned a 503 server error or was simply sent to the default backend servers. Now with a HEAD request, the stats return the headers and finish there. Normally, other methods should be blocked so that the stats page really catches the whole URI. Other methods would need to cause a 405 Method not allowed to be returned.	2007-01-07 13:47:30 +01:00
Willy Tarreau	ef00b50011	[MINOR] try to guess server check port when unset When a server has no port specified and there is a check enabled on it, the check is disabled because the port is unknown. However, people expect the "listen" line to set the check port just like it sets the server's port. Now, if a port is specified in the listen or in the first bind and nowhere else, it will be used for the checks as well.	2007-01-07 02:40:09 +01:00
Willy Tarreau	86efac8411	Merge branch 'rbtree'	2007-01-07 02:17:18 +01:00
Willy Tarreau	733fef4add	Merge branch 'tcpsplice'	2007-01-07 02:16:59 +01:00
Willy Tarreau	964c936b04	[MAJOR] replace the wait-queue linked list with an rbtree. This patch from Sin Yu makes use of an rbtree for the wait queue, which will solve the slowdown problem encountered when timeouts are heterogenous in the configuration. The next step will be to turn maintain_proxies() into a per-proxy task so that we won't have to scan them all after each poll() loop.	2007-01-07 02:14:23 +01:00
Willy Tarreau	d59d22e20a	[MINOR] imported the rbtree function from Linux kernel Those rbtree functions will be used by Sin Yu's new rbtree scheduler.	2007-01-07 02:12:57 +01:00
Willy Tarreau	368e96ad88	[MINOR] [STATS] swap color sets for active and backup servers colors had incidently been swapped during the stats page rewrite. Thanks to Sin Yu for noticing it.	2007-01-07 02:08:18 +01:00
Willy Tarreau	6d1a9884f9	[MAJOR] complete support and doc for tcp-splicing The tcp-splicing code has been merged, and a doc has been written. A configuration example has been derived from the previous content switching sample.	2007-01-07 02:03:04 +01:00
Willy Tarreau	8f922fcc3c	[MINOR] added the "tcpsplice" option it does nothing yet except set the minimal options.	2007-01-06 23:45:24 +01:00
Willy Tarreau	4fee4e9d32	[MINOR] the options table now sets the prerequisite checks Some options will need some checks (or initializations) to be performed before starting everything. The cfg_opts table has been extended to allow storing of options-dependant checks.	2007-01-06 21:09:17 +01:00
Willy Tarreau	35d66b0c28	[MINOR] added byte count to sessions and statistics. Now the stats page reports the IN and OUT byte counts per FE, BE and SRV.	2007-01-02 00:28:21 +01:00
Willy Tarreau	41dff82b54	[CRITICAL] fixed memory leak in session_free() Since the introduction of hdr_idx, session_free() had not been updated to free the header ! It implied a consumption of about 400 bytes per new session.	2007-01-01 23:32:30 +01:00
Willy Tarreau	5fdfb911a0	[MEDIUM] implemented the "default_backend" keyword The "default_backend" keyword used in a frontend sets the default backend which will be used if no setbe rule matches.	2007-01-01 23:11:07 +01:00
Willy Tarreau	128e954663	[MINOR] stats: factorize many chunk_printf() Improve code size, speed and readability by factoring many calls to chunk_printf().	2007-01-01 22:01:43 +01:00
Willy Tarreau	c0dde7a8ed	[MAJOR] udpated the stats page to clearly distinguish FEs and BEs The stats page could not tell the difference between a FE and a BE. It has been revamped to indicate all relevant information. The font is also slightly smaller in order for all the info to fit into small screens. The data output path has been greatly simplified to use string chunks.	2007-01-01 21:38:07 +01:00
Willy Tarreau	2b5652f9fa	[MINOR] indicate the proxy type in the logs after a loss of servers When the last server goes down in a backend, indicate 'backend' or 'listener' in the log message depending on the type of the backend.	2006-12-31 17:46:05 +01:00
Willy Tarreau	13943abbd2	[MEDIUM] use an array to store most common options Most common options are now stored in an array which eases the parsing and which also permits reporting of ignored options depending on the proxy's capabilities (back/front).	2006-12-31 00:24:10 +01:00
Willy Tarreau	e01954f45e	[MINOR] option httpclose is now checked both in FE and BE The "httpclose" option affects both frontend and backend, so it was logical to check for its presence at both places. A request which traverses either a frontend or a backend with this option set will have a "Connection: close" header appended.	2006-12-30 23:43:54 +01:00
Willy Tarreau	ebd6160dd3	[MEDIUM] updated log format to report frontend and backend The log format has been slightly updated to separately report the name of the frontend and the name of the backend. The accept date has been enhanced to report the millisecond. The number of remaining connections has also been updated and their order reversed, to include the number of connections on the frontend. The new log format is now : - $1: IP:port - $2: accept date in this format : [dd/mm/YYYY:HH:MM:SS.ttt] - $3: frontend name - $4: backend name '/' server name - $5: req time '/' queue time '/' conn time '/' header time '/' total time - $6: HTTP status code - $7: number of bytes returned - $8: captures (request) - $9: captures (response) - $10: completion flags - $11: remaining conns on process '/' frontend '/' backend '/' server - $12: srv queue size '/' backend queue size - $13..: '"' full request '"'	2006-12-30 11:54:15 +01:00
Willy Tarreau	977b8e41ba	[MAJOR] distinguish between frontend, backend, ruleset and listen The notion of capabilities has been added to the proxy so that we know whether a proxy supports frontend, backend, or rulesets. Given this, some parameters are optionnal, some are ignored with a warning and others are forbidden. It is now possible to write valid two level configs without binding to dummy address/ports.	2006-12-29 14:19:17 +01:00
Willy Tarreau	8603431822	[MEDIUM] split fe->maxconn into fe->maxconn and be->fullconn The maxconn argument is used only for the listeners, and the fullconn is used only for the backends. If unset, it inherits maxconn's value which itself can inherit the default or the global value (we might need to change this).	2006-12-29 00:10:33 +01:00
Willy Tarreau	97de624c17	[MEDIUM] session logging is now defined by the frontend To solve the logging maze, it has been decided that the frontend and nothing else will define how a session will be logged. It might change in the future but at least this choice allows all sort of fantasies.	2006-12-27 17:18:38 +01:00
Willy Tarreau	8058743d7a	[MEDIUM] errorloc now checked first from backend then from frontend It is now possible to define an errorloc in the backend as well as in the frontend. The backend's will be used first, and if undefined, then the frontend's will be used instead. If none is used, then the original error messages will be used.	2006-12-24 17:47:20 +01:00
Willy Tarreau	0f77253a22	[MINOR] store HTTP error messages into a chunk array HTTP error messages were all specific cases handled by an IF. Now they are all in an array so that it will be easier to add new ones. Also, the return functions now use chunks as inputs so that it should be easier to provide alternative return messages if needed.	2006-12-23 20:51:41 +01:00
Willy Tarreau	a496b6042b	[MAJOR] merged the 'setbe' actions to switch the backend on a regex Sin Yu's patch to permit to change the proxy from a regex was merged with little changes : - req_cap/rsp_cap are not reassigned to the new proxy, they stay attached to the frontend - the actions have been renamed "reqsetbe" and "reqisetbe" for "set BackEnd". - the buffer is not reset after the switch, instead, the headers are parsed again by the backend - in Sin's patch, it was theorically possible to switch multiple times, but the switching track was lost, making it impossible to apply server responsesin the reverse order. Now switching is limited to 1 action (separation between frontend and backend) but the filters remain. Now it will be extremely easy to add other switching conditions, such as host matching, URI matching, etc... There's still a hard work to be done on the logs and stats.	2006-12-17 23:15:24 +01:00
Willy Tarreau	ddb358d932	[MEDIUM] tried to clean the logs up a little bit The logs have become a real mess. It is now very hard to tell which frontend/backend will impose its configuration for the logs. This needs a complete rework but at least it should work.	2006-12-17 22:55:52 +01:00
Willy Tarreau	f1221aa19f	[MEDIUM] separated nbconn into feconn and beconn The nbconn attribute in the proxies was not relevant anymore because a frontend A may use backend B and both of them must account for their respective connections. For this reason, there now are two separate counters for frontend and backend connections. The stats page has been updated to reflect the backend, but a separate line entry for the frontend with error counts would be good. Note that as of now, beconn may be higher than maxconn, because maxconn applies to the frontend, while beconn may be increased due to sessions passed from another frontend.	2006-12-17 22:14:12 +01:00
Willy Tarreau	830ff458de	[MAJOR] reworked ->be, ->fe and ->fi in sessions There was a confusion about the way to find filters and backend parameters from sessions. The chaining has been changed between the session and the proxy. Now, a session knows only two proxies : one frontend (->fe) and one backend (->be). Each proxy has a link to the proxy providing filters and to the proxy providing backend parameters (both self by default). The captures (cookies and headers) have been attached to the frontend's filters for now. The uri_auth and the statistics are attached to the backend's filters so that the uri can depend on a hostname for instance.	2006-12-17 19:31:23 +01:00
Willy Tarreau	97a738f32c	[MINOR] add the fiprm and beprm indirections to struct proxy A proxy will be able to borrow parameters from another one. In particular, the filters will be inheritable from another proxy, and the backend parameters too.	2006-12-17 18:02:30 +01:00
Willy Tarreau	b251390f7e	[MEDIUM] moved uri_auth check to a separate function The check of uri_auth is now in a separate function which is checked after every backend switch, so that it will be possible to have an uri_auth for the frontend and another one for the backend.	2006-12-17 14:52:38 +01:00
Willy Tarreau	921d7c0a70	[MINOR] removed the SN_POST flag and string checks on method Now that hreq.meth is known, use it everywhere a method is required.	2006-12-17 13:50:27 +01:00
Willy Tarreau	53b6c74d06	[MEDIUM] check the HTTP method after all filters have been applied The HTTP method is now checked and saved into hreq.meth. It will be usable at several places instead of those dirty string comparisons.	2006-12-17 13:37:46 +01:00
Willy Tarreau	230fd0bfdf	[MEDIUM] optimized the request parser a bit more Some while() constructs are not very efficient with gcc, yet they are used to scan all the text in the start line and the headers. Replacing them with more efficient (but ugly) loops provides a global gain of about 2%, which is not bad at all !	2006-12-17 12:05:00 +01:00
Willy Tarreau	976f1ee561	[MINOR] reorganized the request parser states to improve speed The most commonly branched states have been grouped in the first ifs.	2006-12-17 10:06:03 +01:00
Willy Tarreau	06619265b1	[MEDIUM] reorganized request handling to prepare for content-switching The filters are now iterated for FE, FI, BE. Some grey areas remain : - uri_auth has been propagated to the backend, but in fact it should be checked at every level (fe, fi, be), depending where it is declared, and before the filters. - the HTTP method and URI should be stored and propagated everywhere they are used. For this, we would need to first apply filters to be aware of filter changes which affect them. - there seems to be no need anymore for hdr_idx[0] being empty. It may contain the start line, which will slightly improve performance and make the code easier to read.	2006-12-17 08:37:22 +01:00
Willy Tarreau	45e73e3cd9	[MEDIUM] move all HTTP Request-related session material to struct hreq The req_cap, hdr_state, hdr_idx, auth_hdr and req_line have been moved to a dedicated hreq structure in the session. It makes is easier to add HTTP-specific fields such as SOR (start of request) and EOF (end of headers). It also made it possible to fix two bugs introduced by last commit : - end of headers not correctly detected - hdr_idx not freed upon one specific error during session creation When the backend side will be reworked, it should rely on a similar structure.	2006-12-17 00:05:15 +01:00
Willy Tarreau	a4cd1f50cc	[MEDIUM] make process_cli() not depend on req->h anymore Local variables now keep the start and end of line at any moment. req->h has been removed and will soon be removable from the buffer.	2006-12-16 19:57:26 +01:00
Willy Tarreau	f224273df3	[BUILD] last commit did not build	2006-12-16 19:00:29 +01:00
Willy Tarreau	e15d9132df	[MEDIUM] reference and index appended request headers When headers are appended to the end of a request, they must be indexed.	2006-12-14 22:26:42 +01:00
Willy Tarreau	2a32428926	[MAJOR] finished replacement of the client-side HTTP parser with a new one The code is working again, but not as clean as it could be. Many blocks should still move to dedicated functions. req->h must be removed everywhere and updated everytime needed. A few functions or macros should take care of the headers during header insertion/deletion/change.	2006-12-05 00:05:46 +01:00
Willy Tarreau	58f10d7478	[MAJOR] replaced the client-side HTTP parser with a new one The new parser uses an FSM to strictly follow RFC2616. Headers are indexed and parsed only once they're all available. That way, complex regexes make more sense. HTTP processing is now performed in several phases by calling multiple functions, making the code cleaner and easier to read. Note that req[i]pass does not work anymore because it would require that we mark a header to be ignored. What is really needed is to have the ability to add an exception to a matching (match xx except yy). Several bugs have been fixed in appsession during the conversion to the new FSM (method length and recovery on malloc errors). The code does build and work with the debug examples, but is not usable yet to connect to anything as it does not forward the requests yet.	2006-12-04 02:26:12 +01:00
Willy Tarreau	b7eba10304	[BUG] files were missing for hdr_idx in previous commit	2006-12-04 02:20:02 +01:00
Willy Tarreau	e5f20dcea8	[MEDIUM] added the hdr_idx structure for future HTTP header indexing This structure will consume 4 bytes per header to keep track of headers within a request or a response without having to parse the whole request for each regex. As it's not possible to allocate only 4 bytes, we define a max number of HTTP headers. We set it to (BUFSIZE+79)/80 so that 8kB buffers can contain 100 headers (like Apache), resulting in 400 bytes dedicated to indexation, or about 400/(2*8kB) ~= 2.4% of the memory usage.	2006-12-03 15:21:35 +01:00
Willy Tarreau	09536952b3	Merge branch 'rfc2616' into switch	2006-12-02 20:13:39 +01:00
Willy Tarreau	669e6da163	[BUG] implemented support for multi-line headers as required by RFC2616. This patch was added in 1.2.9 but was then incidentely reverted by manipulation error when merging next patch (enforce max number of conns). It's now merged again.	2006-12-02 20:12:55 +01:00
Willy Tarreau	73de9899a6	[MAJOR] separate sess->proxy into sess->{fe,fi,be} The references to the proxy from the session have been turned into Frontend (fe), Filters (fi) and Backend (be). This should ease the migration to the L7 switching features. Next step will be to kill the struct proxy and have 3 independant structs instead, each referenced from entities called listener, frontend, filters and backend.	2006-11-30 11:40:23 +01:00
Willy Tarreau	163c53253c	[MEDIUM] use tproxy address as source of health checks If a tproxy address is defined, then use it for health checks too.	2006-11-14 16:18:41 +01:00
Willy Tarreau	f19cf37031	[BUILD] remove a warning in backend.c include <string.h> to remove a warning on memset	2006-11-14 15:40:51 +01:00
Willy Tarreau	77074d548b	[MAJOR] support for source binding via cttproxy Using the cttproxy kernel patch, it's possible to bind to any source address. It is highly recommended to use the 03-natdel patch with the other ones. A new keyword appears as a complement to the "source" keyword : "usesrc". The source address is mandatory and must be valid on the interface which will see the packets. The "usesrc" option supports "client" (for full client_ip:client_port spoofing), "client_ip" (for client_ip spoofing) and any 'IP[:port]' combination to pretend to be another machine. Right now, the source binding is missing from server health-checks if set to another address. It must be implemented (think restricted firewalls). The doc is still missing too.	2006-11-12 23:57:19 +01:00
Willy Tarreau	1001b949ee	[CLEANUP] fd.c : regparm was hardcoded too.	2006-10-15 23:10:10 +02:00
Willy Tarreau	bf73613543	[CLEANUP] added the correct cast to call localtime() Calling localtime() with a timeval.tv_sec causes a warning on OpenBSD where the tv_sec is declared long.	2006-10-15 22:54:47 +02:00
Willy Tarreau	fb278677e2	[MEDIUM] use regparm on a few tv_* functions Some of the tv_* functions are called very often. Passing their arguments as registers is quite faster. This can be disabled by setting CONFIG_HAP_DISABLE_REGPARM.	2006-10-15 15:38:50 +02:00
Willy Tarreau	2b35c95d6c	[MEDIUM] remove useless calls to gettimeofday() send_log(), Alert() and Warning() used gettimeofday() while using <now> should have been preferred.	2006-10-15 15:25:48 +02:00
Willy Tarreau	b17916e89b	[CLEANUP] add a few "const char " where appropriate As suggested by Markus Elfring, a few "const char " have replaced some "char *" declarations where a function is not expected to modify a value. It does not change the code but it helps detecting coding errors.	2006-10-15 15:17:57 +02:00
Willy Tarreau	c642348ce4	[CLEANUP] add a few checks for functions return values Markus Elfring suggested adding a few checks which were missing after a bunch of getsockopt() and 2 strdup(). While those are unlikely to fail where they are used, it makes the code cleaner.	2006-10-15 14:59:03 +02:00
Willy Tarreau	2a429503e0	[MINOR] turn every FD_* into functions On recent CPUs, functions are about twice as fast as inline FD_*, so there is now a #define CONFIG_HAP_INLINE_FD_SET to choose between the two modes.	2006-10-15 14:53:07 +02:00
Willy Tarreau	0bbc3cf157	[MEDIUM] fix broken redispatch option Since the connection queueing was introduced, the "redispatch" option could not cover the cases where a connection has been refused by the server after having been marked "in progress". The fix consists in doing a redispatch in the delayed connection handling code. Problem reported by Konrad Rzentarzewski.	2006-10-15 14:26:02 +02:00
Willy Tarreau	08fa2e37fd	[MINOR] tarpit: close the connection if the client closes. There's no point at maintaining an open tarpitted connection if the client has left.	2006-09-03 10:47:37 +02:00
Willy Tarreau	b8750a82a2	[MEDIUM] added the "reqtarpit" and "reqitarpit" features It is now possible to tarpit connections based on regex matches. The tarpit timeout is equal to the contimeout. A 500 server error response is faked, and the logs show the status flags as "PT" which indicate the connection has been tarpitted.	2006-09-03 09:56:00 +02:00
Willy Tarreau	f8306d5391	[MEDIUM] got rid of event_{cli,srv}_write() in favor of stream_sock_write() The timeouts, expiration timers and results are now stored in the buffers. The timers will have to change a bit to become more flexible, and when the I/O completion functions will be written, the connect_complete() will have to be extracted from the write() function.	2006-07-29 19:01:31 +02:00
Willy Tarreau	d797128d6e	[MEDIUM] got rid of event_{cli,srv}_read() in favor of stream_sock_read()	2006-07-29 18:36:34 +02:00
Willy Tarreau	0f9f5056f9	[MEDIUM] removed all res_* and RES_* The read-, write-, end- and error- status are now stored in the buffer.	2006-07-29 17:39:25 +02:00
Willy Tarreau	5446940e37	[MEDIUM] started the changes towards I/O completion callbacks Now the event_* functions find their buffer in the fdtab itself.	2006-07-29 16:59:06 +02:00
Willy Tarreau	1c47f85292	[MEDIUM] implemented the 'monitor-uri' keyword. It is used to test haproxy's status with an HTTP request to which it will reply with HTTP/1.0 200 OK.	2006-07-09 17:01:40 +02:00
Willy Tarreau	f3c692090e	[MEDIUM] implement 'option ssl-hello-chk' to use CLIENT HELLO health checks. This makes it possible to relay SSL connections in pure TCP instances while ensuring the remote end really receives our data eventhough intermediate agents (firewalls, proxies, ...) might acknowledge the connection.	2006-07-09 16:42:34 +02:00
Willy Tarreau	2738a14941	[MEDIUM] now upon startup, haproxy will warn about missing timeouts. Too many problem reports were caused by missing timeouts. While there has never been any default value since version 1.0, having no timeout is abnormal in networked environments, and will lead to various problems such as CLOSE_WAIT sockets accumulating and nasty things like this. For this reason, it's better to annoy the users until they fix their configs than letting them run buggy configurations.	2006-07-09 16:22:41 +02:00
Willy Tarreau	791d66d363	[MINOR] added lots of Content-Type: text/html to HTML responses and stats. This suggestion from Cameron Simpson is perfectly valid and should have been implemented from the beginning.	2006-07-09 16:13:17 +02:00
Willy Tarreau	e3ba5f0aaa	[CLEANUP] included common/version.h everywhere	2006-06-29 18:54:54 +02:00
Willy Tarreau	2dd0d4799e	[CLEANUP] renamed include/haproxy to include/common	2006-06-29 17:53:05 +02:00
Willy Tarreau	baaee00406	[BIGMOVE] exploded the monolithic haproxy.c file into multiple files. The files are now stored under : - include/haproxy for the generic includes - include/types.h for the structures needed within prototypes - include/proto.h for function prototypes and inline functions - src/*.c for the C files Most include files are now covered by LGPL. A last move still needs to be done to put inline functions under GPL and not LGPL. Version has been set to 1.3.0 in the code but some control still needs to be done before releasing.	2006-06-26 02:48:02 +02:00
willy tarreau	1f431b5851	[MEDIUM] the stats dump FSM was buggy and looped on dispatch instances. It has been rewritten and now supports an initialization state. It now also prevents from dumping stopped(disabled) listeners and it is possible to specify a scope with a list of proxies that are allowed to be dumped from the one being configured ('.' meaning "this one"). The 'stats' entry can be configured from the 'defaults' instance and it is correctly flushed from proxies which redefine it.	2006-05-21 14:46:15 +02:00
willy tarreau	9e1388671a	[MEDIUM] added the new 'stats' keyword with user authentication subsystem. Right now it only validates the user/passwd according to a specified list, and lets the user pass through the proxy if the authentication is OK, and it refuses any invalid access with a 401 Unauthorized response.	2006-05-14 23:06:28 +02:00
willy tarreau	598da41537	* released 1.2.5-pre1 * build fixes for appsession * documentation for appsession	2005-12-18 01:07:29 +01:00
willy tarreau	12350155a4	* released 1.2.4 * merged Alexander Lazic's and Klaus Wagner's work on application cookie-based persistence. Since this is the first merge, this version is not intended for general use and reports are more than welcome. Some documentation is really needed though.	2005-12-18 01:03:27 +01:00

... 323 324 325 326 327 ...

16712 Commits