haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-08 08:07:10 +02:00

Author	SHA1	Message	Date
Willy Tarreau	c38f71cfcd	MINOR: session: introduce session_new() This one creates a new session and does the minimum initialization.	2015-04-06 11:37:33 +02:00
Willy Tarreau	9903f0e1a2	REORG: session: move the session parts out of stream.c This concerns everythins related to accepting a new session and expiring the embryonic session. There's still a hard-coded call to stream_accept_session() which could be set somewhere in the frontend, but for now it's not a problem.	2015-04-06 11:37:32 +02:00
Willy Tarreau	bb2ef12a60	MEDIUM: session: update the session's stick counters upon session_free() Whenever session_free() is called, any possible stick counter stored in the session will be synchronized.	2015-04-06 11:37:31 +02:00
Willy Tarreau	11c3624c32	MINOR: session: implement session_free() and use it everywhere We want to call this one everywhere we have to kill a session so that future parts we move to the session can be released from there.	2015-04-06 11:37:30 +02:00
Willy Tarreau	b1ec8c4a59	MINOR: session: start to reintroduce struct session There is now a pointer to the session in the stream, which is NULL for now. The session pool is created as well. Some parts will move from the stream to the session now.	2015-04-06 11:23:57 +02:00
Willy Tarreau	87b09668be	REORG/MAJOR: session: rename the "session" entity to "stream" With HTTP/2, we'll have to support multiplexed streams. A stream is in fact the largest part of what we currently call a session, it has buffers, logs, etc. In order to catch any error, this commit removes any reference to the struct session and tries to rename most "session" occurrences in function names to "stream" and "sess" to "strm" when that's related to a session. The files stream.{c,h} were added and session.{c,h} removed. The session will be reintroduced later and a few parts of the stream will progressively be moved overthere. It will more or less contain only what we need in an embryonic session. Sample fetch functions and converters will have to change a bit so that they'll use an L5 (session) instead of what's currently called "L4" which is in fact L6 for now. Once all changes are completed, we should see approximately this : L7 - http_txn L6 - stream L5 - session L4 - connection \| applet There will be at most one http_txn per stream, and a same session will possibly be referenced by multiple streams. A connection will point to a session and to a stream. The session will hold all the information we need to keep even when we don't yet have a stream. Some more cleanup is needed because some code was already far from being clean. The server queue management still refers to sessions at many places while comments talk about connections. This will have to be cleaned up once we have a server-side connection pool manager. Stream flags "SN_*" still need to be renamed, it doesn't seem like any of them will need to move to the session.	2015-04-06 11:23:56 +02:00
Willy Tarreau	10b688f2b4	MEDIUM: listener: store the default target per listener This will be useful later to state that some listeners have to use certain decoders (typically an HTTP/2 decoder) regardless of the regular processing applied to other listeners. For now it simply defaults to the frontend's default target, and it is used by the session.	2015-03-13 16:45:37 +01:00
Willy Tarreau	f87ab94e3b	MINOR: proxy: store the default target into the frontend's configuration Some services such as peers and CLI pre-set the target applet immediately during accept(), and for this reason they're forced to have a dedicated accept() function which does not even properly follow everything the regular one does (eg: sndbuf/rcvbuf/linger/nodelay are not set, etc). Let's store the default target when known into the frontend's config so that it's session_accept() which automatically sets it.	2015-03-13 16:23:00 +01:00
Willy Tarreau	78955f4c8b	MEDIUM: session: simplify receive buffer allocator to only use the channel Now that we can get the session from the channel, let's simplify the prototype of session_alloc_recv_buffer() to only require the channel. Both the caller and the function are now simplified.	2015-03-11 20:41:47 +01:00
Willy Tarreau	103197d597	CLEANUP: session: don't use si_{ic,oc} when we know the session. During the connection establishment, we needlessly rely on pointer dereferences.	2015-03-11 20:41:47 +01:00
Willy Tarreau	7b8c4f9661	CLEANUP: session: don't needlessly pass a pointer to the stream-int All functions dealing with connection establishment currently use a pointer to the stream interface. Now we know it cannot change and is always s->si[1].	2015-03-11 20:41:47 +01:00
Willy Tarreau	8f128b41ec	CLEANUP: session: use local variables to access channels / stream ints In process_session, we had around 300 accesses to channels and stream-ints from the session. Not only this inflates the code due to the large offsets from the original pointer, but readability can be improved. Let's have 4 local variables for the channels and stream-ints.	2015-03-11 20:41:47 +01:00
Willy Tarreau	350f487300	CLEANUP: session: simplify references to chn_{prod,cons}(&s->{req,res}) These 4 combinations are needlessly complicated since the session already has direct access to the associated stream interfaces without having to check an indirect pointer.	2015-03-11 20:41:47 +01:00
Willy Tarreau	81cd90069a	MEDIUM: channel: remove now unused ->prod and ->cons pointers Nothing uses them anymore.	2015-03-11 20:41:47 +01:00
Willy Tarreau	ef573c0f22	MEDIUM: channel: add a new flag "CF_ISRESP" for the response channel This flag designates the response channel. This will be used to know what channel we're seeing and finding our way back to the session.	2015-03-11 20:41:47 +01:00
Willy Tarreau	73796535a9	REORG/MEDIUM: channel: only use chn_prod / chn_cons to find stream-interfaces The purpose of these two macros will be to pass via the session to find the relevant stream interfaces so that we don't need to store the ->cons nor ->prod pointers anymore. Currently they're only defined so that all references could be removed. Note that many places need a second pass of clean up so that we don't have any chn_prod(&s->req) anymore and only &s->si[0] instead, and conversely for the 3 other cases.	2015-03-11 20:41:47 +01:00
Willy Tarreau	819d332dfd	MEDIUM: stream-int: remove any reference to the owner si->owner is not used anymore now, so let's remove any reference to it.	2015-03-11 20:41:46 +01:00
Willy Tarreau	07373b8660	MEDIUM: stream-int: use si_task() to retrieve the task from the stream int We go back to the session to get the owner. Here again it's very easy and is just a matter of relative offsets. Since the owner always exists and always points to the session's task, we can remove some unneeded tests.	2015-03-11 20:41:46 +01:00
Willy Tarreau	a2df3fa251	MEDIUM: stream-interface: remove now unused pointers to channels Everyone must now use si_ic() / si_oc() to find the relevant channels, the points have been totally removed.	2015-03-11 20:41:46 +01:00
Willy Tarreau	a5f5d8dc69	MEDIUM: stream-int: add a flag indicating which side the SI is on This new flag "SI_FL_ISBACK" is set only on the back SI and is cleared on the front SI. That way it's possible only by looking at the SI to know what side it is.	2015-03-11 20:41:46 +01:00
Willy Tarreau	2bb4a96f8f	REORG/MEDIUM: stream-int: introduce si_ic/si_oc to access channels We'll soon remove direct references to the channels from the stream interface since everything belongs to the same session, so let's first not dereference si->ib / si->ob anymore and use macros instead.	2015-03-11 20:41:46 +01:00
Willy Tarreau	a27dc19eda	CLEANUP: remove now unused channel pool The channels are now part of the struct session. Their pool is not needed anymore.	2015-03-11 20:41:46 +01:00
Willy Tarreau	22ec1eadd0	REORG/MAJOR: move session's req and resp channels back into the session The channels were pointers to outside structs and this is not needed anymore since the buffers have moved, but this complicates operations. Move them back into the session so that both channels and stream interfaces are always allocated for a session. Some places (some early sample fetch functions) used to validate that a channel was NULL prior to dereferencing it. Now instead we check if chn->buf is NULL and we force it to remain NULL until the channel is initialized.	2015-03-11 20:41:46 +01:00
Thierry FOURNIER	a718b29b6d	MINOR: lua: remove some #define The #define compilation directives are centralized in the hlua include files. This permits to remove ome #ifdef from the haproxy main code.	2015-03-04 17:58:52 +01:00
Thierry FOURNIER	05ac42455f	MEDIUM: lua: Lua initialisation "on demand" Actually, the Lua context is always initilized in each session, even if the session doesn't use Lua. This behavior cause 5% performances loss. This patch initilize the Lua only if it is use by the session. The initialization is now on demand.	2015-02-28 23:12:37 +01:00
Thierry FOURNIER	65f34c6367	MINOR: lua: txn: create class TXN associated with the transaction. This class of functions permit to access to all the functions associated with the transaction like http header, HAProxy internal fetches, etc ... This patch puts the skeleton of this class. The class will be enhanced later.	2015-02-28 23:12:34 +01:00
Thierry FOURNIER	bc4c1ac6ad	MEDIUM: http/tcp: permit to resume http and tcp custom actions Later, the processing of some actions needs to be interrupted and resumed later. This patch permit to resume the actions. The actions that needs to run with the resume mode are not yet avalaible. It will be soon with Lua patches. So the code added by this patch is untestable for the moment. The list of "tcp_exec_req_rules" cannot resme because is called by the unresumable function "accept_session".	2015-02-28 23:12:33 +01:00
Thierry FOURNIER	f41a809dc9	MINOR: sample: add private argument to the struct sample_fetch The add of this private argument is to prepare the integration of the lua fetchs.	2015-02-28 23:12:31 +01:00
Thierry FOURNIER	b83862dd74	MEDIUM: channel: wake up any request analyzer on response activity This behavior is already existing for the "WAIT_HTTP" analyzer, this patch just extends the system to any analyzer that would be waked up on response activity.	2015-02-28 23:12:31 +01:00
Thierry FOURNIER	2e05a8c742	MEDIUM: task: call session analyzers if the task is woken by a message. When a task used to receive a message from another one, its analysers were not called if there was no I/O activity.	2015-02-28 23:12:30 +01:00
Willy Tarreau	a24adf0795	MAJOR: session: only wake up as many sessions as available buffers permit We've already experimented with three wake up algorithms when releasing buffers : the first naive one used to wake up far too many sessions, causing many of them not to get any buffer. The second approach which was still in use prior to this patch consisted in waking up either 1 or 2 sessions depending on the number of FDs we had released. And this was still inaccurate. The third one tried to cover the accuracy issues of the second and took into consideration the number of FDs the sessions would be willing to use, but most of the time we ended up waking up too many of them for nothing, or deadlocking by lack of buffers. This patch completely removes the need to allocate two buffers at once. Instead it splits allocations into critical and non-critical ones and implements a reserve in the pool for this. The deadlock situation happens when all buffers are be allocated for requests pending in a maxconn-limited server queue, because then there's no more way to allocate buffers for responses, and these responses are critical to release the servers's connection in order to release the pending requests. In fact maxconn on a server creates a dependence between sessions and particularly between oldest session's responses and latest session's requests. Thus, it is mandatory to get a free buffer for a response in order to release a server connection which will permit to release a request buffer. Since we definitely have non-symmetrical buffers, we need to implement this logic in the buffer allocation mechanism. What this commit does is implement a reserve of buffers which can only be allocated for responses and that will never be allocated for requests. This is made possible by the requester indicating how much margin it wants to leave after the allocation succeeds. Thus it is a cooperative allocation mechanism : the requester (process_session() in general) prefers not to get a buffer in order to respect other's need for response buffers. The session management code always knows if a buffer will be used for requests or responses, so that is not difficult : - either there's an applet on the initiator side and we really need the request buffer (since currently the applet is called in the context of the session) - or we have a connection and we really need the response buffer (in order to support building and sending an error message back) This reserve ensures that we don't take all allocatable buffers for requests waiting in a queue. The downside is that all the extra buffers are really allocated to ensure they can be allocated. But with small values it is not an issue. With this change, we don't observe any more deadlocks even when running with maxconn 1 on a server under severely constrained memory conditions. The code becomes a bit tricky, it relies on the scheduler's run queue to estimate how many sessions are already expected to run so that it doesn't wake up everyone with too few resources. A better solution would probably consist in having two queues, one for urgent requests and one for normal requests. A failed allocation for a session dealing with an error, a connection event, or the need for a response (or request when there's an applet on the left) would go to the urgent request queue, while other requests would go to the other queue. Urgent requests would be served from 1 entry in the pool, while the regular ones would be served only according to the reserve. Despite not yet having this, it works remarkably well. This mechanism is quite efficient, we don't perform too many wake up calls anymore. For 1 million sessions elapsed during massive memory contention, we observe about 4.5M calls to process_session() compared to 4.0M without memory constraints. Previously we used to observe up to 16M calls, which rougly means 12M failures. During a test run under high memory constraints (limit enforced to 27 MB instead of the 58 MB normally needed), performance used to drop by 53% prior to this patch. Now with this patch instead it increases by about 1.5%. The best effect of this change is that by limiting the memory usage to about 2/3 to 3/4 of what is needed by default, it's possible to increase performance by up to about 18% mainly due to the fact that pools are reused more often and remain hot in the CPU cache (observed on regular HTTP traffic with 20k objects, buffers.limit = maxconn/10, buffers.reserve = limit/2). Below is an example of scenario which used to cause a deadlock previously : - connection is received - two buffers are allocated in process_session() then released - one is allocated when receiving an HTTP request - the second buffer is allocated then released in process_session() for request parsing then connection establishment. - poll() says we can send, so the request buffer is sent and released - process session gets notified that the connection is now established and allocates two buffers then releases them - all other sessions do the same till one cannot get the request buffer without hitting the margin - and now the server responds. stream_interface allocates the response buffer and manages to get it since it's higher priority being for a response. - but process_session() cannot allocate the request buffer anymore => We could end up with all buffers used by responses so that none may be allocated for a request in process_session(). When the applet processing leaves the session context, the test will have to be changed so that we always allocate a response buffer regardless of the left side (eg: H2->H1 gateway). A final improvement would consists in being able to only retry the failed I/O operation without waking up a task, but to date all experiments to achieve this have proven not to be reliable enough.	2014-12-24 23:47:33 +01:00
Willy Tarreau	10fc09e872	MAJOR: session: only allocate buffers when needed A session doesn't need buffers all the time, especially when they're empty. With this patch, we don't allocate buffers anymore when the session is initialized, we only allocate them in two cases : - during process_session() - during I/O operations During process_session(), we try hard to allocate both buffers at once so that we know for sure that a started operation can complete. Indeed, a previous version of this patch used to allocate one buffer at a time, but it can result in a deadlock when all buffers are allocated for requests for example, and there's no buffer left to emit error responses. Here, if any of the buffers cannot be allocated, the whole operation is cancelled and the session is added at the tail of the buffer wait queue. At the end of process_session(), a call to session_release_buffers() is done so that we can offer unused buffers to other sessions waiting for them. For I/O operations, we only need to allocate a buffer on the Rx path. For this, we only allocate a single buffer but ensure that at least two are available to avoid the deadlock situation. In case buffers are not available, SI_FL_WAIT_ROOM is set on the stream interface and the session is queued. Unused buffers resulting either from a successful send() or from an unused read buffer are offered to pending sessions during the ->wake() callback.	2014-12-24 23:47:33 +01:00
Willy Tarreau	bf883e0aa7	MAJOR: session: implement a wait-queue for sessions who need a buffer When a session_alloc_buffers() fails to allocate one or two buffers, it subscribes the session to buffer_wq, and waits for another session to release buffers. It's then removed from the queue and woken up with TASK_WAKE_RES, and can attempt its allocation again. We decide to try to wake as many waiters as we release buffers so that if we release 2 and two waiters need only once, they both have their chance. We must never come to the situation where we don't wake enough tasks up. It's common to release buffers after the completion of an I/O callback, which can happen even if the I/O could not be performed due to half a failure on memory allocation. In this situation, we don't want to move out of the wait queue the session that was just added, otherwise it will never get any buffer. Thus, we only force ourselves out of the queue when freeing the session. Note: at the moment, since session_alloc_buffers() is not used, no task is subscribed to the wait queue.	2014-12-24 23:47:33 +01:00
Willy Tarreau	656859d478	MEDIUM: session: implement a basic atomic buffer allocator This patch introduces session_alloc_recv_buffer(), session_alloc_buffers() and session_release_buffers() whose purpose will be to allocate missing buffers and release unneeded ones around the process_session() and during I/O operations. I/O callbacks only need a single buffer for recv operations and none for send. However we still want to ensure that we don't pick the last buffer. That's what session_alloc_recv_buffer() is for. This allocator is atomic in that it always ensures we can get 2 buffers or fails. Here, if any of the buffers is not ready and cannot be allocated, the operation is cancelled. The purpose is to guarantee that we don't enter into the deadlock where all buffers are allocated by the same size of all sessions. A queue will have to be implemented for failed allocations. For now they're just reported as failures.	2014-12-24 23:47:32 +01:00
Willy Tarreau	909e267be0	MINOR: session: group buffer allocations together We'll soon want to release buffers together upon failure so we need to allocate them after the channels. Let's change this now. There's no impact on the behaviour, only the error path is unrolled slightly differently. The same was done in peers.	2014-12-24 23:47:32 +01:00
Willy Tarreau	7dfca9daec	MINOR: buffer: only use b_free to release buffers We don't call pool_free2(pool2_buffers) anymore, we only call b_free() to do the job. This ensures that we can start to centralize the releasing of buffers.	2014-12-24 23:47:32 +01:00
Willy Tarreau	696a2910a0	MINOR: buffer: move buffer initialization after channel initialization It's not clean to initialize the buffer before the channel since it dereferences one pointer in the channel. Also we'll want to let the channel pre-initialize the buffer, so let's ensure that the channel is always initialized prior to the buffers.	2014-12-24 23:47:32 +01:00
Willy Tarreau	e583ea583a	MEDIUM: buffer: use b_alloc() to allocate and initialize a buffer b_alloc() now allocates a buffer and initializes it to the size specified in the pool minus the size of the struct buffer itself. This ensures that callers do not need to care about buffer details anymore. Also this never applies memory poisonning, which is slow and useless on buffers.	2014-12-24 23:47:32 +01:00
Willy Tarreau	474cf54a97	MINOR: buffer: reset a buffer in b_reset() and not channel_init() We'll soon need to be able to switch buffers without touching the channel, so let's move buffer initialization out of channel_init(). We had the same in compressoin.c.	2014-12-24 23:47:31 +01:00
Willy Tarreau	3b24641745	BUG/MAJOR: sessions: unlink session from list on out of memory Since embryonic sessions were introduced in 1.5-dev12 with commit `2542b53` ("MAJOR: session: introduce embryonic sessions"), a major bug remained present. If haproxy cannot allocate memory during session_complete() (for example, no more buffers), it will not unlink the new session from the sessions list. This will cause memory corruptions if the memory area from the session is reused for anything else, and may also cause bogus output on "show sess" on the CLI. This fix must be backported to 1.5.	2014-11-25 22:09:05 +01:00
KOVACS Krisztian	b3e54fe387	MAJOR: namespace: add Linux network namespace support This patch makes it possible to create binds and servers in separate namespaces. This can be used to proxy between multiple completely independent virtual networks (with possibly overlapping IP addresses) and a non-namespace-aware proxy implementation that supports the proxy protocol (v2). The setup is something like this: net1 on VLAN 1 (namespace 1) -\ net2 on VLAN 2 (namespace 2) -- haproxy ==== proxy (namespace 0) net3 on VLAN 3 (namespace 3) -/ The proxy is configured to make server connections through haproxy and sending the expected source/target addresses to haproxy using the proxy protocol. The network namespace setup on the haproxy node is something like this: = 8< = $ cat setup.sh ip netns add 1 ip link add link eth1 type vlan id 1 ip link set eth1.1 netns 1 ip netns exec 1 ip addr add 192.168.91.2/24 dev eth1.1 ip netns exec 1 ip link set eth1.$id up ... = 8< = = 8< = $ cat haproxy.cfg frontend clients bind 127.0.0.1:50022 namespace 1 transparent default_backend scb backend server mode tcp server server1 192.168.122.4:2222 namespace 2 send-proxy-v2 = 8< = A bind line creates the listener in the specified namespace, and connections originating from that listener also have their network namespace set to that of the listener. A server line either forces the connection to be made in a specified namespace or may use the namespace from the client-side connection if that was set. For more documentation please read the documentation included in the patch itself. Signed-off-by: KOVACS Tamas <ktamas@balabit.com> Signed-off-by: Sarkozi Laszlo <laszlo.sarkozi@balabit.com> Signed-off-by: KOVACS Krisztian <hidden@balabit.com>	2014-11-21 07:51:57 +01:00
Willy Tarreau	3a5e060bf6	MINOR: session: release a few other pools when stopping We currently release all pools when a proxy is stopped, except the connection, pendconn, and pipe pools. Doing so can improve further reduce memory usage of old processes, eventhough the connection struct is quite small, but there are a lot and they can participate to memory fragmentation. The pipe pool is very small and limited, and not exported so it's not done here.	2014-11-13 16:56:12 +01:00
Willy Tarreau	e12704bfc7	MINOR: session: export the function 'smp_fetch_sc_stkctr' This one is sometimes useful outside of this file.	2014-07-15 19:09:56 +02:00
Willy Tarreau	b5975defba	MINOR: stick-table: make stktable_fetch_key() indicate why it failed stktable_fetch_key() does not indicate whether it returns NULL because the input sample was not found or because it's unstable. It causes trouble with track-sc* rules. Just like with sample_fetch_string(), we want it to be able to give more information to the caller about what it found. Thus, now we use the pointer to a sample passed by the caller, and fill it with the information we have about the sample. That way, even if we return NULL, the caller has the ability to check whether a sample was found and if it is still changing or not.	2014-06-25 17:17:53 +02:00
Willy Tarreau	6f0a7bac28	BUG/MAJOR: session: revert all the crappy client-side timeout changes This is the 3rd regression caused by the changes below. The latest to date was reported by Finn Arne Gangstad. If a server responds with no content-length and the client's FIN is never received, either we leak the client-side FD or we spin at 100% CPU if timeout client-fin is set. Enough is enough. The amount of tricks needed to cover these side-effects starts to look like used toilet paper stacked over a chocolate cake. I don't want to eat that cake anymore! All this to avoid reporting a server-side timeout when a client stops uploading data and haproxy expires faster than the server... A lot of "ifs" resulting in a technically valid log that doesn't always please users, and whose alternative causes that many issues for all others users. So let's revert this crap merged since 1.5-dev25 : Revert "CLEANUP: http: don't clear CF_READ_NOEXP twice" This reverts commit `1592d1e72a`. Revert "BUG/MEDIUM: http: clear CF_READ_NOEXP when preparing a new transaction" This reverts commit `77d29029af`. Revert "BUG/MEDIUM: session: don't clear CF_READ_NOEXP if analysers are not called" This reverts commit `0943757a21`. Revert "BUG/MEDIUM: http: disable server-side expiration until client has sent the body" This reverts commit `3bed5e9337`. Revert "BUG/MEDIUM: http: correctly report request body timeouts" This reverts commit `b9edf8fbec`. Revert "BUG/MEDIUM: http/session: disable client-side expiration only after body" This reverts commit `b1982e27aa`. If a cleaner AND SAFER way to do something equivalent in 1.6-dev, we might consider backporting it to 1.5, but given the vicious bugs that have surfaced since, I doubt it will happen any time soon. Fortunately, that crap never made it into 1.4 so no backport is needed.	2014-06-23 15:47:00 +02:00
Willy Tarreau	4bfc580dd3	MEDIUM: session: maintain per-backend and per-server time statistics Using the last rate counters, we now compute the queue, connect, response and total times per server and per backend with a 95% accuracy over the last 1024 samples. The operation is cheap so we don't need to condition it.	2014-06-17 17:15:56 +02:00
Willy Tarreau	33a14e515b	MEDIUM: session: redispatch earlier when possible As discussed with Dmitry Sivachenko, is a server farm has more than one active server, uses a guaranteed non-determinist algorithm (round robin), and a connection was initiated from a non-persistent connection, there's no point insisting to reconnect to the same server after a connect failure, better redispatch upon the very first retry instead of insisting on the same server multiple times.	2014-06-13 17:53:55 +02:00
Willy Tarreau	db6d012270	MEDIUM: session: don't apply the retry delay when redispatching The retry delay is only useful when sticking to a same server. During a redispatch, it's useless and counter-productive if we're sure to switch to another server, which is almost guaranteed when there's more than one server and the balancing algorithm is round robin, so better not pass via the turn-around state in this case. It could be done as well for leastconn, but there's a risk of always killing the delay after the recovery of a server in a farm where it's almost guaranteed to take most incoming traffic. So better only kill the delay when using round robin.	2014-06-13 17:48:45 +02:00
Willy Tarreau	b02906659b	MEDIUM: session: allow shorter retry delay if timeout connect is small As discussed with Dmitry Sivachenko, the default 1-second connect retry delay can be large for situations where the connect timeout is much smaller, because it means that an active connection reject will take more time to be retried than a silent drop, and that does not make sense. This patch changes this so that the retry delay is the minimum of 1 second and the connect timeout. That way people running with sub-second connect timeout will benefit from the shorter reconnect.	2014-06-13 17:04:44 +02:00
Willy Tarreau	892337c8e1	MAJOR: server: use states instead of flags to store the server state Servers used to have 3 flags to store a state, now they have 4 states instead. This avoids lots of confusion for the 4 remaining undefined states. The encoding from the previous to the new states can be represented this way : SRV_STF_RUNNING \| SRV_STF_GOINGDOWN \| \| SRV_STF_WARMINGUP \| \| \| 0 x x SRV_ST_STOPPED 1 0 0 SRV_ST_RUNNING 1 0 1 SRV_ST_STARTING 1 1 x SRV_ST_STOPPING Note that the case where all bits were set used to exist and was randomly dealt with. For example, the task was not stopped, the throttle value was still updated and reported in the stats and in the http_server_state header. It was the same if the server was stopped by the agent or for maintenance. It's worth noting that the internal function names are still quite confusing.	2014-05-22 11:27:00 +02:00
Willy Tarreau	c93cd16b6c	REORG/MEDIUM: server: split server state and flags in two different variables Till now, the server's state and flags were all saved as a single bit field. It causes some difficulties because we'd like to have an enum for the state and separate flags. This commit starts by splitting them in two distinct fields. The first one is srv->state (with its counter-part srv->prev_state) which are now enums, but which still contain bits (SRV_STF_*). The flags now lie in their own field (srv->flags). The function srv_is_usable() was updated to use the enum as input, since it already used to deal only with the state. Note that currently, the maintenance mode is still in the state for simplicity, but it must move as well.	2014-05-22 11:27:00 +02:00
Willy Tarreau	0943757a21	BUG/MEDIUM: session: don't clear CF_READ_NOEXP if analysers are not called As more or less suspected, commit `b1982e2` ("BUG/MEDIUM: http/session: disable client-side expiration only after body") was hazardous. It introduced a regression causing client side timeout to expire during connection retries if it's lower than the time needed to cover the amount of retries, so clients get a 408 when the connection to the server fails to establish fast enough. The reason is that the CF_READ_NOEXP flag is set after the MSG_DONE state is reached, which protects the timeout from being re-armed, then during the retries, process_session() clears the flag without calling the analyser (since there's no activity for it), so the timeouts are rearmed. Ideally, these one-shot flags should be per-analyser, and the analyser which sets them would be responsible for clearing them, or they would automatically be cleared when switching to another analyser. Unfortunately this is not really possible currently. What can be done however is to only clear them in the following situations : - we're going to call analysers - analysers have all been unsubscribed This method seems reliable enough and approaches the ideal case well enough. No backport is needed, this bug was introduced in 1.5-dev25.	2014-05-21 16:58:17 +02:00
Willy Tarreau	05cdd9655d	MEDIUM: session: implement half-closed timeouts (client-fin and server-fin) Long-lived sessions are often subject to half-closed sessions resulting in a lot of sessions appearing in FIN_WAIT state in the system tables, and no way for haproxy to get rid of them. This typically happens because clients suddenly disconnect without sending any packet (eg: FIN or RST was lost in the path), and while the server detects this using an applicative heart beat, haproxy does not close the connection. This patch adds two new timeouts : "timeout client-fin" and "timeout server-fin". The former allows one to override the client-facing timeout when a FIN has been received or sent. The latter does the same for server-facing connections, which is less useful.	2014-05-10 15:14:05 +02:00
Willy Tarreau	b4f98098aa	BUG/MAJOR: session: recover the correct connection pointer in half-initialized sessions John-Paul Bader reported a nasty segv which happens after a few hours when SSL is enabled under a high load. Fortunately he could catch a stack trace, systematically looking like this one : (gdb) bt full level = 6 conn = (struct connection ) 0x0 err_msg = <value optimized out> s = (struct session ) 0x80337f800 conn = <value optimized out> flags = 41997063 new_updt = <value optimized out> old_updt = 1 e = <value optimized out> status = 0 fd = 53999616 nbfd = 279 wait_time = <value optimized out> updt_idx = <value optimized out> en = <value optimized out> eo = <value optimized out> count = 78 sr = <value optimized out> sw = <value optimized out> rn = <value optimized out> wn = <value optimized out> The variable "flags" in conn_fd_handler() holds a copy of connection->flags when entering the function. These flags indicate 41997063 = 0x0280d307 : - {SOCK,DATA,CURR}_RD_ENA=1 => it's a handshake, waiting for reading - {SOCK,DATA,CURR}_WR_ENA=0 => no need for writing - CTRL_READY=1 => FD is still allocated - XPRT_READY=1 => transport layer is initialized - ADDR_FROM_SET=1, ADDR_TO_SET=0 => clearly it's a frontend connection - INIT_DATA=1, WAKE_DATA=1 => processing a handshake (ssl I guess) - {DATA,SOCK}_{RD,WR}_SH=0 => no shutdown - ERROR=0, CONNECTED=0 => handshake not completed yet - WAIT_L4_CONN=0 => normal - WAIT_L6_CONN=1 => waiting for an L6 handshake to complete - SSL_WAIT_HS=1 => the pending handshake is an SSL handshake So this is a handshake is in progress. And the only way to reach line 88 is for the handshake to complete without error. So we know for sure that ssl_sock_handshake() was called and completed the handshake then removed the CO_FL_SSL_WAIT_HS flag from the connection. With these flags, ssl_sock_handshake() does only call SSL_do_handshake() and retruns. So that means that the problem is necessarily in data->init(). The fd is wrong as reported but is simply mis-decoded as it's the lower half of the last function pointer. What happens in practice is that there's an issue with the way we deal with embryonic sessions during their conversion to regular sessions. Since they have no stream interface at the beginning, the pointer to the connection is temporarily stored into s->target. Then during their conversion, the first stream interface is properly initialized and the connection is attached to it, then s->target is set to NULL. The problem is that if anything fails in session_complete(), the session is left in this intermediate state where s->target is NULL, and kill_mini_session() is called afterwards to perform the cleanup. It needs the connection, that it finds in s->target which is NULL, dereferences it and dies. The only reasons for dying here are a problem on the TCP connection when doing the setsockopt(TCP_NODELAY) or a memory allocation issue. This patch implements a solution consisting in restoring s->target in session_complete() on the error path. That way embryonic sessions that were valid before calling it are still valid after. The bug was introduced in 1.5-dev20 by commit `f8a49ea` ("MEDIUM: session: attach incoming connection to target on embryonic sessions"). No backport is needed. Special thanks to John for his numerous tests and traces.	2014-05-08 22:46:32 +02:00
Willy Tarreau	b1982e27aa	BUG/MEDIUM: http/session: disable client-side expiration only after body For a very long time, back in the v1.3 days, we used to rely on a trick to avoid expiring the client side while transferring a payload to the server. The problem was that if a client was able to quickly fill the buffers, and these buffers took some time to reach the server, the client should not expire while not sending anything. In order to cover this situation, the client-side timeout was disabled once the connection to the server was OK, since it implied that we would at least expire on the server if required. But there is a drawback to this : if a client stops uploading data before the end, its timeout is not enforced and we only expire on the server's timeout, so the logs report a 504. Since 1.4, we have message body analysers which ensure that we know whether all the expected data was received or not (HTTP_MSG_DATA or HTTP_MSG_DONE). So we can fix this problem by disabling the client-side or server-side timeout at the end of the transfer for the respective side instead of having it unconditionally in session.c during all the transfer. With this, the logs now report the correct side for the timeout. Note that this patch is not enough, because another issue remains : the HTTP body forwarders do not abort upon timeout, they simply rely on the generic handling from session.c. So for now, the session is still aborted when reaching the server timeout, but the culprit is properly reported. A subsequent patch will address this specific point. This bug was tagged MEDIUM because of the changes performed. The issue it fixes is minor however. After some cooling down, it may be backported to 1.4. It was reported by and discussed with Rachel Chavez and Patrick Hemmer on the mailing list.	2014-05-07 14:21:47 +02:00
Willy Tarreau	644c101e2d	BUG/MAJOR: http: connection setup may stall on balance url_param On the mailing list, seri0528@naver.com reported an issue when using balance url_param or balance uri. The request would sometimes stall forever. Cyril Bont� managed to reproduce it with the configuration below : listen test :80 mode http balance url_param q hash-type consistent server s demo.1wt.eu:80 and found it appeared with this commit : `80a92c0` ("BUG/MEDIUM: http: don't start to forward request data before the connect"). The bug is subtle but real. The problem is that the HTTP request forwarding analyzer refrains from starting to parse the request body when some LB algorithms might need the body contents, in order to preserve the data pointer and avoid moving things around during analysis in case a redispatch is later needed. And in order to detect that the connection establishes, it watches the response channel's CF_READ_ATTACHED flag. The problem is that a request analyzer is not subscribed to a response channel, so it will only see changes when woken for other (generally correlated) reasons, such as the fact that part of the request could be sent. And since the CF_READ_ATTACHED flag is cleared once leaving process_session(), it is important not to miss it. It simply happens that sometimes the server starts to respond in a sequence that validates the connection in the middle of process_session(), that it is detected after the analysers, and that the newly assigned CF_READ_ATTACHED is not used to detect that the request analysers need to be called again, then the flag is lost. The CF_WAKE_WRITE flag doesn't work either because it's cleared upon entry into process_session(), ie if we spend more than one call not connecting. Thus we need a new flag to tell the connection initiator that we are specifically interested in being notified about connection establishment. This new flag is CF_WAKE_CONNECT. It is set by the requester, and is cleared once the connection succeeds, where CF_WAKE_ONCE is set instead, causing the request analysers to be scanned again. For future versions, some better options will have to be considered : - let all analysers subscribe to both request and response events ; - let analysers subscribe to stream interface events (reduces number of useless calls) - change CF_WAKE_WRITE's semantics to persist across calls to process_session(), but that is different from validating a connection establishment (eg: no data sent, or no data to send) The bug was introduced in 1.5-dev23, no backport is needed.	2014-04-30 20:02:02 +02:00
Willy Tarreau	f51658dac4	MEDIUM: config: relax use_backend check to make the condition optional Since it became possible to use log-format expressions in use_backend, having a mandatory condition becomes annoying because configurations are full of "if TRUE". Let's relax the check to accept no condition like many other keywords (eg: redirect).	2014-04-23 01:21:56 +02:00
Willy Tarreau	b9a551e6aa	BUG/MINOR: stats: last session was not always set Cyril Bont� reported that the "lastsess" field of a stats-only backend was never updated. In fact the same is true for any applet and anything not a server. Also, lastsess was not updated for a server reusing its connection for a new request. Since the goal of this field is to report recent activity, it's better to ensure that all accesses are reported. The call has been moved to the code validating the session establishment instead, since everything passes there.	2014-04-23 00:35:17 +02:00
Willy Tarreau	5a8f947f4f	CLEANUP: http: rename http_process_request_body() This function does not process anything, it just waits for the beginning of the request body. Let's rename it http_wait_for_request_body().	2014-04-22 23:15:27 +02:00
Thierry FOURNIER	d988f21589	BUG/MAJOR: session: fix a possible crash with src_tracked Since commit `4d4149c` ("MEDIUM: counters: support passing the counter number as a fetch argument"), the sample fetch sc_tracked(num) became equivalent to sc[0-9]_tracked, by using the same smp_fetch_sc_tracked() function. This was theorically made possible after the series of changes starting with commit `a65536ca` ("MINOR: counters: provide a generic function to retrieve a stkctr for sc* and src."). Unfortunately, while all other functions were changed to use the generic primitive smp_fetch_sc_stkctr(), smp_fetch_sc_tracked() was forgotten and is not able to differentiate between sc_tracked, src_tracked and sc[0-9]_tracked. The resulting mess is that if sc_tracked is used, the counter number is assumed to be 47 because that's what remains after subtracting "0" from char "_". Fix this by simply relying on the generic function as should have been done. The bug was introduced in 1.5-dev20. No backport is needed.	2014-04-15 11:09:49 +02:00
Thierry FOURNIER	74c219dc04	BUG/MEDIUM: stick-table: fix IPv4-to-IPv6 conversion in src_* fetches The function addr_to_stktable_key doesn't consider the expected type of key. If the stick table key is based on IPv6 addresses and the input is IPv4, the returned key is IPv4 adddress and his length is 4 bytes, while is expected 16 bytes key. This patch considers the expected key and try to convert IPv4 to IPv6 and IPv6 to IPv4 according with the expected key. This fixes the bug reported by Apollon Oikonomopoulos. This bug was introduced somewhere in the 1.5-dev process.	2014-04-14 18:22:57 +02:00
Willy Tarreau	6a0b6bd648	BUG/MAJOR: counters: check for null-deref when looking up an alternate table Constructions such as sc0_get_gpc0(foo) allow to look up the same key as the current key but in an alternate table. A check was missing to ensure we already have a key, resulting in a crash if this lookup is performed before the associated track-sc rule. This bug was reported on the mailing list by Neil@iamafreeman and narrowed down further by Lukas Tribus and Thierry Fournier. This bug was introduced in 1.5-dev20 by commit "0f791d4 MEDIUM: counters: support looking up a key in an alternate table".	2014-04-09 13:32:11 +02:00
Bertrand Jacquin	702d44f2ff	MEDIUM: proxy: support use_backend with dynamic names We have a use case where we look up a customer ID in an HTTP header and direct it to the corresponding server. This can easily be done using ACLs and use_backend rules, but the configuration becomes painful to maintain when the number of customers grows to a few tens or even a several hundreds. We realized it would be nice if we could make the use_backend resolve its name at run time instead of config parsing time, and use a similar expression as http-request add-header to decide on the proper backend to use. This permits the use of prefixes or even complex names in backend expressions. If no name matches, then the default backend is used. Doing so allowed us to get rid of all the use_backend rules. Since there are some config checks on the use_backend rules to see if the referenced backend exists, we want to keep them to detect config errors in normal config. So this patch does not modify the default behaviour and proceeds this way : - if the backend name in the use_backend directive parses as a log format rule, it's used as-is and is resolved at run time ; - otherwise it's a static name which must be valid at config time. There was the possibility of doing this with the use-server directive instead of use_backend, but it seems like use_backend is more suited to this task, as it can be used for other purposes. For example, it becomes easy to serve a customer-specific proxy.pac file based on the customer ID by abusing the errorfile primitive : use_backend bk_cust_%[hdr(X-Cust-Id)] if { hdr(X-Cust-Id) -m found } default_backend bk_err_404 backend bk_cust_1 errorfile 200 /etc/haproxy/static/proxy.pac.cust1 Signed-off-by: Bertrand Jacquin <bjacquin@exosec.fr>	2014-03-31 10:18:30 +02:00
Thierry FOURNIER	a47a94fb13	MINOR: session: don't always assume there's a listener For outgoing connections initiated from an applet, there might not be any listener. It's the case with peers, which resort to a hack consisting in making the session's listener point to the peer. This listener is only used for statistics now so it's much easier to check for its presence now.	2014-03-28 13:16:32 +01:00
Willy Tarreau	7519560767	MINOR: http: release compression context only in http_end_txn() Currently there are two places where the compression context is released, one in session_free() and another one in http_end_txn_clean_session(). Both of them call http_end_txn(), either directly or via http_reset_txn(), and this function is made for this exact purpose. So let's centralize the call there instead.	2014-03-14 19:26:20 +01:00
Bhaskar Maddala	a20cb85eba	MINOR: stats: Enhancement to stats page to provide information of last session time. Summary: Track and report last session time on the stats page for each server in every backend, as well as the backend. This attempts to address the requirement in the ROADMAP - add a last activity date for each server (req/resp) that will be displayed in the stats. It will be useful with soft stop. The stats page reports this as time elapsed since last session. This change does not adequately address the requirement for long running session (websocket, RDP... etc).	2014-02-08 01:19:58 +01:00
Willy Tarreau	a23ee3a2ea	MINOR: session: clean up the connection free code Use conn_free() instead of pool_free2(conn...). This makes the code more auditable.	2014-02-05 00:18:47 +01:00
Willy Tarreau	818dca5098	BUG/MEDIUM: listener: improve detection of non-working accept4() On ARM, glibc does not implement accept4() and simply returns ENOSYS which was not caught as a reason to fall back to accept(), resulting in a spinning process since poll() would call again. Let's change the error detection mechanism to save the broken status of the syscall into a local variable that is used to fall back to the legacy accept(). In addition to this, since the code was becoming a bit messy, the accept4() was removed, so now the fallback code and the legacy code are the same. This will also increase bug report accuracy if needed. This is 1.5-specific, no backport is needed.	2014-01-31 19:40:19 +01:00
Willy Tarreau	cc08d2c9ff	MEDIUM: counters: stop relying on session flags at all Till now, we had one flag per stick counter to indicate if it was tracked in a backend or in a frontend. We just had to add another flag per stick-counter to indicate if it relies on contents or just connection. These flags are quite painful to maintain and tend to easily conflict with other flags if their number is changed. The correct solution consists in moving the flags to the stkctr struct itself, but currently this struct is made of 2 pointers, so adding a new entry there to store only two bits will cause at least 16 more bytes to be eaten per counter due to alignment issues, and we definitely don't want to waste tens to hundreds of bytes per session just for things that most users don't use. Since we only need to store two bits per counter, an intermediate solution consists in replacing the entry pointer with a composite value made of the original entry pointer and the two flags in the 2 unused lower bits. If later a need for other flags arises, we'll have to store them in the struct. A few inline functions have been added to abstract the retrieval and assignment of the pointers and flags, resulting in very few changes. That way there is no more dependence on the number of stick-counters and their position in the session flags.	2014-01-28 23:34:45 +01:00
Willy Tarreau	e9101695ef	BUG/MEDIUM: counters: fix stick-table entry leak when using track-sc2 in connection In 1.5-dev19, commit `e25c917` ("MEDIUM: counters: add support for tracking a third counter") introduced the third track counter. However, there was a hard-coded test in the accept() error path to release only sc0 and sc1. So it seems that if tracking sc2 at the connection level and deciding to reject once the track-sc2 has been done, there could be some leaking of stick-table entries which remain marked used forever, thus which can never be purged nor expired. There's no memory leak though, it's just that entries are unexpirable forever. The simple solution consists in removing the test and always calling the inline function which iterates over all entries.	2014-01-28 23:32:50 +01:00
Willy Tarreau	1f0da2485e	BUG/MEDIUM: unique_id: HTTP request counter is not stable Patrick Hemmer reported that using unique_id_format and logs did not report the same unique ID counter since commit `9f09521` ("BUG/MEDIUM: unique_id: HTTP request counter must be unique!"). This is because the increment was done while producing the log message, so it was performed twice. A better solution consists in fetching a new value once per request and saving it in the request or session context for all of this request's life. It happens that sessions already have a unique ID field which is used for debugging and reporting errors, and which differs from the one sent in logs and unique_id header. So let's change this to reuse this field to have coherent IDs everywhere. As of now, a session gets a new unique ID once it is instanciated. This means that TCP sessions will also benefit from a unique ID that can be logged. And this ID is renewed for each extra HTTP request received on an existing session. Thus, all TCP sessions and HTTP requests will have distinct IDs that will be stable along all their life, and coherent between all places where they're used (logs, unique_id header, "show sess", "show errors"). This feature is 1.5-specific, no backport to 1.4 is needed.	2014-01-25 11:07:06 +01:00
Willy Tarreau	2b028dd828	OPTIM: session: put unlikely() around the freewheeling code The code which enables tunnel mode or TCP transfers is rarely used and at most once per session. Putting it in an unlikely() clause reduces the length of the hot path of process_session() which is already quite long, and also slightly reduces its overall size. Some measurements show a steady gain of about 0.2% thanks to this.	2013-12-31 23:56:46 +01:00
Willy Tarreau	9e5a3aacf4	MEDIUM: stream-int: make si_connect() return an established state when possible si_connect() used to only return SI_ST_CON. But it already detect the connection reuse and is the function which avoids calling connect(). So it already knows the connection is valid and reuse. Thus we make it return SI_ST_EST when a connection is reused. This means that connect_server() can return this state and sess_update_stream_int() as well. Thanks to this change, we don't need to leave process_session() in SI_ST_CON state to immediately enter it again to switch to SI_ST_EST. Implementing this removes one call to process_session() per request in keep-alive mode. We're now at 2 calls per request, which is the minimum (one for the request and another one for the response). The number of calls to http_wait_for_response() has also dropped from 2 to one. Tests indicate a performance gain of about 2.6% in request rate in keep-alive mode. There should be no gain in http-server-close() since we don't use this faster path.	2013-12-31 23:32:12 +01:00
Willy Tarreau	b44c873d61	MEDIUM: session: prepare to support earlier transitions to the established state At the moment it is possible in sess_prepare_conn_req() to switch to the established state when the target is an applet. But sess_update_stream_int() will soon also have the ability to set the established state via connect_server() when a connection is reused, leading to a synchronous connect. So prepare the code to handle this SI_ST_ASS -> SI_ST_EST transition, which really matches what's done in the lower layers.	2013-12-31 23:16:50 +01:00
Willy Tarreau	0e37f1c40e	MINOR: session: factor out the connect time measurement Currently there are 3 places in the code where t_connect is set after switching to state SI_ST_EST, and a fourth one will soon come. Since all these places lead to an immediate call to sess_establish() to complete the session establishment, better move that measurement there.	2013-12-31 23:06:46 +01:00
Willy Tarreau	d81ca04051	OPTIM: session: set the READ_DONTWAIT flag when connecting As soon as we connect to the server, we want to limit the number of recvfrom() on the response path because most of the time a single call will retrieve enough information. At the moment this is only done in the HTTP response parser, after some reads have already failed, which is too late. We need to do that at the earliest possible instant. It was already done for the request side by frontend_accept() for the first request, and by http_reset_txn() for the next requests. Thanks to this change, there are no more failed recvfrom() calls in keep-alive mode.	2013-12-31 22:39:26 +01:00
Willy Tarreau	d7ad9f5b0d	MAJOR: channel: add a new flag CF_WAKE_WRITE to notify the task of writes Since commit `6b66f3e` ([MAJOR] implement autonomous inter-socket forwarding) introduced in 1.3.16-rc1, we've been relying on a stupid mechanism to wake up the task after a write, which was an exact copy-paste of the reader side. The principle was that if we empty a buffer and there's no forwarding scheduled or if the producer is not in a connected state, then we wake the task up. That does not make any sense. It happens to wake up too late sometimes (eg, when the request analyser waits for some room in the buffer to start to work), and leads to unneeded wakeups in client-side keep-alive, because the task is woken up when the response is sent, while the analysers are simply waiting for a new request. In order to fix this, we introduce a new channel flag : CF_WAKE_WRITE. It is designed so that an analyser can explicitly request being notified when some data were written. It is used only when the HTTP request or response analysers need to wait for more room in the buffers. It is automatically cleared upon wake up. The flag is also automatically set by the functions which try to write into a buffer from an applet when they fail (bi_putblk() etc...). That allows us to remove the stupid condition above and avoid some wakeups. In http-server-close and in http-keep-alive modes, this reduces from 4 to 3 the average number of wakeups per request, and increases the overall performance by about 1.5%.	2013-12-31 18:37:36 +01:00
Willy Tarreau	068621e4ad	MINOR: http: try to stick to same server after status 401/407 In HTTP keep-alive mode, if we receive a 401, we still have a chance of being able to send the visitor again to the same server over the same connection. This is required by some broken protocols such as NTLM, and anyway whenever there is an opportunity for sending the challenge to the proper place, it's better to do it (at least it helps with debugging).	2013-12-23 15:12:44 +01:00
Willy Tarreau	2cff2f7bb8	MINOR: session: remove debugging code The memset() was put here to corrupt memory for a debugging test, it's not needed anymore and was unfortunately committed. It does not harm anyway, it probably just slightly affects performance.	2013-12-16 10:12:54 +01:00
Willy Tarreau	59e3ff4549	BUG/MAJOR: session: repair tcp-request connection rules Since recent commit `f79c817` (MAJOR: connection: add two new flags to indicate readiness of control/transport) and the surrounding commits, the session initialization has been slightly delayed and the control layer of the connection is not yet initialized when processing the rules. We need to move that minimal initialization a bit above. The bug was introduced with latest changes, no backport is needed.	2013-12-16 02:23:50 +01:00
Willy Tarreau	89efaed6b6	BUILD: definitely silence some stupid GCC warnings It's becoming increasingly difficult to ignore unwanted function returns in debug code with gcc. Now even when you try to work around it, it suggests a way to write your code differently. For example : src/frontend.c:187:65: warning: if statement has empty body [-Wempty-body] if (write(1, trash.str, trash.len) < 0) /* shut gcc warning */; ^ src/frontend.c:187:65: note: put the semicolon on a separate line to silence this warning 1 warning generated. This is totally unacceptable, this code already had to be written this way to shut it up in earlier versions. And now it comments the form ? What's the purpose of the C language if you can't write anymore the code that does what you want ? Emeric proposed to just keep a global variable to drain such useless results so that gcc stops complaining all the time it believes people who write code are monkeys. The solution is acceptable because the useless assignment is done only in debug code so it will not impact performance. This patch implements this, until gcc becomes even "smarter" to detect that we tried to cheat.	2013-12-13 15:21:36 +01:00
Willy Tarreau	6bbb2f68cd	MINOR: session: report lack of resources using the new stream-interface's error code Let's now use SI_ET_CONN_RES to report lack of resources instead of SO_ET_CONN_OTHER with a handcrafted code.	2013-12-09 17:14:23 +01:00
Willy Tarreau	2d400bb931	MINOR: stream_interface: add reporting of ressouce allocation errors SSL and keep-alive will need to be able to fail on allocation errors, and the stream interface did not allow to report such a cause. The flag will then be "RC" as already documented.	2013-12-09 17:12:18 +01:00
Willy Tarreau	4384ddfc84	MEDIUM: session: automatically register the applet designated by the target Some applet users don't need to initialize their applet, they just want to route the traffic there just as if it were a server. Since applets are now connected to from session.c, let's simply ensure that when connecting, the applet in si->end matches the target, and allocate one there if it's not already done. In case of error, we force the status code to resource and connection so that it's clear that it happens because of a memory shortage.	2013-12-09 15:40:23 +01:00
Willy Tarreau	32e3c6a607	MAJOR: stream interface: dynamically allocate the outgoing connection The outgoing connection is now allocated dynamically upon the first attempt to touch the connection's source or destination address. If this allocation fails, we fail on SN_ERR_RESOURCE. As we didn't use si->conn anymore, it was removed. The endpoints are released upon session_free(), on the error path, and upon a new transaction. That way we are able to carry the existing server's address across retries. The stream interfaces are not initialized anymore before session_complete(), so we could even think about allocating them dynamically as well, though that would not provide much savings. The session initialization now makes use of conn_new()/conn_free(). This slightly simplifies the code and makes it more logical. The connection initialization code is now shorter by about 120 bytes because it's done at once, allowing the compiler to remove all redundant initializations. The si_attach_applet() function now takes care of first detaching the existing endpoint, and it is called from stream_int_register_handler(), so we can safely remove the calls to si_release_endpoint() in the application code around this call. A call to si_detach() was made upon stream_int_unregister_handler() to ensure we always free the allocated connection if one was allocated in parallel to setting an applet (eg: detect HTTP proxy while proceeding with stats maybe).	2013-12-09 15:40:23 +01:00
Willy Tarreau	2a6e8802c0	MEDIUM: stream-interface: introduce si_attach_conn to replace si_prepare_conn si_prepare_conn() is not appropriate in our case as it both initializes and attaches the connection to the stream interface. Due to the asymmetry between accept() and connect(), it causes some fields such as the control and transport layers to be reinitialized. Now that we can separately initialize these fields using conn_prepare(), let's break this function to only attach the connection to the stream interface. Also, by analogy, si_prepare_none() was renamed si_detach(), and si_prepare_applet() was renamed si_attach_applet().	2013-12-09 15:40:23 +01:00
Willy Tarreau	7abddb5c67	MINOR: connection: replace conn_assign with conn_attach We don't want to assign the control nor transport layers anymore at the same time as the data layer, because it prevents one from keeping existing settings when reattaching a connection to an existing stream interface. Let's have conn_attach() replace conn_assign() for this purpose. Thus, conn_prepare() + conn_attach() do exactly the same as the previous conn_assign().	2013-12-09 15:40:23 +01:00
Willy Tarreau	910c6aa5b7	MINOR: connection: reintroduce conn_prepare to set the protocol and transport Now that we can assign conn->xprt regardless of the initialization state, we can reintroduce conn_prepare() to set only the protocol, the transport layer and initialize the transport layer's state.	2013-12-09 15:40:23 +01:00
Willy Tarreau	3ed35ef05b	MINOR: stream-interface: introduce si_reset() and si_set_state() The first function is used to (re)initialize a stream interface and the second to force it into a known state. These are intended for cleaning up the stream interface initialization code in session.c and peers.c and avoiding future issues with missing initializations.	2013-12-09 15:40:23 +01:00
Willy Tarreau	f79c8171b2	MAJOR: connection: add two new flags to indicate readiness of control/transport Currently the control and transport layers of a connection are supposed to be initialized when their respective pointers are not NULL. This will not work anymore when we plan to reuse connections, because there is an asymmetry between the accept() side and the connect() side : - on accept() side, the fd is set first, then the ctrl layer then the transport layer ; upon error, they must be undone in the reverse order, then the FD must be closed. The FD must not be deleted if the control layer was not yet initialized ; - on the connect() side, the fd is set last and there is no reliable way to know if it has been initialized or not. In practice it's initialized to -1 first but this is hackish and supposes that local FDs only will be used forever. Also, there are even less solutions for keeping trace of the transport layer's state. Also it is possible to support delayed close() when something (eg: logs) tracks some information requiring the transport and/or control layers, making it even more difficult to clean them. So the proposed solution is to add two flags to the connection : - CO_FL_CTRL_READY is set when the control layer is initialized (fd_insert) and cleared after it's released (fd_delete). - CO_FL_XPRT_READY is set when the control layer is initialized (xprt->init) and cleared after it's released (xprt->close). The functions have been adapted to rely on this and not on the pointers anymore. conn_xprt_close() was unused and dangerous : it did not close the control layer (eg: the socket itself) but still marks the transport layer as closed, preventing any future call to conn_full_close() from finishing the job. The problem comes from conn_full_close() in fact. It needs to close the xprt and ctrl layers independantly. After that we're still having an issue : we don't know based on ->ctrl alone whether the fd was registered or not. For this we use the two new flags CO_FL_XPRT_READY and CO_FL_CTRL_READY. We now rely on this and not on conn->xprt nor conn->ctrl anymore to decide what remains to be done on the connection. In order not to miss some flag assignments, we introduce conn_ctrl_init() to initialize the control layer, register the fd using fd_insert() and set the flag, and conn_ctrl_close() which unregisters the fd and removes the flag, but only if the transport layer was closed. Similarly, at the transport layer, conn_xprt_init() calls ->init and sets the flag, while conn_xprt_close() checks the flag, calls ->close and clears the flag, regardless xprt_ctx or xprt_st. This also ensures that the ->init and the ->close functions are called only once each and in the correct order. Note that conn_xprt_close() does nothing if the transport layer is still tracked. conn_full_close() now simply calls conn_xprt_close() then conn_full_close() in turn, which do nothing if CO_FL_XPRT_TRACKED is set. In order to handle the error path, we also provide conn_force_close() which ignores CO_FL_XPRT_TRACKED and closes the transport and the control layers in turns. All relevant instances of fd_delete() have been replaced with conn_force_close(). Now we always know what state the connection is in and we can expect to split its initialization.	2013-12-09 15:40:23 +01:00
Willy Tarreau	c10aec299f	MINOR: get rid of si_takeover_conn() Since last commit, this function is an exact copy of si_prepare_conn().	2013-12-09 15:40:23 +01:00
Willy Tarreau	37213433a8	MEDIUM: connection: replace conn_prepare with conn_assign Everywhere conn_prepare() is used, the call to conn_init() has already been done. We can now safely replace all instances of conn_prepare() with conn_assign() which does not reset the transport layer, and remove conn_prepare().	2013-12-09 15:40:23 +01:00
Willy Tarreau	f826c2205b	MINOR: session: use conn_init() to initialize the connections Proceeding like this is safer to ensure we don't forget a field.	2013-12-09 15:40:23 +01:00
Willy Tarreau	f8a49eab4f	MEDIUM: session: attach incoming connection to target on embryonic sessions In order to reduce the dependency over stream-interfaces, we now attach the incoming connection to the embryonic session's target instead of the stream-interface's connection. This means we won't need to initialize stream interfaces anymore after we implement dynamic connection allocation. The session's target is reset to NULL after the session has been converted to a complete session.	2013-12-09 15:40:22 +01:00
Willy Tarreau	b363a1f469	MAJOR: stream-int: stop using si->conn and use si->end instead The connection will only remain there as a pre-allocated entity whose goal is to be placed in ->end when establishing an outgoing connection. All connection initialization can be made on this connection, but all information retrieved should be applied to the end point only. This change is huge because there were many users of si->conn. Now the only users are those who initialize the new connection. The difficulty appears in a few places such as backend.c, proto_http.c, peers.c where si->conn is used to hold the connection's target address before assigning the connection to the stream interface. This is why we have to keep si->conn for now. A future improvement might consist in dynamically allocating the connection when it is needed.	2013-12-09 15:40:22 +01:00
Willy Tarreau	691b1f429e	CLEANUP: stream-int: remove obsolete si_ctrl function This function makes no sense anymore and will cause trouble to convert the remains of connection/applet to end points. Let's replace it now with its contents.	2013-12-09 15:40:22 +01:00
Willy Tarreau	cf644ed37a	MEDIUM: stream-int: make ->end point to the connection or the appctx The long-term goal is to have a context for applets as an alternative to the connection and not as a complement. At the moment, the context is still stored into the stream interface, and we only put a pointer to the applet's context in si->end, initialize the context with object type OBJ_TYPE_APPCTX, and this allows us not to allocate an entry when deciding to switch to an applet. A special care is taken to never dereference si->conn anymore when dealing with an applet. That's why it's important that si->end is always set to the proper type : si->end == NULL => not connected to anything si->end == OBJ_TYPE_APPCTX => connected to an applet si->end == OBJ_TYPE_CONN => real connection (server, proxy, ...) The session management code used to check the applet from the connection's target. Now it uses the stream interface's end point and does not touch the connection at all. Similarly, we stop checking the connection's addresses and file descriptors when reporting the applet's status in the stats dump.	2013-12-09 15:40:22 +01:00
Willy Tarreau	4a59f2f954	MAJOR: stream interface: remove the ->release function pointer Since last commit, we now have a pointer to the applet in the applet context. So we don't need the si->release function pointer anymore, it can be extracted from applet->applet.release. At many places, the ->release function was still tested for real connections while it is only limited to applets, so most of them were simply removed. For the remaining valid uses, a new inline function si_applet_release() was added to simplify the check and the call.	2013-12-09 15:40:22 +01:00
Willy Tarreau	372d6708fb	MINOR: stream-int: split si_prepare_embedded into si_prepare_none and si_prepare_applet si_prepare_embedded() was used both to attach an applet and to detach anything from a stream interface. Split it into si_prepare_none() to detach and si_prepare_applet() to attach an applet. si->conn->target is now assigned from within these two functions instead of their respective callers.	2013-12-09 15:40:22 +01:00
Willy Tarreau	1e6902fd6a	MINOR: connection: always initialize conn->objt_type to OBJ_TYPE_CONN We do this everywhere we prepare a connection so that we can safely switch to objt_conn() next.	2013-12-09 15:40:22 +01:00
Willy Tarreau	fac4bd1492	MAJOR: session: pass applet return traffic through the response analysers Now that applets work like real connections, there is no reason for them to evade the response analysers. The stats applet emits valid HTTP responses, it can flow through the HTTP response analyser just fine. This now allows http-response/rsprep/rspadd rules to be applied on top of stats. Cookie insertion does nothing since applets are not servers and thus do not have a cookie. We can imagine compression to be applied later if the stats output is emitted in chunks and in HTTP/1.1. A minor visible effect of this change is that there is no more "-1" in the timers presented in the logs when viewing the stats, all timers are real.	2013-12-09 15:40:22 +01:00
Willy Tarreau	d84fb5e60f	MAJOR: session: check for a connection to an applet in sess_prepare_conn_req() Instead of having applets bypass the whole connection process, we now follow the common path through sess_prepare_conn_req(). It is this function which detects an applet an sets the output state so SI_ST_EST instead of initiating a connection to a server. It is made possible because we now have s->target pointing to the applet.	2013-12-09 15:40:22 +01:00
Willy Tarreau	7584b27956	MEDIUM: session: detect applets from the session by using s->target We used to rely on the stream interface's target to detect an applet from within the session while trying to process the connection request, but this is incorrect, as this target is the one currently connected and not the next one to process. This will make a difference when we later support keep-alive. The only "official" value indicating where we want to connect is the session's target, which can be : - &applet : connect to this applet - NULL : connect using the normal LB algos - anything else : direct connection to some entity Since we're interested in detecting the specific case of applets, it's OK to make use of s->target then. Also, applets are being isolated from connections, and as such there will not be any ->connect method available when an applet is running, so we can get rid of this test as well.	2013-12-09 15:40:22 +01:00
Willy Tarreau	08382955fe	CLEANUP: stream_interface: remove unused field err_loc This field was still fed with a pointer to the server that caught an error but was not used anymore. Let's remove it.	2013-12-09 15:40:21 +01:00
Willy Tarreau	9667a80676	BUG/MEDIUM: stick-tables: complete the latest fix about store-responses The commit `37e340c` (BUG/MEDIUM: stick: completely remove the unused flag from the store entries) was incomplete. We also need to ensure that only the first store-response for a table is applied and that it may coexist with a possible store-request that was already done on this table. This patch with the previous one should be backported to 1.4.	2013-12-09 15:29:25 +01:00
Willy Tarreau	37e340ce4b	BUG/MEDIUM: stick: completely remove the unused flag from the store entries The store[] array in the session holds a flag which probably aimed to differenciate store entries learned from the request from those learned from the response, and allowing responses to overwrite only the request ones (eg: have a server set a response cookie which overwrites the request one). But this flag is set when a response data is stored, and is never cleared. So in practice, haproxy always runs with this flag set, meaning that responses prevent themselves from overriding the request data. It is desirable anyway to keep the ability not to override data, because the override is performed only based on the table and not on the key, so that would mean that it would be impossible to retrieve two different keys to store into a same table. For example, if a client sets a cookie and a server another one, both need to be updated in the table in the proper order. This is especially true when multiple keys may be tracked on each side into the same table (eg: list of IP addresses in a header). So the correct fix which also maintains the current behaviour consists in simply removing this flag and never try to optimize for the overwrite case. This fix also has the benefit of significantly reducing the session size, by 64 bytes due to alignment issues caused by this flag! The bug has been there forever (since 1.4-dev7), so a backport to 1.4 would be appropriate.	2013-12-06 23:14:53 +01:00
Willy Tarreau	38d5892634	OPTIM/MINOR: mark the source address as already known on accept() Commit `986a9d2d12` moved the source address from the stream interface to the session, but it did not set the flag on the connection to report that the source address is known. Thus when logs are enabled, we had a call to getpeername() which is redundant with the result from accept(). This patch simply sets the flag.	2013-11-16 00:17:59 +01:00
Willy Tarreau	05bf5e1c36	BUG/MEDIUM: session: risk of crash on out of memory conditions In session_accept(), if we face a memory allocation error, we try to emit an HTTP 500 error message in HTTP mode. The problem is that we must not use http_error_message() for this since it dereferences the session which can be NULL in this case. We don't need the session to build the error message anyway since this function only uses it to retrieve the backend and frontend to get the most suited error message. Let's pick it ourselves, we're at the beginning of the session, only the frontend is relevant. This bug is 1.5-specific.	2013-10-30 07:59:03 +01:00
Willy Tarreau	0f791d42b6	MEDIUM: counters: support looking up a key in an alternate table sc_* sample fetches now take an optional parameter which allows to look the key in an alternate table. This is convenient to pass multiple information for the same key at once (eg: have multiple gpc0 for the same key, or support being fed complementary information from the CLI). Example : listen front bind :8000 tcp-request content track-sc0 src table local-ip http-response set-header src-id %[sc0_get_gpc0]+%[sc0_get_gpc0(global-ip)] server dummy 127.0.0.1:8001 backend local-ip stick-table size 1k type ip store gpc0 backend global-ip stick-table size 1k type ip store gpc0	2013-08-01 21:17:14 +02:00
Willy Tarreau	4d4149cf3e	MEDIUM: counters: support passing the counter number as a fetch argument One very annoying issue when trying to extend the sticky counters beyond the current 3 counters is that it requires a massive copy-paste of fetch functions (we don't have to copy-paste code anymore), just so that the fetch names exist. So let's have an alternate form like "sc_*(num)" to allow passing the counter number as an argument without having to redefine new fetch names. The MAX_SESS_STKCTR macro defines the number of usable sticky counters, which defaults to 3.	2013-08-01 21:17:14 +02:00
Willy Tarreau	b4c8493a9f	MINOR: session: make the number of stick counter entries more configurable In preparation of more flexibility in the stick counters, make their number configurable. It still defaults to 3 which is the minimum accepted value. Changing the value alone is not sufficient to get more counters, some bitfields still need to be updated and the TCP actions need to be updated as well, but this update tries to be easier, which is nice for experimentation purposes.	2013-08-01 21:17:14 +02:00
Willy Tarreau	563eef4e30	MEDIUM: counters: factor out smp_fetch_sc*_trackers smp_fetch_sc0_trackers, smp_fetch_sc1_trackers and smp_fetch_sc2_trackers were merged into a single function which relies on the fetch name to decide what to return. This is also a bug fix for this feature which has never worked till its bogus introduction by commit "2406db4 MEDIUM: counters: add sc1_trackers/sc2_trackers" (1.5-dev10). Instead of returning the value in the sample, it was returned as the fetch result! There is no need to backport this fix anyway since it's 1.5-specific and nobody uses the feature.	2013-08-01 21:17:14 +02:00
Willy Tarreau	a0b68eddef	MEDIUM: counters: factor out smp_fetch_sc*_bytes_out_rate smp_fetch_sc0_bytes_out_rate, smp_fetch_sc1_bytes_out_rate, smp_fetch_sc2_bytes_out_rate, smp_fetch_src_bytes_out_rate and smp_fetch_bytes_out_rate were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:14 +02:00
Willy Tarreau	53aea10fe9	MEDIUM: counters: factor out smp_fetch_sc*_kbytes_out smp_fetch_sc0_kbytes_out, smp_fetch_sc1_kbytes_out, smp_fetch_sc2_kbytes_out, smp_fetch_src_kbytes_out and smp_fetch_kbytes_out were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:14 +02:00
Willy Tarreau	613fe99cda	MEDIUM: counters: factor out smp_fetch_sc*_bytes_in_rate smp_fetch_sc0_bytes_in_rate, smp_fetch_sc1_bytes_in_rate, smp_fetch_sc2_bytes_in_rate, smp_fetch_src_bytes_in_rate and smp_fetch_bytes_in_rate were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:14 +02:00
Willy Tarreau	5077d4b261	MEDIUM: counters: factor out smp_fetch_sc*_kbytes_in smp_fetch_sc0_kbytes_in, smp_fetch_sc1_kbytes_in, smp_fetch_sc2_kbytes_in, smp_fetch_src_kbytes_in and smp_fetch_kbytes_in were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:14 +02:00
Willy Tarreau	9daf262c88	MEDIUM: counters: factor out smp_fetch_sc*_http_err_rate smp_fetch_sc0_http_err_rate, smp_fetch_sc1_http_err_rate, smp_fetch_sc2_http_err_rate, smp_fetch_src_http_err_rate and smp_fetch_http_err_rate were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:14 +02:00
Willy Tarreau	30d07c3b8e	MEDIUM: counters: factor out smp_fetch_sc*_http_err_cnt smp_fetch_sc0_http_err_cnt, smp_fetch_sc1_http_err_cnt, smp_fetch_sc2_http_err_cnt, smp_fetch_src_http_err_cnt and smp_fetch_http_err_cnt were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:14 +02:00
Willy Tarreau	cf47763c92	MEDIUM: counters: factor out smp_fetch_sc*_http_req_rate smp_fetch_sc0_http_req_rate, smp_fetch_sc1_http_req_rate, smp_fetch_sc2_http_req_rate, smp_fetch_src_http_req_rate and smp_fetch_http_req_rate were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:14 +02:00
Willy Tarreau	91200da197	MEDIUM: counters: factor out smp_fetch_sc*_http_req_cnt smp_fetch_sc0_http_req_cnt, smp_fetch_sc1_http_req_cnt, smp_fetch_sc2_http_req_cnt, smp_fetch_src_http_req_cnt and smp_fetch_http_req_cnt were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:14 +02:00
Willy Tarreau	3a96f3f274	MEDIUM: counters: factor out smp_fetch_sc*_sess_rate smp_fetch_sc0_sess_rate, smp_fetch_sc1_sess_rate, smp_fetch_sc2_sess_rate, smp_fetch_src_sess_rate and smp_fetch_sess_rate were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	20843087f5	MEDIUM: counters: factor out smp_fetch_sc*_sess_cnt smp_fetch_sc0_sess_cnt, smp_fetch_sc1_sess_cnt, smp_fetch_sc2_sess_cnt, smp_fetch_src_sess_cnt and smp_fetch_sess_cnt were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	f44a553476	MEDIUM: counters: factor out smp_fetch_sc*_conn_cur smp_fetch_sc0_conn_cur, smp_fetch_sc1_conn_cur, smp_fetch_sc2_conn_cur, smp_fetch_src_conn_cur and smp_fetch_conn_cur were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	c8c65700de	MEDIUM: counters: factor out smp_fetch_sc*_conn_rate smp_fetch_sc0_conn_rate, smp_fetch_sc1_conn_rate, smp_fetch_sc2_conn_rate, smp_fetch_src_conn_rate and smp_fetch_conn_rate were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	3b46c5c47d	MEDIUM: counters: factor out smp_fetch_sc*_conn_cnt smp_fetch_sc0_conn_cnt, smp_fetch_sc1_conn_cnt, smp_fetch_sc2_conn_cnt, smp_fetch_src_conn_cnt and smp_fetch_conn_cnt were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	b9f441d2c0	MEDIUM: counters: factor out smp_fetch_sc*_clr_gpc0 smp_fetch_sc0_clr_gpc0, smp_fetch_sc1_clr_gpc0, smp_fetch_sc2_clr_gpc0, smp_fetch_src_clr_gpc0 and smp_fetch_clr_gpc0 were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	710d38cea5	MEDIUM: counters: factor out smp_fetch_sc*_inc_gpc0 smp_fetch_sc0_inc_gpc0, smp_fetch_sc1_inc_gpc0, smp_fetch_sc2_inc_gpc0, smp_fetch_src_inc_gpc0 and smp_fetch_inc_gpc0 were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	b5e0af0b6b	MEDIUM: counters: factor out smp_fetch_sc*_gpc0_rate smp_fetch_sc0_gpc0, smp_fetch_sc1_gpc0, smp_fetch_sc2_gpc0, smp_fetch_src_gpc0 and smp_fetch_gpc0 were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	30b2046dfe	MEDIUM: counters: factor out smp_fetch_sc*_get_gpc0 smp_fetch_sc0_get_gpc0, smp_fetch_sc1_get_gpc0, smp_fetch_sc2_get_gpc0, smp_fetch_src_get_gpc0 and smp_fetch_get_gpc0 were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	a65536ca4e	MINOR: counters: provide a generic function to retrieve a stkctr for sc* and src. This function aims at simplifying the prefetching of the table and entry when using any of the session counters fetches. The principle is that the src_* variant produces a stkctr that is used instead of the one from the session. That way we can call the same function from all session counter fetch functions and always have a single function to support sc[0-9]_/src_.	2013-08-01 21:17:13 +02:00
Willy Tarreau	88821241d4	MINOR: counters: factor out smp_fetch_sc*_tracked The new function makes use of the sc# in the keyword to get the counter ID.	2013-08-01 21:17:13 +02:00
Willy Tarreau	ef38c39287	MEDIUM: sample: systematically pass the keyword pointer to the keyword We're having a lot of duplicate code just because of minor variants between fetch functions that could be dealt with if the functions had the pointer to the original keyword, so let's pass it as the last argument. An earlier version used to pass a pointer to the sample_fetch element, but this is not the best solution for two reasons : - fetch functions will solely rely on the keyword string - some other smp_fetch_* users do not have the pointer to the original keyword and were forced to pass NULL. So finally we're passing a pointer to the keyword as a const char *, which perfectly fits the original purpose.	2013-08-01 21:17:13 +02:00
Willy Tarreau	0fc36e3ae9	BUG/MAJOR: http: don't emit the send-name-header when no server is available Lukas Benes reported that http-send-name-header causes a segfault if no server is available because we're dereferencing the session's target which is NULL. The tiniest reproducer looks like this : listen foo bind :1234 mode http http-send-name-header srv This obvious fix must be backported to 1.4 which is affected as well.	2013-07-04 11:44:27 +02:00
Willy Tarreau	7af7d5957d	BUG: counters: third counter was not stored if others unset Commit `e25c917a` introduced a third tracking counter bug forgot to check it when storing values at the end of the session. The impact is that if neither the first nor the second one are changed, none of them are saved.	2013-07-01 18:08:41 +02:00
Willy Tarreau	dc13c11c1e	BUG/MEDIUM: prevent gcc from moving empty keywords lists into BSS Benoit Dolez reported a failure to start haproxy 1.5-dev19. The process would immediately report an internal error with missing fetches from some crap instead of ACL names. The cause is that some versions of gcc seem to trim static structs containing a variable array when moving them to BSS, and only keep the fixed size, which is just a list head for all ACL and sample fetch keywords. This was confirmed at least with gcc 3.4.6. And we can't move these structs to const because they contain a list element which is needed to link all of them together during the parsing. The bug indeed appeared with 1.5-dev19 because it's the first one to have some empty ACL keyword lists. One solution is to impose -fno-zero-initialized-in-bss to everyone but this is not really nice. Another solution consists in ensuring the struct is never empty so that it does not move there. The easy solution consists in having a non-null list head since it's not yet initialized. A new "ILH" list head type was thus created for this purpose : create an Initialized List Head so that gcc cannot move the struct to BSS. This fixes the issue for this version of gcc and does not create any burden for the declarations.	2013-06-21 23:29:02 +02:00
Willy Tarreau	8615c2af67	MEDIUM: session: disable lingering on the server when the client aborts When abortonclose is used and an error is detected on the client side, better force an RST to the server. That way we propagate to the server the same vision we got from the client, and we ensure that we won't keep TIME_WAITs.	2013-06-21 08:20:19 +02:00
Willy Tarreau	be4a3eff34	MEDIUM: counters: use sc0/sc1/sc2 instead of sc1/sc2/sc3 It was a bit inconsistent to have gpc start at 0 and sc start at 1, so make sc start at zero like gpc. No previous release was issued with sc3 anyway, so no existing setup should be affected.	2013-06-17 15:04:07 +02:00
Willy Tarreau	6d4e4e8dd2	MEDIUM: acl: remove a lot of useless ACLs that are equivalent to their fetches The following 116 ACLs were removed because they're redundant with their fetch function since last commit which allows the fetch function to be used instead for types BOOL, INT and IP. Most places are now left with an empty ACL keyword list that was not removed so that it's easier to add other ACLs later. always_false, always_true, avg_queue, be_conn, be_id, be_sess_rate, connslots, nbsrv, queue, srv_conn, srv_id, srv_is_up, srv_sess_rate, res.comp, fe_conn, fe_id, fe_sess_rate, dst_conn, so_id, wait_end, http_auth, http_first_req, status, dst, dst_port, src, src_port, sc1_bytes_in_rate, sc1_bytes_out_rate, sc1_clr_gpc0, sc1_conn_cnt, sc1_conn_cur, sc1_conn_rate, sc1_get_gpc0, sc1_gpc0_rate, sc1_http_err_cnt, sc1_http_err_rate, sc1_http_req_cnt, sc1_http_req_rate, sc1_inc_gpc0, sc1_kbytes_in, sc1_kbytes_out, sc1_sess_cnt, sc1_sess_rate, sc1_tracked, sc1_trackers, sc2_bytes_in_rate, sc2_bytes_out_rate, sc2_clr_gpc0, sc2_conn_cnt, sc2_conn_cur, sc2_conn_rate, sc2_get_gpc0, sc2_gpc0_rate, sc2_http_err_cnt, sc2_http_err_rate, sc2_http_req_cnt, sc2_http_req_rate, sc2_inc_gpc0, sc2_kbytes_in, sc2_kbytes_out, sc2_sess_cnt, sc2_sess_rate, sc2_tracked, sc2_trackers, sc3_bytes_in_rate, sc3_bytes_out_rate, sc3_clr_gpc0, sc3_conn_cnt, sc3_conn_cur, sc3_conn_rate, sc3_get_gpc0, sc3_gpc0_rate, sc3_http_err_cnt, sc3_http_err_rate, sc3_http_req_cnt, sc3_http_req_rate, sc3_inc_gpc0, sc3_kbytes_in, sc3_kbytes_out, sc3_sess_cnt, sc3_sess_rate, sc3_tracked, sc3_trackers, src_bytes_in_rate, src_bytes_out_rate, src_clr_gpc0, src_conn_cnt, src_conn_cur, src_conn_rate, src_get_gpc0, src_gpc0_rate, src_http_err_cnt, src_http_err_rate, src_http_req_cnt, src_http_req_rate, src_inc_gpc0, src_kbytes_in, src_kbytes_out, src_sess_cnt, src_sess_rate, src_updt_conn_cnt, table_avl, table_cnt, ssl_c_ca_err, ssl_c_ca_err_depth, ssl_c_err, ssl_c_used, ssl_c_verify, ssl_c_version, ssl_f_version, ssl_fc, ssl_fc_alg_keysize, ssl_fc_has_crt, ssl_fc_has_sni, ssl_fc_use_keysize,	2013-06-11 21:22:58 +02:00
Willy Tarreau	9a355ec257	MEDIUM: http: add support for action "set-log-level" in http-request/http-response Some users want to disable logging for certain non-important requests such as stats requests or health-checks coming from another equipment. Other users want to log with a higher importance (eg: notice) some special traffic (POST requests, authenticated requests, requests coming from suspicious IPs) or some abnormally large responses. This patch responds to all these needs at once by adding a "set-log-level" action to http-request/http-response. The 8 syslog levels are supported, as well as "silent" to disable logging.	2013-06-11 17:50:26 +02:00
Willy Tarreau	2b57cb8f30	MEDIUM: protocol: implement a "drain" function in protocol layers Since commit `cfd97c6f` was merged into 1.5-dev14 (BUG/MEDIUM: checks: prevent TIME_WAITs from appearing also on timeouts), some valid health checks sometimes used to show some TCP resets. For example, this HTTP health check sent to a local server : 19:55:15.742818 IP 127.0.0.1.16568 > 127.0.0.1.8000: S 3355859679:3355859679(0) win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:15.742841 IP 127.0.0.1.8000 > 127.0.0.1.16568: S 1060952566:1060952566(0) ack 3355859680 win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:15.742863 IP 127.0.0.1.16568 > 127.0.0.1.8000: . ack 1 win 257 19:55:15.745402 IP 127.0.0.1.16568 > 127.0.0.1.8000: P 1:23(22) ack 1 win 257 19:55:15.745488 IP 127.0.0.1.8000 > 127.0.0.1.16568: FP 1:146(145) ack 23 win 257 19:55:15.747109 IP 127.0.0.1.16568 > 127.0.0.1.8000: R 23:23(0) ack 147 win 257 After some discussion with Chris Huang-Leaver, it appeared clear that what we want is to only send the RST when we have no other choice, which means when the server has not closed. So we still keep SYN/SYN-ACK/RST for pure TCP checks, but don't want to see an RST emitted as above when the server has already sent the FIN. The solution against this consists in implementing a "drain" function at the protocol layer, which, when defined, causes as much as possible of the input socket buffer to be flushed to make recv() return zero so that we know that the server's FIN was received and ACKed. On Linux, we can make use of MSG_TRUNC on TCP sockets, which has the benefit of draining everything at once without even copying data. On other platforms, we read up to one buffer of data before the close. If recv() manages to get the final zero, we don't disable lingering. Same for hard errors. Otherwise we do. In practice, on HTTP health checks we generally find that the close was pending and is returned upon first recv() call. The network trace becomes cleaner : 19:55:23.650621 IP 127.0.0.1.16561 > 127.0.0.1.8000: S 3982804816:3982804816(0) win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:23.650644 IP 127.0.0.1.8000 > 127.0.0.1.16561: S 4082139313:4082139313(0) ack 3982804817 win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:23.650666 IP 127.0.0.1.16561 > 127.0.0.1.8000: . ack 1 win 257 19:55:23.651615 IP 127.0.0.1.16561 > 127.0.0.1.8000: P 1:23(22) ack 1 win 257 19:55:23.651696 IP 127.0.0.1.8000 > 127.0.0.1.16561: FP 1:146(145) ack 23 win 257 19:55:23.652628 IP 127.0.0.1.16561 > 127.0.0.1.8000: F 23:23(0) ack 147 win 257 19:55:23.652655 IP 127.0.0.1.8000 > 127.0.0.1.16561: . ack 24 win 257 This change should be backported to 1.4 which is where Chris encountered this issue. The code is different, so probably the tcp_drain() function will have to be put in the checks only.	2013-06-10 20:33:23 +02:00
Willy Tarreau	6f1615f596	MINOR: counters: add fetch/acl sc*_tracked to indicate whether a counter is tracked Sometimes we'd like to know if a counter is being tracked before adding a header to an outgoing request. These ones do that.	2013-06-10 10:30:09 +02:00
Willy Tarreau	ba2ffd18b5	MEDIUM: counters: add a new "gpc0_rate" counter in stick-tables This counter is special in that instead of reporting the gpc0 cumulative count, it returns its increase rate over the configured period.	2013-05-29 15:54:14 +02:00
Willy Tarreau	e25c917af8	MEDIUM: counters: add support for tracking a third counter We're often missin a third counter to track base, src and base+src at the same time. Here we introduce track_sc3 to have this third counter. It would be wise not to add much more counters because that slightly increases the session size and processing time though the real issue is more the declaration of the keywords in the code and in the doc.	2013-05-29 00:37:16 +02:00
Willy Tarreau	d5ca9abb0d	MINOR: counters: make it easier to extend the amount of tracked counters By properly affecting the flags and values, it becomes easier to add more tracked counters, for example for experimentation. It also slightly reduces the code and the number of tests. No counters were added with this patch.	2013-05-28 17:43:40 +02:00
Willy Tarreau	1e5dfdad77	MINOR: session: only call http_send_name_header() when changing the server Till now we used to call the function until the connection established, which means that the header rewriting was performed for nothing upon each even (eg: uploaded contents) until the server responded or timed out. Now we only call the function when we assign the server.	2013-04-11 18:18:01 +02:00
Willy Tarreau	d86e29d2a1	CLEANUP: acl: remove unused references to ACL_USE_* Now that acl->requires is not used anymore, we can remove all references to it as well as all ACL_USE_* flags.	2013-04-03 02:13:00 +02:00
Willy Tarreau	c48c90dfa5	MAJOR: acl: remove the arg_mask from the ACL definition and use the sample fetch's Now that ACLs solely rely on sample fetch functions, make them use the same arg mask. All inconsistencies have been fixed separately prior to this patch, so this patch almost only adds a new pointer indirection and removes all references to ARG*() in the definitions. The parsing is still performed by the ACL code though.	2013-04-03 02:12:58 +02:00
Willy Tarreau	8ed669b12a	MAJOR: acl: make all ACLs reference the fetch function via a sample. ACL fetch functions used to directly reference a fetch function. Now that all ACL fetches have their sample fetches equivalent, we can make ACLs reference a sample fetch keyword instead. In order to simplify the code, a sample keyword name may be NULL if it is the same as the ACL's, which is the most common case. A minor change appeared, http_auth always expects one argument though the ACL allowed it to be missing and reported as such afterwards, so fix the ACL to match this. This is not really a bug.	2013-04-03 02:12:58 +02:00
Willy Tarreau	281c799e25	MINOR: session: rename sample fetch functions and declare the sample keywords The following sample fetch functions were only usable by ACLs but are now usable by sample fetches too : sc1_bytes_in_rate, sc1_bytes_out_rate, sc1_clr_gpc0, sc1_conn_cnt, sc1_conn_cur, sc1_conn_rate, sc1_get_gpc0, sc1_http_err_cnt, sc1_http_err_rate, sc1_http_req_cnt, sc1_http_req_rate, sc1_inc_gpc0, sc1_kbytes_in, sc1_kbytes_out, sc1_sess_cnt, sc1_sess_rate, sc1_trackers, sc2_bytes_in_rate, sc2_bytes_out_rate, sc2_clr_gpc0, sc2_conn_cnt, sc2_conn_cur, sc2_conn_rate, sc2_get_gpc0, sc2_http_err_cnt, sc2_http_err_rate, sc2_http_req_cnt, sc2_http_req_rate, sc2_inc_gpc0, sc2_kbytes_in, sc2_kbytes_out, sc2_sess_cnt, sc2_sess_rate, sc2_trackers, src_bytes_in_rate, src_bytes_out_rate, src_clr_gpc0, src_conn_cnt, src_conn_cur, src_conn_rate, src_get_gpc0, src_http_err_cnt, src_http_err_rate, src_http_req_cnt, src_http_req_rate, src_inc_gpc0, src_kbytes_in, src_kbytes_out, src_sess_cnt, src_sess_rate, src_updt_conn_cnt, table_avl, table_cnt, The fetch functions have been renamed "smp_fetch_*".	2013-04-03 02:12:58 +02:00
Willy Tarreau	a7a7ebc382	BUG/MINOR: http: don't process abortonclose when request was sent option abortonclose may cause a valid connection to be aborted just after the request has been sent. This is because we check for it during the session establishment sequence before checking for write activity. So if the abort and the connect complete at the same time, the abort is still considered. Let's check for an explicity partial write before aborting. This fix should be backported to 1.4 too.	2012-12-30 00:50:35 +01:00
Willy Tarreau	71241abfd3	MINOR: http: move redirect rule processing to its own function We now have http_apply_redirect_rule() which does all the redirect-specific job instead of having this inside http_process_req_common(). Also one of the benefit gained from uniformizing this code is that both keep-alive and close response do emit the PR-- flags. The fix for the flags could probably be backported to 1.4 though it's very minor. The previous function http_perform_redirect() was becoming confusing so it was renamed http_perform_server_redirect() since it only applies to server-based redirection.	2012-12-28 14:47:19 +01:00
Willy Tarreau	d79a3b248e	BUG/MINOR: log: make log-format, unique-id-format and add-header more independant It happens that all of them call parse_logformat_line() which sets proxy->to_log with a number of flags affecting the line format for all three users. For example, having a unique-id specified disables the default log-format since fe->to_log is tested when the session is established. Similarly, having "option logasap" will cause "+" to be inserted in unique-id or headers referencing some of the fields depending on LW_BYTES. This patch first removes most of the dependency on fe->to_log whenever possible. The first possible cleanup is to stop checking fe->to_log for being null, considering that it always contains at least LW_INIT when any such usage is made of the log-format! Also, some checks are wrong. s->logs.logwait cannot be nulled by "logwait &= ~LW_" since LW_INIT is always there. This results in getting the wrong log at the end of a request or session when a unique-id or add-header is set, because logwait is still not null but the log-format is not checked. Further cleanups are required. Most LW_ flags should be removed or at least replaced with what they really mean (eg: depend on client-side connection, depend on server-side connection, etc...) and this should only affect logging, not other mechanisms. This patch fixes the default log-format and tries to limit interferences between the log formats, but does not pretend to do more for the moment, since it's the most visible breakage.	2012-12-28 09:51:00 +01:00
Willy Tarreau	20d46a5a95	CLEANUP: session: use an array for the stick counters The stick counters were in two distinct sets of struct members, causing some code to be duplicated. Now we use an array, which enables some processing to be performed in loops. This allowed the code to be shrunk by 700 bytes.	2012-12-09 15:57:16 +01:00
Willy Tarreau	2406db4b39	MEDIUM: counters: add sc1_trackers/sc2_trackers Returns the current amount of concurrent connections tracking the same tracked counters. This number is automatically incremented when tracking begins and decremented when tracking stops. It differs from sc1_conn_cur in that it does not rely on any stored information but on the table's reference count (the "use" value which is returned by "show table" on the CLI). This may sometimes be more suited for layer7 tracking.	2012-12-09 14:08:47 +01:00
Willy Tarreau	5d5b5d8eaf	MEDIUM: proto_tcp: add support for tracking L7 information Until now it was only possible to use track-sc1/sc2 with "src" which is the IPv4 source address. Now we can use track-sc1/sc2 with any fetch as well as any transformation type. It works just like the "stick" directive. Samples are automatically converted to the correct types for the table. Only "tcp-request content" rules may use L7 information, and such information must already be present when the tracking is set up. For example it becomes possible to track the IP address passed in the X-Forwarded-For header. HTTP request processing now also considers tracking from backend rules because we want to be able to update the counters even when the request was already parsed and tracked. Some more controls need to be performed (eg: samples do not distinguish between L4 and L6).	2012-12-09 14:08:47 +01:00
Willy Tarreau	0ede5a3318	BUG/MEDIUM: session: fix FD leak when transport layer logging is enabled Commit `2b199c9a` attempted to fix all places where the transport layer is improperly closed, but it missed one place in session_free(). If SSL ciphers are logged, the close() is delayed post-log and performed in session_free(). However, conn_xprt_close() only closes the transport layer but not the file descriptor, resulting in a slow FD leak which is hardly noticeable until the process cannot accept any new connection. A workaround consisted in disabling %sslv/%sslc in log-format. So use conn_full_close() instead of conn_xprt_close() to fix this there too. A similar pending issue existed in the close during outgoing connection failure, though on this side, the transport layer is never tracked at the moment.	2012-12-08 08:48:04 +01:00
Willy Tarreau	20879a0233	MEDIUM: connection: add error reporting for the SSL Get a bit more info in the logs when client-side SSL handshakes fail.	2012-12-03 17:21:52 +01:00
Willy Tarreau	8e3bf699db	MEDIUM: connection: add error reporting for the PROXY protocol header When the PROXY protocol header is expected and fails, leading to an abort of the incoming connection, we now emit a log message. If option dontlognull is set and it was just a port probe, then nothing is logged.	2012-12-03 17:21:51 +01:00
Willy Tarreau	0af2912fd1	MEDIUM: connection: add minimal error reporting in logs for incomplete connections Since the introduction of SSL, it became quite annoying not to get any useful info in logs about handshake failures. Let's improve reporting for embryonic sessions by checking a per-connection error code and reporting it into the logs if an error happens before the session is completely instanciated. The "dontlognull" option is supported in that if a connection does not talk before being aborted, nothing will be emitted. At the moment, only timeouts are considered for SSL and the PROXY protocol, but next patches will handle more errors.	2012-12-03 15:38:23 +01:00
Willy Tarreau	14cba4b0b1	MEDIUM: connection: add an error code in connections This will be needed to improve error reporting, especially for SSL.	2012-12-03 14:22:13 +01:00
Willy Tarreau	8139b9959f	MINOR: compression: make the stats a bit more robust To ensure that we only count when a response was compressed, we also check for the SN_COMP_READY flag which indicates that the compression was effectively initialized. Comp_algo alone is meaningless.	2012-11-27 09:34:00 +01:00
Willy Tarreau	5e16cbc3bd	MINOR: stats: report the total number of compressed responses per front/back Depending on the content-types and accept-encoding fields, some responses might or might not be compressed. Let's have a counter of the number of compressed responses and report it in the stats to help improve compression usage. Some cosmetic issues were fixed in the CSV output too (missing commas at the end).	2012-11-24 14:54:13 +01:00
Willy Tarreau	2b199c9ac3	MEDIUM: connection: provide a common conn_full_close() function Several places got the connection close sequence wrong because it was not obvious. In practice we always need the same sequence when aborting, so let's have a common function for this.	2012-11-23 17:32:21 +01:00
Willy Tarreau	543db62e1f	BUG/MEDIUM: compression: release the zlib pools between keep-alive requests There was a possible memory leak in the zlib code when the first response of a keep-alive session was compressed, because the next request would reset the compression algo, preventing a later call to session_free() from releasing it. The reason is that it is necessary to release the assigned resources in http_end_txn_clean_session().	2012-11-15 16:41:22 +01:00
William Lallemand	ec3e3890f0	BUG/MINOR: compression: deinit zlib only when required The zlib stream was deinitialized even when the init failed.	2012-11-15 15:42:17 +01:00
Willy Tarreau	3fdb366885	MAJOR: connection: replace struct target with a pointer to an enum Instead of storing a couple of (int, ptr) in the struct connection and the struct session, we use a different method : we only store a pointer to an integer which is stored inside the target object and which contains a unique type identifier. That way, the pointer allows us to retrieve the object type (by dereferencing it) and the object's address (by computing the displacement in the target structure). The NULL pointer always corresponds to OBJ_TYPE_NONE. This reduces the size of the connection and session structs. It also simplifies target assignment and compare. In order to improve the generated code, we try to put the obj_type element at the beginning of all the structs (listener, server, proxy, si_applet), so that the original and target pointers are always equal. A lot of code was touched by massive replaces, but the changes are not that important.	2012-11-12 00:42:33 +01:00
Willy Tarreau	b31c971bef	CLEANUP: channel: remove any reference of the hijackers Hijackers were functions designed to inject data into channels in the distant past. They became unused around 1.3.16, and since there has not been any user of this mechanism to date, it's uncertain whether the mechanism still works (and it's not really useful anymore). So better remove it as well as the pointer it uses in the channel struct.	2012-11-11 23:05:39 +01:00
Willy Tarreau	7f7ad91056	BUILD: stream_interface: remove si_fd() and its references si_fd() is not used a lot, and breaks builds on OpenBSD 5.2 which defines this name for its own purpose. It's easy enough to remove this one-liner function, so let's do it.	2012-11-11 20:53:29 +01:00
Willy Tarreau	815f5ecffa	BUG/MINOR: session: mark the handshake as complete earlier There is a small waste of CPU cycles when no handshake is required on an accepted connection, because we had to perform one call to conn_fd_handler() to mark the connection CONNECTED and to call process_session() again to say that nothing happened. By marking the connection CONNECTED when there is no pending handshake, we avoid this extra call to process_session().	2012-11-09 22:09:08 +01:00
Willy Tarreau	798f4325fa	OPTIM: session: don't process the whole session when only timers need a refresh Having a global expiration timer for a task means that the tasks are regularly woken up (at least after each expiration timer). It's totally useless and counter productive to process the whole session upon each such wakeup, and it's fairly easy to detect such wakeups, so let's just update the task's timer and return to sleep when this happens. For 100k concurrent connections with 10s of timeouts, this can save 10k wakeups per second, which is not bad.	2012-11-08 16:55:07 +01:00
William Lallemand	1c2d622d82	CLEANUP: use struct comp_ctx instead of union Replace union comp_ctx by struct comp_ctx. Use struct comp_ctx * in the init/add_data/flush/reset/end prototypes of compression.h functions.	2012-11-05 10:23:16 +01:00
Willy Tarreau	e3224e870f	BUG/MINOR: session: ensure that we don't retry connection if some data were sent With extra-large buffers, it is possible that a lot of data are sent upon connection establishment before the session is notified. The issue is how to handle a send() error after some data were actually sent. At the moment, only a connection error is reported, causing a new connection attempt and send() to restart after the last data. We absolutely don't want to retry the connect() if at least one byte was sent, because those data are lost. The solution consists in reporting exactly what happens, which is : - a successful connection attempt - a read/write error on the channel That way we go on with sess_establish(), the response analysers are called and report the appropriate connection state for the error (typically a server abort while waiting for a response). This mechanism also guarantees that we won't retry since it's a success. The logs also report the correct connect time. Note that 1.4 is not directly affected because it only attempts one send(), so it cannot detect a send() failure here and distinguish it form a failed connection attempt. So no backport is needed. Also, this is just a safe belt we're taking, since this issue should not happen anymore since previous commit.	2012-10-29 23:31:04 +01:00
Willy Tarreau	19d14ef104	MEDIUM: make the trash be a chunk instead of a char * The trash is used everywhere to store the results of temporary strings built out of s(n)printf, or as a storage for a chunk when chunks are needed. Using global.tune.bufsize is not the most convenient thing either. So let's replace trash with a chunk and directly use it as such. We can then use trash.size as the natural way to get its size, and get rid of many intermediary chunks that were previously used. The patch is huge because it touches many areas but it makes the code a lot more clear and even outlines places where trash was used without being that obvious.	2012-10-29 16:57:30 +01:00
Willy Tarreau	f2943dccd0	MAJOR: session: detach the connections from the stream interfaces We will need to be able to switch server connections on a session and to keep idle connections. In order to achieve this, the preliminary requirement is that the connections can survive the session and be detached from them. Right now they're still allocated at exactly the same place, so when there is a session, there are always 2 connections. We could soon improve on this by allocating the outgoing connection only during a connect(). This current patch touches a lot of code and intentionally does not change any functionnality. Performance tests show no regression (even a very minor improvement). The doc has not yet been updated.	2012-10-26 20:15:20 +02:00
Willy Tarreau	c919dc66a3	CLEANUP: remove trashlen trashlen is a copy of global.tune.bufsize, so let's stop using it as a duplicate, fall back to the original bufsize, it's less confusing this way.	2012-10-26 20:04:27 +02:00
William Lallemand	82fe75c1a7	MEDIUM: HTTP compression (zlib library support) This commit introduces HTTP compression using the zlib library. http_response_forward_body has been modified to call the compression functions. This feature includes 3 algorithms: identity, gzip and deflate: * identity: this is mostly for debugging, and it was useful for developping the compression feature. With Content-Length in input, it is making each chunk with the data available in the current buffer. With chunks in input, it is rechunking, the output chunks will be bigger or smaller depending of the size of the input chunk and the size of the buffer. Identity does not apply any change on data. * gzip: same as identity, but applying a gzip compression. The data are deflated using the Z_NO_FLUSH flag in zlib. When there is no more data in the input buffer, it flushes the data in the output buffer (Z_SYNC_FLUSH). At the end of data, when it receives the last chunk in input, or when there is no more data to read, it writes the end of data with Z_FINISH and the ending chunk. * deflate: same as gzip, but with deflate algorithm and zlib format. Note that this algorithm has ambiguous support on many browsers and no support at all from recent ones. It is strongly recommended not to use it for anything else than experimentation. You can't choose the compression ratio at the moment, it will be set to Z_BEST_SPEED (1), as tests have shown very little benefit in terms of compression ration when going above for HTML contents, at the cost of a massive CPU impact. Compression will be activated depending of the Accept-Encoding request header. With identity, it does not take care of that header. To build HAProxy with zlib support, use USE_ZLIB=1 in the make parameters. This work was initially started by David Du Colombier at Exceliance.	2012-10-26 02:30:48 +02:00
Willy Tarreau	c93f7959e5	CLEANUP: session: remove term_trace which is not used anymore This field was used to trace precisely where a session was terminated but it did not survive code rearchitecture and was not used at all anymore. Let's get rid of it.	2012-10-13 11:10:30 +02:00
Willy Tarreau	9b28e03b66	MAJOR: channel: replace the struct buffer with a pointer to a buffer With this commit, we now separate the channel from the buffer. This will allow us to replace buffers on the fly without touching the channel. Since nobody is supposed to keep a reference to a buffer anymore, doing so is not a problem and will also permit some copy-less data manipulation. Interestingly, these changes have shown a 2% performance increase on some workloads, probably due to a better cache placement of data.	2012-10-13 09:07:52 +02:00
Willy Tarreau	394db379eb	REORG: http: rename msg->buf to msg->chn since it's a channel It's extremely confusing to have all those msg->buf->buf everywhere after the extraction of the buffer from the channel. Let's clean this up.	2012-10-12 22:40:39 +02:00
Willy Tarreau	93dbc2bc0e	MEDIUM: log: add a new LW_XPRT flag to pin the transport layer This flag will have to be set on log tags which require transport layer information. They will prevent the conn_xprt_close() call from releasing the transport layer too early.	2012-10-12 20:30:51 +02:00
Willy Tarreau	1e954913de	MEDIUM: connection: add a flag to hold the transport layer When we start logging SSL information, we need the SSL struct to be present even past the conn_xprt_close() call. In order to achieve this, we should use refcounting on the connection and the transport layer. At the moment it's not worth using plain refcounting as only the logs require this, so instead of real refcounting we just use a flag which will be set by the log subsystem when SSL data need to be logged. What happens then is that the xprt->close() call is ignored and the transport layer is closed again during session_free(), after the log line is emitted.	2012-10-12 20:30:50 +02:00
Willy Tarreau	91083f5c8f	BUG/MEDIUM: session: enable the conn_session_update() callback This callback was introduced by commit `9683e9a0` but never enabled because the CO_FL_WAKE_DATA flag was not set. The result is that this function is never called when an SSL handshake fails, so the connection is only closed on timeout.	2012-10-12 20:30:38 +02:00
Willy Tarreau	e9909f4e50	BUG/MINOR: session: fix some leftover from debug code Commit `82569f91` moved the health and monitor-net checks to session.c but a debug test introduced 0& to disable MSG_DONTWAIT in the recv() call and this debug code remained there. Since the socket is marked non-blocking, there should be no effect but it's dangerous to keep such a thing here.	2012-10-12 17:36:40 +02:00
Willy Tarreau	1bc4aab290	MEDIUM: listener: add support for linux's accept4() syscall On Linux, accept4() does the same as accept() except that it allows the caller to specify some flags to set on the resulting socket. We use this to set the O_NONBLOCK flag and thus to save one fcntl() call in each connection. The effect is a small performance gain of around 1%. The option is automatically enabled when target linux2628 is set, or when the USE_ACCEPT4 Makefile variable is set. If the libc is too old to provide the equivalent function, this is automatically detected and our own function is used instead. In any case it is possible to force the use of our implementation with USE_MY_ACCEPT4.	2012-10-08 20:11:03 +02:00
Willy Tarreau	9683e9a05f	MEDIUM: session: register a data->wake callback to process errors The connection layer will soon call ->wake() only when errors happen, and not ->init(). So make the session layer use this callback to detect errors and abort connections.	2012-10-04 22:26:10 +02:00
Willy Tarreau	071e137ec2	MEDIUM: connection: use a generic data-layer init() callback The generic data-layer init callback is now used after the transport layer is complete and before calling the data layer recv/send callbacks. This allows the session to switch from the embryonic session data layer to the complete stream interface data layer, by making conn_session_complete() the data layer's init callback. It sill looks awkwards that the init() callback must be used opon error, but except by adding yet another one, it does not seem to be mergeable into another function (eg: it should probably not be merged with ->wake to avoid unneeded calls during the handshake, though semantically that would make sense).	2012-10-04 22:26:10 +02:00
Willy Tarreau	5e75e2755e	MEDIUM: session: use a specific data_cb for embryonic sessions We don't want to have the recv or send callbacks in embryonic sessions, and we want the stream interface to be referenced as the connection owner only once the session is instanciated. So let's first have the embryonic session be the owner, then replaced later by the stream interface once the transport layer is ready.	2012-10-04 22:26:10 +02:00
Willy Tarreau	4aa3683b2d	MINOR: connection: provide a generic data layer wakeup callback Instead of calling conn_notify_si() from the connection handler, we now call data->wake(), which will allow us to use a different callback with health checks. Note that we still rely on a flag in order to decide whether or not to call this function. The reason is that with embryonic sessions, the callback is already initialized to si_conn_cb without the flag, and we can't call the SI notify function in the leave path before the stream interface is initialized. This issue should be addressed by involving a different data_cb for embryonic sessions and for stream interfaces, that would be changed during session_complete() for the final data_cb.	2012-10-04 22:26:10 +02:00
Willy Tarreau	f7bc57ca6e	REORG: connection: rename the data layer the "transport layer" While working on the changes required to make the health checks use the new connections, it started to become obvious that some naming was not logical at all in the connections. Specifically, it is not logical to call the "data layer" the layer which is in charge for all the handshake and which does not yet provide a data layer once established until a session has allocated all the required buffers. In fact, it's more a transport layer, which makes much more sense. The transport layer offers a medium on which data can transit, and it offers the functions to move these data when the upper layer requests this. And it is the upper layer which iterates over the transport layer's functions to move data which should be called the data layer. The use case where it's obvious is with embryonic sessions : an incoming SSL connection is accepted. Only the connection is allocated, not the buffers nor stream interface, etc... The connection handles the SSL handshake by itself. Once this handshake is complete, we can't use the data functions because the buffers and stream interface are not there yet. Hence we have to first call a specific function to complete the session initialization, after which we'll be able to use the data functions. This clearly proves that SSL here is only a transport layer and that the stream interface constitutes the data layer. A similar change will be performed to rename app_cb => data, but the two could not be in the same commit for obvious reasons.	2012-10-04 22:26:09 +02:00
Willy Tarreau	e603e69d18	MEDIUM: connection: make use of the owner instead of container_of This way the connection can become independant on the stream interface.	2012-09-28 00:01:23 +02:00
Willy Tarreau	82569f9158	MEDIUM: monitor: simplify handling of monitor-net and mode health We were having several different behaviours with monitor-net and "mode health" : - monitor-net on TCP connections was evaluated just after accept(), did not count a connection on the frontend and were not subject to tcp-request connection rules, and caused an immediate close(). - monitor-net in HTTP mode was evaluated once the session was accepted (eg: on top of SSL), returned "HTTP/1.0 200 OK\r\n\r\n" over the connection's data layer and instanciated a session which was responsible for closing this connection. A connection AND a session were counted for the frontend ; - "mode health" with "option httpchk" would do exactly the same as monitor-net in HTTP mode ; - "mode health" without "option httpchk" would do the same as above except that "OK" was returned instead of "HTTP/1.0 200 OK\r\n\r\n". None of them took care of cleaning the input buffer, sometimes resulting in a TCP reset to be emitted after the last packet if a request was received over the connection. Given the inconsistencies and the complexity in keeping all these features handled at the right position, we now slightly changed the way they are handled : - all of them are handled just after the "tcp-request connection" rules, so that all of them may be blocked using such rules, offering more flexibility and consistency ; - no connection handshake is performed anymore for non-TCP modes - all of them send the response as raw data over the socket, there is no more difference between TCP and HTTP mode for example (these rules were never meant to be served over SSL connections and were never documented as able to do that). - any possible pending data on the incoming socket is drained before the response is sent, in order to avoid the risk of a reset. - none of them exactly did what was documented ! This results in more consistent, more flexible and more accurate handling of monitor rules, with smaller and more robust code.	2012-09-28 00:01:22 +02:00
Cyril Bonté	3aaba440a2	BUILD: fix compilation error with DEBUG_FULL Recent changes in structures broke the compilation when using DEBUG_FULL. Let's update apply the changes also to the variables used in DPRINTF calls.	2012-09-24 20:36:39 +02:00
Willy Tarreau	d1d5454180	REORG: split "protocols" files into protocol and listener It was becoming confusing to have protocols and listeners in the same files, split them.	2012-09-15 22:29:32 +02:00
Willy Tarreau	cbaaec475c	MINOR: session: do not send an HTTP/500 error on SSL sockets If a session fails its initialization, we don't want to send HTTP/500 over the socket if it's not a raw data layer.	2012-09-06 11:32:07 +02:00
Willy Tarreau	783f25800c	BUILD: http: rename error_message http_error_message to fix conflicts on RHEL Duncan Hall reported a build issue on CentOS where error_message conflicts with another system declaration when SSL is enabled. Rename the function.	2012-09-04 12:19:04 +02:00
Willy Tarreau	dd2f85eb3b	CLEANUP: includes: fix includes for a number of users of fd.h It appears that fd.h includes a number of unneeded files and was included from standard.h, and as such served as an intermediary to provide almost everything to everyone. By removing its useless includes, a long dependency chain broke but could easily be fixed.	2012-09-03 20:49:14 +02:00
Willy Tarreau	40ff59d820	CLEANUP: fd: remove fdtab->flags These flags were added for TCP_CORK. They were only set at various places but never checked by any user since TCP_CORK was replaced with MSG_MORE. Simply get rid of this now.	2012-09-03 20:49:14 +02:00
Willy Tarreau	74172ff9c3	CLEANUP: frontend: remove the old proxy protocol decoder This one used to rely on a stream analyser which was inappropriate. It's not used anymore.	2012-09-03 20:47:35 +02:00
Willy Tarreau	22cda21ad5	MAJOR: connection: make the PROXY decoder a handshake handler The PROXY protocol is now decoded in the connection before other handshakes. This means that it may be extracted from a TCP stream before SSL is decoded from this stream.	2012-09-03 20:47:35 +02:00
Willy Tarreau	2542b53b19	MAJOR: session: introduce embryonic sessions When an incoming connection request is accepted, a connection structure is needed to store its state. However we don't want to fully initialize a session until the data layer is about to be ready. As long as the connection is physically stored into the session, it's not easy to split both allocations. As such, we only initialize the minimum requirements of a session, which results in what we call an embryonic session. Then once the data layer is ready, we can complete the function's initialization. Doing so avoids buffers allocation and ensures that a session only sees ready connections. The frontend's client timeout is used as the handshake timeout. It is likely that another timeout will be used in the future.	2012-09-03 20:47:35 +02:00

... 2 3 4 5 6 ...

622 Commits