haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-09 16:47:18 +02:00

Author	SHA1	Message	Date
Christopher Faulet	a73e59b690	BUG/MAJOR: Fix how the list of entities waiting for a buffer is handled When an entity tries to get a buffer, if it cannot be allocted, for example because the number of buffers which may be allocated per process is limited, this entity is added in a list (called <buffer_wq>) and wait for an available buffer. Historically, the <buffer_wq> list was logically attached to streams because it were the only entities likely to be added in it. Now, applets can also be waiting for a free buffer. And with filters, we could imagine to have more other entities waiting for a buffer. So it make sense to have a generic list. Anyway, with the current design there is a bug. When an applet failed to get a buffer, it will wait. But we add the stream attached to the applet in <buffer_wq>, instead of the applet itself. So when a buffer is available, we wake up the stream and not the waiting applet. So, it is possible to have waiting applets and never awakened. So, now, <buffer_wq> is independant from streams. And we really add the waiting entity in <buffer_wq>. To be generic, the entity is responsible to define the callback used to awaken it. In addition, applets will still request an input buffer when they become active. But they will not be sleeped anymore if no buffer are available. So this is the responsibility to the applet I/O handler to check if this buffer is allocated or not. This way, an applet can decide if this buffer is required or not and can do additional processing if not. [wt: backport to 1.7 and 1.6]	2016-12-12 19:11:04 +01:00
Christopher Faulet	9d810cae11	BUG/MEDIUM: stream: Save unprocessed events for a stream A stream can be awakened for different reasons. During its processing, it can be early stopped if no buffer is available. In this situation, the reason why the stream was awakened is lost, because we rely on the task state, which is reset after each processing loop. In many cases, that's not a big deal. But it can be useful to accumulate the task states if the stream processing is interrupted, especially if some filters need to be called. To be clearer, here is an simple example: 1) A stream is awakened with the reason TASK_WOKEN_MSG. 2) Because no buffer is available, the processing is interrupted, the stream is back to sleep. And the task state is reset. 3) Some buffers become available, so the stream is awakened with the reason TASK_WOKEN_RES. At this step, the previous reason (TASK_WOKEN_MSG) is lost. Now, the task states are saved for a stream and reset only when the stream processing is not interrupted. The correspoing bitfield represents the pending events for a stream. And we use this one instead of the task state during the stream processing. Note that TASK_WOKEN_TIMER and TASK_WOKEN_RES are always removed because these events are always handled during the stream processing. [wt: backport to 1.7 and 1.6]	2016-12-12 19:10:58 +01:00
Christopher Faulet	34c5cc98da	MINOR: task: Rename run_queue and run_queue_cur counters <run_queue> is used to track the number of task in the run queue and <run_queue_cur> is a copy used for the reporting purpose. These counters has been renamed, respectively, <tasks_run_queue> and <tasks_run_queue_cur>. So the naming is consistent between tasks and applets. [wt: needed for next fixes, backport to 1.7 and 1.6]	2016-12-12 19:10:54 +01:00
Christopher Faulet	d47a1bd1d7	BUG/MINOR: filters: Invert evaluation order of HTTP_XFER_BODY and XFER_DATA analyzers These 2 analyzers are responsible of the data forwarding in, respectively, HTTP mode and TCP mode. Now, the analyzer responsible of the HTTP data forwarding is called before the one responsible of the TCP data forwarding. This will allow the filtering of tunneled data in HTTP. [wt: backport desired in 1.7 - no impact right now but may impact the ability to backport future fixes]	2016-11-29 17:03:04 +01:00
Willy Tarreau	7d56221d57	REORG: stkctr: move all the stick counters processing to stick-tables.c Historically we used to have the stick counters processing put into session.c which became stream.c. But a big part of it is now in stick-table.c (eg: converters) but despite this we still have all the sample fetch functions in stream.c These parts do not depend on the stream anymore, so let's move the remaining chunks to stick-table.c and have cleaner files. What remains in stream.c is everything needed to attach/detach trackers to the stream and to update the counters while the stream is being processed.	2016-11-25 16:10:05 +01:00
Willy Tarreau	397131093f	REORG: tcp-rules: move tcp rules processing to their own file There's no more reason to keep tcp rules processing inside proto_tcp.c given that there is nothing in common there except these 3 letters : tcp. The tcp rules are in fact connection, session and content processing rules. Let's move them to "tcp-rules" and let them live their life there.	2016-11-25 15:57:38 +01:00
Willy Tarreau	30e5e18bbb	CLEANUP: cli: remove assignments to st0 and st2 in keyword parsers Now it's not needed anymore to set STAT_ST_INIT nor CLI_ST_CALLBACK in the parsers, remove it in the various places.	2016-11-24 16:59:28 +01:00
Willy Tarreau	3b6e547be8	CLEANUP: cli: rename STAT_CLI_* to CLI_ST_* These are in CLI states, not stats states anymore. STAT_CLI_O_CUSTOM was more appropriately renamed CLI_ST_CALLBACK.	2016-11-24 16:59:28 +01:00
Willy Tarreau	61b6521cbf	REORG: cli: move "shutdown session" to stream.c It really kills streams in fact, but we can't change the name now.	2016-11-24 16:59:28 +01:00
Willy Tarreau	4e46b62ab1	REORG: cli: move "shutdown sessions server" to stream.c It could be argued that it's between server, stream and session but at least due to the fact that it operates on streams, its best place is in stream.c.	2016-11-24 16:59:28 +01:00
William Lallemand	4c5b4d531c	REORG: cli: move 'show sess' to stream.c Move 'show sess' CLI functions to stream.c and use the cli keyword API to register it on the CLI. [wt: the choice of stream vs session makes sense because since 1.6 these really are streams that we're dumping and not sessions anymore]	2016-11-24 16:59:27 +01:00
William Lallemand	9ed6203aef	REORG: cli: split dumpstats.h in stats.h and cli.h proto/dumpstats.h has been split in 4 files: * proto/cli.h contains protypes for the CLI * proto/stats.h contains prototypes for the stats * types/cli.h contains definition for the CLI * types/stats.h contains definition for the stats	2016-11-24 16:59:27 +01:00
Christopher Faulet	a00d817aba	MINOR: filters: Add check_timeouts callback to handle timers expiration on streams A filter can now be notified when a stream is woken up because of an expired timer. The documentation and the TRACE filter have been updated.	2016-11-21 15:29:58 +01:00
Willy Tarreau	350135cf49	BUG/MEDIUM: connection: check the control layer before stopping polling The bug described in commit `568743a` ("BUG/MEDIUM: stream-int: completely detach connection on connect error") was not a stream-interface layer bug but a connection layer bug. There was exactly one place in the code where we could change a file descriptor's status without first checking whether it is valid or not, it was in conn_stop_polling(). This one is called when the polling status is changed after an update, and calls fd_stop_both even if we had already closed the file descriptor : 1479388298.484240 ->->->->-> conn_fd_handler > conn_cond_update_polling 1479388298.484240 ->->->->->-> conn_cond_update_polling > conn_stop_polling 1479388298.484241 ->->->->->->-> conn_stop_polling > conn_ctrl_ready 1479388298.484241 conn_stop_polling < conn_ctrl_ready 1479388298.484241 ->->->->->->-> conn_stop_polling > fd_stop_both 1479388298.484242 ->->->->->->->-> fd_stop_both > fd_update_cache 1479388298.484242 ->->->->->->->->-> fd_update_cache > fd_release_cache_entry 1479388298.484242 fd_update_cache < fd_release_cache_entry 1479388298.484243 fd_stop_both < fd_update_cache 1479388298.484243 conn_stop_polling < fd_stop_both 1479388298.484243 conn_cond_update_polling < conn_stop_polling 1479388298.484243 conn_fd_handler < conn_cond_update_polling The problem with the previous fix above is that it break the http_proxy mode and possibly even some Lua parts and peers to a certain extent ; all outgoing connections where the target address is initially copied into the outgoing connection which experience a retry would use a random outgoing address after the retry because closing and detaching the connection causes the target address to be lost. This was attempted to be addressed by commit `0857d7a` ("BUG/MAJOR: stream: properly mark the server address as unset on connect retry") but it used to only solve the most visible effect and not the root cause. Prior to this fix, it was possible to cause this config to keep CLOSE_WAIT for as long as it takes to expire a client or server timeout (note the missing client timeout) : listen test mode http bind :8002 server s1 127.0.0.1:8001 $ tcploop 8001 L0 W N20 A R P100 S:"HTTP/1.1 200 OK\r\nContent-length: 0\r\n\r\n" & $ tcploop 8002 N200 C T W S:"GET / HTTP/1.0\r\n\r\n" O P10000 K With this patch, these CLOSE_WAIT properly vanish when both processes leave. This commit reverts the two fixes above and replaces them with the proper fix in connection.h. It must be backported to 1.6 and 1.5. Thanks to Robson Roberto Souza Peixoto for providing very detailed traces showing some obvious inconsistencies leading to finding this bug.	2016-11-18 14:48:52 +01:00
Willy Tarreau	def0d22cc5	MINOR: stream: make option contstats usable again Quite a lot of people have been complaining about option contstats not working correctly anymore since about 1.4. The reason was that one reason for the significant performance boost between 1.3 and 1.4 was the ability to forward data between a server and a client without waking up the stream manager. And we couldn't afford to force sessions to constantly wake it up given that most of the people interested in contstats are also those interested in high performance transmission. An idea was experimented with in the past, consisting in limiting the amount of transmissible data before waking it up, but it was not usable on slow connections (eg: FTP over modem lines, RDP, SSH) as stats would be updated too rarely if at all, so that idea was dropped. During a discussion today another idea came up : ensure that stats are updated once in a while, since it's the only thing that matters. It happens that we have the request channel's analyse_exp timeout that is used to wake the stream up after a configured delay, and that by definition this timeout is not used when there's no more analyser (otherwise the stream would wake up and the stats would be updated). Thus here the idea is to reuse this timeout when there's no analyser and set it to now+5 seconds so that a stream wakes up at least once every 5 seconds to update its stats. It should be short enough to provide smooth traffic graphs and to allow to debug outputs of "show sess" more easily without inflicting too much load even for very large number of concurrent connections. This patch is simple enough and safe enough to be backportable to 1.6 if there is some demand.	2016-11-08 22:03:00 +01:00
Andrew Rodland	e168feb4a8	MINOR: proxy: add 'served' field to proxy, equal to total of all servers' This will allow lb_chash to determine the total active sessions for a proxy without any computation. Signed-off-by: Andrew Rodland <andrewr@vimeo.com>	2016-10-25 20:21:32 +02:00
Willy Tarreau	0857d7aa8f	BUG/MAJOR: stream: properly mark the server address as unset on connect retry Bartosz Koninski reported that recent commit `568743a` ("BUG/MEDIUM: stream-int: completely detach connection on connect error") introduced a nasty side effect during the connection retry, causing a reconnect attempt to be performed on a random address, possibly another server from another backend as shown in his reproducer. The real reason is in fact that by calling si_release_endpoint() after a failed connect attempt and not clearing the SN_ADDR_SET flag on the stream, we indicate that we continue to trust the connection's current address. So upon next connect attempt, a connection is picked from the pool and the last target address is reused. It is not very easy to reproduce it 100% reliably out of production traffic, but it's easier to see when haproxy is started with -dM to force the pools to be filled with junk, and where subsequent connections are not even attempted due to their family being incorrect. The correct fix consists in clearing the SN_ADDR_SET flag after calling si_release_endpoint() during a retry. But it outlines a deeper issue which is in fact that the target address is stored in the connection and its validity is stored in the stream. Until we have a better solution to store target addresses, it would be better to rely on the connection flags (CO_FL_ADDR_TO_SET) for this purpose. But it also outlines the fact that the same issue still exists in Lua sockets and in idle sockets, which fortunately are not affected by this issue. Thanks to Bartosz for providing all the elements needed to understand the problem. This fix needs to be backported to 1.6 and 1.5.	2016-08-29 15:33:56 +02:00
Thierry FOURNIER / OZON.IO	4cac359a39	MEDIUM: log: Decompose %Tq in %Th %Ti %TR Tq is the time between the instant the connection is accepted and a complete valid request is received. This time includes the handshake (SSL / Proxy-Protocol), the idle when the browser does preconnect and the request reception. This patch decomposes %Tq in 3 measurements names %Th, %Ti, and %TR which returns respectively the handshake time, the idle time and the duration of valid request reception. It also adds %Ta which reports the request's active time, which is the total time without %Th nor %Ti. It replaces %Tt as the total time, reporting accurate measurements for HTTP persistent connections. %Th is avalaible for TCP and HTTP sessions, %Ti, %TR and %Ta are only avalaible for HTTP connections. In addition to this, we have new timestamps %tr, %trg and %trl, which log the date of start of receipt of the request, respectively in the default format, in GMT time and in local time (by analogy with %t, %T and %Tl). All of them are obviously only available for HTTP. These values are more relevant as they more accurately represent the request date without being skewed by a browser's preconnect nor a keep-alive idle time. The HTTP log format and the CLF log format have been modified to use %tr, %TR, and %Ta respectively instead of %t, %Tq and %Tt. This way the default log formats now produce the expected output for users who don't want to manually fiddle with the log-format directive. Example with the following log-format : log-format "%ci:%cp [%tr] %ft %b/%s h=%Th/i=%Ti/R=%TR/w=%Tw/c=%Tc/r=%Tr/a=%Ta/t=%Tt %ST %B %CC %CS %tsc %ac/%fc/%bc/%sc/%rc %sq/%bq %hr %hs %{+Q}r" The request was sent by hand using "openssl s_client -connect" : Aug 23 14:43:20 haproxy[25446]: 127.0.0.1:45636 [23/Aug/2016:14:43:20.221] test~ test/test h=6/i=2375/R=261/w=0/c=1/r=0/a=262/t=2643 200 145 - - ---- 1/1/0/0/0 0/0 "GET / HTTP/1.1" => 6 ms of SSL handshake, 2375 waiting before sending the first char (in fact the time to type the first line), 261 ms before the end of the request, no time spent in queue, 1 ms spend connecting to the server, immediate response, total active time for this request = 262ms. Total time from accept to close : 2643 ms. The timing now decomposes like this : first request 2nd request \|<-------------------------------->\|<-------------- ... t tr t tr ... ---\|----\|----\|----\|----\|----\|----\|----\|----\|-- : Th Ti TR Tw Tc Tr Td : Ti ... :<---- Tq ---->: : :<-------------- Tt -------------->: :<--------- Ta --------->:	2016-08-23 15:18:08 +02:00
Willy Tarreau	4d03ef7f03	BUG/MAJOR: stick-counters: possible crash when using sc_trackers with wrong table Bryan Talbot reported a very interesting bug. The sc_trackers() sample fetch seems to have escaped the sanitization that was performed during 1.5 to ensure all dereferences of stkctr_entry() were safe. Here if a tacker is set on a backend and is then checked against a different backend where the entry doesn't exist, stkctr_entry() returns NULL and this is dereferenced to retrieve the ref count. Thanks to Bryan for his detailed bug report featuring a working config and reproducer. This fix must be backported to 1.6 and 1.5.	2016-08-14 12:02:55 +02:00
Willy Tarreau	568743a21f	BUG/MEDIUM: stream-int: completely detach connection on connect error Tim Butler reported a troubling issue affecting all versions since 1.5. When a connection error occurs and a retry is performed on the same server, the server connection first goes into the turn-around state (SI_ST_TAR) for one second. During this time, the client may speak and try to push some data. The tests in place confirm that the stream interface is in a state <= SI_ST_EST and that a connection exists, so all ingredients are present to try to perform a send() to forward data. The send() cannot be performed since the connection's control layer is marked as not ready, but the polling flags are changed, and due to the remaining error flag present on the connection, the polling on the FD is disabled in both directions. But if this FD was reassigned to another connection in the mean time, it is this FD which is disabled, and it causes a timeout on another connection. A configuration allowing to reproduce the issue looks like this : listen test bind :8003 server s1 127.0.0.1:8001 # this one should be closed listen victim bind :8002 server s1 127.0.0.1:8000 # this one should respond slowly (~50ms) Two parallel injections should be run with short time-outs (100ms). After some time, some dead connections will appear in listener "victim" due to their I/Os being disabled by some of the failed transfers on "test" instance. These ones will only be flushed on time out. A dead connection looks like this : > show sess 0x7dcb70 0x7dcb70: [07/Aug/2016:08:58:40.120151] id=3771 proto=tcpv4 source=127.0.0.1:34682 flags=0xce, conn_retries=3, srv_conn=0x7da020, pend_pos=(nil) frontend=victim (id=3 mode=tcp), listener=? (id=1) addr=127.0.0.1:8002 backend=victim (id=3 mode=tcp) addr=127.0.0.1:37736 server=s1 (id=1) addr=127.0.0.1:8000 task=0x7dcaf8 (state=0x08 nice=0 calls=2 exp=<NEVER> age=30s) si[0]=0x7dcd68 (state=EST flags=0x08 endp0=CONN:0x7e2410 exp=<NEVER>, et=0x000) si[1]=0x7dcd88 (state=EST flags=0x18 endp1=CONN:0x7e0cd0 exp=<NEVER>, et=0x000) co0=0x7e2410 ctrl=tcpv4 xprt=RAW data=STRM target=LISTENER:0x7d9ea8 flags=0x0020b306 fd=122 fd.state=25 fd.cache=0 updt=0 co1=0x7e0cd0 ctrl=tcpv4 xprt=RAW data=STRM target=SERVER:0x7da020 flags=0x0020b306 fd=93 fd.state=20 fd.cache=0 updt=0 req=0x7dcb80 (f=0x848000 an=0x0 pipe=0 tofwd=-1 total=129) an_exp=<NEVER> rex=<NEVER> wex=<NEVER> buf=0x7893c0 data=0x7893d4 o=0 p=0 req.next=0 i=0 size=0 res=0x7dcbc0 (f=0x80008000 an=0x0 pipe=0 tofwd=-1 total=0) an_exp=<NEVER> rex=<NEVER> wex=<NEVER> buf=0x7893c0 data=0x7893d4 o=0 p=0 rsp.next=0 i=0 size=0 The solution against this issue is to completely detach the connection upon error instead of only performing a forced close. This fix should be backported to 1.6 and 1.5. Special thanks to Tim who did all the troubleshooting work and provided a lot of traces allowing to find the root cause of this problem.	2016-08-07 09:21:04 +02:00
Thierry Fournier	6fc340ff07	BUG/MEDIUM: sticktables: segfault in some configuration error cases When a stick table is tracked, and another one is used later on the configuration, a segfault occurs. The function "smp_create_src_stkctr" can return a NULL value, and its value is not tested, so one other function try to dereference a NULL pointer. This patch just add a verification of the NULL pointer. The problem is reproduced with this configuration: listen www mode http bind :12345 tcp-request content track-sc0 src table IPv4 http-request allow if { sc0_inc_gpc0(IPv6) gt 0 } server dummy 127.0.0.1:80 backend IPv4 stick-table type ip size 10 expire 60s store gpc0 backend IPv6 stick-table type ipv6 size 10 expire 60s store gpc0 Thank to kabefuna@gmail.com for the bug report. This patch must be backported in the 1.6 and 1.5 version.	2016-06-07 11:05:23 +02:00
Christopher Faulet	3a394fa7cd	MEDIUM: filters: Add pre and post analyzer callbacks 'channel_analyze' callback has been removed. Now, there are 2 callbacks to surround calls to analyzers: * channel_pre_analyze: Called BEFORE all filterable analyzers. it can be called many times for the same analyzer, once at each loop until the analyzer finishes its processing. This callback is resumable, it returns a negative value if an error occurs, 0 if it needs to wait, any other value otherwise. * channel_post_analyze: Called AFTER all filterable analyzers. Here, AFTER means when an analyzer finishes its processing. This callback is NOT resumable, it returns a negative value if an error occurs, any other value otherwise. Pre and post analyzer callbacks are not automatically called. 'pre_analyzers' and 'post_analyzers' bit fields in the filter structure must be set to the right value using AN_* flags (see include/types/channel.h). The flag AN_RES_ALL has been added (AN_REQ_ALL already exists) to ease the life of filter developers. AN_REQ_ALL and AN_RES_ALL include all filterable analyzers.	2016-05-18 15:11:54 +02:00
Christopher Faulet	a9215b7206	MINOR: filters: Simplify calls to analyzers using 2 new macros Now, to call an analyzer in 'process_stream' function, we should use FLT_ANALAYZE or ANALYZE macros, depending if this is a filterable analyzer or not.	2016-05-18 15:11:54 +02:00
Willy Tarreau	8bf242b764	BUG/MEDIUM: channel: fix inconsistent handling of 4GB-1 transfers In 1.4-dev3, commit `31971e5` ("[MEDIUM] add support for infinite forwarding") made it possible to configure the lower layer to forward data indefinitely by setting the forward size to CHN_INFINITE_FORWARD (4GB-1). By then larger chunk sizes were not supported so there was no confusion in the usage of the function. Since 1.5 we support 64-bit content-lengths and chunk sizes and the function has grown to support 64-bit arguments, though it still limits a single pass to 32-bit quantities (what fit in the channel's to_forward field). The issue now becomes that a 4GB-1 content-length can be confused with infinite forwarding (in fact it's 4GB-1+what was already in the buffer). It causes a visible effect when transferring this exact size because the transfer rate is lower than with other sizes due in part to the disabling of the Nagle algorithm on the sendto() call. In theory with keep-alive it should prevent a second request from being processed after such a transfer, but since the analysers are still present, the forwarding analyser properly counts down the remaining size to transfer and ultimately the transaction gets correctly reset so there is no visible effect. Since the root cause of the issue is an API problem (lack of distinction between a real valid length and a magic value), this patch modifies the API to have a new dedicated function called channel_forward_forever() to program a permanent forwarding. The existing function __channel_forward() was modified to properly take care of the requested sizes and ensure it 1) never overflows and 2) never reaches CHN_INFINITE_FORWARD by accident. It is worth noting that the function used to have a bug causing a 2GB forward to be scheduled if it was called with less data than what is present in buf->i. Fortunately this bug couldn't be triggered with existing code. This fix should be backported to 1.6 and 1.5. While it also theorically affects 1.4, it's better not to backport it there, as the risk of breaking large object transfers due to significant API differences is high, compared to the fact that the largest supported objects (4GB-1) are just slower to transfer.	2016-05-04 15:26:37 +02:00
Willy Tarreau	5fb04711f0	BUG/MEDIUM: stream: ensure the SI_FL_DONT_WAKE flag is properly cleared The previous buffer space bug has revealed an issue causing some stalled connections to remain orphaned forever, preventing an old process from dying. The issue is that once in a while a task may be woken up because a disabled expiration timer has been reached despite no timeout being reached. In this case we exit very early but the SI_FL_DONT_WAKE flag wasn't cleared, resulting in new events not waking the task up. It may be one of the reasons why a few people have already observed some peers connections stuck in CLOSE_WAIT state. This bug was introduced in 1.5-dev13 by commit `798f432` ("OPTIM: session: don't process the whole session when only timers need a refresh"), so the fix must be backported to 1.6 and 1.5.	2016-05-04 10:18:37 +02:00
Frederik Deweerdt	6cd8d13c05	OPTIM/MINOR: session: abort if possible before connecting to the backend Depending on the path that led to sess_update_stream_int(), it's possible that we had a read error on the frontend, but that we haven't checked if we may abort the connection. This was seen in particular the following setup: tcp mode, with abortonclose set, frontend using ssl. If the ssl connection had a first successful read, but the second read failed, we would stil try to open a connection to the backend, although we had enough information to close the connection early. sess_update_stream_int() had some logic to handle that case in the SI_ST_QUE and SI_ST_TAR, but that was missing in the SI_ST_ASS case. This patches addresses the issue by verifying the state of the req channel (and the abortonclose option) right before opening the connection to the backend, so we have the opportunity to close the connection there, and factorizes the shared SI_ST_{QUE,TAR,ASS} code.	2016-04-07 19:12:02 +02:00
Thierry Fournier	40e1d51068	BUG/MEDIUM: stick-tables: some sample-fetch doesn't work in the connection state. The sc_* sample fetch can work without the struct strm, because the tracked counters are also stored in the session. So, this patchs removes the check for the strm existance. This bug is recent and was introduced in 1.7-dev2 by commit `6204cd9` ("BUG/MAJOR: vars: always retrieve the stream and session from the sample") This bugfix must be backported in 1.6.	2016-03-30 19:51:33 +02:00
Willy Tarreau	6204cd9f27	BUG/MAJOR: vars: always retrieve the stream and session from the sample This is the continuation of previous patch called "BUG/MAJOR: samples: check smp->strm before using it". It happens that variables may have a session-wide scope, and that their session is retrieved by dereferencing the stream. But nothing prevents them from being used from a streamless context such as tcp-request connection, thus crashing the process. Example : tcp-request connection accept if { src,set-var(sess.foo) -m found } In order to fix this, we have to always ensure that variable manipulation only happens via the sample, which contains the correct owner and context, and that we never use one from a different source. This results in quite a large change since a lot of functions are inderctly involved in the call chain, but the change is easy to follow. This fix must be backported to 1.6, and requires the last two patches.	2016-03-10 17:28:04 +01:00
Willy Tarreau	be508f1580	BUG/MAJOR: samples: check smp->strm before using it Since commit `6879ad3` ("MEDIUM: sample: fill the struct sample with the session, proxy and stream pointers") merged in 1.6-dev2, the sample contains the pointer to the stream and sample fetch functions as well as converters use it heavily. The problem is that earlier commit `87b0966` ("REORG/MAJOR: session: rename the "session" entity to "stream"") had split the session and stream resulting in the possibility for smp->strm to be NULL before the stream was initialized. This is what happens in tcp-request connection rulesets, as discovered by Baptiste. The sample fetch functions must now check that smp->strm is valid before using it. An alternative could consist in using a dummy stream with nothing in it to avoid some checks but it would only result in deferring them to the next step anyway, and making it harder to detect that a stream is valid or the dummy one. There is still an issue with variables which requires a complete independant fix. They use strm->sess to find the session with strm possibly NULL and passed as an argument. All call places indirectly use smp->strm to build strm. So the problem is there but the API needs to be changed to remove this duplicate argument that makes it much harder to know what pointer to use. This fix must be backported to 1.6, as well as the next one fixing variables.	2016-03-10 16:42:58 +01:00
Christopher Faulet	309c6418b0	MEDIUM: filters: Replace filter_http_headers callback by an analyzer This new analyzer will be called for each HTTP request/response, before the parsing of the body. It is identified by AN_FLT_HTTP_HDRS. Special care was taken about the following condition : * the frontend is a TCP proxy * filters are defined in the frontend section * the selected backend is a HTTP proxy So, this patch explicitly add AN_FLT_HTTP_HDRS analyzer on the request and the response channels when the backend is a HTTP proxy and when there are filters attatched on the stream. This patch simplifies http_request_forward_body and http_response_forward_body functions.	2016-02-09 14:53:15 +01:00
Christopher Faulet	3e34429515	MEDIUM: filters: Use macros to call filters callbacks to speed-up processing When no filter is attached to the stream, the CPU footprint due to the calls to filters_* functions is huge, especially for chunk-encoded messages. Using macros to check if we have some filters or not is a great improvement. Furthermore, instead of checking the filter list emptiness, we introduce a flag to know if filters are attached or not to a stream.	2016-02-09 14:53:15 +01:00
Christopher Faulet	92d3638d2d	MAJOR: filters/http: Rewrite the HTTP compression as a filter HTTP compression has been rewritten to use the filter API. This is more a PoC than other thing for now. It allocates memory to work. So, if only for that, it should be rewritten. In the mean time, the implementation has been refactored to allow its use with other filters. However, there are limitations that should be respected: - No filter placed after the compression one is allowed to change input data (in 'http_data' callback). - No filter placed before the compression one is allowed to change forwarded data (in 'http_forward_data' callback). For now, these limitations are informal, so you should be careful when you use several filters. About the configuration, 'compression' keywords are still supported and must be used to configure the HTTP compression behavior. In absence of a 'filter' line for the compression filter, it is added in the filter chain when the first compression' line is parsed. This is an easy way to do when you do not use other filters. But another filter exists, an error is reported so that the user must explicitly declare the filter. For example: listen tst ... compression algo gzip compression offload ... filter flt_1 filter compression filter flt_2 ...	2016-02-09 14:53:15 +01:00
Christopher Faulet	d7c9196ae5	MAJOR: filters: Add filters support This patch adds the support of filters in HAProxy. The main idea is to have a way to "easely" extend HAProxy by adding some "modules", called filters, that will be able to change HAProxy behavior in a programmatic way. To do so, many entry points has been added in code to let filters to hook up to different steps of the processing. A filter must define a flt_ops sutrctures (see include/types/filters.h for details). This structure contains all available callbacks that a filter can define: struct flt_ops { /* * Callbacks to manage the filter lifecycle / int (init) (struct proxy p); void (deinit)(struct proxy p); int (check) (struct proxy p); / * Stream callbacks / void (stream_start) (struct stream s); void (stream_accept) (struct stream s); void (session_establish)(struct stream s); void (stream_stop) (struct stream s); / * HTTP callbacks / int (http_start) (struct stream s, struct http_msg msg); int (http_start_body) (struct stream s, struct http_msg msg); int (http_start_chunk) (struct stream s, struct http_msg msg); int (http_data) (struct stream s, struct http_msg msg); int (http_last_chunk) (struct stream s, struct http_msg msg); int (http_end_chunk) (struct stream s, struct http_msg msg); int (http_chunk_trailers)(struct stream s, struct http_msg msg); int (http_end_body) (struct stream s, struct http_msg msg); void (http_end) (struct stream s, struct http_msg msg); void (http_reset) (struct stream s, struct http_msg msg); int (http_pre_process) (struct stream s, struct http_msg msg); int (http_post_process) (struct stream s, struct http_msg msg); void (http_reply) (struct stream s, short status, const struct chunk msg); }; To declare and use a filter, in the configuration, the "filter" keyword must be used in a listener/frontend section: frontend test ... filter <FILTER-NAME> [OPTIONS...] The filter referenced by the <FILTER-NAME> must declare a configuration parser on its own name to fill flt_ops and filter_conf field in the proxy's structure. An exemple will be provided later to make it perfectly clear. For now, filters cannot be used in backend section. But this is only a matter of time. Documentation will also be added later. This is the first commit of a long list about filters. It is possible to have several filters on the same listener/frontend. These filters are stored in an array of at most MAX_FILTERS elements (define in include/types/filters.h). Again, this will be replaced later by a list of filters. The filter API has been highly refactored. Main changes are: * Now, HA supports an infinite number of filters per proxy. To do so, filters are stored in list. * Because filters are stored in list, filters state has been moved from the channel structure to the filter structure. This is cleaner because there is no more info about filters in channel structure. * It is possible to defined filters on backends only. For such filters, stream_start/stream_stop callbacks are not called. Of course, it is possible to mix frontend and backend filters. * Now, TCP streams are also filtered. All callbacks without the 'http_' prefix are called for all kind of streams. In addition, 2 new callbacks were added to filter data exchanged through a TCP stream: - tcp_data: it is called when new data are available or when old unprocessed data are still waiting. - tcp_forward_data: it is called when some data can be consumed. * New callbacks attached to channel were added: - channel_start_analyze: it is called when a filter is ready to process data exchanged through a channel. 2 new analyzers (a frontend and a backend) are attached to channels to call this callback. For a frontend filter, it is called before any other analyzer. For a backend filter, it is called when a backend is attached to a stream. So some processing cannot be filtered in that case. - channel_analyze: it is called before each analyzer attached to a channel, expects analyzers responsible for data sending. - channel_end_analyze: it is called when all other analyzers have finished their processing. A new analyzers is attached to channels to call this callback. For a TCP stream, this is always the last one called. For a HTTP one, the callback is called when a request/response ends, so it is called one time for each request/response. * 'session_established' callback has been removed. Everything that is done in this callback can be handled by 'channel_start_analyze' on the response channel. * 'http_pre_process' and 'http_post_process' callbacks have been replaced by 'channel_analyze'. * 'http_start' callback has been replaced by 'http_headers'. This new one is called just before headers sending and parsing of the body. * 'http_end' callback has been replaced by 'channel_end_analyze'. * It is possible to set a forwarder for TCP channels. It was already possible to do it for HTTP ones. * Forwarders can partially consumed forwardable data. For this reason a new HTTP message state was added before HTTP_MSG_DONE : HTTP_MSG_ENDING. Now all filters can define corresponding callbacks (http_forward_data and tcp_forward_data). Each filter owns 2 offsets relative to buf->p, next and forward, to track, respectively, input data already parsed but not forwarded yet by the filter and parsed data considered as forwarded by the filter. A any time, we have the warranty that a filter cannot parse or forward more input than previous ones. And, of course, it cannot forward more input than it has parsed. 2 macros has been added to retrieve these offets: FLT_NXT and FLT_FWD. In addition, 2 functions has been added to change the 'next size' and the 'forward size' of a filter. When a filter parses input data, it can alter these data, so the size of these data can vary. This action has an effet on all previous filters that must be handled. To do so, the function 'filter_change_next_size' must be called, passing the size variation. In the same spirit, if a filter alter forwarded data, it must call the function 'filter_change_forward_size'. 'filter_change_next_size' can be called in 'http_data' and 'tcp_data' callbacks and only these ones. And 'filter_change_forward_size' can be called in 'http_forward_data' and 'tcp_forward_data' callbacks and only these ones. The data changes are the filter responsability, but with some limitation. It must not change already parsed/forwarded data or data that previous filters have not parsed/forwarded yet. Because filters can be used on backends, when we the backend is set for a stream, we add filters defined for this backend in the filter list of the stream. But we must only do that when the backend and the frontend of the stream are not the same. Else same filters are added a second time leading to undefined behavior. The HTTP compression code had to be moved. So it simplifies http_response_forward_body function. To do so, the way the data are forwarded has changed. Now, a filter (and only one) can forward data. In a commit to come, this limitation will be removed to let all filters take part to data forwarding. There are 2 new functions that filters should use to deal with this feature: * flt_set_http_data_forwarder: This function sets the filter (using its id) that will forward data for the specified HTTP message. It is possible if it was not already set by another filter _AND_ if no data was yet forwarded (msg->msg_state <= HTTP_MSG_BODY). It returns -1 if an error occurs. * flt_http_data_forwarder: This function returns the filter id that will forward data for the specified HTTP message. If there is no forwarder set, it returns -1. When an HTTP data forwarder is set for the response, the HTTP compression is disabled. Of course, this is not definitive.	2016-02-09 14:53:15 +01:00
Willy Tarreau	49008c157e	BUG/MINOR: stream: don't force retries if the server is DOWN Arkadiy Kulev noticed that if a server is marked down while a connection is being trying to establish, we still insist on performing retries on the same server, which is absurd. Better perform the redispatch if we already know the server is down. Because of this, it's likely that the observe-l4 and sudden-death mechanisms are not optimal an cannot help much the connection which was used to detect the problem. The fix should be backported to 1.6 and 1.5 at least.	2016-01-25 14:36:20 +01:00
Thierry FOURNIER	337eae1882	BUG/MINOR: stream: bad return code In error case, we expect the enum ACT_RET_PRS_ERR, but the function "stream_parse_use_service()" returns "-1" This patch must be backported in 1.6	2015-12-22 13:36:20 +01:00
Willy Tarreau	f25b3573d6	BUG/MEDIUM: stream: fix half-closed timeout handling client-fin and server-fin are bogus. They are applied on the write side after a SHUTR was seen. The immediate effect is that sometimes if a SHUTR was seen after a SHUTW on the same side, the timeout is enabled again regardless of the fact that the output is already closed. This results in the timeout event not to be processed and a busy poll loop to happen until another timeout on the stream gets rid of it. Note that haproxy continues its job during this, it's just that it eats all the CPU trying to handle an event that it ignores. An reproducible case consists in having a client stop reading data from a server to ensure data remain in the response buffer, then the client sends a shutdown(write). If abortonclose is enabled on haproxy, the shutdown is passed to the server side and the server responds with a SHUTR that cannot immediately be forwarded to the client since the buffer is full. During this time the event is ignored and the task is woken again in loops. It is worth noting that the timeout handling since 1.5 is a bit fragile and that it might be possible that other similar conditions still exist, so the timeout handling should be audited regarding this issue. Many thanks to BaiYang for providing detailed information showing the problem in action. This bug also affects 1.5 thus the fix must be backported.	2015-11-26 10:33:47 +01:00
Thierry FOURNIER	5a363e71b2	MINOR: stream/applet: add use-service action This new target can be called from the frontend or the backend. It is evaluated just before the backend choice and just before the server choice. So, the input stream or HTTP request can be forwarded to a server or to an internal service.	2015-09-28 01:03:48 +02:00
Thierry FOURNIER	27929fbfd7	MINOR: channel: rename function chn_sess to chn_strm The name of the function chn_sess is no longer appropriate. This patch renames it to chn_strm.	2015-09-25 23:27:33 +02:00
Willy Tarreau	2c1068cb67	BUG/MEDIUM: stream: do not dereference strm_li(stream) Some streams do not have a listener (eg: Lua's cosockets) so let's check for this. For now this problem cannot happen but it's definitely unsafe.	2015-09-23 13:42:08 +02:00
Willy Tarreau	a68f7629dd	BUG/MEDIUM: stick-tables: fix double-decrement of tracked entries Mailing list participant "mlist" reported negative conn_cur values in stick tables as the result of "tcp-request connection track-sc". The reason is that after the stick entry it copied from the session to the stream, both the session and the stream grab a reference to the entry and when the stream ends, it decrements one reference and one connection, then the same is done for the session. In fact this problem was already encountered slightly differently in the past and addressed by Thierry using the patch below as it was believed by then to be only a refcount issue since it was the observable symptom : `827752e` "BUG/MEDIUM: stick-tables: refcount error after copying SC..." In reality the problem is that the stream must touch neither the refcount nor the connection count for entries it inherits from the session. While we have no way to tell whether a track entry was inherited from the session (since they're simply memcpy'd), it is possible to prevent the stream from touching an entry that already exists in the session because that's a guarantee that it was inherited from it. Note that it may be a temporary fix. Maybe in the future when a session gives birth to multiple streams we'll face a situation where a session may be updated to add more tracked entries after instanciating some streams. The correct long-term fix is to mark some tracked entries as shared or private (or RO/RW). That will allow the session to track more entries even after the same trackers are being used by early streams. No backport is needed, this is only caused by the session/stream split in 1.6.	2015-09-21 17:48:24 +02:00
Willy Tarreau	9c03b33329	BUG/MAJOR: http: don't call http_send_name_header() after an error A crash was reported when using the "famous" http-send-name-header directive. This time it's a bit tricky, it requires a certain number of conditions to be met including maxconn on a server, queuing, timeout in the queue and cookie-based persistence. The problem is that in stream.c, before calling http_send_name_header(), we check a number of conditions to know if we have to replace the header name. But prior to reaching this place, it's possible for sess_update_stream_int() to fail and change the stream-int's state to SI_ST_CLO, send an error 503 to the client, and flush all buffers. But http_send_name_header() can only be called with valid buffer contents matching the http_msg's description. So when it rewinds the stream to modify the header, buf->o becomes negative by the size of the incoming request and is used as the argument to memmove() which basically displaces 4GB of memory off a few bytes to write the new name, resulting in a core and a core file that's really not fun to play with. The solution obviously consists in refraining from calling this nasty function when the stream interface is already closed. This bug also affects 1.5 and possibly 1.4, so the fix must be backported there.	2015-09-07 19:41:42 +02:00
Thierry FOURNIER	236657b5e0	MINOR: stick-tables: Add GPT0 access This patch adds acces to GPT0. The access can be done with http and tcp actions, and through a converter.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	7e25df3f17	MAJOR: stick-tables: remove key storage from the key struct Now, the key struct only points to the storage provided by the sample as input.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	136f9d34a9	MINOR: samples: rename union from "data" to "u" The union name "data" is a little bit heavy while we read the source code because we can read "data.data.sint". The rename from "data" to "u" makes the read easiest like "data.u.sint".	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	8c542cac07	MEDIUM: samples: Use the "struct sample_data" in the "struct sample" This patch remove the struct information stored both in the struct sample_data and in the striuct sample. Now, only thestruct sample_data contains data, and the struct sample use the struct sample_data for storing his own data.	2015-08-20 17:13:46 +02:00
Willy Tarreau	0f4eadd483	BUG/MEDIUM: counters: ensure that src_{inc,clr}_gpc0 creates a missing entry During 1.5-dev20 there was some code refactoring to make the src_* fetch function use the same code as sc_. Unfortunately this introduced a regression where src_ doesn't create an entry anymore if it does not exist in the table. The reason is that smp_fetch_sc_stkctr() only calls stktable_lookup_key() while src_inc_/src_clr_ used to make use of stktable_update_key() which additionally create the entry if it does not exist. There's no point modifying the common function for these two exceptions, so instead we now have a function dedicated to the creation of this entry for src_* only. It is called when the entry didn't exist, so that requires minimal modifications to existing code. Thanks to Thierry Fournier for helping diagnose the issue. This fix must be backported to 1.5.	2015-08-18 17:15:20 +02:00
Thierry FOURNIER	827752e7ec	BUG/MEDIUM: stick-tables: refcount error after copying SC for the session to the stream Each copyied SC counter will be used and released by the stream, so we need to increment the refcount.	2015-08-18 13:39:49 +02:00
Thierry FOURNIER	c8fdb983c5	BUG/MEDIUM: stream: The stream doen't inherit SC from the session During the processing of tcp-request connection, the stream doesn't exists, so the stick counters are stored in the session. When the stream is created it must inherit from the session sc. This patch fix this behavior. [WT: this is specific to 1.6, no backport needed]	2015-08-16 23:55:32 +02:00
Remi Gacogne	7fb9de25e2	MINOR: stream: initialize the current_rule field to NULL on stream init Currently it is possible for the current_rule field to be evaluated before being set, leading to valgrind complaining: ==16783== Conditional jump or move depends on uninitialised value(s) ==16783== at 0x44E662: http_res_get_intercept_rule (proto_http.c:3730) ==16783== by 0x44E662: http_process_res_common (proto_http.c:6528) ==16783== by 0x4797B7: process_stream (stream.c:1851) ==16783== by 0x414634: process_runnable_tasks (task.c:238) ==16783== by 0x40B02F: run_poll_loop (haproxy.c:1528) ==16783== by 0x407F25: main (haproxy.c:1887) This was introduced by commit `152b81e7b2`.	2015-07-26 21:50:50 +02:00
Thierry FOURNIER	bf65cd4d77	MAJOR: arg: converts uint and sint in sint This patch removes the 32 bits unsigned integer and the 32 bit signed integer. It replaces these types by a unique type 64 bit signed.	2015-07-22 00:48:23 +02:00
Thierry FOURNIER	07ee64ef4d	MAJOR: sample: converts uint and sint in 64 bits signed integer This patch removes the 32 bits unsigned integer and the 32 bit signed integer. It replaces these types by a unique type 64 bit signed. This makes easy the usage of integer and clarify signed and unsigned use. With the previous version, signed and unsigned are used ones in place of others, and sometimes the converter loose the sign. For example, divisions are processed with "unsigned", if one entry is negative, the result is wrong. Note that the integer pattern matching and dotted version pattern matching are already working with signed 64 bits integer values. There is one user-visible change : the "uint()" and "sint()" sample fetch functions which used to return a constant integer have been replaced with a new more natural, unified "int()" function. These functions were only introduced in the latest 1.6-dev2 so there's no impact on regular deployments.	2015-07-22 00:48:23 +02:00
Willy Tarreau	069b0c8e6f	CLEANUP: stream: remove a useless call to si_detach() It's pointless to call si_detach() after si_reset() since it does nothing due to si->end being NULL.	2015-07-19 17:57:02 +02:00
Willy Tarreau	42529c38ac	MINOR: stream: maintain consistence between channel_forward and HTTP forward When the HTTP forwarder is used, it resets msg->sov so that we know that the parsing pointer has advanced by exactly (msg->eoh + msg->eol - msg->sov) bytes which may have to be rewound in case we want to perform an HTTP fetch after forwarding has started (eg: upon connect). But when the backend is in TCP mode, there may be no HTTP forwarding analyser installed, still we may want to perform these HTTP fetches in case we have already ensured at the TCP layer that we have a properly parsed HTTP transaction. In order to solve this, we reset msg->sov before doing a channel_forward() so that we can still compute http_rewind() on the pending data. That ensures the buffer is always rewindable even in mixed TCP+HTTP mode.	2015-07-10 11:37:29 +02:00
Willy Tarreau	ebcd4844e8	MEDIUM: vars: move the session variables to the session, not the stream It's important that the session-wide variables are in the session and not in the stream.	2015-06-19 11:59:02 +02:00
Emeric Brun	57056f0347	MEDIUM: peers: re-schedule stick-table's entry for sync when data is modified. This was correctly done for data of type 'serverid' but it is now necessary for all stick-tables data-types if we want to perform their sync.	2015-06-16 16:11:36 +02:00
Thierry FOURNIER	4834bc773c	MEDIUM: vars: adds support of variables This patch adds support of variables during the processing of each stream. The variables scope can be set as 'session', 'transaction', 'request' or 'response'. The variable type is the type returned by the assignment expression. The type can change while the processing. The allocated memory can be controlled for each scope and each request, and for the global process.	2015-06-13 23:01:37 +02:00
Willy Tarreau	9e0bb1013e	CLEANUP: proxy: make the proxy lookup functions more user-friendly First, findproxy() was renamed proxy_find_by_name() so that its explicit that a name is required for the lookup. Second, we give this function the ability to search for tables if needed. Third we now provide inline wrappers to pass the appropriate PR_CAP_* flags and to explicitly look up a frontend, backend or table.	2015-05-26 11:24:42 +02:00
Joseph Lynch	726ab7145c	MEDIUM: backend: Allow redispatch on retry intervals For backend load balancing it sometimes makes sense to redispatch rather than retrying against the same server. For example, when machines or routers fail you may not want to waste time retrying against a dead server and would instead prefer to immediately redispatch against other servers. This patch allows backend sections to specify that they want to redispatch on a particular interval. If the interval N is positive the redispatch occurs on every Nth retry, and if the interval N is negative then the redispatch occurs on the Nth retry prior to the last retry (-1 is the default and maintains backwards compatibility). In low latency environments tuning this setting can save a few hundred milliseconds when backends fail.	2015-05-22 07:07:40 +02:00
Thierry FOURNIER	0786d05a04	MEDIUM: sample: change the prototype of sample-fetches functions This patch removes the "opt" entry from the prototype of the sample-fetches fucntions. This permits to remove some weight in the prototype call.	2015-05-11 20:03:08 +02:00
Thierry FOURNIER	0a9a2b8cec	MEDIUM: sample change the prototype of sample-fetches and converters functions This patch removes the structs "session", "stream" and "proxy" from the sample-fetches and converters function prototypes. This permits to remove some weight in the prototype call.	2015-05-11 20:01:42 +02:00
CJ Ess	6eac32ebc1	BUILD: Compile clean when debug options defined Here you go!	2015-05-03 09:17:43 +02:00
Willy Tarreau	748179eb5a	MEDIUM: stream: move HTTP request body analyser before process_common Since 1.5, the request body analyser has become independant from any other element and does not even disturb the message forwarder anymore. And since it's disabled by default, we can place it before most analysers so that it's can preempt any other one if an intermediary one enables it.	2015-05-02 00:10:44 +02:00
Willy Tarreau	fe127937a8	MEDIUM: applet: make the applets only use si_applet_{cant\|want\|stop}_{get\|put} The applets don't fiddle with SI_FL_WAIT_ROOM anymore, instead they indicate what they want, possibly that they failed (eg: WAIT_ROOM), and it's done() / update() which finally updates the WAIT_* flags according to the channels' and stream interface's states. This solves the issue of the pauses during a "show sess" without creating busy loops.	2015-04-23 17:56:17 +02:00
Willy Tarreau	bc39a5d8c8	MAJOR: stream: do not allocate request buffers anymore when the left side is an applet We used to allocate a request buffer so that we could process applets from process_stream(), and this was causing some trouble because it was not possible for an analyzer to return an error to an applet, which we'll need for HTTP/2. Now that we don't call applets anymore from process_stream() we can simplify this and ensure that a response is always allocated to process a stream.	2015-04-23 17:56:17 +02:00
Willy Tarreau	563cc37609	MAJOR: stream: use a regular ->update for all stream interfaces Now si->update() is used to update any type of stream interface, whether it's an applet, a connection or even nothing. We don't call si_applet_call() anymore at the end of the resync and we don't have the risk that the stream's task is reinserted into the run queue, which makes the code a bit simpler. The stream_int_update_applet() function was simplified to ensure that it remained compatible with this standardized calling convention. It was almost copy-pasted from the update code dedicated to connections. Just like for si_applet_done(), it seems that it should be possible to merge the two functions except that it would require some slow operations, except maybe if the type of end point is tested inside the update function itself.	2015-04-23 17:56:16 +02:00
Willy Tarreau	8a8d83b85c	REORG: applet: move the applet definitions out of stream_interface We're tidying the definitions so that appctx lives on its own. A new set of applet.h files has been added for this purpose.	2015-04-23 17:56:16 +02:00
Willy Tarreau	73b65acd46	MINOR: stream: pass the pointer to the origin explicitly to stream_new() We don't pass sess->origin anymore but the pointer to the previous step. Now it should be much easier to chain elements together once applets are moved out of streams. Indeed, the session is only used for configuration and not for the dynamic chaining anymore.	2015-04-08 18:26:29 +02:00
Willy Tarreau	678be62981	MEDIUM: session: adjust the connection flags before stream_new() It's not the stream's job to manipulate the connection's flags, it's more related to the session that accepted the new connection. And the only case where we have to do it conditionally is based on the frontend which is known from the session, thus it makes sense to do it there.	2015-04-08 18:18:15 +02:00
Willy Tarreau	042cd75bc2	MINOR: session: maintain the session count stats in the session, not the stream This has nothing to do in the stream, as we'll face absurdities when chaining multiple streams. The session is where it must be accounted for.	2015-04-08 18:10:49 +02:00
Willy Tarreau	d1769b8b9a	MEDIUM: stream: don't rely on the session's listener anymore in stream_new() When the stream is instanciated from an applet, it doesn't necessarily have a listener. The listener was sparsely used there, just to retrieve the task function, update the listeners' stats, and set the analysers and default target, both of which are often zero from applets. Thus these elements are now initialized with default values that the caller is free to change if desired.	2015-04-06 11:37:35 +02:00
Willy Tarreau	fb9f584956	CLEANUP: frontend: remove one useless local variable "p" was a copy of sess->fe which itself is also used. It's used in a few cases, let's remove it and reduce the code as well.	2015-04-06 11:37:35 +02:00
Willy Tarreau	c8815efb06	MEDIUM: frontend: move some remaining stream settings to stream_new() The auto-forwarding mechanism in case no analyser is set is generic to the streams. Also the timeouts on the client side are better preset in the stream initialization as well.	2015-04-06 11:37:35 +02:00
Willy Tarreau	e0232f1e33	MEDIUM: frontend: don't restrict frontend_accept() to connections anymore Now it can also initialize streams initiated by applets. This will be needed for HTTP/2.	2015-04-06 11:37:35 +02:00
Willy Tarreau	99eb0f13bc	MEDIUM: frontend: move some stream initialisation to stream_new() This is mostly what is related to logging, and which is more of a stream initialization than anything frontend-specific.	2015-04-06 11:37:35 +02:00
Willy Tarreau	02d863866d	MEDIUM: stream: return the stream upon accept() The function was called stream_accept_session(), let's rename it stream_new() and make it return the newly allocated pointer. It's more convenient for some callers who need it.	2015-04-06 11:37:34 +02:00
Willy Tarreau	c5a7ff4a86	MEDIUM: frontend: make ->accept only return +/-1 This function was specified as being able to return 3 states, which had repercussions to the stream accept function. It was used at the time when the frontend would do the monitoring itself. This is not the case anymore, so let's simplify this.	2015-04-06 11:37:34 +02:00
Willy Tarreau	470280ef63	MEDIUM: stream: also accept appctx as origin in stream_accept_session() It's likely that the code could be simplified a bit though.	2015-04-06 11:37:34 +02:00
Willy Tarreau	f2b9874bcf	MEDIUM: stream: isolate connection-specific initialization code In stream_accept_session(), we perform some operations that explicitly want a connection as the origin, but we'll soon have other types of origin (eg: applet). Thus change the test to ensure we only call this code with connections. Additionally, we refrain from calling fe->accept() if the origin is not a connection, because for now the only fe->accept() may only use a connection (frontend_accept).	2015-04-06 11:37:34 +02:00
Willy Tarreau	ce7eec9186	CLEANUP: stream.c: do not re-attach the connection to the stream This was a leftover from the initial code, it's not needed at all anymore.	2015-04-06 11:37:33 +02:00
Willy Tarreau	9903f0e1a2	REORG: session: move the session parts out of stream.c This concerns everythins related to accepting a new session and expiring the embryonic session. There's still a hard-coded call to stream_accept_session() which could be set somewhere in the frontend, but for now it's not a problem.	2015-04-06 11:37:32 +02:00
Willy Tarreau	32990b531b	MEDIUM: session: remove the task pointer from the session Now that the previous changes were made, we can add a struct task pointer to stream_complete() and get rid of it in struct session. The new relation between connection, session and task are like this : orig -- sess <-- context \| \| v \| conn -- owner ---> task Some session-specific parts should now move away from stream.	2015-04-06 11:37:32 +02:00
Willy Tarreau	02a0c0e407	MAJOR: stream: don't initialize the stream anymore in stream_accept The function now only initializes a session, calls the tcp req connection rules, and calls stream_complete() to finish initialization. If a handshake is needed, it is done without allocating the stream at all. Temporarily, in order to limit the amount of changes, the task allocated is put into sess->task, and it is used by the connection for the handshake or is offered to the stream. At this point we set the relation between sess/task/conn this way : orig -- sess <-- context \| ^ +- task -+ \| v \| v \| conn -- owner task The task must not remain in the session and ultimately it is planned to remove this task pointer from the session because it can be found by having conn->owner = task, and looping back from sess to conn, and to find the session from the connection via the task.	2015-04-06 11:37:32 +02:00
Willy Tarreau	5ecb069c58	MEDIUM: stream: move all the session-specific stuff of stream_accept() earlier Since the tcp-request connection rules don't need the stream anymore, we can safely move the session-specific stuff earlier and prepare for a split of session and stream initialization. Some work remains to be done.	2015-04-06 11:37:31 +02:00
Willy Tarreau	1df0cc6466	MEDIUM: stream: don't call stream_store_counters() in kill_mini_session() nor session_accept() This one is not needed anymore since we cannot track the stream counters prior to reaching these locations. Only session counters may be tracked and they're properly committed during session_free().	2015-04-06 11:37:31 +02:00
Willy Tarreau	e73ef85a63	MAJOR: tcp: make tcp_exec_req_rules() only rely on the session It passes a NULL wherever a stream was needed (acl_exec_cond() and action_ptr mainly). It can still track the connection rate correctly and block based on ACLs.	2015-04-06 11:37:31 +02:00
Willy Tarreau	8b7f8688ee	MEDIUM: streams: support looking up stkctr in the session In order to support sessions tracking counters, we first ensure that there is no overlap between streams' stkctr and sessions', and we allow an automatic lookup into the session's counters when the stream doesn't have a counter or when the stream doesn't exist during an access via a sample fetch. The functions used to update the stream counters only update them and not the session counters however.	2015-04-06 11:37:31 +02:00
Willy Tarreau	b2bf8331fb	MINOR: session: add stick counters to the struct session The stick counters in the session will be used for everything not related to contents, hence the connections / concurrent sessions / etc. They will be usable by "tcp-request connection" rules even without a stream. For now they're just allocated and initialized.	2015-04-06 11:37:30 +02:00
Willy Tarreau	11c3624c32	MINOR: session: implement session_free() and use it everywhere We want to call this one everywhere we have to kill a session so that future parts we move to the session can be released from there.	2015-04-06 11:37:30 +02:00
Willy Tarreau	8d2eca73eb	MINOR: session: don't rely on s->logs.logwait in embryonic sessions The embryonic session now only stores the task (to expire the session) and the stick counters.	2015-04-06 11:37:30 +02:00
Willy Tarreau	7ea671b914	MINOR: session: store the session's accept date Doing so ensures we don't need to use the stream anymore to prepare the log information to report a failed handshake on an embryonic session. Thus, prepare_mini_sess_log_prefix() now takes a session in argument.	2015-04-06 11:37:30 +02:00
Willy Tarreau	15b5e14faa	MINOR: stream: move session initialization before the stream's In an effort to completely separate stream from session, this patch further separates the two by initializing the session before the stream.	2015-04-06 11:37:29 +02:00
Willy Tarreau	1f52bb27ec	CLEANUP: stream: don't set ->target to the incoming connection anymore Now that we have sess->origin to carry that information along, we don't need to put that into strm->target anymore, so we remove one dependence on the stream in embryonic connections.	2015-04-06 11:37:29 +02:00
Willy Tarreau	d0d8da989b	MINOR: stream: provide a few helpers to retrieve frontend, listener and origin Expressions are quite long when using strm_sess(strm)->whatever, so let's provide a few helpers : strm_fe(), strm_li(), strm_orig().	2015-04-06 11:37:29 +02:00
Willy Tarreau	192252e2d8	MAJOR: sample: pass a pointer to the session to each sample fetch function Many such function need a session, and till now they used to dereference the stream. Once we remove the stream from the embryonic session, this will not be possible anymore. So as of now, sample fetch functions will be called with this : - sess = NULL, strm = NULL : never - sess = valid, strm = NULL : tcp-req connection - sess = valid, strm = valid, strm->txn = NULL : tcp-req content - sess = valid, strm = valid, strm->txn = valid : http-req / http-res	2015-04-06 11:37:25 +02:00
Willy Tarreau	15e91e1b36	MAJOR: sample: don't pass l7 anymore to sample fetch functions All of them can now retrieve the HTTP transaction if it exists from the stream and be sure to get NULL there when called with an embryonic session. The patch is a bit large because many locations were touched (all fetch functions had to have their prototype adjusted). The opportunity was taken to also uniformize the call names (the stream is now always "strm" instead of "l4") and to fix indent where it was broken. This way when we later introduce the session here there will be less confusion.	2015-04-06 11:35:53 +02:00
Willy Tarreau	eee5b51248	MAJOR: http: move http_txn out of struct stream Now this one is dynamically allocated. It means that 280 bytes of memory are saved per TCP stream, but more importantly that it will become possible to remove the l7 pointer from fetches and converters since it will be deduced from the stream and will support being null. A lot of care was taken because it's easy to forget a test somewhere, and the previous code used to always trust s->txn for being valid, but all places seem to have been visited. All HTTP fetch functions check the txn first so we shouldn't have any issue there even when called from TCP. When branching from a TCP frontend to an HTTP backend, the txn is properly allocated at the same time as the hdr_idx.	2015-04-06 11:35:52 +02:00
Willy Tarreau	63986c72c8	MINOR: http: create a dedicated pool for http_txn This one will not necessarily be allocated for each stream, and we want to use the fact that it equals null to know it's not present so that we can always deduce its presence from the stream pointer. This commit only creates the new pool.	2015-04-06 11:35:52 +02:00
Willy Tarreau	cb7dd015be	MEDIUM: http: move header captures from http_txn to struct stream The header captures are now general purpose captures since tcp rules can use them to capture various contents. That removes a dependency on http_txn that appeared in some sample fetch functions and in the order by which captures and http_txn were allocated. Interestingly the reset of the header captures were done at too many places as http_init_txn() used to do it while it was done previously in every call place.	2015-04-06 11:35:52 +02:00
Willy Tarreau	9ad7bd48d2	MEDIUM: session: use the pointer to the origin instead of s->si[0].end When s->si[0].end was dereferenced as a connection or anything in order to retrieve information about the originating session, we'll now use sess->origin instead so that when we have to chain multiple streams in HTTP/2, we'll keep accessing the same origin.	2015-04-06 11:34:29 +02:00
Willy Tarreau	40606ab976	MINOR: session: add a pointer to the session's origin A session's origin is the entity that was responsible for creating the session. It can be an applet or a connection for now.	2015-04-06 11:23:58 +02:00
Willy Tarreau	e36cbcb3b0	MEDIUM: stream: move the frontend's pointer to the session Just like for the listener, the frontend is session-wide so let's move it to the session. There are a lot of places which were changed but the changes are minimal in fact.	2015-04-06 11:23:58 +02:00
Willy Tarreau	fb0afa77c9	MEDIUM: stream: move the listener's pointer to the session The listener is session-specific, move it there.	2015-04-06 11:23:57 +02:00
Willy Tarreau	feb764040d	MEDIUM: stream: allocate the session when a stream is created This is where we'll put some session-wide information.	2015-04-06 11:23:57 +02:00
Willy Tarreau	b1ec8c4a59	MINOR: session: start to reintroduce struct session There is now a pointer to the session in the stream, which is NULL for now. The session pool is created as well. Some parts will move from the stream to the session now.	2015-04-06 11:23:57 +02:00
Willy Tarreau	e7dff02dd4	REORG/MEDIUM: stream: rename stream flags from SN_* to SF_* This is in order to keep things consistent.	2015-04-06 11:23:57 +02:00
Willy Tarreau	87b09668be	REORG/MAJOR: session: rename the "session" entity to "stream" With HTTP/2, we'll have to support multiplexed streams. A stream is in fact the largest part of what we currently call a session, it has buffers, logs, etc. In order to catch any error, this commit removes any reference to the struct session and tries to rename most "session" occurrences in function names to "stream" and "sess" to "strm" when that's related to a session. The files stream.{c,h} were added and session.{c,h} removed. The session will be reintroduced later and a few parts of the stream will progressively be moved overthere. It will more or less contain only what we need in an embryonic session. Sample fetch functions and converters will have to change a bit so that they'll use an L5 (session) instead of what's currently called "L4" which is in fact L6 for now. Once all changes are completed, we should see approximately this : L7 - http_txn L6 - stream L5 - session L4 - connection \| applet There will be at most one http_txn per stream, and a same session will possibly be referenced by multiple streams. A connection will point to a session and to a stream. The session will hold all the information we need to keep even when we don't yet have a stream. Some more cleanup is needed because some code was already far from being clean. The server queue management still refers to sessions at many places while comments talk about connections. This will have to be cleaned up once we have a server-side connection pool manager. Stream flags "SN_*" still need to be renamed, it doesn't seem like any of them will need to move to the session.	2015-04-06 11:23:56 +02:00

... 5 6 7 8 9

406 Commits