haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-09 16:47:18 +02:00

Author	SHA1	Message	Date
Willy Tarreau	6f5e4b98df	MEDIUM: session: take care of incrementing/decrementing jobs Each user of a session increments/decrements the jobs variable at its own place, resulting in a real mess and inconsistencies between them. Let's have session_new() increment jobs and session_free() decrement it.	2017-09-15 11:49:52 +02:00
Andjelko Iharos	c3680ecdf8	MINOR: add severity information to cli feedback messages	2017-09-13 13:38:32 +02:00
Emeric Brun	52a91d3d48	MEDIUM: check: server states and weight propagation re-work The server state and weight was reworked to handle "pending" values updated by checks/CLI/LUA/agent. These values are commited to be propagated to the LB stack. In further dev related to multi-thread, the commit will be handled into a sync point. Pending values are named using the prefix 'next_' Current values used by the LB stack are named 'cur_'	2017-09-05 15:23:16 +02:00
Christopher Faulet	35fe699ec7	BUG/MEDIUM: http: Fix a regression bug when a HTTP response is in TUNNEL mode Unfortunatly, a regression bug was introduced in the commit `1486b0ab` ("BUG/MEDIUM: http: Switch HTTP responses in TUNNEL mode when body length is undefined"). HTTP responses with undefined body length are blocked until timeout when the compression is enabled. This bug was fixed in commit `69744d92` ("BUG/MEDIUM: http: Fix blocked HTTP/1.0 responses when compression is enabled"). The bug is still the same. We do not forward response data because we are waiting for the synchronization between the HTTP request and the response. To fix the bug, conditions to infinitly forward channel data has been slightly relaxed. Now, it is done if there is no more analyzer registered on the channel or if _FLT_END analyzer is still there but without the flag CF_FLT_ANALYZE. This last condition is only possible when a channel is waiting the end of the other side. So, fundamentally, it means that no one is analyzing the channel anymore. This is a transitional state during a sync phase. This patch must be backported in 1.7.	2017-09-05 10:00:58 +02:00
Willy Tarreau	5790eb0a76	MINOR: stream: provide a new stream creation function for connections The purpose will be to create new streams for a given connection so that we can later abstract this from a mux.	2017-08-30 07:06:39 +02:00
Willy Tarreau	87787acf72	MEDIUM: stream: make stream_new() allocate its own task Currently a task is allocated in session_new() and serves two purposes : - either the handshake is complete and it is offered to the stream via the second arg of stream_new() - or the handshake is not complete and it's diverted to be used as a timeout handler for the embryonic session and repurposed once we land into conn_complete_session() Furthermore, the task's process() function was taken from the listener's handler in conn_complete_session() prior to being replaced by a call to stream_new(). This will become a serious mess with the mux. Since it's impossible to have a stream without a task, this patch removes the second arg from stream_new() and make this function allocate its own task. In session_accept_fd(), we now only allocate the task if needed for the embryonic session and delete it later.	2017-08-30 07:05:04 +02:00
Willy Tarreau	585744bf2e	REORG/MEDIUM: connection: introduce the notion of connection handle Till now connections used to rely exclusively on file descriptors. It was planned in the past that alternative solutions would be implemented, leading to member "union t" presenting sock.fd only for now. With QUIC, the connection will need to continue to exist but will not rely on a file descriptor but a connection ID. So this patch introduces a "connection handle" which is either a file descriptor or a connection ID, to replace the existing "union t". We've now removed the intermediate "struct sock" which was never used. There is no functional change at all, though the struct connection was inflated by 32 bits on 64-bit platforms due to alignment.	2017-08-24 19:30:04 +02:00
Willy Tarreau	85cb0aecf5	BUG/MEDIUM: stream: properly set the required HTTP analysers on use-service Commit 4850e51 ("BUG/MAJOR: lua: Do not force the HTTP analysers in use-services") fixed a bug in how services are used in Lua, but this fix broke the ability for Lua services to support keep-alive. The cause is that we branch to a service while we have not yet set the body analysers on the request nor the response, and when we start to deal with the response we don't have any request analyser anymore. This leads the response forward engine to detect an error and abort. It's very likely that this also causes some random truncation of responses though this has not been observed during the tests. The root cause is not the Lua part in fact, the commit above was correct, the problem is the implementation of the "use-service" action. When done in an HTTP request, it bypasses the load balancing decisions and the connect() phase. These ones are normally the ones preparing the request analysers to parse the body when keep-alive is set. This should be dealt with in the main process_use_service() function in fact. That's what this patch does. If process_use_service() is called from the http-request rule set, it enables the XFER_BODY analyser on the request (since the same is always set on the response). Note that it's exactly what is being done on the stats page which properly supports keep-alive and compression. This fix must be backported to 1.7 and 1.6 as the breakage appeared in 1.6.3.	2017-08-23 16:11:38 +02:00
Willy Tarreau	2bfd35885e	MINOR: stream: link the stream to its session Now each stream is added to the session's list of streams, so that it will be possible to know all the streams belonging to a session, and to know if any stream is still attached to a sessoin.	2017-08-18 13:26:35 +02:00
Willy Tarreau	7632548d97	BUG/MAJOR: stream: in stream_free(), close the front endpoint and not the origin stream_free() used to close the front connection by using s->sess->origin, instead of using s->si[0].end. This is very visible in HTTP/2 where the front connection is abusively closed and causes all sort of issues including crashes caused by double closes due to the same origin being referenced many times. It's also suspected that it may have caused some of the early issues met during the Lua development. It's uncertain whether stable branches are affected. It might be worth backporting it once it has been confirmed not to create new impacts.	2017-08-17 18:26:56 +02:00
Willy Tarreau	46d5b0872a	BUG/MEDIUM: stream: don't retry SSL connections which fail the SNI name check Commits `2ab8867` ("MINOR: ssl: compare server certificate names to the SNI on outgoing connections") and `96c7b8d` ("BUG/MINOR: ssl: Fix check against SNI during server certificate verification") made it possible to check that the server's certificate matches the name presented in the SNI field. While it solves a class of problems, it opens another one which is that by failing such a connection, we'll retry it and put more load on the server. It can be a real problem if a user can trigger this issue, which is what will very often happen when the SNI is forwarded from the client to the server. This patch solves this by detecting that this very specific hostname verification failed and that the hostname was provided using SNI, and then it simply disables retries and the failure is immediate. At the time of writing this patch, the previous patches were not backported (yet), so no backport is needed for this one unless the aforementionned patches are backported as well. This patch requires previous patches "BUG/MINOR: ssl: make use of the name in SNI before verifyhost" and "MINOR: ssl: add a new error code for wrong server certificates".	2017-07-28 12:06:05 +02:00
Christopher Faulet	cdaea89a0c	BUG/MINOR: stream: Don't forget to remove CF_WAKE_ONCE flag on response channel This flag can be set on a channel to pretend there is activity on it. This is a way to wake-up the corresponding stream and evaluate stream analyzers on the channel. It is correctly handled on both channels but removed only on the request channel. This patch is flagged as a bug but for now, CF_WAKE_ONCE is never set on the response channel.	2017-07-06 23:06:47 +02:00
Emeric Brun	c730606879	MAJOR: applet: applet scheduler rework. In order to authorize call of appctx_wakeup on running task: - from within the task handler itself. - in futur, from another thread. The appctx is considered paused as default after running the handler. The handler should explicitly call appctx_wakeup to be re-called. When the appctx_free is called on a running handler. The real free is postponed at the end of the handler process.	2017-06-27 14:38:02 +02:00
Willy Tarreau	d62b98c6e8	MINOR: stream: don't set backend's nor response analysers on SF_TUNNEL In order to implement hot-pluggable applets like we'll need for HTTP/2 which will speak a different protocol than the expected one, it will be mandatory to be able to clear all analysers from the request and response channel and/or to keep only the ones the applet initializer installed. Unfortunately for now in sess_establish() we systematically place a number of analysers inherited from the frontend, backend and some hard-coded ones. This patch reuses the now unused SF_TUNNEL flag on the stream to indicate we're dealing with a tunnel and don't want to add more analysers anymore. It will be usable to install such a specific applet. Ideally over the long term it might be nice to be able to set the mode on the stream instead of the proxy so that we can decide to change a stream's mode (eg: TCP, HTTP, HTTP/2) at run time. But it would require many more changes for a gain which is not yet obvious.	2017-06-27 14:38:02 +02:00
Willy Tarreau	9b82d941c5	MEDIUM: stream: make stream_new() always set the target and analysers It doesn't make sense that stream_new() doesn't sets the target nor analysers and that the caller has to do it even if it doesn't know about streams (eg: in session_accept_fd()). This causes trouble for H2 where the applet handling the protocol cannot properly change these information during its init phase. Let's ensure it's always set and that the callers don't set it anymore. Note: peers and lua don't use analysers and that's properly handled.	2017-06-27 14:38:02 +02:00
Emeric Brun	5f77fef34e	MINOR: task/stream: tasks related to a stream must be init by the caller. The task_wakeup was called on stream_new, but the task/stream wasn't fully initialized yet. The task_wakeup must be called explicitly by the caller once the task/stream is initialized.	2017-06-27 14:38:02 +02:00
Christopher Faulet	c0c672a2ab	BUG/MINOR: http: Fix conditions to clean up a txn and to handle the next request To finish a HTTP transaction and to start the new one, we check, among other things, that there is enough space in the reponse buffer to eventually inject a message during the parsing of the next request. Because these messages can reach the maximum buffers size, it is mandatory to have an empty response buffer. Remaining input data are trimmed during the txn cleanup (in http_reset_txn), so we just need to check that the output data were flushed. The current implementation depends on channel_congested, which does check the reserved area is available. That's not of course good enough. There are other tests on the reponse buffer is http_wait_for_request. But conditions to move on are almost the same. So, we can imagine some scenarii where some output data remaining in the reponse buffer during the request parsing prevent any messages injection. To fix this bug, we just wait that output data were flushed before cleaning up the HTTP txn (ie. s->res.buf->o == 0). In addition, in http_reset_txn we realign the response buffer (note the buffer is empty at this step). Thanks to this changes, there is no more need to set CF_EXPECT_MORE on the response channel in http_end_txn_clean_session. And more important, there is no more need to check the response buffer state in http_wait_for_request. This remove a workaround on response analysers to handle HTTP pipelining. This patch can be backported in 1.7, 1.6 and 1.5.	2017-03-31 14:36:20 +02:00
Hongbo Long	e39683c4d4	BUG/MEDIUM: stream: fix client-fin/server-fin handling A tcp half connection can cause 100% CPU on expiration. First reproduced with this haproxy configuration : global tune.bufsize 10485760 defaults timeout server-fin 90s timeout client-fin 90s backend node2 mode tcp timeout server 900s timeout connect 10s server def 127.0.0.1:3333 frontend fe_api mode tcp timeout client 900s bind :1990 use_backend node2 Ie timeout server-fin shorter than timeout server, the backend server sends data, this package is left in the cache of haproxy, the backend server continue sending fin package, haproxy recv fin package. this time the session information is as follows: time the session information is as follows: 0x2373470: proto=tcpv4 src=127.0.0.1:39513 fe=fe_api be=node2 srv=def ts=08 age=1s calls=3 rq[f=848000h,i=0,an=00h,rx=14m58s,wx=,ax=] rp[f=8004c020h,i=0,an=00h,rx=,wx=14m58s,ax=] s0=[7,0h,fd=6,ex=] s1=[7,18h,fd=7,ex=] exp=14m58s rp has set the CF_SHUTR state, next, the client sends the fin package, session information is as follows: 0x2373470: proto=tcpv4 src=127.0.0.1:39513 fe=fe_api be=node2 srv=def ts=08 age=38s calls=4 rq[f=84a020h,i=0,an=00h,rx=,wx=,ax=] rp[f=8004c020h,i=0,an=00h,rx=1m11s,wx=14m21s,ax=] s0=[7,0h,fd=6,ex=] s1=[9,10h,fd=7,ex=] exp=1m11s After waiting 90s, session information is as follows: 0x2373470: proto=tcpv4 src=127.0.0.1:39513 fe=fe_api be=node2 srv=def ts=04 age=4m11s calls=718074391 rq[f=84a020h,i=0,an=00h,rx=,wx=,ax=] rp[f=8004c020h,i=0,an=00h,rx=?,wx=10m49s,ax=] s0=[7,0h,fd=6,ex=] s1=[9,10h,fd=7,ex=] exp=? run(nice=0) cpu information: 6899 root 20 0 112224 21408 4260 R 100.0 0.7 3:04.96 haproxy Buffering is set to ensure that there is data in the haproxy buffer, and haproxy can receive the fin package, set the CF_SHUTR flag, If the CF_SHUTR flag has been set, The following code does not clear the timeout message, causing cpu 100%: stream.c:process_stream: if (unlikely((res->flags & (CF_SHUTR\|CF_READ_TIMEOUT)) == CF_READ_TIMEOUT)) { if (si_b->flags & SI_FL_NOHALF) si_b->flags \|= SI_FL_NOLINGER; si_shutr(si_b); } If you have closed the read, set the read timeout does not make sense. With or without cf_shutr, read timeout is set: if (tick_isset(s->be->timeout.serverfin)) { res->rto = s->be->timeout.serverfin; res->rex = tick_add(now_ms, res->rto); } After discussion on the mailing list, setting half-closed timeouts the hard way here doesn't make sense. They should be set only at the moment the shutdown() is performed. It will also solve a special case which was already reported of some half-closed timeouts not working when the shutw() is performed directly at the stream-interface layer (no analyser involved). Since the stream interface layer cannot know the timeout values, we'll have to store them directly in the stream interface so that they are used upon shutw(). This patch does this, fixing the problem. An easier reproducer to validate the fix is to keep the huge buffer and shorten all timeouts, then call it under tcploop server and client, and wait 3 seconds to see haproxy run at 100% CPU : global tune.bufsize 10485760 listen px bind :1990 timeout client 90s timeout server 90s timeout connect 1s timeout server-fin 3s timeout client-fin 3s server def 127.0.0.1:3333 $ tcploop 3333 L W N20 A P100 F P10000 & $ tcploop 127.0.0.1:1990 C S10000000 F	2017-03-21 15:04:43 +01:00
Christopher Faulet	0184ea71a6	BUG/MAJOR: channel: Fix the definition order of channel analyzers It is important to defined analyzers (AN_REQ_* and AN_RES_) in the same order they are evaluated in process_stream. This order is really important because during analyzers evaluation, we run them in the order of the lower bit to the higher one. This way, when an analyzer adds/removes another one during its evaluation, we know if it is located before or after it. So, when it adds an analyzer which is located before it, we can switch to it immediately, even if it has already been called once but removed since. With the time, and introduction of new analyzers, this order was broken up. the main problems come from the filter analyzers. We used values not related with their evaluation order. Furthermore, we used same values for request and response analyzers. So, to fix the bug, filter analyzers have been splitted in 2 distinct lists to have different analyzers for the request channel than those for the response channel. And of course, we have moved them to the right place. Some other analyzers have been reordered to respect the evaluation order: AN_REQ_HTTP_TARPIT has been moved just before AN_REQ_SRV_RULES * AN_REQ_PRST_RDP_COOKIE has been moved just before AN_REQ_STICKING_RULES * AN_RES_STORE_RULES has been moved just after AN_RES_WAIT_HTTP Note today we have 29 analyzers, all stored into a 32 bits bitfield. So we can still add 4 more analyzers before having a problem. A good way to fend off the problem for a while could be to have a different bitfield for request and response analyzers. [wt: all of this must be backported to 1.7, and part of it must be backported to 1.6 and 1.5]	2017-01-05 17:58:22 +01:00
Thierry FOURNIER	2c8b54e7be	MEDIUM: lua: remove Lua struct from session, and allocate it with memory pools This patch use memory pools for allocating the Lua struct. This save 128B of memory in the session if the Lua is unused.	2016-12-21 15:24:56 +01:00
Christopher Faulet	a73e59b690	BUG/MAJOR: Fix how the list of entities waiting for a buffer is handled When an entity tries to get a buffer, if it cannot be allocted, for example because the number of buffers which may be allocated per process is limited, this entity is added in a list (called <buffer_wq>) and wait for an available buffer. Historically, the <buffer_wq> list was logically attached to streams because it were the only entities likely to be added in it. Now, applets can also be waiting for a free buffer. And with filters, we could imagine to have more other entities waiting for a buffer. So it make sense to have a generic list. Anyway, with the current design there is a bug. When an applet failed to get a buffer, it will wait. But we add the stream attached to the applet in <buffer_wq>, instead of the applet itself. So when a buffer is available, we wake up the stream and not the waiting applet. So, it is possible to have waiting applets and never awakened. So, now, <buffer_wq> is independant from streams. And we really add the waiting entity in <buffer_wq>. To be generic, the entity is responsible to define the callback used to awaken it. In addition, applets will still request an input buffer when they become active. But they will not be sleeped anymore if no buffer are available. So this is the responsibility to the applet I/O handler to check if this buffer is allocated or not. This way, an applet can decide if this buffer is required or not and can do additional processing if not. [wt: backport to 1.7 and 1.6]	2016-12-12 19:11:04 +01:00
Christopher Faulet	9d810cae11	BUG/MEDIUM: stream: Save unprocessed events for a stream A stream can be awakened for different reasons. During its processing, it can be early stopped if no buffer is available. In this situation, the reason why the stream was awakened is lost, because we rely on the task state, which is reset after each processing loop. In many cases, that's not a big deal. But it can be useful to accumulate the task states if the stream processing is interrupted, especially if some filters need to be called. To be clearer, here is an simple example: 1) A stream is awakened with the reason TASK_WOKEN_MSG. 2) Because no buffer is available, the processing is interrupted, the stream is back to sleep. And the task state is reset. 3) Some buffers become available, so the stream is awakened with the reason TASK_WOKEN_RES. At this step, the previous reason (TASK_WOKEN_MSG) is lost. Now, the task states are saved for a stream and reset only when the stream processing is not interrupted. The correspoing bitfield represents the pending events for a stream. And we use this one instead of the task state during the stream processing. Note that TASK_WOKEN_TIMER and TASK_WOKEN_RES are always removed because these events are always handled during the stream processing. [wt: backport to 1.7 and 1.6]	2016-12-12 19:10:58 +01:00
Christopher Faulet	34c5cc98da	MINOR: task: Rename run_queue and run_queue_cur counters <run_queue> is used to track the number of task in the run queue and <run_queue_cur> is a copy used for the reporting purpose. These counters has been renamed, respectively, <tasks_run_queue> and <tasks_run_queue_cur>. So the naming is consistent between tasks and applets. [wt: needed for next fixes, backport to 1.7 and 1.6]	2016-12-12 19:10:54 +01:00
Christopher Faulet	d47a1bd1d7	BUG/MINOR: filters: Invert evaluation order of HTTP_XFER_BODY and XFER_DATA analyzers These 2 analyzers are responsible of the data forwarding in, respectively, HTTP mode and TCP mode. Now, the analyzer responsible of the HTTP data forwarding is called before the one responsible of the TCP data forwarding. This will allow the filtering of tunneled data in HTTP. [wt: backport desired in 1.7 - no impact right now but may impact the ability to backport future fixes]	2016-11-29 17:03:04 +01:00
Willy Tarreau	7d56221d57	REORG: stkctr: move all the stick counters processing to stick-tables.c Historically we used to have the stick counters processing put into session.c which became stream.c. But a big part of it is now in stick-table.c (eg: converters) but despite this we still have all the sample fetch functions in stream.c These parts do not depend on the stream anymore, so let's move the remaining chunks to stick-table.c and have cleaner files. What remains in stream.c is everything needed to attach/detach trackers to the stream and to update the counters while the stream is being processed.	2016-11-25 16:10:05 +01:00
Willy Tarreau	397131093f	REORG: tcp-rules: move tcp rules processing to their own file There's no more reason to keep tcp rules processing inside proto_tcp.c given that there is nothing in common there except these 3 letters : tcp. The tcp rules are in fact connection, session and content processing rules. Let's move them to "tcp-rules" and let them live their life there.	2016-11-25 15:57:38 +01:00
Willy Tarreau	30e5e18bbb	CLEANUP: cli: remove assignments to st0 and st2 in keyword parsers Now it's not needed anymore to set STAT_ST_INIT nor CLI_ST_CALLBACK in the parsers, remove it in the various places.	2016-11-24 16:59:28 +01:00
Willy Tarreau	3b6e547be8	CLEANUP: cli: rename STAT_CLI_* to CLI_ST_* These are in CLI states, not stats states anymore. STAT_CLI_O_CUSTOM was more appropriately renamed CLI_ST_CALLBACK.	2016-11-24 16:59:28 +01:00
Willy Tarreau	61b6521cbf	REORG: cli: move "shutdown session" to stream.c It really kills streams in fact, but we can't change the name now.	2016-11-24 16:59:28 +01:00
Willy Tarreau	4e46b62ab1	REORG: cli: move "shutdown sessions server" to stream.c It could be argued that it's between server, stream and session but at least due to the fact that it operates on streams, its best place is in stream.c.	2016-11-24 16:59:28 +01:00
William Lallemand	4c5b4d531c	REORG: cli: move 'show sess' to stream.c Move 'show sess' CLI functions to stream.c and use the cli keyword API to register it on the CLI. [wt: the choice of stream vs session makes sense because since 1.6 these really are streams that we're dumping and not sessions anymore]	2016-11-24 16:59:27 +01:00
William Lallemand	9ed6203aef	REORG: cli: split dumpstats.h in stats.h and cli.h proto/dumpstats.h has been split in 4 files: * proto/cli.h contains protypes for the CLI * proto/stats.h contains prototypes for the stats * types/cli.h contains definition for the CLI * types/stats.h contains definition for the stats	2016-11-24 16:59:27 +01:00
Christopher Faulet	a00d817aba	MINOR: filters: Add check_timeouts callback to handle timers expiration on streams A filter can now be notified when a stream is woken up because of an expired timer. The documentation and the TRACE filter have been updated.	2016-11-21 15:29:58 +01:00
Willy Tarreau	350135cf49	BUG/MEDIUM: connection: check the control layer before stopping polling The bug described in commit `568743a` ("BUG/MEDIUM: stream-int: completely detach connection on connect error") was not a stream-interface layer bug but a connection layer bug. There was exactly one place in the code where we could change a file descriptor's status without first checking whether it is valid or not, it was in conn_stop_polling(). This one is called when the polling status is changed after an update, and calls fd_stop_both even if we had already closed the file descriptor : 1479388298.484240 ->->->->-> conn_fd_handler > conn_cond_update_polling 1479388298.484240 ->->->->->-> conn_cond_update_polling > conn_stop_polling 1479388298.484241 ->->->->->->-> conn_stop_polling > conn_ctrl_ready 1479388298.484241 conn_stop_polling < conn_ctrl_ready 1479388298.484241 ->->->->->->-> conn_stop_polling > fd_stop_both 1479388298.484242 ->->->->->->->-> fd_stop_both > fd_update_cache 1479388298.484242 ->->->->->->->->-> fd_update_cache > fd_release_cache_entry 1479388298.484242 fd_update_cache < fd_release_cache_entry 1479388298.484243 fd_stop_both < fd_update_cache 1479388298.484243 conn_stop_polling < fd_stop_both 1479388298.484243 conn_cond_update_polling < conn_stop_polling 1479388298.484243 conn_fd_handler < conn_cond_update_polling The problem with the previous fix above is that it break the http_proxy mode and possibly even some Lua parts and peers to a certain extent ; all outgoing connections where the target address is initially copied into the outgoing connection which experience a retry would use a random outgoing address after the retry because closing and detaching the connection causes the target address to be lost. This was attempted to be addressed by commit `0857d7a` ("BUG/MAJOR: stream: properly mark the server address as unset on connect retry") but it used to only solve the most visible effect and not the root cause. Prior to this fix, it was possible to cause this config to keep CLOSE_WAIT for as long as it takes to expire a client or server timeout (note the missing client timeout) : listen test mode http bind :8002 server s1 127.0.0.1:8001 $ tcploop 8001 L0 W N20 A R P100 S:"HTTP/1.1 200 OK\r\nContent-length: 0\r\n\r\n" & $ tcploop 8002 N200 C T W S:"GET / HTTP/1.0\r\n\r\n" O P10000 K With this patch, these CLOSE_WAIT properly vanish when both processes leave. This commit reverts the two fixes above and replaces them with the proper fix in connection.h. It must be backported to 1.6 and 1.5. Thanks to Robson Roberto Souza Peixoto for providing very detailed traces showing some obvious inconsistencies leading to finding this bug.	2016-11-18 14:48:52 +01:00
Willy Tarreau	def0d22cc5	MINOR: stream: make option contstats usable again Quite a lot of people have been complaining about option contstats not working correctly anymore since about 1.4. The reason was that one reason for the significant performance boost between 1.3 and 1.4 was the ability to forward data between a server and a client without waking up the stream manager. And we couldn't afford to force sessions to constantly wake it up given that most of the people interested in contstats are also those interested in high performance transmission. An idea was experimented with in the past, consisting in limiting the amount of transmissible data before waking it up, but it was not usable on slow connections (eg: FTP over modem lines, RDP, SSH) as stats would be updated too rarely if at all, so that idea was dropped. During a discussion today another idea came up : ensure that stats are updated once in a while, since it's the only thing that matters. It happens that we have the request channel's analyse_exp timeout that is used to wake the stream up after a configured delay, and that by definition this timeout is not used when there's no more analyser (otherwise the stream would wake up and the stats would be updated). Thus here the idea is to reuse this timeout when there's no analyser and set it to now+5 seconds so that a stream wakes up at least once every 5 seconds to update its stats. It should be short enough to provide smooth traffic graphs and to allow to debug outputs of "show sess" more easily without inflicting too much load even for very large number of concurrent connections. This patch is simple enough and safe enough to be backportable to 1.6 if there is some demand.	2016-11-08 22:03:00 +01:00
Andrew Rodland	e168feb4a8	MINOR: proxy: add 'served' field to proxy, equal to total of all servers' This will allow lb_chash to determine the total active sessions for a proxy without any computation. Signed-off-by: Andrew Rodland <andrewr@vimeo.com>	2016-10-25 20:21:32 +02:00
Willy Tarreau	0857d7aa8f	BUG/MAJOR: stream: properly mark the server address as unset on connect retry Bartosz Koninski reported that recent commit `568743a` ("BUG/MEDIUM: stream-int: completely detach connection on connect error") introduced a nasty side effect during the connection retry, causing a reconnect attempt to be performed on a random address, possibly another server from another backend as shown in his reproducer. The real reason is in fact that by calling si_release_endpoint() after a failed connect attempt and not clearing the SN_ADDR_SET flag on the stream, we indicate that we continue to trust the connection's current address. So upon next connect attempt, a connection is picked from the pool and the last target address is reused. It is not very easy to reproduce it 100% reliably out of production traffic, but it's easier to see when haproxy is started with -dM to force the pools to be filled with junk, and where subsequent connections are not even attempted due to their family being incorrect. The correct fix consists in clearing the SN_ADDR_SET flag after calling si_release_endpoint() during a retry. But it outlines a deeper issue which is in fact that the target address is stored in the connection and its validity is stored in the stream. Until we have a better solution to store target addresses, it would be better to rely on the connection flags (CO_FL_ADDR_TO_SET) for this purpose. But it also outlines the fact that the same issue still exists in Lua sockets and in idle sockets, which fortunately are not affected by this issue. Thanks to Bartosz for providing all the elements needed to understand the problem. This fix needs to be backported to 1.6 and 1.5.	2016-08-29 15:33:56 +02:00
Thierry FOURNIER / OZON.IO	4cac359a39	MEDIUM: log: Decompose %Tq in %Th %Ti %TR Tq is the time between the instant the connection is accepted and a complete valid request is received. This time includes the handshake (SSL / Proxy-Protocol), the idle when the browser does preconnect and the request reception. This patch decomposes %Tq in 3 measurements names %Th, %Ti, and %TR which returns respectively the handshake time, the idle time and the duration of valid request reception. It also adds %Ta which reports the request's active time, which is the total time without %Th nor %Ti. It replaces %Tt as the total time, reporting accurate measurements for HTTP persistent connections. %Th is avalaible for TCP and HTTP sessions, %Ti, %TR and %Ta are only avalaible for HTTP connections. In addition to this, we have new timestamps %tr, %trg and %trl, which log the date of start of receipt of the request, respectively in the default format, in GMT time and in local time (by analogy with %t, %T and %Tl). All of them are obviously only available for HTTP. These values are more relevant as they more accurately represent the request date without being skewed by a browser's preconnect nor a keep-alive idle time. The HTTP log format and the CLF log format have been modified to use %tr, %TR, and %Ta respectively instead of %t, %Tq and %Tt. This way the default log formats now produce the expected output for users who don't want to manually fiddle with the log-format directive. Example with the following log-format : log-format "%ci:%cp [%tr] %ft %b/%s h=%Th/i=%Ti/R=%TR/w=%Tw/c=%Tc/r=%Tr/a=%Ta/t=%Tt %ST %B %CC %CS %tsc %ac/%fc/%bc/%sc/%rc %sq/%bq %hr %hs %{+Q}r" The request was sent by hand using "openssl s_client -connect" : Aug 23 14:43:20 haproxy[25446]: 127.0.0.1:45636 [23/Aug/2016:14:43:20.221] test~ test/test h=6/i=2375/R=261/w=0/c=1/r=0/a=262/t=2643 200 145 - - ---- 1/1/0/0/0 0/0 "GET / HTTP/1.1" => 6 ms of SSL handshake, 2375 waiting before sending the first char (in fact the time to type the first line), 261 ms before the end of the request, no time spent in queue, 1 ms spend connecting to the server, immediate response, total active time for this request = 262ms. Total time from accept to close : 2643 ms. The timing now decomposes like this : first request 2nd request \|<-------------------------------->\|<-------------- ... t tr t tr ... ---\|----\|----\|----\|----\|----\|----\|----\|----\|-- : Th Ti TR Tw Tc Tr Td : Ti ... :<---- Tq ---->: : :<-------------- Tt -------------->: :<--------- Ta --------->:	2016-08-23 15:18:08 +02:00
Willy Tarreau	4d03ef7f03	BUG/MAJOR: stick-counters: possible crash when using sc_trackers with wrong table Bryan Talbot reported a very interesting bug. The sc_trackers() sample fetch seems to have escaped the sanitization that was performed during 1.5 to ensure all dereferences of stkctr_entry() were safe. Here if a tacker is set on a backend and is then checked against a different backend where the entry doesn't exist, stkctr_entry() returns NULL and this is dereferenced to retrieve the ref count. Thanks to Bryan for his detailed bug report featuring a working config and reproducer. This fix must be backported to 1.6 and 1.5.	2016-08-14 12:02:55 +02:00
Willy Tarreau	568743a21f	BUG/MEDIUM: stream-int: completely detach connection on connect error Tim Butler reported a troubling issue affecting all versions since 1.5. When a connection error occurs and a retry is performed on the same server, the server connection first goes into the turn-around state (SI_ST_TAR) for one second. During this time, the client may speak and try to push some data. The tests in place confirm that the stream interface is in a state <= SI_ST_EST and that a connection exists, so all ingredients are present to try to perform a send() to forward data. The send() cannot be performed since the connection's control layer is marked as not ready, but the polling flags are changed, and due to the remaining error flag present on the connection, the polling on the FD is disabled in both directions. But if this FD was reassigned to another connection in the mean time, it is this FD which is disabled, and it causes a timeout on another connection. A configuration allowing to reproduce the issue looks like this : listen test bind :8003 server s1 127.0.0.1:8001 # this one should be closed listen victim bind :8002 server s1 127.0.0.1:8000 # this one should respond slowly (~50ms) Two parallel injections should be run with short time-outs (100ms). After some time, some dead connections will appear in listener "victim" due to their I/Os being disabled by some of the failed transfers on "test" instance. These ones will only be flushed on time out. A dead connection looks like this : > show sess 0x7dcb70 0x7dcb70: [07/Aug/2016:08:58:40.120151] id=3771 proto=tcpv4 source=127.0.0.1:34682 flags=0xce, conn_retries=3, srv_conn=0x7da020, pend_pos=(nil) frontend=victim (id=3 mode=tcp), listener=? (id=1) addr=127.0.0.1:8002 backend=victim (id=3 mode=tcp) addr=127.0.0.1:37736 server=s1 (id=1) addr=127.0.0.1:8000 task=0x7dcaf8 (state=0x08 nice=0 calls=2 exp=<NEVER> age=30s) si[0]=0x7dcd68 (state=EST flags=0x08 endp0=CONN:0x7e2410 exp=<NEVER>, et=0x000) si[1]=0x7dcd88 (state=EST flags=0x18 endp1=CONN:0x7e0cd0 exp=<NEVER>, et=0x000) co0=0x7e2410 ctrl=tcpv4 xprt=RAW data=STRM target=LISTENER:0x7d9ea8 flags=0x0020b306 fd=122 fd.state=25 fd.cache=0 updt=0 co1=0x7e0cd0 ctrl=tcpv4 xprt=RAW data=STRM target=SERVER:0x7da020 flags=0x0020b306 fd=93 fd.state=20 fd.cache=0 updt=0 req=0x7dcb80 (f=0x848000 an=0x0 pipe=0 tofwd=-1 total=129) an_exp=<NEVER> rex=<NEVER> wex=<NEVER> buf=0x7893c0 data=0x7893d4 o=0 p=0 req.next=0 i=0 size=0 res=0x7dcbc0 (f=0x80008000 an=0x0 pipe=0 tofwd=-1 total=0) an_exp=<NEVER> rex=<NEVER> wex=<NEVER> buf=0x7893c0 data=0x7893d4 o=0 p=0 rsp.next=0 i=0 size=0 The solution against this issue is to completely detach the connection upon error instead of only performing a forced close. This fix should be backported to 1.6 and 1.5. Special thanks to Tim who did all the troubleshooting work and provided a lot of traces allowing to find the root cause of this problem.	2016-08-07 09:21:04 +02:00
Thierry Fournier	6fc340ff07	BUG/MEDIUM: sticktables: segfault in some configuration error cases When a stick table is tracked, and another one is used later on the configuration, a segfault occurs. The function "smp_create_src_stkctr" can return a NULL value, and its value is not tested, so one other function try to dereference a NULL pointer. This patch just add a verification of the NULL pointer. The problem is reproduced with this configuration: listen www mode http bind :12345 tcp-request content track-sc0 src table IPv4 http-request allow if { sc0_inc_gpc0(IPv6) gt 0 } server dummy 127.0.0.1:80 backend IPv4 stick-table type ip size 10 expire 60s store gpc0 backend IPv6 stick-table type ipv6 size 10 expire 60s store gpc0 Thank to kabefuna@gmail.com for the bug report. This patch must be backported in the 1.6 and 1.5 version.	2016-06-07 11:05:23 +02:00
Christopher Faulet	3a394fa7cd	MEDIUM: filters: Add pre and post analyzer callbacks 'channel_analyze' callback has been removed. Now, there are 2 callbacks to surround calls to analyzers: * channel_pre_analyze: Called BEFORE all filterable analyzers. it can be called many times for the same analyzer, once at each loop until the analyzer finishes its processing. This callback is resumable, it returns a negative value if an error occurs, 0 if it needs to wait, any other value otherwise. * channel_post_analyze: Called AFTER all filterable analyzers. Here, AFTER means when an analyzer finishes its processing. This callback is NOT resumable, it returns a negative value if an error occurs, any other value otherwise. Pre and post analyzer callbacks are not automatically called. 'pre_analyzers' and 'post_analyzers' bit fields in the filter structure must be set to the right value using AN_* flags (see include/types/channel.h). The flag AN_RES_ALL has been added (AN_REQ_ALL already exists) to ease the life of filter developers. AN_REQ_ALL and AN_RES_ALL include all filterable analyzers.	2016-05-18 15:11:54 +02:00
Christopher Faulet	a9215b7206	MINOR: filters: Simplify calls to analyzers using 2 new macros Now, to call an analyzer in 'process_stream' function, we should use FLT_ANALAYZE or ANALYZE macros, depending if this is a filterable analyzer or not.	2016-05-18 15:11:54 +02:00
Willy Tarreau	8bf242b764	BUG/MEDIUM: channel: fix inconsistent handling of 4GB-1 transfers In 1.4-dev3, commit `31971e5` ("[MEDIUM] add support for infinite forwarding") made it possible to configure the lower layer to forward data indefinitely by setting the forward size to CHN_INFINITE_FORWARD (4GB-1). By then larger chunk sizes were not supported so there was no confusion in the usage of the function. Since 1.5 we support 64-bit content-lengths and chunk sizes and the function has grown to support 64-bit arguments, though it still limits a single pass to 32-bit quantities (what fit in the channel's to_forward field). The issue now becomes that a 4GB-1 content-length can be confused with infinite forwarding (in fact it's 4GB-1+what was already in the buffer). It causes a visible effect when transferring this exact size because the transfer rate is lower than with other sizes due in part to the disabling of the Nagle algorithm on the sendto() call. In theory with keep-alive it should prevent a second request from being processed after such a transfer, but since the analysers are still present, the forwarding analyser properly counts down the remaining size to transfer and ultimately the transaction gets correctly reset so there is no visible effect. Since the root cause of the issue is an API problem (lack of distinction between a real valid length and a magic value), this patch modifies the API to have a new dedicated function called channel_forward_forever() to program a permanent forwarding. The existing function __channel_forward() was modified to properly take care of the requested sizes and ensure it 1) never overflows and 2) never reaches CHN_INFINITE_FORWARD by accident. It is worth noting that the function used to have a bug causing a 2GB forward to be scheduled if it was called with less data than what is present in buf->i. Fortunately this bug couldn't be triggered with existing code. This fix should be backported to 1.6 and 1.5. While it also theorically affects 1.4, it's better not to backport it there, as the risk of breaking large object transfers due to significant API differences is high, compared to the fact that the largest supported objects (4GB-1) are just slower to transfer.	2016-05-04 15:26:37 +02:00
Willy Tarreau	5fb04711f0	BUG/MEDIUM: stream: ensure the SI_FL_DONT_WAKE flag is properly cleared The previous buffer space bug has revealed an issue causing some stalled connections to remain orphaned forever, preventing an old process from dying. The issue is that once in a while a task may be woken up because a disabled expiration timer has been reached despite no timeout being reached. In this case we exit very early but the SI_FL_DONT_WAKE flag wasn't cleared, resulting in new events not waking the task up. It may be one of the reasons why a few people have already observed some peers connections stuck in CLOSE_WAIT state. This bug was introduced in 1.5-dev13 by commit `798f432` ("OPTIM: session: don't process the whole session when only timers need a refresh"), so the fix must be backported to 1.6 and 1.5.	2016-05-04 10:18:37 +02:00
Frederik Deweerdt	6cd8d13c05	OPTIM/MINOR: session: abort if possible before connecting to the backend Depending on the path that led to sess_update_stream_int(), it's possible that we had a read error on the frontend, but that we haven't checked if we may abort the connection. This was seen in particular the following setup: tcp mode, with abortonclose set, frontend using ssl. If the ssl connection had a first successful read, but the second read failed, we would stil try to open a connection to the backend, although we had enough information to close the connection early. sess_update_stream_int() had some logic to handle that case in the SI_ST_QUE and SI_ST_TAR, but that was missing in the SI_ST_ASS case. This patches addresses the issue by verifying the state of the req channel (and the abortonclose option) right before opening the connection to the backend, so we have the opportunity to close the connection there, and factorizes the shared SI_ST_{QUE,TAR,ASS} code.	2016-04-07 19:12:02 +02:00
Thierry Fournier	40e1d51068	BUG/MEDIUM: stick-tables: some sample-fetch doesn't work in the connection state. The sc_* sample fetch can work without the struct strm, because the tracked counters are also stored in the session. So, this patchs removes the check for the strm existance. This bug is recent and was introduced in 1.7-dev2 by commit `6204cd9` ("BUG/MAJOR: vars: always retrieve the stream and session from the sample") This bugfix must be backported in 1.6.	2016-03-30 19:51:33 +02:00
Willy Tarreau	6204cd9f27	BUG/MAJOR: vars: always retrieve the stream and session from the sample This is the continuation of previous patch called "BUG/MAJOR: samples: check smp->strm before using it". It happens that variables may have a session-wide scope, and that their session is retrieved by dereferencing the stream. But nothing prevents them from being used from a streamless context such as tcp-request connection, thus crashing the process. Example : tcp-request connection accept if { src,set-var(sess.foo) -m found } In order to fix this, we have to always ensure that variable manipulation only happens via the sample, which contains the correct owner and context, and that we never use one from a different source. This results in quite a large change since a lot of functions are inderctly involved in the call chain, but the change is easy to follow. This fix must be backported to 1.6, and requires the last two patches.	2016-03-10 17:28:04 +01:00
Willy Tarreau	be508f1580	BUG/MAJOR: samples: check smp->strm before using it Since commit `6879ad3` ("MEDIUM: sample: fill the struct sample with the session, proxy and stream pointers") merged in 1.6-dev2, the sample contains the pointer to the stream and sample fetch functions as well as converters use it heavily. The problem is that earlier commit `87b0966` ("REORG/MAJOR: session: rename the "session" entity to "stream"") had split the session and stream resulting in the possibility for smp->strm to be NULL before the stream was initialized. This is what happens in tcp-request connection rulesets, as discovered by Baptiste. The sample fetch functions must now check that smp->strm is valid before using it. An alternative could consist in using a dummy stream with nothing in it to avoid some checks but it would only result in deferring them to the next step anyway, and making it harder to detect that a stream is valid or the dummy one. There is still an issue with variables which requires a complete independant fix. They use strm->sess to find the session with strm possibly NULL and passed as an argument. All call places indirectly use smp->strm to build strm. So the problem is there but the API needs to be changed to remove this duplicate argument that makes it much harder to know what pointer to use. This fix must be backported to 1.6, as well as the next one fixing variables.	2016-03-10 16:42:58 +01:00
Christopher Faulet	309c6418b0	MEDIUM: filters: Replace filter_http_headers callback by an analyzer This new analyzer will be called for each HTTP request/response, before the parsing of the body. It is identified by AN_FLT_HTTP_HDRS. Special care was taken about the following condition : * the frontend is a TCP proxy * filters are defined in the frontend section * the selected backend is a HTTP proxy So, this patch explicitly add AN_FLT_HTTP_HDRS analyzer on the request and the response channels when the backend is a HTTP proxy and when there are filters attatched on the stream. This patch simplifies http_request_forward_body and http_response_forward_body functions.	2016-02-09 14:53:15 +01:00

1 2 3

126 Commits