haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-08 16:17:09 +02:00

Author	SHA1	Message	Date
Willy Tarreau	fac4bd1492	MAJOR: session: pass applet return traffic through the response analysers Now that applets work like real connections, there is no reason for them to evade the response analysers. The stats applet emits valid HTTP responses, it can flow through the HTTP response analyser just fine. This now allows http-response/rsprep/rspadd rules to be applied on top of stats. Cookie insertion does nothing since applets are not servers and thus do not have a cookie. We can imagine compression to be applied later if the stats output is emitted in chunks and in HTTP/1.1. A minor visible effect of this change is that there is no more "-1" in the timers presented in the logs when viewing the stats, all timers are real.	2013-12-09 15:40:22 +01:00
Willy Tarreau	d84fb5e60f	MAJOR: session: check for a connection to an applet in sess_prepare_conn_req() Instead of having applets bypass the whole connection process, we now follow the common path through sess_prepare_conn_req(). It is this function which detects an applet an sets the output state so SI_ST_EST instead of initiating a connection to a server. It is made possible because we now have s->target pointing to the applet.	2013-12-09 15:40:22 +01:00
Willy Tarreau	7584b27956	MEDIUM: session: detect applets from the session by using s->target We used to rely on the stream interface's target to detect an applet from within the session while trying to process the connection request, but this is incorrect, as this target is the one currently connected and not the next one to process. This will make a difference when we later support keep-alive. The only "official" value indicating where we want to connect is the session's target, which can be : - &applet : connect to this applet - NULL : connect using the normal LB algos - anything else : direct connection to some entity Since we're interested in detecting the specific case of applets, it's OK to make use of s->target then. Also, applets are being isolated from connections, and as such there will not be any ->connect method available when an applet is running, so we can get rid of this test as well.	2013-12-09 15:40:22 +01:00
Willy Tarreau	08382955fe	CLEANUP: stream_interface: remove unused field err_loc This field was still fed with a pointer to the server that caught an error but was not used anymore. Let's remove it.	2013-12-09 15:40:21 +01:00
Willy Tarreau	9667a80676	BUG/MEDIUM: stick-tables: complete the latest fix about store-responses The commit `37e340c` (BUG/MEDIUM: stick: completely remove the unused flag from the store entries) was incomplete. We also need to ensure that only the first store-response for a table is applied and that it may coexist with a possible store-request that was already done on this table. This patch with the previous one should be backported to 1.4.	2013-12-09 15:29:25 +01:00
Willy Tarreau	37e340ce4b	BUG/MEDIUM: stick: completely remove the unused flag from the store entries The store[] array in the session holds a flag which probably aimed to differenciate store entries learned from the request from those learned from the response, and allowing responses to overwrite only the request ones (eg: have a server set a response cookie which overwrites the request one). But this flag is set when a response data is stored, and is never cleared. So in practice, haproxy always runs with this flag set, meaning that responses prevent themselves from overriding the request data. It is desirable anyway to keep the ability not to override data, because the override is performed only based on the table and not on the key, so that would mean that it would be impossible to retrieve two different keys to store into a same table. For example, if a client sets a cookie and a server another one, both need to be updated in the table in the proper order. This is especially true when multiple keys may be tracked on each side into the same table (eg: list of IP addresses in a header). So the correct fix which also maintains the current behaviour consists in simply removing this flag and never try to optimize for the overwrite case. This fix also has the benefit of significantly reducing the session size, by 64 bytes due to alignment issues caused by this flag! The bug has been there forever (since 1.4-dev7), so a backport to 1.4 would be appropriate.	2013-12-06 23:14:53 +01:00
Willy Tarreau	38d5892634	OPTIM/MINOR: mark the source address as already known on accept() Commit `986a9d2d12` moved the source address from the stream interface to the session, but it did not set the flag on the connection to report that the source address is known. Thus when logs are enabled, we had a call to getpeername() which is redundant with the result from accept(). This patch simply sets the flag.	2013-11-16 00:17:59 +01:00
Willy Tarreau	05bf5e1c36	BUG/MEDIUM: session: risk of crash on out of memory conditions In session_accept(), if we face a memory allocation error, we try to emit an HTTP 500 error message in HTTP mode. The problem is that we must not use http_error_message() for this since it dereferences the session which can be NULL in this case. We don't need the session to build the error message anyway since this function only uses it to retrieve the backend and frontend to get the most suited error message. Let's pick it ourselves, we're at the beginning of the session, only the frontend is relevant. This bug is 1.5-specific.	2013-10-30 07:59:03 +01:00
Willy Tarreau	0f791d42b6	MEDIUM: counters: support looking up a key in an alternate table sc_* sample fetches now take an optional parameter which allows to look the key in an alternate table. This is convenient to pass multiple information for the same key at once (eg: have multiple gpc0 for the same key, or support being fed complementary information from the CLI). Example : listen front bind :8000 tcp-request content track-sc0 src table local-ip http-response set-header src-id %[sc0_get_gpc0]+%[sc0_get_gpc0(global-ip)] server dummy 127.0.0.1:8001 backend local-ip stick-table size 1k type ip store gpc0 backend global-ip stick-table size 1k type ip store gpc0	2013-08-01 21:17:14 +02:00
Willy Tarreau	4d4149cf3e	MEDIUM: counters: support passing the counter number as a fetch argument One very annoying issue when trying to extend the sticky counters beyond the current 3 counters is that it requires a massive copy-paste of fetch functions (we don't have to copy-paste code anymore), just so that the fetch names exist. So let's have an alternate form like "sc_*(num)" to allow passing the counter number as an argument without having to redefine new fetch names. The MAX_SESS_STKCTR macro defines the number of usable sticky counters, which defaults to 3.	2013-08-01 21:17:14 +02:00
Willy Tarreau	b4c8493a9f	MINOR: session: make the number of stick counter entries more configurable In preparation of more flexibility in the stick counters, make their number configurable. It still defaults to 3 which is the minimum accepted value. Changing the value alone is not sufficient to get more counters, some bitfields still need to be updated and the TCP actions need to be updated as well, but this update tries to be easier, which is nice for experimentation purposes.	2013-08-01 21:17:14 +02:00
Willy Tarreau	563eef4e30	MEDIUM: counters: factor out smp_fetch_sc*_trackers smp_fetch_sc0_trackers, smp_fetch_sc1_trackers and smp_fetch_sc2_trackers were merged into a single function which relies on the fetch name to decide what to return. This is also a bug fix for this feature which has never worked till its bogus introduction by commit "2406db4 MEDIUM: counters: add sc1_trackers/sc2_trackers" (1.5-dev10). Instead of returning the value in the sample, it was returned as the fetch result! There is no need to backport this fix anyway since it's 1.5-specific and nobody uses the feature.	2013-08-01 21:17:14 +02:00
Willy Tarreau	a0b68eddef	MEDIUM: counters: factor out smp_fetch_sc*_bytes_out_rate smp_fetch_sc0_bytes_out_rate, smp_fetch_sc1_bytes_out_rate, smp_fetch_sc2_bytes_out_rate, smp_fetch_src_bytes_out_rate and smp_fetch_bytes_out_rate were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:14 +02:00
Willy Tarreau	53aea10fe9	MEDIUM: counters: factor out smp_fetch_sc*_kbytes_out smp_fetch_sc0_kbytes_out, smp_fetch_sc1_kbytes_out, smp_fetch_sc2_kbytes_out, smp_fetch_src_kbytes_out and smp_fetch_kbytes_out were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:14 +02:00
Willy Tarreau	613fe99cda	MEDIUM: counters: factor out smp_fetch_sc*_bytes_in_rate smp_fetch_sc0_bytes_in_rate, smp_fetch_sc1_bytes_in_rate, smp_fetch_sc2_bytes_in_rate, smp_fetch_src_bytes_in_rate and smp_fetch_bytes_in_rate were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:14 +02:00
Willy Tarreau	5077d4b261	MEDIUM: counters: factor out smp_fetch_sc*_kbytes_in smp_fetch_sc0_kbytes_in, smp_fetch_sc1_kbytes_in, smp_fetch_sc2_kbytes_in, smp_fetch_src_kbytes_in and smp_fetch_kbytes_in were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:14 +02:00
Willy Tarreau	9daf262c88	MEDIUM: counters: factor out smp_fetch_sc*_http_err_rate smp_fetch_sc0_http_err_rate, smp_fetch_sc1_http_err_rate, smp_fetch_sc2_http_err_rate, smp_fetch_src_http_err_rate and smp_fetch_http_err_rate were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:14 +02:00
Willy Tarreau	30d07c3b8e	MEDIUM: counters: factor out smp_fetch_sc*_http_err_cnt smp_fetch_sc0_http_err_cnt, smp_fetch_sc1_http_err_cnt, smp_fetch_sc2_http_err_cnt, smp_fetch_src_http_err_cnt and smp_fetch_http_err_cnt were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:14 +02:00
Willy Tarreau	cf47763c92	MEDIUM: counters: factor out smp_fetch_sc*_http_req_rate smp_fetch_sc0_http_req_rate, smp_fetch_sc1_http_req_rate, smp_fetch_sc2_http_req_rate, smp_fetch_src_http_req_rate and smp_fetch_http_req_rate were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:14 +02:00
Willy Tarreau	91200da197	MEDIUM: counters: factor out smp_fetch_sc*_http_req_cnt smp_fetch_sc0_http_req_cnt, smp_fetch_sc1_http_req_cnt, smp_fetch_sc2_http_req_cnt, smp_fetch_src_http_req_cnt and smp_fetch_http_req_cnt were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:14 +02:00
Willy Tarreau	3a96f3f274	MEDIUM: counters: factor out smp_fetch_sc*_sess_rate smp_fetch_sc0_sess_rate, smp_fetch_sc1_sess_rate, smp_fetch_sc2_sess_rate, smp_fetch_src_sess_rate and smp_fetch_sess_rate were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	20843087f5	MEDIUM: counters: factor out smp_fetch_sc*_sess_cnt smp_fetch_sc0_sess_cnt, smp_fetch_sc1_sess_cnt, smp_fetch_sc2_sess_cnt, smp_fetch_src_sess_cnt and smp_fetch_sess_cnt were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	f44a553476	MEDIUM: counters: factor out smp_fetch_sc*_conn_cur smp_fetch_sc0_conn_cur, smp_fetch_sc1_conn_cur, smp_fetch_sc2_conn_cur, smp_fetch_src_conn_cur and smp_fetch_conn_cur were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	c8c65700de	MEDIUM: counters: factor out smp_fetch_sc*_conn_rate smp_fetch_sc0_conn_rate, smp_fetch_sc1_conn_rate, smp_fetch_sc2_conn_rate, smp_fetch_src_conn_rate and smp_fetch_conn_rate were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	3b46c5c47d	MEDIUM: counters: factor out smp_fetch_sc*_conn_cnt smp_fetch_sc0_conn_cnt, smp_fetch_sc1_conn_cnt, smp_fetch_sc2_conn_cnt, smp_fetch_src_conn_cnt and smp_fetch_conn_cnt were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	b9f441d2c0	MEDIUM: counters: factor out smp_fetch_sc*_clr_gpc0 smp_fetch_sc0_clr_gpc0, smp_fetch_sc1_clr_gpc0, smp_fetch_sc2_clr_gpc0, smp_fetch_src_clr_gpc0 and smp_fetch_clr_gpc0 were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	710d38cea5	MEDIUM: counters: factor out smp_fetch_sc*_inc_gpc0 smp_fetch_sc0_inc_gpc0, smp_fetch_sc1_inc_gpc0, smp_fetch_sc2_inc_gpc0, smp_fetch_src_inc_gpc0 and smp_fetch_inc_gpc0 were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	b5e0af0b6b	MEDIUM: counters: factor out smp_fetch_sc*_gpc0_rate smp_fetch_sc0_gpc0, smp_fetch_sc1_gpc0, smp_fetch_sc2_gpc0, smp_fetch_src_gpc0 and smp_fetch_gpc0 were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	30b2046dfe	MEDIUM: counters: factor out smp_fetch_sc*_get_gpc0 smp_fetch_sc0_get_gpc0, smp_fetch_sc1_get_gpc0, smp_fetch_sc2_get_gpc0, smp_fetch_src_get_gpc0 and smp_fetch_get_gpc0 were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	a65536ca4e	MINOR: counters: provide a generic function to retrieve a stkctr for sc* and src. This function aims at simplifying the prefetching of the table and entry when using any of the session counters fetches. The principle is that the src_* variant produces a stkctr that is used instead of the one from the session. That way we can call the same function from all session counter fetch functions and always have a single function to support sc[0-9]_/src_.	2013-08-01 21:17:13 +02:00
Willy Tarreau	88821241d4	MINOR: counters: factor out smp_fetch_sc*_tracked The new function makes use of the sc# in the keyword to get the counter ID.	2013-08-01 21:17:13 +02:00
Willy Tarreau	ef38c39287	MEDIUM: sample: systematically pass the keyword pointer to the keyword We're having a lot of duplicate code just because of minor variants between fetch functions that could be dealt with if the functions had the pointer to the original keyword, so let's pass it as the last argument. An earlier version used to pass a pointer to the sample_fetch element, but this is not the best solution for two reasons : - fetch functions will solely rely on the keyword string - some other smp_fetch_* users do not have the pointer to the original keyword and were forced to pass NULL. So finally we're passing a pointer to the keyword as a const char *, which perfectly fits the original purpose.	2013-08-01 21:17:13 +02:00
Willy Tarreau	0fc36e3ae9	BUG/MAJOR: http: don't emit the send-name-header when no server is available Lukas Benes reported that http-send-name-header causes a segfault if no server is available because we're dereferencing the session's target which is NULL. The tiniest reproducer looks like this : listen foo bind :1234 mode http http-send-name-header srv This obvious fix must be backported to 1.4 which is affected as well.	2013-07-04 11:44:27 +02:00
Willy Tarreau	7af7d5957d	BUG: counters: third counter was not stored if others unset Commit `e25c917a` introduced a third tracking counter bug forgot to check it when storing values at the end of the session. The impact is that if neither the first nor the second one are changed, none of them are saved.	2013-07-01 18:08:41 +02:00
Willy Tarreau	dc13c11c1e	BUG/MEDIUM: prevent gcc from moving empty keywords lists into BSS Benoit Dolez reported a failure to start haproxy 1.5-dev19. The process would immediately report an internal error with missing fetches from some crap instead of ACL names. The cause is that some versions of gcc seem to trim static structs containing a variable array when moving them to BSS, and only keep the fixed size, which is just a list head for all ACL and sample fetch keywords. This was confirmed at least with gcc 3.4.6. And we can't move these structs to const because they contain a list element which is needed to link all of them together during the parsing. The bug indeed appeared with 1.5-dev19 because it's the first one to have some empty ACL keyword lists. One solution is to impose -fno-zero-initialized-in-bss to everyone but this is not really nice. Another solution consists in ensuring the struct is never empty so that it does not move there. The easy solution consists in having a non-null list head since it's not yet initialized. A new "ILH" list head type was thus created for this purpose : create an Initialized List Head so that gcc cannot move the struct to BSS. This fixes the issue for this version of gcc and does not create any burden for the declarations.	2013-06-21 23:29:02 +02:00
Willy Tarreau	8615c2af67	MEDIUM: session: disable lingering on the server when the client aborts When abortonclose is used and an error is detected on the client side, better force an RST to the server. That way we propagate to the server the same vision we got from the client, and we ensure that we won't keep TIME_WAITs.	2013-06-21 08:20:19 +02:00
Willy Tarreau	be4a3eff34	MEDIUM: counters: use sc0/sc1/sc2 instead of sc1/sc2/sc3 It was a bit inconsistent to have gpc start at 0 and sc start at 1, so make sc start at zero like gpc. No previous release was issued with sc3 anyway, so no existing setup should be affected.	2013-06-17 15:04:07 +02:00
Willy Tarreau	6d4e4e8dd2	MEDIUM: acl: remove a lot of useless ACLs that are equivalent to their fetches The following 116 ACLs were removed because they're redundant with their fetch function since last commit which allows the fetch function to be used instead for types BOOL, INT and IP. Most places are now left with an empty ACL keyword list that was not removed so that it's easier to add other ACLs later. always_false, always_true, avg_queue, be_conn, be_id, be_sess_rate, connslots, nbsrv, queue, srv_conn, srv_id, srv_is_up, srv_sess_rate, res.comp, fe_conn, fe_id, fe_sess_rate, dst_conn, so_id, wait_end, http_auth, http_first_req, status, dst, dst_port, src, src_port, sc1_bytes_in_rate, sc1_bytes_out_rate, sc1_clr_gpc0, sc1_conn_cnt, sc1_conn_cur, sc1_conn_rate, sc1_get_gpc0, sc1_gpc0_rate, sc1_http_err_cnt, sc1_http_err_rate, sc1_http_req_cnt, sc1_http_req_rate, sc1_inc_gpc0, sc1_kbytes_in, sc1_kbytes_out, sc1_sess_cnt, sc1_sess_rate, sc1_tracked, sc1_trackers, sc2_bytes_in_rate, sc2_bytes_out_rate, sc2_clr_gpc0, sc2_conn_cnt, sc2_conn_cur, sc2_conn_rate, sc2_get_gpc0, sc2_gpc0_rate, sc2_http_err_cnt, sc2_http_err_rate, sc2_http_req_cnt, sc2_http_req_rate, sc2_inc_gpc0, sc2_kbytes_in, sc2_kbytes_out, sc2_sess_cnt, sc2_sess_rate, sc2_tracked, sc2_trackers, sc3_bytes_in_rate, sc3_bytes_out_rate, sc3_clr_gpc0, sc3_conn_cnt, sc3_conn_cur, sc3_conn_rate, sc3_get_gpc0, sc3_gpc0_rate, sc3_http_err_cnt, sc3_http_err_rate, sc3_http_req_cnt, sc3_http_req_rate, sc3_inc_gpc0, sc3_kbytes_in, sc3_kbytes_out, sc3_sess_cnt, sc3_sess_rate, sc3_tracked, sc3_trackers, src_bytes_in_rate, src_bytes_out_rate, src_clr_gpc0, src_conn_cnt, src_conn_cur, src_conn_rate, src_get_gpc0, src_gpc0_rate, src_http_err_cnt, src_http_err_rate, src_http_req_cnt, src_http_req_rate, src_inc_gpc0, src_kbytes_in, src_kbytes_out, src_sess_cnt, src_sess_rate, src_updt_conn_cnt, table_avl, table_cnt, ssl_c_ca_err, ssl_c_ca_err_depth, ssl_c_err, ssl_c_used, ssl_c_verify, ssl_c_version, ssl_f_version, ssl_fc, ssl_fc_alg_keysize, ssl_fc_has_crt, ssl_fc_has_sni, ssl_fc_use_keysize,	2013-06-11 21:22:58 +02:00
Willy Tarreau	9a355ec257	MEDIUM: http: add support for action "set-log-level" in http-request/http-response Some users want to disable logging for certain non-important requests such as stats requests or health-checks coming from another equipment. Other users want to log with a higher importance (eg: notice) some special traffic (POST requests, authenticated requests, requests coming from suspicious IPs) or some abnormally large responses. This patch responds to all these needs at once by adding a "set-log-level" action to http-request/http-response. The 8 syslog levels are supported, as well as "silent" to disable logging.	2013-06-11 17:50:26 +02:00
Willy Tarreau	2b57cb8f30	MEDIUM: protocol: implement a "drain" function in protocol layers Since commit `cfd97c6f` was merged into 1.5-dev14 (BUG/MEDIUM: checks: prevent TIME_WAITs from appearing also on timeouts), some valid health checks sometimes used to show some TCP resets. For example, this HTTP health check sent to a local server : 19:55:15.742818 IP 127.0.0.1.16568 > 127.0.0.1.8000: S 3355859679:3355859679(0) win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:15.742841 IP 127.0.0.1.8000 > 127.0.0.1.16568: S 1060952566:1060952566(0) ack 3355859680 win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:15.742863 IP 127.0.0.1.16568 > 127.0.0.1.8000: . ack 1 win 257 19:55:15.745402 IP 127.0.0.1.16568 > 127.0.0.1.8000: P 1:23(22) ack 1 win 257 19:55:15.745488 IP 127.0.0.1.8000 > 127.0.0.1.16568: FP 1:146(145) ack 23 win 257 19:55:15.747109 IP 127.0.0.1.16568 > 127.0.0.1.8000: R 23:23(0) ack 147 win 257 After some discussion with Chris Huang-Leaver, it appeared clear that what we want is to only send the RST when we have no other choice, which means when the server has not closed. So we still keep SYN/SYN-ACK/RST for pure TCP checks, but don't want to see an RST emitted as above when the server has already sent the FIN. The solution against this consists in implementing a "drain" function at the protocol layer, which, when defined, causes as much as possible of the input socket buffer to be flushed to make recv() return zero so that we know that the server's FIN was received and ACKed. On Linux, we can make use of MSG_TRUNC on TCP sockets, which has the benefit of draining everything at once without even copying data. On other platforms, we read up to one buffer of data before the close. If recv() manages to get the final zero, we don't disable lingering. Same for hard errors. Otherwise we do. In practice, on HTTP health checks we generally find that the close was pending and is returned upon first recv() call. The network trace becomes cleaner : 19:55:23.650621 IP 127.0.0.1.16561 > 127.0.0.1.8000: S 3982804816:3982804816(0) win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:23.650644 IP 127.0.0.1.8000 > 127.0.0.1.16561: S 4082139313:4082139313(0) ack 3982804817 win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:23.650666 IP 127.0.0.1.16561 > 127.0.0.1.8000: . ack 1 win 257 19:55:23.651615 IP 127.0.0.1.16561 > 127.0.0.1.8000: P 1:23(22) ack 1 win 257 19:55:23.651696 IP 127.0.0.1.8000 > 127.0.0.1.16561: FP 1:146(145) ack 23 win 257 19:55:23.652628 IP 127.0.0.1.16561 > 127.0.0.1.8000: F 23:23(0) ack 147 win 257 19:55:23.652655 IP 127.0.0.1.8000 > 127.0.0.1.16561: . ack 24 win 257 This change should be backported to 1.4 which is where Chris encountered this issue. The code is different, so probably the tcp_drain() function will have to be put in the checks only.	2013-06-10 20:33:23 +02:00
Willy Tarreau	6f1615f596	MINOR: counters: add fetch/acl sc*_tracked to indicate whether a counter is tracked Sometimes we'd like to know if a counter is being tracked before adding a header to an outgoing request. These ones do that.	2013-06-10 10:30:09 +02:00
Willy Tarreau	ba2ffd18b5	MEDIUM: counters: add a new "gpc0_rate" counter in stick-tables This counter is special in that instead of reporting the gpc0 cumulative count, it returns its increase rate over the configured period.	2013-05-29 15:54:14 +02:00
Willy Tarreau	e25c917af8	MEDIUM: counters: add support for tracking a third counter We're often missin a third counter to track base, src and base+src at the same time. Here we introduce track_sc3 to have this third counter. It would be wise not to add much more counters because that slightly increases the session size and processing time though the real issue is more the declaration of the keywords in the code and in the doc.	2013-05-29 00:37:16 +02:00
Willy Tarreau	d5ca9abb0d	MINOR: counters: make it easier to extend the amount of tracked counters By properly affecting the flags and values, it becomes easier to add more tracked counters, for example for experimentation. It also slightly reduces the code and the number of tests. No counters were added with this patch.	2013-05-28 17:43:40 +02:00
Willy Tarreau	1e5dfdad77	MINOR: session: only call http_send_name_header() when changing the server Till now we used to call the function until the connection established, which means that the header rewriting was performed for nothing upon each even (eg: uploaded contents) until the server responded or timed out. Now we only call the function when we assign the server.	2013-04-11 18:18:01 +02:00
Willy Tarreau	d86e29d2a1	CLEANUP: acl: remove unused references to ACL_USE_* Now that acl->requires is not used anymore, we can remove all references to it as well as all ACL_USE_* flags.	2013-04-03 02:13:00 +02:00
Willy Tarreau	c48c90dfa5	MAJOR: acl: remove the arg_mask from the ACL definition and use the sample fetch's Now that ACLs solely rely on sample fetch functions, make them use the same arg mask. All inconsistencies have been fixed separately prior to this patch, so this patch almost only adds a new pointer indirection and removes all references to ARG*() in the definitions. The parsing is still performed by the ACL code though.	2013-04-03 02:12:58 +02:00
Willy Tarreau	8ed669b12a	MAJOR: acl: make all ACLs reference the fetch function via a sample. ACL fetch functions used to directly reference a fetch function. Now that all ACL fetches have their sample fetches equivalent, we can make ACLs reference a sample fetch keyword instead. In order to simplify the code, a sample keyword name may be NULL if it is the same as the ACL's, which is the most common case. A minor change appeared, http_auth always expects one argument though the ACL allowed it to be missing and reported as such afterwards, so fix the ACL to match this. This is not really a bug.	2013-04-03 02:12:58 +02:00
Willy Tarreau	281c799e25	MINOR: session: rename sample fetch functions and declare the sample keywords The following sample fetch functions were only usable by ACLs but are now usable by sample fetches too : sc1_bytes_in_rate, sc1_bytes_out_rate, sc1_clr_gpc0, sc1_conn_cnt, sc1_conn_cur, sc1_conn_rate, sc1_get_gpc0, sc1_http_err_cnt, sc1_http_err_rate, sc1_http_req_cnt, sc1_http_req_rate, sc1_inc_gpc0, sc1_kbytes_in, sc1_kbytes_out, sc1_sess_cnt, sc1_sess_rate, sc1_trackers, sc2_bytes_in_rate, sc2_bytes_out_rate, sc2_clr_gpc0, sc2_conn_cnt, sc2_conn_cur, sc2_conn_rate, sc2_get_gpc0, sc2_http_err_cnt, sc2_http_err_rate, sc2_http_req_cnt, sc2_http_req_rate, sc2_inc_gpc0, sc2_kbytes_in, sc2_kbytes_out, sc2_sess_cnt, sc2_sess_rate, sc2_trackers, src_bytes_in_rate, src_bytes_out_rate, src_clr_gpc0, src_conn_cnt, src_conn_cur, src_conn_rate, src_get_gpc0, src_http_err_cnt, src_http_err_rate, src_http_req_cnt, src_http_req_rate, src_inc_gpc0, src_kbytes_in, src_kbytes_out, src_sess_cnt, src_sess_rate, src_updt_conn_cnt, table_avl, table_cnt, The fetch functions have been renamed "smp_fetch_*".	2013-04-03 02:12:58 +02:00
Willy Tarreau	a7a7ebc382	BUG/MINOR: http: don't process abortonclose when request was sent option abortonclose may cause a valid connection to be aborted just after the request has been sent. This is because we check for it during the session establishment sequence before checking for write activity. So if the abort and the connect complete at the same time, the abort is still considered. Let's check for an explicity partial write before aborting. This fix should be backported to 1.4 too.	2012-12-30 00:50:35 +01:00
Willy Tarreau	71241abfd3	MINOR: http: move redirect rule processing to its own function We now have http_apply_redirect_rule() which does all the redirect-specific job instead of having this inside http_process_req_common(). Also one of the benefit gained from uniformizing this code is that both keep-alive and close response do emit the PR-- flags. The fix for the flags could probably be backported to 1.4 though it's very minor. The previous function http_perform_redirect() was becoming confusing so it was renamed http_perform_server_redirect() since it only applies to server-based redirection.	2012-12-28 14:47:19 +01:00
Willy Tarreau	d79a3b248e	BUG/MINOR: log: make log-format, unique-id-format and add-header more independant It happens that all of them call parse_logformat_line() which sets proxy->to_log with a number of flags affecting the line format for all three users. For example, having a unique-id specified disables the default log-format since fe->to_log is tested when the session is established. Similarly, having "option logasap" will cause "+" to be inserted in unique-id or headers referencing some of the fields depending on LW_BYTES. This patch first removes most of the dependency on fe->to_log whenever possible. The first possible cleanup is to stop checking fe->to_log for being null, considering that it always contains at least LW_INIT when any such usage is made of the log-format! Also, some checks are wrong. s->logs.logwait cannot be nulled by "logwait &= ~LW_" since LW_INIT is always there. This results in getting the wrong log at the end of a request or session when a unique-id or add-header is set, because logwait is still not null but the log-format is not checked. Further cleanups are required. Most LW_ flags should be removed or at least replaced with what they really mean (eg: depend on client-side connection, depend on server-side connection, etc...) and this should only affect logging, not other mechanisms. This patch fixes the default log-format and tries to limit interferences between the log formats, but does not pretend to do more for the moment, since it's the most visible breakage.	2012-12-28 09:51:00 +01:00
Willy Tarreau	20d46a5a95	CLEANUP: session: use an array for the stick counters The stick counters were in two distinct sets of struct members, causing some code to be duplicated. Now we use an array, which enables some processing to be performed in loops. This allowed the code to be shrunk by 700 bytes.	2012-12-09 15:57:16 +01:00
Willy Tarreau	2406db4b39	MEDIUM: counters: add sc1_trackers/sc2_trackers Returns the current amount of concurrent connections tracking the same tracked counters. This number is automatically incremented when tracking begins and decremented when tracking stops. It differs from sc1_conn_cur in that it does not rely on any stored information but on the table's reference count (the "use" value which is returned by "show table" on the CLI). This may sometimes be more suited for layer7 tracking.	2012-12-09 14:08:47 +01:00
Willy Tarreau	5d5b5d8eaf	MEDIUM: proto_tcp: add support for tracking L7 information Until now it was only possible to use track-sc1/sc2 with "src" which is the IPv4 source address. Now we can use track-sc1/sc2 with any fetch as well as any transformation type. It works just like the "stick" directive. Samples are automatically converted to the correct types for the table. Only "tcp-request content" rules may use L7 information, and such information must already be present when the tracking is set up. For example it becomes possible to track the IP address passed in the X-Forwarded-For header. HTTP request processing now also considers tracking from backend rules because we want to be able to update the counters even when the request was already parsed and tracked. Some more controls need to be performed (eg: samples do not distinguish between L4 and L6).	2012-12-09 14:08:47 +01:00
Willy Tarreau	0ede5a3318	BUG/MEDIUM: session: fix FD leak when transport layer logging is enabled Commit `2b199c9a` attempted to fix all places where the transport layer is improperly closed, but it missed one place in session_free(). If SSL ciphers are logged, the close() is delayed post-log and performed in session_free(). However, conn_xprt_close() only closes the transport layer but not the file descriptor, resulting in a slow FD leak which is hardly noticeable until the process cannot accept any new connection. A workaround consisted in disabling %sslv/%sslc in log-format. So use conn_full_close() instead of conn_xprt_close() to fix this there too. A similar pending issue existed in the close during outgoing connection failure, though on this side, the transport layer is never tracked at the moment.	2012-12-08 08:48:04 +01:00
Willy Tarreau	20879a0233	MEDIUM: connection: add error reporting for the SSL Get a bit more info in the logs when client-side SSL handshakes fail.	2012-12-03 17:21:52 +01:00
Willy Tarreau	8e3bf699db	MEDIUM: connection: add error reporting for the PROXY protocol header When the PROXY protocol header is expected and fails, leading to an abort of the incoming connection, we now emit a log message. If option dontlognull is set and it was just a port probe, then nothing is logged.	2012-12-03 17:21:51 +01:00
Willy Tarreau	0af2912fd1	MEDIUM: connection: add minimal error reporting in logs for incomplete connections Since the introduction of SSL, it became quite annoying not to get any useful info in logs about handshake failures. Let's improve reporting for embryonic sessions by checking a per-connection error code and reporting it into the logs if an error happens before the session is completely instanciated. The "dontlognull" option is supported in that if a connection does not talk before being aborted, nothing will be emitted. At the moment, only timeouts are considered for SSL and the PROXY protocol, but next patches will handle more errors.	2012-12-03 15:38:23 +01:00
Willy Tarreau	14cba4b0b1	MEDIUM: connection: add an error code in connections This will be needed to improve error reporting, especially for SSL.	2012-12-03 14:22:13 +01:00
Willy Tarreau	8139b9959f	MINOR: compression: make the stats a bit more robust To ensure that we only count when a response was compressed, we also check for the SN_COMP_READY flag which indicates that the compression was effectively initialized. Comp_algo alone is meaningless.	2012-11-27 09:34:00 +01:00
Willy Tarreau	5e16cbc3bd	MINOR: stats: report the total number of compressed responses per front/back Depending on the content-types and accept-encoding fields, some responses might or might not be compressed. Let's have a counter of the number of compressed responses and report it in the stats to help improve compression usage. Some cosmetic issues were fixed in the CSV output too (missing commas at the end).	2012-11-24 14:54:13 +01:00
Willy Tarreau	2b199c9ac3	MEDIUM: connection: provide a common conn_full_close() function Several places got the connection close sequence wrong because it was not obvious. In practice we always need the same sequence when aborting, so let's have a common function for this.	2012-11-23 17:32:21 +01:00
Willy Tarreau	543db62e1f	BUG/MEDIUM: compression: release the zlib pools between keep-alive requests There was a possible memory leak in the zlib code when the first response of a keep-alive session was compressed, because the next request would reset the compression algo, preventing a later call to session_free() from releasing it. The reason is that it is necessary to release the assigned resources in http_end_txn_clean_session().	2012-11-15 16:41:22 +01:00
William Lallemand	ec3e3890f0	BUG/MINOR: compression: deinit zlib only when required The zlib stream was deinitialized even when the init failed.	2012-11-15 15:42:17 +01:00
Willy Tarreau	3fdb366885	MAJOR: connection: replace struct target with a pointer to an enum Instead of storing a couple of (int, ptr) in the struct connection and the struct session, we use a different method : we only store a pointer to an integer which is stored inside the target object and which contains a unique type identifier. That way, the pointer allows us to retrieve the object type (by dereferencing it) and the object's address (by computing the displacement in the target structure). The NULL pointer always corresponds to OBJ_TYPE_NONE. This reduces the size of the connection and session structs. It also simplifies target assignment and compare. In order to improve the generated code, we try to put the obj_type element at the beginning of all the structs (listener, server, proxy, si_applet), so that the original and target pointers are always equal. A lot of code was touched by massive replaces, but the changes are not that important.	2012-11-12 00:42:33 +01:00
Willy Tarreau	b31c971bef	CLEANUP: channel: remove any reference of the hijackers Hijackers were functions designed to inject data into channels in the distant past. They became unused around 1.3.16, and since there has not been any user of this mechanism to date, it's uncertain whether the mechanism still works (and it's not really useful anymore). So better remove it as well as the pointer it uses in the channel struct.	2012-11-11 23:05:39 +01:00
Willy Tarreau	7f7ad91056	BUILD: stream_interface: remove si_fd() and its references si_fd() is not used a lot, and breaks builds on OpenBSD 5.2 which defines this name for its own purpose. It's easy enough to remove this one-liner function, so let's do it.	2012-11-11 20:53:29 +01:00
Willy Tarreau	815f5ecffa	BUG/MINOR: session: mark the handshake as complete earlier There is a small waste of CPU cycles when no handshake is required on an accepted connection, because we had to perform one call to conn_fd_handler() to mark the connection CONNECTED and to call process_session() again to say that nothing happened. By marking the connection CONNECTED when there is no pending handshake, we avoid this extra call to process_session().	2012-11-09 22:09:08 +01:00
Willy Tarreau	798f4325fa	OPTIM: session: don't process the whole session when only timers need a refresh Having a global expiration timer for a task means that the tasks are regularly woken up (at least after each expiration timer). It's totally useless and counter productive to process the whole session upon each such wakeup, and it's fairly easy to detect such wakeups, so let's just update the task's timer and return to sleep when this happens. For 100k concurrent connections with 10s of timeouts, this can save 10k wakeups per second, which is not bad.	2012-11-08 16:55:07 +01:00
William Lallemand	1c2d622d82	CLEANUP: use struct comp_ctx instead of union Replace union comp_ctx by struct comp_ctx. Use struct comp_ctx * in the init/add_data/flush/reset/end prototypes of compression.h functions.	2012-11-05 10:23:16 +01:00
Willy Tarreau	e3224e870f	BUG/MINOR: session: ensure that we don't retry connection if some data were sent With extra-large buffers, it is possible that a lot of data are sent upon connection establishment before the session is notified. The issue is how to handle a send() error after some data were actually sent. At the moment, only a connection error is reported, causing a new connection attempt and send() to restart after the last data. We absolutely don't want to retry the connect() if at least one byte was sent, because those data are lost. The solution consists in reporting exactly what happens, which is : - a successful connection attempt - a read/write error on the channel That way we go on with sess_establish(), the response analysers are called and report the appropriate connection state for the error (typically a server abort while waiting for a response). This mechanism also guarantees that we won't retry since it's a success. The logs also report the correct connect time. Note that 1.4 is not directly affected because it only attempts one send(), so it cannot detect a send() failure here and distinguish it form a failed connection attempt. So no backport is needed. Also, this is just a safe belt we're taking, since this issue should not happen anymore since previous commit.	2012-10-29 23:31:04 +01:00
Willy Tarreau	19d14ef104	MEDIUM: make the trash be a chunk instead of a char * The trash is used everywhere to store the results of temporary strings built out of s(n)printf, or as a storage for a chunk when chunks are needed. Using global.tune.bufsize is not the most convenient thing either. So let's replace trash with a chunk and directly use it as such. We can then use trash.size as the natural way to get its size, and get rid of many intermediary chunks that were previously used. The patch is huge because it touches many areas but it makes the code a lot more clear and even outlines places where trash was used without being that obvious.	2012-10-29 16:57:30 +01:00
Willy Tarreau	f2943dccd0	MAJOR: session: detach the connections from the stream interfaces We will need to be able to switch server connections on a session and to keep idle connections. In order to achieve this, the preliminary requirement is that the connections can survive the session and be detached from them. Right now they're still allocated at exactly the same place, so when there is a session, there are always 2 connections. We could soon improve on this by allocating the outgoing connection only during a connect(). This current patch touches a lot of code and intentionally does not change any functionnality. Performance tests show no regression (even a very minor improvement). The doc has not yet been updated.	2012-10-26 20:15:20 +02:00
Willy Tarreau	c919dc66a3	CLEANUP: remove trashlen trashlen is a copy of global.tune.bufsize, so let's stop using it as a duplicate, fall back to the original bufsize, it's less confusing this way.	2012-10-26 20:04:27 +02:00
William Lallemand	82fe75c1a7	MEDIUM: HTTP compression (zlib library support) This commit introduces HTTP compression using the zlib library. http_response_forward_body has been modified to call the compression functions. This feature includes 3 algorithms: identity, gzip and deflate: * identity: this is mostly for debugging, and it was useful for developping the compression feature. With Content-Length in input, it is making each chunk with the data available in the current buffer. With chunks in input, it is rechunking, the output chunks will be bigger or smaller depending of the size of the input chunk and the size of the buffer. Identity does not apply any change on data. * gzip: same as identity, but applying a gzip compression. The data are deflated using the Z_NO_FLUSH flag in zlib. When there is no more data in the input buffer, it flushes the data in the output buffer (Z_SYNC_FLUSH). At the end of data, when it receives the last chunk in input, or when there is no more data to read, it writes the end of data with Z_FINISH and the ending chunk. * deflate: same as gzip, but with deflate algorithm and zlib format. Note that this algorithm has ambiguous support on many browsers and no support at all from recent ones. It is strongly recommended not to use it for anything else than experimentation. You can't choose the compression ratio at the moment, it will be set to Z_BEST_SPEED (1), as tests have shown very little benefit in terms of compression ration when going above for HTML contents, at the cost of a massive CPU impact. Compression will be activated depending of the Accept-Encoding request header. With identity, it does not take care of that header. To build HAProxy with zlib support, use USE_ZLIB=1 in the make parameters. This work was initially started by David Du Colombier at Exceliance.	2012-10-26 02:30:48 +02:00
Willy Tarreau	c93f7959e5	CLEANUP: session: remove term_trace which is not used anymore This field was used to trace precisely where a session was terminated but it did not survive code rearchitecture and was not used at all anymore. Let's get rid of it.	2012-10-13 11:10:30 +02:00
Willy Tarreau	9b28e03b66	MAJOR: channel: replace the struct buffer with a pointer to a buffer With this commit, we now separate the channel from the buffer. This will allow us to replace buffers on the fly without touching the channel. Since nobody is supposed to keep a reference to a buffer anymore, doing so is not a problem and will also permit some copy-less data manipulation. Interestingly, these changes have shown a 2% performance increase on some workloads, probably due to a better cache placement of data.	2012-10-13 09:07:52 +02:00
Willy Tarreau	394db379eb	REORG: http: rename msg->buf to msg->chn since it's a channel It's extremely confusing to have all those msg->buf->buf everywhere after the extraction of the buffer from the channel. Let's clean this up.	2012-10-12 22:40:39 +02:00
Willy Tarreau	93dbc2bc0e	MEDIUM: log: add a new LW_XPRT flag to pin the transport layer This flag will have to be set on log tags which require transport layer information. They will prevent the conn_xprt_close() call from releasing the transport layer too early.	2012-10-12 20:30:51 +02:00
Willy Tarreau	1e954913de	MEDIUM: connection: add a flag to hold the transport layer When we start logging SSL information, we need the SSL struct to be present even past the conn_xprt_close() call. In order to achieve this, we should use refcounting on the connection and the transport layer. At the moment it's not worth using plain refcounting as only the logs require this, so instead of real refcounting we just use a flag which will be set by the log subsystem when SSL data need to be logged. What happens then is that the xprt->close() call is ignored and the transport layer is closed again during session_free(), after the log line is emitted.	2012-10-12 20:30:50 +02:00
Willy Tarreau	91083f5c8f	BUG/MEDIUM: session: enable the conn_session_update() callback This callback was introduced by commit `9683e9a0` but never enabled because the CO_FL_WAKE_DATA flag was not set. The result is that this function is never called when an SSL handshake fails, so the connection is only closed on timeout.	2012-10-12 20:30:38 +02:00
Willy Tarreau	e9909f4e50	BUG/MINOR: session: fix some leftover from debug code Commit `82569f91` moved the health and monitor-net checks to session.c but a debug test introduced 0& to disable MSG_DONTWAIT in the recv() call and this debug code remained there. Since the socket is marked non-blocking, there should be no effect but it's dangerous to keep such a thing here.	2012-10-12 17:36:40 +02:00
Willy Tarreau	1bc4aab290	MEDIUM: listener: add support for linux's accept4() syscall On Linux, accept4() does the same as accept() except that it allows the caller to specify some flags to set on the resulting socket. We use this to set the O_NONBLOCK flag and thus to save one fcntl() call in each connection. The effect is a small performance gain of around 1%. The option is automatically enabled when target linux2628 is set, or when the USE_ACCEPT4 Makefile variable is set. If the libc is too old to provide the equivalent function, this is automatically detected and our own function is used instead. In any case it is possible to force the use of our implementation with USE_MY_ACCEPT4.	2012-10-08 20:11:03 +02:00
Willy Tarreau	9683e9a05f	MEDIUM: session: register a data->wake callback to process errors The connection layer will soon call ->wake() only when errors happen, and not ->init(). So make the session layer use this callback to detect errors and abort connections.	2012-10-04 22:26:10 +02:00
Willy Tarreau	071e137ec2	MEDIUM: connection: use a generic data-layer init() callback The generic data-layer init callback is now used after the transport layer is complete and before calling the data layer recv/send callbacks. This allows the session to switch from the embryonic session data layer to the complete stream interface data layer, by making conn_session_complete() the data layer's init callback. It sill looks awkwards that the init() callback must be used opon error, but except by adding yet another one, it does not seem to be mergeable into another function (eg: it should probably not be merged with ->wake to avoid unneeded calls during the handshake, though semantically that would make sense).	2012-10-04 22:26:10 +02:00
Willy Tarreau	5e75e2755e	MEDIUM: session: use a specific data_cb for embryonic sessions We don't want to have the recv or send callbacks in embryonic sessions, and we want the stream interface to be referenced as the connection owner only once the session is instanciated. So let's first have the embryonic session be the owner, then replaced later by the stream interface once the transport layer is ready.	2012-10-04 22:26:10 +02:00
Willy Tarreau	4aa3683b2d	MINOR: connection: provide a generic data layer wakeup callback Instead of calling conn_notify_si() from the connection handler, we now call data->wake(), which will allow us to use a different callback with health checks. Note that we still rely on a flag in order to decide whether or not to call this function. The reason is that with embryonic sessions, the callback is already initialized to si_conn_cb without the flag, and we can't call the SI notify function in the leave path before the stream interface is initialized. This issue should be addressed by involving a different data_cb for embryonic sessions and for stream interfaces, that would be changed during session_complete() for the final data_cb.	2012-10-04 22:26:10 +02:00
Willy Tarreau	f7bc57ca6e	REORG: connection: rename the data layer the "transport layer" While working on the changes required to make the health checks use the new connections, it started to become obvious that some naming was not logical at all in the connections. Specifically, it is not logical to call the "data layer" the layer which is in charge for all the handshake and which does not yet provide a data layer once established until a session has allocated all the required buffers. In fact, it's more a transport layer, which makes much more sense. The transport layer offers a medium on which data can transit, and it offers the functions to move these data when the upper layer requests this. And it is the upper layer which iterates over the transport layer's functions to move data which should be called the data layer. The use case where it's obvious is with embryonic sessions : an incoming SSL connection is accepted. Only the connection is allocated, not the buffers nor stream interface, etc... The connection handles the SSL handshake by itself. Once this handshake is complete, we can't use the data functions because the buffers and stream interface are not there yet. Hence we have to first call a specific function to complete the session initialization, after which we'll be able to use the data functions. This clearly proves that SSL here is only a transport layer and that the stream interface constitutes the data layer. A similar change will be performed to rename app_cb => data, but the two could not be in the same commit for obvious reasons.	2012-10-04 22:26:09 +02:00
Willy Tarreau	e603e69d18	MEDIUM: connection: make use of the owner instead of container_of This way the connection can become independant on the stream interface.	2012-09-28 00:01:23 +02:00
Willy Tarreau	82569f9158	MEDIUM: monitor: simplify handling of monitor-net and mode health We were having several different behaviours with monitor-net and "mode health" : - monitor-net on TCP connections was evaluated just after accept(), did not count a connection on the frontend and were not subject to tcp-request connection rules, and caused an immediate close(). - monitor-net in HTTP mode was evaluated once the session was accepted (eg: on top of SSL), returned "HTTP/1.0 200 OK\r\n\r\n" over the connection's data layer and instanciated a session which was responsible for closing this connection. A connection AND a session were counted for the frontend ; - "mode health" with "option httpchk" would do exactly the same as monitor-net in HTTP mode ; - "mode health" without "option httpchk" would do the same as above except that "OK" was returned instead of "HTTP/1.0 200 OK\r\n\r\n". None of them took care of cleaning the input buffer, sometimes resulting in a TCP reset to be emitted after the last packet if a request was received over the connection. Given the inconsistencies and the complexity in keeping all these features handled at the right position, we now slightly changed the way they are handled : - all of them are handled just after the "tcp-request connection" rules, so that all of them may be blocked using such rules, offering more flexibility and consistency ; - no connection handshake is performed anymore for non-TCP modes - all of them send the response as raw data over the socket, there is no more difference between TCP and HTTP mode for example (these rules were never meant to be served over SSL connections and were never documented as able to do that). - any possible pending data on the incoming socket is drained before the response is sent, in order to avoid the risk of a reset. - none of them exactly did what was documented ! This results in more consistent, more flexible and more accurate handling of monitor rules, with smaller and more robust code.	2012-09-28 00:01:22 +02:00
Cyril Bonté	3aaba440a2	BUILD: fix compilation error with DEBUG_FULL Recent changes in structures broke the compilation when using DEBUG_FULL. Let's update apply the changes also to the variables used in DPRINTF calls.	2012-09-24 20:36:39 +02:00
Willy Tarreau	d1d5454180	REORG: split "protocols" files into protocol and listener It was becoming confusing to have protocols and listeners in the same files, split them.	2012-09-15 22:29:32 +02:00
Willy Tarreau	cbaaec475c	MINOR: session: do not send an HTTP/500 error on SSL sockets If a session fails its initialization, we don't want to send HTTP/500 over the socket if it's not a raw data layer.	2012-09-06 11:32:07 +02:00
Willy Tarreau	783f25800c	BUILD: http: rename error_message http_error_message to fix conflicts on RHEL Duncan Hall reported a build issue on CentOS where error_message conflicts with another system declaration when SSL is enabled. Rename the function.	2012-09-04 12:19:04 +02:00
Willy Tarreau	dd2f85eb3b	CLEANUP: includes: fix includes for a number of users of fd.h It appears that fd.h includes a number of unneeded files and was included from standard.h, and as such served as an intermediary to provide almost everything to everyone. By removing its useless includes, a long dependency chain broke but could easily be fixed.	2012-09-03 20:49:14 +02:00
Willy Tarreau	40ff59d820	CLEANUP: fd: remove fdtab->flags These flags were added for TCP_CORK. They were only set at various places but never checked by any user since TCP_CORK was replaced with MSG_MORE. Simply get rid of this now.	2012-09-03 20:49:14 +02:00
Willy Tarreau	74172ff9c3	CLEANUP: frontend: remove the old proxy protocol decoder This one used to rely on a stream analyser which was inappropriate. It's not used anymore.	2012-09-03 20:47:35 +02:00
Willy Tarreau	22cda21ad5	MAJOR: connection: make the PROXY decoder a handshake handler The PROXY protocol is now decoded in the connection before other handshakes. This means that it may be extracted from a TCP stream before SSL is decoded from this stream.	2012-09-03 20:47:35 +02:00
Willy Tarreau	2542b53b19	MAJOR: session: introduce embryonic sessions When an incoming connection request is accepted, a connection structure is needed to store its state. However we don't want to fully initialize a session until the data layer is about to be ready. As long as the connection is physically stored into the session, it's not easy to split both allocations. As such, we only initialize the minimum requirements of a session, which results in what we call an embryonic session. Then once the data layer is ready, we can complete the function's initialization. Doing so avoids buffers allocation and ensures that a session only sees ready connections. The frontend's client timeout is used as the handshake timeout. It is likely that another timeout will be used in the future.	2012-09-03 20:47:35 +02:00
Willy Tarreau	15678efc45	MEDIUM: connection: add an ->init function to data layer SSL need to initialize the data layer before proceeding with data. At the moment, this data layer is automatically initialized from itself, which will not be possible once we extract connection from sessions since we'll only create the data layer once the handshake is finished. So let's have the application layer initialize the data layer before using it.	2012-09-03 20:47:34 +02:00
Willy Tarreau	64ee491309	MINOR: tcp: replace tcp_src_to_stktable_key with addr_to_stktable_key Make it more obvious that this function does not depend on any knowledge of the session. This is important to plan for TCP rules that can run on connection without any initialized session yet.	2012-09-03 20:47:34 +02:00
Willy Tarreau	93b0f4f6c6	MEDIUM: stream_interface: remove CAP_SPLTCP/CAP_SPLICE flags These ones are implicitly handled by the connection's data layer, no need to rely on them anymore and reaching them maintains undesired dependences on stream-interface.	2012-09-03 20:47:34 +02:00
Willy Tarreau	986a9d2d12	MAJOR: connection: move the addr field from the stream_interface We need to have the source and destination addresses in the connection. They were lying in the stream interface so let's move them. The flags SI_FL_FROM_SET and SI_FL_TO_SET have been moved as well. It's worth noting that tcp_connect_server() almost does not use the stream interface anymore except for a few flags. It has been identified that once we detach the connection from the SI, it will probably be needed to keep a copy of the server-side addresses in the SI just for logging purposes. This has not been implemented right now though.	2012-09-03 20:47:34 +02:00
Willy Tarreau	3cefd521fa	REORG: connection: move the target pointer from si to connection The target is per connection and is directly used by the connection, so we need it there. It's not needed anymore in the SI however.	2012-09-03 20:47:34 +02:00
Willy Tarreau	8263d2b259	CLEANUP: channel: use "channel" instead of "buffer" in function names This is a massive rename of most functions which should make use of the word "channel" instead of the word "buffer" in their names. In concerns the following ones (new names) : unsigned long long channel_forward(struct channel buf, unsigned long long bytes); static inline void channel_init(struct channel buf) static inline int channel_input_closed(struct channel buf) static inline int channel_output_closed(struct channel buf) static inline void channel_check_timeouts(struct channel b) static inline void channel_erase(struct channel buf) static inline void channel_shutr_now(struct channel buf) static inline void channel_shutw_now(struct channel buf) static inline void channel_abort(struct channel buf) static inline void channel_stop_hijacker(struct channel buf) static inline void channel_auto_connect(struct channel buf) static inline void channel_dont_connect(struct channel buf) static inline void channel_auto_close(struct channel buf) static inline void channel_dont_close(struct channel buf) static inline void channel_auto_read(struct channel buf) static inline void channel_dont_read(struct channel buf) unsigned long long channel_forward(struct channel *buf, unsigned long long bytes) Some functions provided by channel.[ch] have kept their "buffer" name because they are really designed to act on the buffer according to some information gathered from the channel. They have been moved together to the same place in the file for better readability but they were not changed at all. The "buffer" memory pool was also renamed "channel".	2012-09-03 20:47:33 +02:00
Willy Tarreau	03cdb7c678	CLEANUP: channel: usr CF_/CHN_ prefixes instead of BF_/BUF_ Get rid of these confusing BF_* flags. Now channel naming should clearly be used everywhere appropriate. No code was changed, only a renaming was performed. The comments about channel operations was updated.	2012-09-03 20:47:33 +02:00
Willy Tarreau	3bf1b2b816	MAJOR: channel: stop relying on BF_FULL to take action This flag is quite complex to get right and updating it everywhere is a major pain, especially since the buffer/channel split. This is the first step of getting rid of it. Instead now it's dynamically computed whenever needed.	2012-09-03 20:47:33 +02:00
Willy Tarreau	a75bcef867	REORG: buffer: move buffer_flush, b_adv and b_rew to buffer.h These one now operate over real buffers, not channels anymore.	2012-09-03 20:47:32 +02:00
Willy Tarreau	8e21bb9e52	MAJOR: channel: remove the BF_OUT_EMPTY flag This flag was very problematic because it was composite in that both changes to the pipe or to the buffer had to cause this flag to be updated, which is not always simple (eg: there may not even be a channel attached to a buffer at all). There were not that many users of this flags, mostly setters. So the flag got replaced with a macro which reports whether the channel is empty or not, by checking both the pipe and the buffer. One part of the change is sensible : the flag was also part of BF_MASK_STATIC, which is used by process_session() to rescan all analysers in case the flag's status changes. At first glance, none of the analysers seems to change its mind base on this flag when it is subject to change, so it seems fine not to add variation checks here. Otherwise it's possible that checking the buffer's output size is more useful than checking the flag's replacement.	2012-09-03 20:47:32 +02:00
Willy Tarreau	c7e4238df0	REORG: buffers: split buffers into chunk,buffer,channel Many parts of the channel definition still make use of the "buffer" word.	2012-09-03 20:47:32 +02:00
Willy Tarreau	c578891112	CLEANUP: connection: split sock_ops into data_ops, app_cp and si_ops Some parts of the sock_ops structure were only used by the stream interface and have been moved into si_ops. Some of them were callbacks to the stream interface from the connection and have been moved into app_cp as they're the application seen from the connection (later, health-checks will need to use them). The rest has moved to data_ops. Normally at this point the connection could live without knowing about stream interfaces at all.	2012-09-03 20:47:31 +02:00
Willy Tarreau	96199b1016	MAJOR: stream-interface: restore splicing mechanism The splicing is now provided by the data-layer rcv_pipe/snd_pipe functions which in turn are called by the stream interface's recv and send callbacks. The presence of the rcv_pipe/snd_pipe functions is used to attest support for splicing at the data layer. It looks like the stream-interface's SI_FL_CAP_SPLICE flag does not make sense anymore as it's used as a proxy for the pointers above. It also appears that we call chk_snd() from the recv callback and then try to call it again in update_conn(). It is very likely that this last function will progressively slip into the recv/send callbacks in order to avoid duplicate check code. The code works right now with and without splicing. Only raw_sock provides support for it and it is automatically selected when the various splice options are set. However it looks like splice-auto doesn't enable it, which possibly means that the streamer detection code does not work anymore, or that it's only called at a time where it's too late to enable splicing (in process_session).	2012-09-03 20:47:31 +02:00
Willy Tarreau	75bf2c925f	REORG: sock_raw: rename the files raw_sock* The "raw_sock" prefix will be more convenient for naming functions as it will be prefixed with the data layer and suffixed with the data direction. So let's rename the files now to avoid any further confusion. The #include directive was also removed from a number of files which do not need it anymore.	2012-09-02 21:54:56 +02:00
Willy Tarreau	572bf9095d	REORG/MAJOR: extract "struct buffer" from "struct channel" At the moment, the struct is still embedded into the struct channel, but all the functions have been updated to use struct buffer only when possible, otherwise struct channel. Some functions would likely need to be splitted between a buffer-layer primitive and a channel-layer function. Later the buffer should become a pointer in the struct buffer, but doing so requires a few changes to the buffer allocation calls.	2012-09-02 21:54:56 +02:00
Willy Tarreau	7421efb85f	REORG/MAJOR: use "struct channel" instead of "struct buffer" This is a massive rename. We'll then split channel and buffer. This change needs a lot of cleanups. At many locations, the parameter or variable is still called "buf" which will become ambiguous. Also, the "struct channel" is still defined in buffers.h.	2012-09-02 21:54:55 +02:00
Willy Tarreau	f9dabecd03	MEDIUM: connection: make use of the new polling functions Now the connection handler, the handshake callbacks and the I/O callbacks make use of the connection-layer polling functions to enable or disable polling on a file descriptor. Some changes still need to be done to avoid using the FD_WAIT_* constants.	2012-09-02 21:53:11 +02:00
Willy Tarreau	49b046dddf	MAJOR: fd: replace all EV_FD_* macros with new fd__ inline calls These functions have a more explicity meaning and will offer provisions for explicit polling. EV_FD_ISSET() has been left for now as it is still in use in checks.	2012-09-02 21:53:11 +02:00
Willy Tarreau	8b117082bc	REORG: connection: replace si_data_close() with conn_data_close() This close function only applies to connection-specific parts and the stream-interface entry may soon disappear. Move this to the connection instead.	2012-09-02 21:53:10 +02:00
Willy Tarreau	fd31e53139	MAJOR: remove the stream interface and task management code from sock_* The socket data layer code must only focus on moving data between a socket and a buffer. We need a special stream interface handler to update the stream interface and the file descriptor status. At the moment the code works but suffers from a race condition caused by its API : the read/write callbacks still make use of the fd instead of using the connection. And when a double shutdown is performed, a call to ->write() after ->read() processed an error results in dereferencing a NULL fdtab[]->owner. This is only a temporary issue which doesn't need to be fixed now since this will automatically go away when the functions change to use the connection instead.	2012-09-02 21:53:08 +02:00
Willy Tarreau	076be25ab8	CLEANUP: remove the now unused fdtab direct I/O callbacks They were all left to NULL since last commit so we can safely remove them all now and remove the temporary dual polling logic in pollers.	2012-09-02 21:51:29 +02:00
Willy Tarreau	8018471f44	MINOR: fd: make fdtab->owner a connection and not a stream_interface anymore It is more convenient with a connection here and will abstract stream_interface more easily.	2012-09-02 21:51:28 +02:00
Willy Tarreau	d2274c6536	MAJOR: connection: replace direct I/O callbacks with the connection callback Almost all direct I/O callbacks have been changed to use the connection callback instead. Only the TCP connection validation remains.	2012-09-02 21:51:28 +02:00
Willy Tarreau	4e6049e553	MINOR: fd: add a new I/O handler to fdtab This one will eventually replace both cb[] handlers. At the moment it is not used yet.	2012-09-02 21:51:27 +02:00
Willy Tarreau	505e34a36d	MAJOR: get rid of fdtab[].state and use connection->flags instead fdtab[].state was only used to know whether a connection was in progress or an error was encountered. Instead we now use connection->flags to store a flag for both. This way, connection management will be able to update the connection status on I/O.	2012-09-02 21:51:26 +02:00
Willy Tarreau	db3b32610f	REORG/MEDIUM: fd: remove FD_STCLOSE from struct fdtab In an attempt to get rid of fdtab[].state, and to move the relevant parts to the connection struct, we remove the FD_STCLOSE state which can easily be deduced from the <owner> pointer as there is a 1:1 match.	2012-09-02 21:51:25 +02:00
Willy Tarreau	96596aeead	MEDIUM: fd/si: move peeraddr from struct fdinfo to struct connection The destination address is purely a connection thing and not an fd thing. It's also likely that later the address will be stored into the connection and linked to by the SI. struct fdinfo only keeps the pointer to the port range and the local port for now. All of this also needs to move to the connection but before this the release of the port range must move from fd_delete() to a new function dedicated to the connection.	2012-06-08 22:59:52 +02:00
Emeric Brun	d88fd824b7	MEDIUM: protocol: add a pointer to struct sock_ops to the listener struct The listener struct is now aware of the socket layer to use upon accept(). At the moment, only sock_raw is supported so this patch should not change anything.	2012-05-21 22:22:39 +02:00
Emeric Brun	21adb02d19	MINOR: stream_interface: add a pointer to the listener for TARG_TYPE_CLIENT When the target is a client, it will be convenient to have a pointer to the original listener so that we can retrieve some configuration information at the stream interface level.	2012-05-21 22:22:39 +02:00
Willy Tarreau	4da69a91a0	MEDIUM: stream_interface: call si_data_close() before releasing the si This will ensure that the data layer releases anything previously allocated.	2012-05-21 18:07:11 +02:00
Willy Tarreau	fb7508aefb	REORG/MINOR: stream_interface: move si->fd to struct connection The socket fd is used only when in socket mode and with a connection.	2012-05-21 16:47:54 +02:00
Willy Tarreau	73b013b070	MINOR: stream_interface: introduce a new "struct connection" type We start to move everything needed to manage a connection to a special entity "struct connection". We have the data layer operations and the control operations there. We'll also have more info in the future such as file descriptors and applet contexts, so that in the end it becomes detachable from the stream interface, which will allow connections to be reused between sessions. For now on, we start with minimal changes.	2012-05-21 16:31:45 +02:00
Willy Tarreau	fe7f1ea68e	REORG/MINOR: session: detect the TCP monitor checks at the protocol accept It does not make sense anymore to wait for a session creation to process a TCP monitor check which only closes the connection and returns. Better to process this immediately after the accept() return. It also saves us from counting a connection for monitor checks, which is much more logical.	2012-05-20 19:22:25 +02:00
Willy Tarreau	be0688c64d	MEDIUM: stream_interface: remove the si->init Calling the init() function in sess_establish was a bad idea, it is too late to allow it to fail on lack of resource and does not help at all. Remove it for now before it's used.	2012-05-18 15:15:26 +02:00
Willy Tarreau	7bb68abb9f	OPTIM/MEDIUM: stream_interface: add a new SI_FL_NOHALF flag This flag indicates that we're not interested in keeping half-open connections on a stream interface. It has the benefit of allowing the socket layer to cause an immediate write close when detecting an incoming read close. This releases resources much faster and saves one syscall (either a shutdown or setsockopt). This flag is only set by HTTP on the interface going to the server since we don't want to continue pushing data there when it has closed. Another benefit is that it responds with a FIN to a server's FIN instead of responding with an RST as it used to, which is much cleaner. Performance gains of 7.5% have been measured on HTTP connection rate on empty objects.	2012-05-13 14:52:22 +02:00
Willy Tarreau	b147a8382a	CLEANUP: fd: remove unused cb->b pointers in the struct fdtab These pointers were used to hold pointers to buffers in the past, but since we introduced the stream interface, they're no longer used but they were still sometimes set. Removing them shrink the struct fdtab from 32 to 24 bytes on 32-bit machines, and from 52 to 36 bytes on 64-bit machines, which is a significant saving. A quick tests shows a steady 0.5% performance gain, probably due to the better cache efficiency.	2012-05-13 00:35:44 +02:00
Willy Tarreau	ce887fd3b2	MEDIUM: session: add support for tunnel timeouts Tunnel timeouts are used when TCP connections are forwarded, or when forwarding upgraded HTTP connections (WebSocket) as well as CONNECT requests to proxies. This timeout allows long-lived sessions to be supported without having to set large timeouts to normal requests.	2012-05-12 12:50:00 +02:00
Willy Tarreau	2f5b6fc090	MINOR: session: call the socket layer init function when a session establishes In sess_establish, once we've prepared everythin, we can call the socket layer init function. We pass an argument for targets which have one (eg: servers). At the moment, the existing socket layers don't have init functions, but SSL will need one.	2012-05-12 08:09:27 +02:00
Willy Tarreau	f873d754f8	CLEANUP: stream_interface: stop exporting socket layer functions Similarly to the previous patch, we don't need the socket-layer functions outside of stream_interface. They could even move to a file dedicated to applets, though that does not seem particularly useful at the moment.	2012-05-11 17:47:17 +02:00
Willy Tarreau	b277d6e568	CLEANUP: sock_raw: remove last references to stream_sock We also stop exporting all functions since they're not needed anymore outside of sock_raw.c.	2012-05-11 17:03:42 +02:00
Willy Tarreau	1539a01645	MINOR: stream_interface: add a client target : TARG_TYPE_CLIENT This one will be used to identify the direction the SI is being used. All incoming connections have a target of type TARG_TYPE_CLIENT.	2012-05-11 14:47:34 +02:00
Willy Tarreau	c63190d429	REORG: use the name sock_raw instead of stream_sock We'll soon have an SSL socket layer, and in order to ease the difference between the two, we use the name "sock_raw" to designate the one which directly talks to the sockets without any conversion.	2012-05-11 14:23:52 +02:00
Willy Tarreau	0a3dd74c9c	MEDIUM: cfgparse: use the new error reporting framework for remaining cfg_keywords All keywords registered using a cfg_kw_list now make use of the new error reporting framework. This allows easier and more precise error reporting without having to deal with complex buffer allocation issues.	2012-05-08 21:28:17 +02:00
Willy Tarreau	bd83314ee9	BUG/MEDIUM: log: ensure that unique_id is properly initialized Last memory poisonning patch immediately made this issue appear. The unique_id field is released but not properly initialized. The feature was introduced very recently, no backport is needed.	2012-05-08 21:28:16 +02:00
Willy Tarreau	63e7fe310e	BUG/MEDIUM: send_proxy: fix initialisation of send_proxy_ofs Commit `b22e55bc` introduced send_proxy_ofs but forgot to initialize it, which remained unnoticed since it's always at the same place in the stream interface. On a machine with dirty RAM returned by malloc(), some responses were holding a PROXY header, which normally is not possible. The problem goes away after properly initializing the field upon each new session_accept(). This fix does not need to be backported except if any code makes use of a backport of this feature.	2012-05-08 21:28:16 +02:00
Willy Tarreau	26d8c59f0b	REORG/MEDIUM: replace stream interface protocol functions by a proto pointer The stream interface now makes use of the socket protocol pointer instead of the direct functions.	2012-05-08 21:28:15 +02:00
Willy Tarreau	5c979a9c71	REORG/MEDIUM: stream_interface: initialize socket ops from descriptors	2012-05-08 21:28:14 +02:00
Willy Tarreau	1b79bdee26	REORG/MEDIUM: move protocol->{read,write} to sock_ops The protocol must not set the read and write callbacks, they're specific to the socket layer. Move them to sock_ops instead.	2012-05-08 21:28:14 +02:00
Willy Tarreau	060781fb4a	REORG: stream_interface: create a struct sock_ops to hold socket operations These operators are used regardless of the socket protocol family. Move them to a "sock_ops" struct. ->read and ->write have been moved there too as they have no reason to remain at the protocol level.	2012-05-08 21:28:14 +02:00
Willy Tarreau	cd3b094618	REORG: rename "pattern" files They're now called "sample" everywhere to match their description.	2012-05-08 20:57:21 +02:00
Willy Tarreau	1278578487	REORG: use the name "sample" instead of "pattern" to designate extracted data This is mainly a massive renaming in the code to get it in line with the calling convention. Next patch will rename a few files to complete this operation.	2012-05-08 20:57:20 +02:00
Willy Tarreau	32a6f2e572	MEDIUM: acl/pattern: use the same direction scheme Patterns were using a bitmask to indicate if request or response was desired in fetch functions and keywords. ACLs were using a bitmask in fetch keywords and a single bit in fetch functions. ACLs were also using an ACL_PARTIAL bit in fetch functions indicating that a non-final fetch was performed, which was an abuse of the existing direction flag. The change now consists in using : - a capabilities field for fetch keywords => SMP_CAP_REQ/RES to indicate if a keyword supports requests, responses, both, etc... - an option field for fetch functions to indicate what the caller expects (request/response, final/non-final) The ACL_PARTIAL bit was reversed to get SMP_OPT_FINAL as it's more explicit to know we're working on a final buffer than on a non-final one. ACL_DIR_* were removed, as well as PATTERN_FETCH_*. L4 fetches were improved to support being called on responses too since they're still available. The <dir> field of all fetch functions was changed to <opt> which is now unsigned. The patch is large but mostly made of cosmetic changes to accomodate this, as almost no logic change happened.	2012-05-08 20:57:17 +02:00
Willy Tarreau	24e32d8c6b	MEDIUM: acl: replace acl_expr with args in acl fetch_* functions Having the args everywhere will make it easier to share fetch functions between patterns and ACLs. The only place where we could have needed the expr was in the http_prefetch function which can do well without.	2012-05-08 20:57:16 +02:00
Willy Tarreau	f853c46bc3	MEDIUM: pattern/acl: get rid of temp_pattern in ACLs This one is not needed anymore as we can return the data and its type in the sample provided by the caller. ACLs now always return the proper type. BOOL is already returned when the result is expected to be processed as a boolean. temp_pattern has been unexported now.	2012-05-08 20:57:14 +02:00
Willy Tarreau	3740635b88	MAJOR: acl: make use of the new sample struct and get rid of acl_test This change is invasive in lines of code but not much in terms of functionalities as it's mainly a replacement of struct acl_test with struct sample.	2012-05-08 20:57:14 +02:00
Willy Tarreau	422aa0792d	MEDIUM: pattern: add new sample types to replace pattern types The new sample types are necessary for the acl-pattern convergence. These types are boolean and signed int. Some types were renamed for less ambiguity (ip->ipv4, integer->uint).	2012-05-08 20:57:14 +02:00
Willy Tarreau	0146c2e873	MEDIUM: acl: remove unused tests for missing args when args are mandatory A number of ACL fetch methods use mandatory arguments (eg: proxy names) so it's pointless to test for the presence of this argument now.	2012-05-08 20:57:12 +02:00
Willy Tarreau	fc2c1fd449	MAJOR: acl: ensure that implicit table and proxies are valid A large number of ACLs make use of frontend, backend or table names in their arguments, and fall back to the current proxy when no argument is passed. If the expected capability is not available, the ACL silently fails at runtime. Now we make all those names mandatory in the parser and we rely on acl_find_targets() to replace the missing names with the holding proxy, then to perform the appropriate tests, and to reject errors at parsing time. It is possible that some faulty configurations will get rejected from now on, while they used to silently fail till now. This is the reason why this change is marked as MAJOR.	2012-05-08 20:57:12 +02:00
Willy Tarreau	d28c353fc5	MAJOR: acl: make acl_find_targets also resolve proxy names at config time Proxy names are now resolved when the config is parsed and not at runtime. This means that errors will be caught for real instead of having an ACL silently never match. Another benefit is that the fetch will be much faster since the lookup will not have to be performed anymore, eg for all ACLs based on explicitly named stick-tables. However some buggy configurations which used to silently fail in the past will now refuse to load, hence the MAJOR tag.	2012-05-08 20:57:11 +02:00
Willy Tarreau	61612d49a7	MAJOR: acl: store the ACL argument types in the ACL keyword declaration The types and minimal number of ACL keyword arguments are now stored in their declaration. This will allow many more fantasies if some ACL use several arguments or types. Doing so required to rework all ACL keyword declarations to add two parameters. So this was a good opportunity for a general cleanup and to sort all entries in alphabetical order. We still have two pending issues : - parse_acl_expr() checks for errors but has no way to report them to the user ; - the types of some arguments are still not resolved and kept as strings (eg: ARGT_FE/BE/TAB) for compatibility reasons, which must be resolved in acl_find_targets()	2012-05-08 20:57:11 +02:00
Willy Tarreau	34db108423	MAJOR: acl: make use of the new argument parsing framework The ACL parser now uses the argument parser to build a typed argument list. Right now arguments are all strings and only one argument is supported since this is what ACLs currently support.	2012-05-08 20:57:11 +02:00
Willy Tarreau	45c0d98769	MEDIUM: http: http_send_name_header: remove references to msg and buffer They can be deduced from txn.	2012-05-08 12:28:12 +02:00
Willy Tarreau	62f791ea6f	MEDIUM: http: add a pointer to the buffer in http_msg ACLs and patterns only rely on a struct http_msg and don't know the pointer to the actual data. struct http_msg will soon only hold relative references so that's not possible. We need http_msg to hold a reference to the struct buffer before having relative pointers everywhere. It is likely that doing so will also result in opportunities to simplify a number of functions arguments. The following functions are already candidate : http_buffer_heavy_realign http_capture_bad_message http_change_connection_header http_forward_trailers http_header_add_tail http_header_add_tail2 http_msg_analyzer http_parse_chunk_size http_parse_connection_header http_remove_header2 http_send_name_header http_skip_chunk_crlf http_upgrade_v09_to_v10	2012-05-08 12:28:12 +02:00
Willy Tarreau	02d6cfc1d7	MAJOR: buffer: replace buf->l with buf->{o+i} We don't have buf->l anymore. We have buf->i for pending data and the total length is retrieved by adding buf->o. Some computation already become simpler. Despite extreme care, bugs are not excluded. It's worth noting that msg->err_pos as set by HTTP request/response analysers becomes relative to pending data and not to the beginning of the buffer. This has not been completed yet so differences might occur when outgoing data are left in the buffer.	2012-05-08 12:28:10 +02:00
Willy Tarreau	a36fc4d7ed	MEDIUM: move message-related flags from transaction to message Too many flags are stored in the transaction structure. Some flags are clearly message-specific and exist in two versions (request and response). Move them to a new "flags" field in the http_message struct instead.	2012-04-30 11:57:00 +02:00
Willy Tarreau	21337825c0	CLEANUP: remove a few warning about unchecked return values in debug code There were a few unchecked write() calls in the debug code that cause gcc 4.x to emit warnings on recent libc. We don't want to check them as we can't make anything from the result, let's simply surround them with an empty if statement. Note that one of the warnings was for chdir("/") which normally cannot fail since it follows a successful chroot (which means the perms are necessarily there). Anyway let's move the call uppe to protect it too.	2012-04-30 11:56:30 +02:00
Willy Tarreau	9b061e3320	MEDIUM: stream_sock: add a get_src and get_dst callback and remove SN_FRT_ADDR_SET These callbacks are used to retrieve the source and destination address of a socket. The address flags are not hold on the stream interface and not on the session anymore. The addresses are collected when needed. This still needs to be improved to store the IP and port separately so that it is not needed to perform a getsockname() when only the IP address is desired for outgoing traffic.	2012-04-07 18:03:52 +02:00
Willy Tarreau	4a5cadea40	MEDIUM: session: implement the "use-server" directive Sometimes it is desirable to forward a particular request to a specific server without having to declare a dedicated backend for this server. This can be achieved using the "use-server" rules. These rules are evaluated after the "redirect" rules and before evaluating cookies, and they have precedence on them. There may be as many "use-server" rules as desired. All of these rules are evaluated in their declaration order, and the first one which matches will assign the server.	2012-04-05 21:14:10 +02:00
Stathis Voukelatos	09a030a9a4	BUG/MINOR: fix typo in processing of http-send-name-header I downloaded version 1.4.19 this morning. While merging the code changes to a custom build that we have here for our project I noticed a typo in 'session.c', in the new code for inserting the server name in the HTTP header. The fix that I did is shown in the patch below. [WT: the bug is harmless, it is only suboptimal]	2012-01-09 14:27:13 +01:00
Mark Lamourine	c2247f0b8d	MEDIUM: http: add support for sending the server's name in the outgoing request New option "http-send-name-header" specifies the name of a header which will hold the server name in outgoing requests. This is the name of the server the connection is really sent to, which means that upon redispatches, the header's value is updated so that it always matches the server's name.	2012-01-05 15:17:31 +01:00
Willy Tarreau	a5e375646c	MEDIUM: acl: use temp_pattern to store any integer-type information All ACL fetches which return integer value now store the result into the temporary pattern struct. All ACL matches which rely on integer also get their value there. Note: the pattern data types are not set right now.	2011-12-30 17:33:26 +01:00
Willy Tarreau	34eb671f24	OPTIM/MINOR: move the hdr_idx pools out of the proxy struct It makes no sense to have one pointer to the hdr_idx pool in each proxy struct since these pools do not depend on the proxy. Let's have a common pool instead as it is already the case for other types.	2011-10-24 18:15:04 +02:00
Willy Tarreau	6471afb43d	MINOR: remove the client/server side distinction in SI addresses Stream interfaces used to distinguish between client and server addresses because they were previously of different types (sockaddr_storage for the client, sockaddr_in for the server). This is not the case anymore, and this distinction is confusing at best and has caused a number of regressions to be introduced in the process of converting everything to full-ipv6. We can now remove this and have a much cleaner code.	2011-09-23 10:54:59 +02:00
Willy Tarreau	a2a64e9689	[MEDIUM] session: make session_shutdown() an independant function We already had the ability to kill a connection, but it was only for the checks. Now we can do this for any session, and for this we add a specific flag "K" to the logs.	2011-09-07 23:01:56 +02:00
Willy Tarreau	3c63fd828a	[MEDIUM] don't limit peers nor stats socket to maxconn nor maxconnrate The peers and the stats socket are control sockets, they must not be limited by traffic rules.	2011-09-07 22:47:42 +02:00
Willy Tarreau	f73cd1198f	[MINOR] session-counters: add the ability to clear the counters Sometimes it can be useful to reset a counter : one condition increments it and another one resets it. It can be used to better detect abuses.	2011-08-13 01:45:16 +02:00
Willy Tarreau	b32907b6c7	[MINOR] sessions: only wake waiting listeners up if rate limit is OK Instead of waking a listener up then making it sleep, we only wake them up if we know their rate limit is fine. In the future we could improve on top of that by deciding to wake a proxy-specific task in XX milliseconds to take care of enabling the listeners again.	2011-07-25 08:37:44 +02:00
Willy Tarreau	07687c171e	[MEDIUM] listeners: queue proxy-bound listeners at the proxy's All listeners that are limited by a proxy-specific resource are now queued at the proxy's and not globally. This allows finer-grained wakeups when releasing resource.	2011-07-24 23:55:06 +02:00
Willy Tarreau	08ceb1012b	[MEDIUM] listeners: put listeners in queue upon resource shortage When an accept() fails because of a connection limit or a memory shortage, we now disable it and queue it so that it's dequeued only when a connection is released. This has improved the behaviour of the process near the fd limit as now a listener with a no connection (eg: stats) will not loop forever trying to get its connection accepted. The solution is still not 100% perfect, as we'd like to have this used when proxy limits are reached (use a per-proxy list) and for safety, we'd need to have dedicated tasks to periodically re-enable them (eg: to overcome temporary system-wide resource limitations when no connection is released).	2011-07-24 22:58:00 +02:00
Willy Tarreau	627937158f	[MINOR] listeners: add listen_full() to mark a listener full This is just a cleanup which removes calls to EV_FD_CLR() and state setting everywhere in the code.	2011-07-24 19:25:28 +02:00
Willy Tarreau	2b15492a75	[MINOR] session: try to emit a 500 response on memory allocation errors When we fail to create a session because of memory shortage, let's at least try to send a 500 message directly on the socket. Even if we don't have any buffers left, the kernel's orphans management will take care of delivering the message as long as there are socket buffers left.	2011-07-24 16:12:25 +02:00
Willy Tarreau	9bd0d744ef	[BUG] session: risk of crash on out of memory (1.5-dev regression) Patch af5149 introduced an issue which can be detected only on out of memory conditions : a LIST_DEL() may be performed on an uninitialized struct member instead of a LIST_INIT() during the accept() phase, causing crashes and memory corruption to occur. This issue was detected and diagnosed by the Exceliance R&D team. This is 1.5-specific and very recent, so no existing deployment should be impacted.	2011-07-20 00:22:54 +02:00
Simon Horman	fa46168c8f	[MINOR] Add non-stick server option Never add connections allocated to this sever to a stick-table. This may be used in conjunction with backup to ensure that stick-table persistence is disabled for backup servers.	2011-06-25 21:14:17 +02:00
Simon Horman	af51495397	[MINOR] Add active connection list to server The motivation for this is to allow iteration of all the connections of a server without the expense of iterating over the global list of connections. The first use of this will be to implement an option to close connections associated with a server when is is marked as being down or in maintenance mode.	2011-06-21 22:00:12 +02:00
Simon Horman	dec5be4ed4	[CLEANUP] session.c: Make functions static where possible	2011-06-18 20:27:19 +02:00
Willy Tarreau	96e312139a	[MEDIUM] http: add support for "http-no-delay" There are some very rare server-to-server applications that abuse the HTTP protocol and expect the payload phase to be highly interactive, with many interleaved data chunks in both directions within a single request. This is absolutely not supported by the HTTP specification and will not work across most proxies or servers. When such applications attempt to do this through haproxy, it works but they will experience high delays due to the network optimizations which favor performance by instructing the system to wait for enough data to be available in order to only send full packets. Typical delays are around 200 ms per round trip. Note that this only happens with abnormal uses. Normal uses such as CONNECT requests nor WebSockets are not affected. When "option http-no-delay" is present in either the frontend or the backend used by a connection, all such optimizations will be disabled in order to make the exchanges as fast as possible. Of course this offers no guarantee on the functionality, as it may break at any other place. But if it works via HAProxy, it will work as fast as possible. This option should never be used by default, and should never be used at all unless such a buggy application is discovered. The impact of using this option is an increase of bandwidth usage and CPU usage, which may significantly lower performance in high latency environments. This change should be backported to 1.4 since the first report of such a misuse was in 1.4. Next patch will also be needed.	2011-05-30 18:42:41 +02:00
David du Colombier	4f92d32004	[MEDIUM] IPv6 support for stick-tables Since IPv6 is a different type than IPv4, the pattern fetch functions src6 and dst6 were added. IPv6 stick-tables can also fetch IPv4 addresses with src and dst. In this case, the IPv4 addresses are mapped to their IPv6 counterpart, according to RFC 4291.	2011-03-29 01:09:14 +02:00
Willy Tarreau	c735a0728e	[MINOR] acl: add support for table_cnt and table_avl matches Those trivial matches respectively return the number of entries used in a stick-table and the number of entries still available in a table.	2011-03-29 00:57:02 +02:00
Willy Tarreau	0b3a411543	[BUG] session: conn_retries was not always initialized Johannes Smith reported some wrong retries count in logs associated with bad requests. The cause was that the conn_retries field in the stream interface was only initialized when attempting to connect, but is used when logging, possibly with an uninitialized value holding last connection's conn_retries. This could have been avoided by making use of a stream interface initializer. This bug is 1.5-specific.	2011-03-27 19:16:56 +02:00
Willy Tarreau	1b6e608c11	[BUG] session: src_conn_cur was returning src_conn_cnt instead Issue reported by Cory Forsyth and diagnosed by Cyril Bont�. Just a plain stupid copy-paste of the wrong fetch function call.	2011-03-16 06:56:57 +01:00
Willy Tarreau	7d0aaf39d1	[MEDIUM] stats: split frontend and backend stats It's very annoying that frontend and backend stats are merged because we don't know what we're observing. For instance, if a "listen" instance makes use of a distinct backend, it's impossible to know what the bytes_out means. Some points take care of not updating counters twice if the backend points to the frontend, indicating a "listen" instance. The thing becomes more complex when we try to add support for server side keep-alive, because we have to maintain a pointer to the backend used for last request, and to update its stats. But we can't perform such comparisons anymore because the counters will not match anymore. So in order to get rid of this situation, let's have both frontend AND backend stats in the "struct proxy". We simply update the relevant ones during activity. Some of them are only accounted for in the backend, while others are just for frontend. Maybe we can improve a bit on that later, but the essential part is that those counters now reflect what they really mean.	2011-03-13 22:00:23 +01:00
Willy Tarreau	827aee913f	[MAJOR] session: remove the ->srv pointer from struct session This one has been removed and is now totally superseded by ->target. To get the server, one must use target_srv(&s->target) instead of s->srv now. The function ensures that non-server targets still return NULL.	2011-03-10 23:32:17 +01:00
Willy Tarreau	9e000c6ec8	[CLEANUP] stream_interface: use inline functions to manipulate targets The connection target involves a type and a union of pointers, let's make the code cleaner using simple wrappers.	2011-03-10 23:32:17 +01:00
Willy Tarreau	3d80d911aa	[MEDIUM] session: remove s->prev_srv which is not needed anymore s->prev_srv is used by assign_server() only, but all code paths leading to it now take s->prev_srv from the existing s->srv. So assign_server() can do that copy into its own stack. If at one point a different srv is needed, we still have a copy of the last server on which we failed a connection attempt in s->target.	2011-03-10 23:32:16 +01:00
Willy Tarreau	664beb8610	[MINOR] session: add a pointer to the new target into the session When dealing with HTTP keep-alive, we'll have to know if we can reuse an existing connection. For that, we'll have to check if the current connection was made on the exact same target (referenced in the stream interface). Thus, we need to first assign the next target to the session, then copy it to the stream interface upon connect(). Later we'll check for equivalence between those two operations.	2011-03-10 23:32:16 +01:00
Willy Tarreau	7c0a151a2e	[CLEANUP] stream_interface: remove the applet.handler pointer Now that we have the target pointer and type in the stream interface, we don't need the applet.handler pointer anymore. That makes the code somewhat cleaner because we know we're dealing with an applet by checking its type instead of checking the pointer is not null.	2011-03-10 23:32:15 +01:00
Willy Tarreau	ac82540c35	[MEDIUM] stream_interface: store the target pointer and type When doing a connect() on a stream interface, some information is needed from the server and from the backend. In some situations, we don't have a server and only a backend (eg: peers). In other cases, we know we have an applet and we don't want to connect to anything, but we'd still like to have the info about the applet being used. For this, we now store a pointer to the "target" into the stream interface. The target describes what's on the other side before trying to connect. It can be a server, a proxy or an applet for now. Later we'll probably have descriptors for multiple-stage chains so that the final information may still be found. This will help removing many specific cases in the code. It already made it possible to remove the "srv" and "be" parameters to tcpv4_connect_server().	2011-03-10 23:32:15 +01:00
Willy Tarreau	957c0a5845	[REORG] session: move client and server address to the stream interface This will be needed very soon for the keep-alive.	2011-03-10 23:32:14 +01:00
Willy Tarreau	b24281b0ff	[MINOR] stream_interface: make use of an applet descriptor for IO handlers I/O handlers are still delicate to manipulate. They have no type, they're just raw functions which have no knowledge of themselves. Let's have them declared as applets once for all. That way we can have multiple applets share the same handler functions and we can store their names there. When we later need to add more parameters (eg: usage stats), we'll be able to do so in the applets themselves. The CLI functions has been prefixed with "cli" instead of "stats" as it's clearly what is going on there. The applet descriptor in the stream interface should get all the applet specific data (st0, ...) but this will be done in the next patch so that we don't pollute this one too much.	2011-03-10 23:32:14 +01:00
Willy Tarreau	b89cfca494	[BUG] session: release slot before processing pending connections When a connection error is encountered on a server and the server's connection pool is full, pending connections are not woken up because the current connection is still accounted for on the server, so it still appears full. This becomes visible on a server which has "maxconn 1" because the pending connections will only be able to expire in the queue. Now we take care of releasing our current connection before trying to offer it to another pending request, so that the server can accept a next connection. This patch should be backported to 1.4.	2010-12-29 14:38:29 +01:00
Willy Tarreau	0499e3575c	[BUG] http: analyser optimizations broke pipelining HTTP pipelining currently needs to monitor the response buffer to wait for some free space to be able to send a response. It was not possible for the HTTP analyser to be called based on response buffer activity. Now we introduce a new buffer flag BF_WAKE_ONCE which is set when the HTTP request analyser is set on the response buffer and some activity is detected. This is not clean at all but once of the only ways to fix the issue before we make it possible to register events for analysers. Also it appeared that one realign condition did not cover all cases.	2010-12-17 07:15:57 +01:00
Willy Tarreau	2f976e18b8	[OPTIM] session: don't recheck analysers when buffer flags have not changed Analysers were re-evaluated when some flags were still present in the buffers, even if they had not changed since previous pass, resulting in a waste of CPU cycles. Ensuring that the flags have changed has saved some useless calls : function min calls per session (before -> after) http_request_forward_body 5 -> 4 http_response_forward_body 3 -> 2 http_sync_req_state 10 -> 8 http_sync_res_state 8 -> 6 http_resync_states 8 -> 6	2010-11-11 14:28:47 +01:00
Willy Tarreau	abe8ea5c1d	[BUG] accept: don't close twice upon error The stream_sock's accept() used to close the FD upon error, but this was also sometimes performed by the frontend's accept() called via the session's accept(). Those interlaced calls were also responsible for the spaghetti-looking error unrolling code in session.c and stream_sock.c. Now the frontend must not close the FD anymore, the session is responsible for that. It also takes care of just closing the FD or also removing from the FD lists, depending on its state. The socket-level accept() does not have to care about that anymore.	2010-11-11 11:05:20 +01:00
Willy Tarreau	fffe1325df	[CLEANUP] accept: replace some inappropriate Alert() calls with send_log() Some Alert() messages were remaining in the accept() path, which they would have no chance to be detected. Remove some of them (the impossible ones) and replace the relevant ones with send_log() so that the admin has a chance to catch them.	2010-11-11 09:51:38 +01:00
Emeric Brun	85e77c7f0d	[MEDIUM] Create updates tree on stick table to manage sync.	2010-11-11 09:29:08 +01:00
Emeric Brun	485479d8e9	[MEDIUM] Create new protected pattern types CONSTSTRING and CONSTDATA to force memcpy if data from protected areas need to be manipulated. Enhance pattern convs and fetch argument parsing, now fetchs and convs callbacks used typed args. Add more details on error messages on parsing pattern expression function. Update existing pattern convs and fetchs to new proto. Create stick table key type "binary". Manage Truncation and padding if pattern's fetch-converted result don't match table key size.	2010-11-11 09:29:07 +01:00
Emeric Brun	97679e7901	[MEDIUM] Implement tcp inspect response rules	2010-11-11 09:28:18 +01:00
Willy Tarreau	da4d9fe5a4	[BUG] session: don't stop forwarding of data upon last packet If a read shutdown is encountered on the first packet of a connection right after the data and the last analyser is unplugged at the same time, then that last data chunk may never be forwarded. In practice, right now it cannot happen on requests due to the way they're scheduled, nor can it happen on responses due to the way their analysers work. But this behaviour has been observed with new response analysers being developped. The reason is that when the read shutdown is encountered and an analyser is present, data cannot be forwarded but the BF_SHUTW_NOW flag is set. After that, the analyser gets called and unplugs itself, hoping that process_session() will automatically forward the data. This does not happen due to BF_SHUTW_NOW. Simply removing the test on this flag is not enough because then aborted requests still get forwarded, due to the forwarding code undoing the abort. The solution here consists in checking BF_SHUTR_NOW instead of BF_SHUTW_NOW. BF_SHUTR_NOW is only set on aborts and remains set until ->shutr() is called. This is enough to catch recent aborts but not prevent forwarding in other cases. Maybe a new special buffer flag "BF_ABORT" might be desirable in the future. This patch does not need to be backported because older versions don't have the analyser which make the problem appear.	2010-11-11 09:26:29 +01:00
Willy Tarreau	3041b9fcc3	[MEDIUM] session: call the frontend_decode_proxy analyser on proxied connections This analyser must absolutely be the earliest one to process contents, given the nature of the protocol.	2010-10-30 19:04:38 +02:00
Willy Tarreau	af7ad00a99	[MINOR] support a global jobs counter This counter is incremented for each incoming connection and each active listener, and is used to prevent haproxy from stopping upon SIGUSR1. It will thus be possible for some tasks in increment this counter in order to prevent haproxy from dying until they have completed their job.	2010-08-31 15:39:26 +02:00
Willy Tarreau	56123282ef	[MINOR] session-counters: use "track-sc{1,2}" instead of "track-{fe,be}-counters" The assumption that there was a 1:1 relation between tracked counters and the frontend/backend role was wrong. It is perfectly possible to track the track-fe-counters from the backend and the track-be-counters from the frontend. Thus, in order to reduce confusion, let's remove this useless {fe,be} reference and simply use {1,2} instead. The keywords have also been renamed in order to limit confusion. The ACL rule action now becomes "track-sc{1,2}". The ACLs are now "sc{1,2}_" instead of "trk{fe,be}_". That means that we can reasonably document "sc1" and "sc2" (sticky counters 1 and 2) as sort of patterns that are available during the whole session's life and use them just like any other pattern.	2010-08-10 18:04:15 +02:00
Willy Tarreau	9e9879a263	[MEDIUM] session-counters: make it possible to count connections from frontend In case a "track-be-counters" rule is referenced in the frontend, count it so that the connection counts are correct.	2010-08-10 18:04:15 +02:00
Willy Tarreau	f059a0f63a	[MAJOR] session-counters: split FE and BE track counters Having a single tracking pointer for both frontend and backend counters does not work. Instead let's have one for each. The keyword has changed to "track-be-counters" and "track-fe-counters", and the ACL "trk_" changed to "trkfe_" and "trkbe_*".	2010-08-10 18:04:15 +02:00
Willy Tarreau	da7ff64aa9	[MEDIUM] session-counters: add HTTP req/err tracking This patch adds support for the following session counters : - http_req_cnt : HTTP request count - http_req_rate: HTTP request rate - http_err_cnt : HTTP request error count - http_err_rate: HTTP request error rate The equivalent ACLs have been added to check the tracked counters for the current session or the counters of the current source.	2010-08-10 18:04:14 +02:00
Willy Tarreau	c3bd972cda	[MINOR] session-counters: add a general purpose counter (gpc0) This counter may be used to track anything. Two sets of ACLs are available to manage it, one gets its value, and the other one increments its value and returns it. In the second case, the entry is created if it did not exist. Thus it is possible for example to mark a source as being an abuser and to keep it marked as long as it does not wait for the entry to expire : # The rules below use gpc0 to track abusers, and reject them if # a source has been marked as such. The track-counters statement # automatically refreshes the entry which will not expire until a # 1-minute silence is respected from the source. The second rule # evaluates the second part if the first one is true, so GPC0 will # be increased once the conn_rate is above 100/5s. stick-table type ip size 200k expire 1m store conn_rate(5s),gpc0 tcp-request track-counters src tcp-request reject if { trk_get_gpc0 gt 0 } tcp-request reject if { trk_conn_rate gt 100 } { trk_inc_gpc0 gt 0} Alternatively, it is possible to let the entry expire even in presence of traffic by swapping the check for gpc0 and the track-counters statement : stick-table type ip size 200k expire 1m store conn_rate(5s),gpc0 tcp-request reject if { src_get_gpc0 gt 0 } tcp-request track-counters src tcp-request reject if { trk_conn_rate gt 100 } { trk_inc_gpc0 gt 0} It is also possible not to track counters at all, but entry lookups will then be performed more often : stick-table type ip size 200k expire 1m store conn_rate(5s),gpc0 tcp-request reject if { src_get_gpc0 gt 0 } tcp-request reject if { src_conn_rate gt 100 } { src_inc_gpc0 gt 0} The '0' at the end of the counter name is there because if we find that more counters may be useful, other ones will be added.	2010-08-10 18:04:14 +02:00
Willy Tarreau	1f7e925d6a	[MINOR] stktable: add a stktable_update_key() function This function looks up a key, updates its expiration date, or creates it if it was not found. acl_fetch_src_updt_conn_cnt() was updated to make use of it.	2010-08-10 18:04:14 +02:00
Willy Tarreau	6c59e0a942	[MEDIUM] session counters: add bytes_in_rate and bytes_out_rate counters These counters maintain incoming and outgoing byte rates in a stick-table, over a period which is defined in the configuration (2 ms to 24 days). They can be used to detect service abuse and enforce a certain bandwidth limits per source address for instance, and block if the rate is passed over. Since 32-bit counters are used to compute the rates, it is important not to use too long periods so that we don't have to deal with rates above 4 GB per period. Example : # block if more than 5 Megs retrieved in 30 seconds from a source. stick-table type ip size 200k expire 1m store bytes_out_rate(30s) tcp-request track-counters src tcp-request reject if { trk_bytes_out_rate gt 5000000 } # cause a 15 seconds pause to requests from sources in excess of 2 megs/30s tcp-request inspect-delay 15s tcp-request content accept if { trk_bytes_out_rate gt 2000000 } WAIT_END	2010-08-10 18:04:13 +02:00
Willy Tarreau	91c43d7fe4	[MEDIUM] session counters: add conn_rate and sess_rate counters These counters maintain incoming connection rates and session rates in a stick-table, over a period which is defined in the configuration (2 ms to 24 days). They can be used to detect service abuse and enforce a certain accept rate per source address for instance, and block if the rate is passed over. Example : # block if more than 50 requests per 5 seconds from a source. stick-table type ip size 200k expire 1m store conn_rate(5s),sess_rate(5s) tcp-request track-counters src tcp-request reject if { trk_conn_rate gt 50 } # cause a 3 seconds pause to requests from sources in excess of 20 requests/5s tcp-request inspect-delay 3s tcp-request content accept if { trk_sess_rate gt 20 } WAIT_END	2010-08-10 18:04:13 +02:00
Willy Tarreau	f4d17d9071	[MEDIUM] session: add a counter on the cumulated number of sessions Sessions are like connections but they have been accepted by L4 rules and really became sessions.	2010-08-10 18:04:13 +02:00
Willy Tarreau	1aa006fe7a	[MINOR] session: add trk_kbytes_* ACL keywords to track data size These one apply to the entry being tracked by current session.	2010-08-10 18:04:13 +02:00
Willy Tarreau	9b0ddcfd84	[MINOR] session: add the trk_conn_cur ACL keyword to track concurrent connection This one applies to the entry being tracked by current session.	2010-08-10 18:04:13 +02:00
Willy Tarreau	9a3f849371	[MINOR] session: add the trk_conn_cnt ACL keyword to track connection counts Most of the time we'll want to check the connection count of the criterion we're currently tracking. So instead of duplicating the src* tests, let's add trk_conn_cnt to report the total number of connections from the stick table entry currently being tracked. A nice part of the code was factored, and we should do the same for the other criteria.	2010-08-10 18:04:12 +02:00
Willy Tarreau	855e4bbcc7	[MEDIUM] session: add data in and out volume counters The new "bytes_in_cnt" and "bytes_out_cnt" session counters have been added. They're automatically updated when session counters are updated. They can be matched with the "src_kbytes_in" and "src_kbytes_out" ACLs which apply to the volume per source address. This can be used to deny access to service abusers.	2010-08-10 18:04:12 +02:00
Willy Tarreau	38285c18f4	[MEDIUM] session: add concurrent connections counter The new "conn_cur" session counter has been added. It is automatically updated upon "track XXX" directives, and the entry is touched at the moment we increment the value so that we don't consider further counter updates as real updates, otherwise we would end up updating upon completion, which may not be desired. Probably that some other event counters (eg: HTTP requests) will have to be updated upon each event though. This counter can be matched against current session's source address using the "src_conn_cur" ACL.	2010-08-10 18:04:12 +02:00
Willy Tarreau	8b22a71a4d	[MEDIUM] session: move counter ACL fetches from proto_tcp It was not normal to have counter fetches in proto_tcp.c. The only reason was that the key based on the source address was fetched there, but now we have split the key extraction and data processing, we must move that to a more appropriate place. Session seems OK since the counters are all manipulated from here. Also, since we're precisely counting number of connections with these ACLs, we rename them src_conn_cnt and src_updt_conn_cnt. This is not a problem right now since no version was emitted with these keywords.	2010-08-10 18:04:12 +02:00
Willy Tarreau	9ba2dcc86c	[MAJOR] session: add track-counters to track counters related to the session This patch adds the ability to set a pointer in the session to an entry in a stick table which holds various counters related to a specific pattern. Right now the syntax matches the target syntax and only the "src" pattern can be specified, to track counters related to the session's IPv4 source address. There is a special function to extract it and convert it to a key. But the goal is to be able to later support as many patterns as for the stick rules, and get rid of the specific function. The "track-counters" directive may only be set in a "tcp-request" statement right now. Only the first one applies. Probably that later we'll support multi-criteria tracking for a single session and that we'll have to name tracking pointers. No counter is updated right now, only the refcount is. Some subsequent patches will have to bring that feature.	2010-08-10 18:04:12 +02:00
Willy Tarreau	fb35620e87	[MEDIUM] session: support "tcp-request content" rules in backends Sometimes it's necessary to be able to perform some "layer 6" analysis in the backend. TCP request rules were not available till now, although documented in the diagram. Enable them in backend now.	2010-08-10 14:10:58 +02:00
Willy Tarreau	815a9b2039	[BUG] session: analysers must be checked when SI state changes Since the BF_READ_ATTACHED bug was fixed, a new issue surfaced. When a connection closes on the return path in tunnel mode while the request input is already closed, the request analyser which is waiting for a state change never gets woken up so it never closes the request output. This causes stuck sessions to remain indefinitely. One way to reliably reproduce the issue is the following (note that the client expects a keep-alive but not the server) : server: printf "HTTP/1.0 303\r\n\r\n" \| nc -lp8080 client: printf "GET / HTTP/1.1\r\n\r\n" \| nc 127.1 2500 The reason for the issue is that we don't wake the analysers up on stream interface state changes. So the least intrusive and most reliable thing to do is to consider stream interface state changes to call the analysers. We just need to remember what state each series of analysers have seen and check for the differences. In practice, that works. A later improvement later could consist in being able to let analysers state what they're interested to monitor : - left SI's state - right SI's state - request buffer flags - response buffer flags That could help having only one set of analysers and call them once status changes.	2010-08-10 14:04:28 +02:00
Willy Tarreau	7a20aa6e6b	[MEDIUM] session: make it possible to call an I/O handler on both SI This will be used when an I/O handler running in a stream interface needs to establish a connection somewhere. We want the session processor to evaluate both I/O handlers, depending on which side has one. Doing so also requires that stream_int_update_embedded() wakes the session up only when the other side is established or has closed, for instance in order to handle connection errors without looping indefinitely during the connection setup time. The session processor still relies on BF_READ_ATTACHED being set, though we must do whatever is required to remove this dependency.	2010-07-13 16:34:26 +02:00
Willy Tarreau	0bd05eaf24	[MEDIUM] stream-interface: add a ->release callback When a connection is closed on a stream interface, some iohandlers will need to be informed in order to release some resources. This normally happens upon a shutr+shutw. It is the equivalent of the fd_delete() call which is done for real sockets, except that this time we release internal resources. It can also be used with real sockets because it does not cost anything else and might one day be useful.	2010-07-13 16:06:23 +02:00
Willy Tarreau	e8f6338c5d	[BUG] stick-table: correctly refresh expiration timers The store operation did not correctly refresh the expiration timer on the stick entry. It did so on the temporary one instead.	2010-07-13 15:20:24 +02:00
Willy Tarreau	2a164ee549	[BUG] stick_table: the fix for the memory leak caused a regression (cherry picked from commit 61ba936e6858dfcf9964d25870726621d8188fb9) [ note: the bug was finally not present in 1.5-dev but at least we have to reset store_count to be compatible with 1.4 ] Commit d6e9e3b5e320b957e6c491bd92d91afad30ba638 caused recently created entries to be removed as soon as they were created, breaking stickiness. It is not clear whether a use-after-free was possible or not in this case. This bug was reported by Ben Congleton and narrowed down by Herv� Commowick, both of whom also tested the fix. Thanks to them !	2010-06-18 09:57:45 +02:00
Willy Tarreau	5214be1b22	[MINOR] session: add a pointer to the tracked counters for the source We'll have to keep counters of various criteria specific to the session's source. When we get one, keep a pointer to it in the session.	2010-06-14 15:32:18 +02:00
Willy Tarreau	cb18364ca7	[MEDIUM] stick_table: separate storage and update of session entries When an entry already exists, we just need to update its expiration timer. Let's have a dedicated function for that instead of spreading open code everywhere. This change also ensures that an update of an existing sticky session really leads to an update of its expiration timer, which was apparently not the case till now. This point needs to be checked in 1.4.	2010-06-14 15:10:26 +02:00
Willy Tarreau	13c29dee21	[MEDIUM] stick_table: move the server ID to a generic data type The server ID is now stored just as any other data type. It is only allocated if needed and is manipulated just like the other ones.	2010-06-14 15:10:25 +02:00
Willy Tarreau	f16d2b8c1b	[MEDIUM] stick_table: don't overwrite data when storing an entry Till now sticky sessions only held server IDs. Now there are other data types so it is not acceptable anymore to overwrite the server ID when writing something. The server ID must then only be written from the caller when appropriate. Doing this has also led to separate lookup and storage.	2010-06-14 15:10:24 +02:00
Willy Tarreau	f0b38bfc33	[CLEANUP] stick_table: move pattern to key functions to stick_table.c pattern.c depended on stick_table while in fact it should be the opposite. So we move from pattern.c everything related to stick_tables and invert the dependency. That way the code becomes more logical and intuitive.	2010-06-14 15:10:24 +02:00
Willy Tarreau	393379c3e0	[MINOR] stick_table: add support for variable-sized data Right now we're only able to store a server ID in a sticky session. The goal is to be able to store anything whose size is known at startup time. For this, we store the extra data before the stksess pointer, using a negative offset. It will then be easy to cumulate multiple data provided they each have their own offset.	2010-06-14 15:10:23 +02:00
Willy Tarreau	24dcaf3450	[MEDIUM] frontend: count the incoming connection earlier The frontend's connection was accounted for once the session was instanciated. This was problematic because the early ACLs weren't able to correctly account for the number of concurrent connections. Now we count the connection once it is assigned to the frontend. It also brings the nice advantage of being more symmetrical, because the stream_sock's accept() does not have to account for that anymore, only the session's accept() does.	2010-06-14 10:53:19 +02:00
Willy Tarreau	b36b4244a2	[MINOR] session: differenciate between accepted connections and received connections Now we're able to reject connections very early, so we need to use a different counter for the connections that are received and the ones that are accepted and converted into sessions, so that the rate limits can still apply to the accepted ones. The session rate must still be used to compute the rate limit, so that we can reject undesired traffic without affecting the rate.	2010-06-14 10:53:19 +02:00
Willy Tarreau	81f9aa3bf2	[MAJOR] frontend: split accept() into frontend_accept() and session_accept() A new function session_accept() is now called from the lower layer to instanciate a new session. Once the session is instanciated, the upper layer's frontent_accept() is called. This one can be service-dependant. That way, we have a 3-phase accept() sequence : 1) protocol-specific, session-less accept(), which is pointed to by the listener. It defaults to the generic stream_sock_accept(). 2) session_accept() which relies on a frontend but not necessarily for use in a proxy (eg: stats or any future service). 3) frontend_accept() which performs the accept for the service offerred by the frontend. It defaults to frontend_accept() which is really what is used by a proxy. The TCP/HTTP proxies have been moved to this mode so that we can now rely on frontend_accept() for any type of session initialization relying on a frontend. The next step will be to convert the stats to use the same system for the stats.	2010-06-14 10:53:17 +02:00
Willy Tarreau	070ceb6cfb	[MEDIUM] session: don't assign conn_retries upon accept() anymore The conn_retries attribute is now assigned when switching from SI_ST_INI to SI_ST_REQ. This eliminates one of the last dependencies on the backend in the frontend's accept() function.	2010-06-14 10:53:16 +02:00
Willy Tarreau	ee28de0a12	[MEDIUM] session: move the conn_retries attribute to the stream interface The conn_retries still lies in the session and its initialization depends on the backend when it may not yet be known. Let's first move it to the stream interface.	2010-06-14 10:53:16 +02:00
Willy Tarreau	d04e858db0	[MEDIUM] session: initialize server-side timeouts after connect() It was particularly embarrassing that the server timeout was assigned to buffers during an accept() just to be potentially changed later in case of a use_backend rule. The frontend side has nothing to do with server timeouts. Now we initialize them right after the connect() succeeds. Later this should change for a unique stream-interface timeout setting only.	2010-06-14 10:53:14 +02:00
Willy Tarreau	85e7d00a70	[MEDIUM] session: finish session establishment sequence in with I/O handlers Calling sess_establish() upon a successful connect() was essential, but it was not clearly stated whether it was necessary for an access to an I/O handler or not. While it would be desired, having it automatically add the response analyzers is quite a problem, and it breaks HTTP stats. The solution is thus not to call it for now and to perform the few response initializations as needed. For the long term, we need to find a way to specify the analyzers to install during a stream_int_register_handler() if any.	2010-06-14 10:53:14 +02:00
Willy Tarreau	a4cda67323	[BUG] stick_table: fix possible memory leak in case of connection error If a "stick store-request" rule is present, an entry is preallocated during the request. However, if there is no response due to an error or to a redir mode server, we never release it.	2010-06-14 10:49:24 +02:00
Willy Tarreau	a6eebb372d	[BUG] session: clear BF_READ_ATTACHED before next I/O The BF_READ_ATTACHED flag was created to wake analysers once after a connection was established. It turns out that this flag is never cleared once set, so even if there is no event, some analysers are still evaluated for no reason. The bug was introduced with commit `ea38854d34`. It may cause slightly increased CPU usages during data transfers, maybe even quite noticeable once when transferring transfer-encoded data, due to the fact that the request analysers are being checked for every chunk. This fix must be backported in 1.4 after all non-reg tests have been completed.	2010-06-04 14:49:52 +02:00
Cyril Bont�	47fdd8e993	[MINOR] add the "ignore-persist" option to conditionally ignore persistence This is used to disable persistence depending on some conditions (for example using an ACL matching static files or a specific User-Agent). You can see it as a complement to "force-persist". In the configuration file, the force-persist/ignore-persist declaration order define the rules priority. Used with the "appsesion" keyword, it can also help reducing memory usage, as the session won't be hashed the persistence is ignored.	2010-04-25 22:37:14 +02:00
Willy Tarreau	e45997661b	[MEDIUM] session: better fix for connection to servers with closed input The following patch fixed an issue but brought another one : 296897 [MEDIUM] connect to servers even when the input has already been closed The new issue is that when a connection is inspected and aborted using TCP inspect rules, now it is sent to the server before being closed. So that test is not satisfying. A probably better way is not to prevent a connection from establishing if only BF_SHUTW_NOW is set but BF_SHUTW is not. That way, the BF_SHUTW flag is not set if the request has any data pending, which still fixes the stats issue, but does not let any empty connection pass through. Also, as a safety measure, we extend buffer_abort() to automatically disable the BF_AUTO_CONNECT flag. While it appears to always be OK, it is by pure luck, so better safe than sorry.	2010-03-21 23:31:42 +01:00
Willy Tarreau	296897f2c6	[MEDIUM] connect to servers even when the input has already been closed The BF_AUTO_CLOSE flag prevented a connection from establishing on a server if the other side's input channel was already closed. This is wrong because there may be pending data to be sent. This was causing an issue with stats, as noticed and reported by Cyril Bont�. Since the stats are now handled as a server, sometimes concurrent accesses were causing one of the connections to send the shutdown(write) before the connection to the stats function was established, which aborted it early. This fix causes the BF_AUTO_CLOSE flag to be checked only when the connection on the outgoing stream interface has reached an established state. That way we're still able to connect, send the request then close.	2010-03-14 19:21:34 +01:00
Willy Tarreau	15e5554467	[CLEANUP] session: remove duplicate test This duplicate test should have been removed with the loop rework but was forgotten. It was harmless, but disassembly shows that it prevents gcc from correctly optimizing the loop.	2010-03-05 10:12:01 +01:00
Willy Tarreau	ae52678444	[STATS] count transfer aborts caused by client and by server Often we need to understand why some transfers were aborted or what constitutes server response errors. With those two counters, it is now possible to detect an unexpected transfer abort during a data phase (eg: too short HTTP response), and to know what part of the server response errors may in fact be assigned to aborted transfers.	2010-03-04 20:34:23 +01:00
Willy Tarreau	033b2dbeb3	[BUG] logs: don't report "last data" when we have just closed after an error Some people have reported seeing "SL" flags in their logs quite often while this should never happen. The reason was that then a server error is detected, we close the connection to that server and when we decide what state we were in, we see the connection is closed, and deduce it was the last data transfer, which is wrong. We should report DATA if the previous state was an established state, which this patch does. Now logs correctly report "SD" and not "SL" when a server resets a connection before the end of the transfer.	2010-03-04 18:45:47 +01:00
Willy Tarreau	2465779459	[STATS] separate frontend and backend HTTP stats It is wrong to merge FE and BE stats for a proxy because when we consult a BE's stats, it reflects the FE's stats eventhough the BE has received no traffic. The most common example happens with listen instances, where the backend gets credited for all the trafic even when a use_backend rule makes use of another backend.	2010-02-26 10:30:28 +01:00
Willy Tarreau	2e2b3eb65a	[BUILD] fix build breakage with DEBUG_FULL Paul Hirose reported a build error when DEBUG_FULL is set.	2010-02-09 20:55:44 +01:00
Krzysztof Piotr Oledzki	f9423ae43a	[MINOR] acl: add http_auth and http_auth_group Add two acls to match http auth data: acl <name> http_auth(userlist) acl <name> http_auth_hroup(userlist) group1 group2 (...)	2010-01-31 19:14:09 +01:00
Willy Tarreau	4de9149f87	[MINOR] add the "force-persist" statement to force persistence on down servers This is used to force access to down servers for some requests. This is useful when validating that a change on a server correctly works before enabling the server again.	2010-01-22 19:10:05 +01:00
Emeric Brun	1d33b2965e	[MEDIUM] Add stick and store rules analysers.	2010-01-12 16:01:24 +01:00
Willy Tarreau	762a23618e	[BUG] appsession's sessid must be reset at end of transaction If we don't do that, we may corrupt the pools in keep-alive sessions.	2010-01-09 13:57:26 +01:00
Willy Tarreau	e34070e1be	[MEDIUM] session: limit the number of analyser loops The initial code's intention was to loop on the analysers as long as an analyser is added by another one. [This code was wrong due to the while(0) which breaks even on a continue statement, but the initial intention must be changed too]. In fact we should limit the number of times we loop on analysers in order to limit latency. Using maxpollevents as a limit makes sense since this tunable is used for the exact same purposes. We may add another tunable later if that ever makes sense, so it's very unlikely.	2010-01-08 00:36:57 +01:00
Willy Tarreau	4602363f6a	[BUG] http: fix for capture memory leak was incorrect That patch was incorrect because under some circumstances, the capture memory could be freed by session_free() and then again by http_end_txn(), causing a double free and an eventual segfault. The pool use count was also reported wrong due to this bug. The cleanup code was removed from session_free() to remain only in http_end_txn().	2010-01-07 22:51:47 +01:00
Willy Tarreau	90deb18916	[MEDIUM] http: make safer use of the DONT_READ and AUTO_CLOSE flags Several HTTP analysers used to set those flags to values that were useful but without considering the possibility that they were not called again to clean what they did. First, replace direct flag manipulation with more explicit macros. Second, enforce a rule stating that any buffer which changes one of these flags from the default must restore it after completion, so that other analysers see correct flags. With both this fix and the previous one about analyser bits, we should not see any more stuck sessions.	2010-01-07 00:20:41 +01:00
Willy Tarreau	576507f4c5	[MEDIUM] session: also consider request analysers added during response A request analyser may very well be added while processing a response (eg: end of an HTTP keep-alive response). It's very dangerous to only rely on flags that ought to change in order to loop back, so let's correctly detect a possible new analyser addition instead of guessing.	2010-01-07 00:09:04 +01:00
Willy Tarreau	1e0bbafcbe	[MAJOR] session: fix the order by which the analysers are run With the introduction of keep-alive, we have created situations where an analyser can add other analysers to the current list, which are behind it, which have already been processed once, and which are needed immediately because without them there will be no more I/O activity. This is typically the case for enabling reading of a new request after preparing for a new request. Instead of creating specific cases for some analysers (there was already one such before), we now use a little bit of algorithmics to create an ordered bit chain supporting priorities and fast operations. Another advantage of this new construction is that it's not a real loop anymore, so if an analyser is unknown, it will not loop but just ignore it. Note that it is easy to skip multiple analysers at once now in order to speed up the checking a bit. Some test code has shown a minor gain though. This change has been carefully re-read and has no direct reason of causing a regression. However it has been tagged "major" because the fact that it runs the analysers correctly might trigger an old sleeping bug somewhere in one of the analysers.	2010-01-07 00:01:03 +01:00
Willy Tarreau	1464140fce	[MEDIUM] session: set SI_FL_NOLINGER when aborting on write timeouts Doing this helps us flush the system buffers from all unread data. This avoids having orphans when clients suddenly get off the net without reading their entire response.	2009-12-29 14:49:56 +01:00
Willy Tarreau	82eeaf2fae	[MEDIUM] http: properly handle "option forceclose" The "forceclose" option used to close the output channel to the server once it started to respond. While this happened to work with most servers, some of them considered this as a connection abort and immediately stopped responding. Now that we're aware of the end of a request and response, we're able to trivially handle this option and properly close both sides when the server's response is complete. During this change it appeared that forwarding could be allowed when the BF_SHUTW_NOW flag was set on a buffer, which obviously is not acceptable and was causing some trouble. This has been fixed too and is the reason for the MEDIUM status on this patch.	2009-12-29 14:26:42 +01:00
Willy Tarreau	d98cf93395	[MAJOR] http: implement body parser The body parser will be used in close and keep-alive modes. It follows the stream to keep in sync with both the request and the response message. Both chunked transfer-coding and content-length are supported according to RFC2616. The multipart/byterange encoding has not yet been implemented and if not seconded by any of the two other ones, will be forwarded till the close, as requested by the specification. Both the request and the response analysers converge into an HTTP_MSG_DONE state where it will be possible to force a close (option forceclose) or to restart with a fresh new transaction and maintain keep-alive. This change is important. All tests are OK but any possible behaviour change with "option httpclose" might find its root here.	2009-12-27 22:54:55 +01:00
Willy Tarreau	0937bc43cf	[MINOR] http: move the http transaction init/cleanup code to proto_http This code really belongs to the http part since it's transaction-specific. This will also make it easier to later reinitialize a transaction in order to support keepalive.	2009-12-22 15:03:09 +01:00
Willy Tarreau	7c3c54177a	[MAJOR] buffers: automatically compute the maximum buffer length We used to apply a limit to each buffer's size in order to leave some room to rewrite headers, then we used to remove this limit once the session switched to a data state. Proceeding that way becomes a problem with keepalive because we have to know when to stop reading too much data into the buffer so that we can leave some room again to process next requests. The principle we adopt here consists in only relying on to_forward+send_max. Indeed, both of those data define how many bytes will leave the buffer. So as long as their sum is larger than maxrewrite, we can safely fill the buffers. If they are smaller, then we refrain from filling the buffer. This means that we won't risk to fill buffers when reading last data chunk followed by a POST request and its contents. The only impact identified so far is that we must ensure that the BF_FULL flag is correctly dropped when starting to forward. Right now this is OK because nobody inflates to_forward without using buffer_forward().	2009-12-22 10:06:34 +01:00
Krzysztof Piotr Oledzki	97f07b832f	[MEDIUM] Decrease server health based on http responses / events, version 3 Implement decreasing health based on observing communication between HAProxy and servers. Changes in this version 2: - documentation - close race between a started check and health analysis event - don't force fastinter if it is not set - better names for options - layer4 support Changes in this version 3: - add stats - port to the current 1.4 tree	2009-12-16 00:29:27 +01:00
Krzysztof Piotr Oledzki	de71d16ec0	[MINOR] Collect & provide http response codes for frontends, fix backends This patch extends and corrects the functionality introduced by "Collect & provide http response codes received from servers": - responses are now also accounted for frontends - backend's and frontend's counters are incremented based on responses sent to client, not received from servers	2009-10-27 21:56:47 +01:00
Willy Tarreau	b37c27e28f	[MAJOR] http: create the analyser which waits for a response The code part which waits for an HTTP response has been extracted from the old function. We now have two analysers and the second one may re-enable the first one when an 1xx response is encountered. This has been tested and works. The calls to stream_int_return() that were remaining in the wait analyser have been converted to stream_int_retnclose().	2009-10-18 23:15:41 +02:00
Cyril Bont�	bf47aeb946	[MEDIUM] appsession: add the "request-learn" option This patch has 2 goals : 1. I wanted to test the appsession feature with a small PHP code, using PHPSESSID. The problem is that when PHP gets an unknown session id, it creates a new one with this ID. So, when sending an unknown session to PHP, persistance is broken : haproxy won't see any new cookie in the response and will never attach this session to a specific server. This also happens when you restart haproxy : the internal hash becomes empty and all sessions loose their persistance (load balancing the requests on all backend servers, creating a new session on each one). For a user, it's like the service is unusable. The patch modifies the code to make haproxy also learn the persistance from the client : if no session is sent from the server, then the session id found in the client part (using the URI or the client cookie) is used to associated the server that gave the response. As it's probably not a feature usable in all cases, I added an option to enable it (by default it's disabled). The syntax of appsession becomes : appsession <cookie> len <length> timeout <holdtime> [request-learn] This helps haproxy repair the persistance (with the risk of losing its session at the next request, as the user will probably not be load balanced to the same server the first time). 2. This patch also tries to reduce the memory usage. Here is a little example to explain the current behaviour : - Take a Tomcat server where /session.jsp is valid. - Send a request using a cookie with an unknown value AND a path parameter with another unknown value : curl -b "JSESSIONID=12345678901234567890123456789012" http://<haproxy>/session.jsp;jsessionid=00000000000000000000000000000001 (I know, it's unexpected to have a request like that on a live service) Here, haproxy finds the URI session ID and stores it in its internal hash (with no server associated). But it also finds the cookie session ID and stores it again. - As a result, session.jsp sends a new session ID also stored in the internal hash, with a server associated. => For 1 request, haproxy has stored 3 entries, with only 1 which will be usable The patch modifies the behaviour to store only 1 entry (maximum).	2009-10-18 11:56:26 +02:00
Krzysztof Piotr Oledzki	aeebf9ba65	[MEDIUM] Collect & provide separate statistics for sockets, v2 This patch allows to collect & provide separate statistics for each socket. It can be very useful if you would like to distinguish between traffic generate by local and remote users or between different types of remote clients (peerings, domestic, foreign). Currently no "Session rate" is supported, but adding it should be possible if we found it useful.	2009-10-04 18:56:02 +02:00
Krzysztof Piotr Oledzki	052d4fd07d	[CLEANUP] Move counters to dedicated structures Move counters from "struct proxy" and "struct server" to "struct pxcounters" and "struct svcounters". This patch should make no functional change.	2009-10-04 18:32:39 +02:00
Willy Tarreau	9a42c0d771	[MEDIUM] stats: replace the stats socket analyser with an SI applet We can get rid of the stats analyser by moving all the stats code to a stream interface applet. Above being cleaner, it provides new advantages such as the ability to process requests and responses from the same function and work only with simple state machines. There's no need for any hijack hack anymore. The direct advantage for the user are the interactive mode and the ability to chain several commands delimited by a semi-colon. Now if the user types "prompt", he gets a prompt from which he can send as many requests as he wants. All outputs are terminated by a blank line followed by a new prompt, so this can be used from external tools too. The code is not very clean, it needs some rework, but some part of the dirty parts are due to the remnants of the hijack mode used in the old functions we call. The old AN_REQ_STATS_SOCK analyser flag is now unused and has been removed.	2009-09-23 23:52:17 +02:00
Willy Tarreau	1accfc0d3a	[MEDIUM] session: call iohandler for embedded tasks (applets) Currently, it's up to process_session() to call the internal tasks if any are associated to the task being processed. If such a task is referenced, we don't use ->update() in process_session(), but only ->iohandler(), which itself is free to use ->update() to complete its work. It it also important to understand that an I/O handler may wake the task up again, for instance because it tries to send data to the other stream interface, which itself will wake the task up. So after returning from ->iohandler(), we must check if the task has been sent back to the runqueue, and if so, immediately return.	2009-09-23 23:52:15 +02:00
Willy Tarreau	89f7ef295d	[MINOR] stream_interface: add SI_FL_DONT_WAKE flag We had to add a new stream_interface flag : SI_FL_DONT_WAKE. This flag is used to indicate that a stream interface is being updated and that no wake up should be sent to its owner. This will be required for tasks embedded into stream interfaces. Otherwise, we could have the owner task send wakeups to itself during status updates, thus preventing the state from converging. As long as a stream_interface's status is being monitored and adjusted, there is no reason to wake it up again, as we know its changes will be seen and considered.	2009-09-23 23:52:14 +02:00
Willy Tarreau	31971e536a	[MEDIUM] add support for infinite forwarding In TCP, we don't want to forward chunks of data, we want to forward indefinitely. This patch introduces a special value for the amount of data to be forwarded. When buffer_forward() is called with BUF_INFINITE_FORWARD, it configures the buffer to never stop forwarding until the end.	2009-09-20 12:07:52 +02:00
Willy Tarreau	f41ffdc1e9	[BUG] stream_interface: SI_ST_CLO must have buffers SHUT An abort during a connect would go to the SI_ST_CLO state without the buffers shut. This was causing some sessions to never end if they would abort before the connect request was initiated. This bug has been introduced after 1.4-dev2. The doc has been extended to reflect that too.	2009-09-20 08:34:41 +02:00
Willy Tarreau	ba0b63d2c7	[MAJOR] buffers: fix the BF_EMPTY flag's meaning The BF_EMPTY flag was once used to indicate an empty buffer. However, it was used half the time as meaning the buffer is empty for the reader, and half the time as meaning there is nothing left to send. "nothing to send" is only indicated by "->send_max=0 && !pipe". Once we fix this, we discover that the flag is not used anymore. So the flags has been renamed BF_OUT_EMPTY and means exactly the condition above, ie, there is nothing to send. Doing so has allowed us to remove some unused tests for emptiness, but also to uncover a certain amount of situations where the flag was not correctly set or tested.	2009-09-20 08:17:45 +02:00
Willy Tarreau	520d95e42b	[MAJOR] buffers: split BF_WRITE_ENA into BF_AUTO_CONNECT and BF_AUTO_CLOSE The BF_WRITE_ENA buffer flag became very complex to deal with, because it was used to : - enable automatic connection - enable close forwarding - enable data forwarding The last point was not very true anymore since we introduced ->send_max, but still the test remained everywhere. This was causing issues such as impossibility to connect without forwarding data, impossibility to prevent closing when data was forwarded, etc... This patch clarifies the situation by getting rid of this multi-purpose flag and replacing it with : - data forwarding based only on ->send_max \|\| ->pipe ; - a new BF_AUTO_CONNECT flag to allow automatic connection and only that ; - ability to perform an automatic connection when ->send_max or ->pipe indicate that data is waiting to leave the buffer ; - a new BF_AUTO_CLOSE flag to let the producer automatically set the BF_SHUTW_NOW flag when it gets a BF_SHUTR. During this cleanup, it was discovered that some tests were performed twice, or that the BF_HIJACK flag was still tested, which is not needed anymore since ->send_max replcaed it. These places have been fixed too. These cleanups have also revealed a few areas where the other flags such as BF_EMPTY are not cleanly used. This will be an opportunity for a second patch.	2009-09-19 21:14:54 +02:00
Willy Tarreau	418fd4722a	[MAJOR] buffers: fix misuse of the BF_SHUTW_NOW flag This flag was incorrectly used as meaning "close immediately", while it needs to say "close ASAP". ASAP here means when unsent data pending in the buffer are sent. This helps cleaning up some dirty tricks where the buffer output was checking the BF_SHUTR flag combined with EMPTY and other such things. Now we have a clearly defined semantics : - producer sets SHUTR and may set SHUTW_NOW if WRITE_ENA is set, otherwise leave it to the session processor to set it. - consumer only checks SHUTW_NOW to decide whether or not to call shutw(). This also induced very minor changes at some locations which were not protected against buffer changes while the SHUTW_NOW flag was set. Now we prevent send_max from changing when the flag is set. Several tests have been run without any unexpected behaviour detected. Some more cleanups are needed, as it clearly appears that some tests could be removed with stricter semantics.	2009-09-19 14:53:46 +02:00
Willy Tarreau	c465fd7836	[BUG] tarpit did not work anymore Tarpit was broken by recent splitting of analysers. It would still let the connection go to the server due to a missing buffer_write_dis(). Also, it was performed too late (after content switching rules).	2009-08-31 00:17:18 +02:00
Willy Tarreau	dc85b39db7	[MEDIUM] stream_interface: add and use ->update function to resync We used to call stream_sock_data_finish() directly at the end of a session update, but if we want to support non-socket interfaces, we need to have this function configurable. Now we access it via ->update().	2009-08-18 07:38:19 +02:00
Willy Tarreau	27a674efb8	[MEDIUM] make it possible to change the buffer size in the configuration The new tune.bufsize and tune.maxrewrite global directives allow one to change the buffer size and the maxrewrite size. Right now, setting bufsize too low will block stats sockets which will not be able to write at all. An error checking must be added to buffer_write_chunk() so that if it cannot write its message to an empty buffer, it causes the caller to abort.	2009-08-17 22:56:56 +02:00
Willy Tarreau	a07a34eb24	[MEDIUM] replace BUFSIZE with buf->size in computations The first step towards dynamic buffer size consists in removing all static definitions of the buffer size. Instead, we store a buffer's size in itself. Right now they're all preinitialized to BUFSIZE, but we will change that.	2009-08-16 23:27:46 +02:00
Willy Tarreau	4e5b8287a6	[MEDIUM] set rep->analysers from fe and be analysers sess_establish() used to resort to protocol-specific guesses in order to set rep->analysers. This is no longer needed as it gets set from the frontend and the backend as a copy of what was defined in the configuration.	2009-08-16 22:57:50 +02:00
Willy Tarreau	5ca791da8d	[CLEANUP] move remaining stats sockets code to dumpstats The remains of the stats socket code has nothing to do in proto_uxst anymore and must move to dumpstats. The code is much cleaner and more structured. It was also an opportunity to rename AN_REQ_UNIX_STATS as AN_REQ_STATS_SOCK as the stats socket is no longer unix-specific either. The last item refering to stats in proto_uxst is the setting of the task's nice value which should in fact come from the listener.	2009-08-16 19:35:36 +02:00
Willy Tarreau	104eb36f26	[MEDIUM] make the unix stats sockets use the generic session handler process_session() is now ready to handle unix stats sockets. This first step works and old code has not been removed. A cleanup is required. The stats handler is not unix socket-centric anymore and should move to dumpstats.c.	2009-08-16 19:33:51 +02:00
Willy Tarreau	7320122655	[MINOR] session: switch to established state if no connect function When a stream interface has no connect() function, it means it is immediately connected, so we don't need any connection request. This will be used with unix sockets.	2009-08-16 19:33:29 +02:00
Willy Tarreau	6e6fb2beb9	[MEDIUM] session: account per-listener connections In order to merge the unix session handling code, we have to maintain the number of per-listener connections in the session. This was only performed for unix sockets till now.	2009-08-16 19:32:44 +02:00
Willy Tarreau	b55932ddaf	[MEDIUM] remove old experimental tcpsplice option This Linux-specific option was never really used in production and has since been superseded by new splicing options brought by recent Linux kernels. It caused several particular cases in the code because the kernel would take care of the session without haproxy being able to do anything on it, which became hard to handle in the new architecture. Let's simply get rid of it now that there is a replacement available.	2009-08-16 13:20:32 +02:00
Emeric Brun	647caf1ebc	[MEDIUM] add support for RDP cookie persistence The new statement "persist rdp-cookie" enables RDP cookie persistence. The RDP cookie is then extracted from the RDP protocol, and compared against available servers. If a server matches the RDP cookie, then it gets the connection.	2009-07-14 12:50:40 +02:00
Willy Tarreau	d88bb6f819	[MINOR] ensure we can jump from swiching rules to http without data In case of switching from TCP to HTTP, we want the HTTP request timeout to be properly initialized. For this, we have to jump to the analyser without breaking out of the loop nor waiting for incoming data. The way it is done right now is not particularly clean but it works. A cleaner method might involve pushing function pointers into a circular list.	2009-07-12 09:55:41 +02:00
Willy Tarreau	bedb9bad67	[MINOR] prepare callers of session_set_backend to handle errors session_set_backend will soon have to allocate areas for HTTP headers. We must ensure that the callers can handle an allocation error.	2009-07-12 08:36:24 +02:00
Willy Tarreau	1d0dfb155d	[MAJOR] http: complete splitting of the remaining stages The HTTP processing has been splitted into 7 steps, one of which is not anymore HTTP-specific (content-switching). That way, it becomes possible to use "use_backend" rules in TCP mode. A new "use_server" directive should follow soon.	2009-07-07 15:10:31 +02:00
Willy Tarreau	3a816293e9	[MEDIUM] session: tell analysers what bit they were called for Some stream analysers might become generic enough to be called for several bits. So we cannot have the analyser bit hard coded into the analyser itself. Let's make the caller inform the callee.	2009-07-07 10:55:49 +02:00
Willy Tarreau	d787e6648c	[MEDIUM] http: split request waiter from request processor We want to split several steps in HTTP processing so that we can call individual analysers depending on what processing we want to perform. The first step consists in splitting the part that waits for a request from the rest.	2009-07-07 10:14:51 +02:00
Willy Tarreau	dc340a900d	[MEDIUM] splice: set the capability on each stream_interface The splice code did not consider compatibility between both ends of the connection. Now we set different capabilities on each stream interface, depending on what the protocol can splice to/from. Right now, only TCP is supported. Thanks to this, we're now able to automatically detect when splice() is not implemented and automatically disable it on one end instead of reporting errors to the upper layer.	2009-06-28 23:10:19 +02:00

... 4 5 6 7 8 ...

622 Commits