haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-09 00:27:08 +02:00

Author	SHA1	Message	Date
Willy Tarreau	3a96f3f274	MEDIUM: counters: factor out smp_fetch_sc*_sess_rate smp_fetch_sc0_sess_rate, smp_fetch_sc1_sess_rate, smp_fetch_sc2_sess_rate, smp_fetch_src_sess_rate and smp_fetch_sess_rate were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	20843087f5	MEDIUM: counters: factor out smp_fetch_sc*_sess_cnt smp_fetch_sc0_sess_cnt, smp_fetch_sc1_sess_cnt, smp_fetch_sc2_sess_cnt, smp_fetch_src_sess_cnt and smp_fetch_sess_cnt were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	f44a553476	MEDIUM: counters: factor out smp_fetch_sc*_conn_cur smp_fetch_sc0_conn_cur, smp_fetch_sc1_conn_cur, smp_fetch_sc2_conn_cur, smp_fetch_src_conn_cur and smp_fetch_conn_cur were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	c8c65700de	MEDIUM: counters: factor out smp_fetch_sc*_conn_rate smp_fetch_sc0_conn_rate, smp_fetch_sc1_conn_rate, smp_fetch_sc2_conn_rate, smp_fetch_src_conn_rate and smp_fetch_conn_rate were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	3b46c5c47d	MEDIUM: counters: factor out smp_fetch_sc*_conn_cnt smp_fetch_sc0_conn_cnt, smp_fetch_sc1_conn_cnt, smp_fetch_sc2_conn_cnt, smp_fetch_src_conn_cnt and smp_fetch_conn_cnt were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	b9f441d2c0	MEDIUM: counters: factor out smp_fetch_sc*_clr_gpc0 smp_fetch_sc0_clr_gpc0, smp_fetch_sc1_clr_gpc0, smp_fetch_sc2_clr_gpc0, smp_fetch_src_clr_gpc0 and smp_fetch_clr_gpc0 were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	710d38cea5	MEDIUM: counters: factor out smp_fetch_sc*_inc_gpc0 smp_fetch_sc0_inc_gpc0, smp_fetch_sc1_inc_gpc0, smp_fetch_sc2_inc_gpc0, smp_fetch_src_inc_gpc0 and smp_fetch_inc_gpc0 were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	b5e0af0b6b	MEDIUM: counters: factor out smp_fetch_sc*_gpc0_rate smp_fetch_sc0_gpc0, smp_fetch_sc1_gpc0, smp_fetch_sc2_gpc0, smp_fetch_src_gpc0 and smp_fetch_gpc0 were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	30b2046dfe	MEDIUM: counters: factor out smp_fetch_sc*_get_gpc0 smp_fetch_sc0_get_gpc0, smp_fetch_sc1_get_gpc0, smp_fetch_sc2_get_gpc0, smp_fetch_src_get_gpc0 and smp_fetch_get_gpc0 were merged into a single function which relies on the fetch name to decide what to return.	2013-08-01 21:17:13 +02:00
Willy Tarreau	a65536ca4e	MINOR: counters: provide a generic function to retrieve a stkctr for sc* and src. This function aims at simplifying the prefetching of the table and entry when using any of the session counters fetches. The principle is that the src_* variant produces a stkctr that is used instead of the one from the session. That way we can call the same function from all session counter fetch functions and always have a single function to support sc[0-9]_/src_.	2013-08-01 21:17:13 +02:00
Willy Tarreau	88821241d4	MINOR: counters: factor out smp_fetch_sc*_tracked The new function makes use of the sc# in the keyword to get the counter ID.	2013-08-01 21:17:13 +02:00
Willy Tarreau	ef38c39287	MEDIUM: sample: systematically pass the keyword pointer to the keyword We're having a lot of duplicate code just because of minor variants between fetch functions that could be dealt with if the functions had the pointer to the original keyword, so let's pass it as the last argument. An earlier version used to pass a pointer to the sample_fetch element, but this is not the best solution for two reasons : - fetch functions will solely rely on the keyword string - some other smp_fetch_* users do not have the pointer to the original keyword and were forced to pass NULL. So finally we're passing a pointer to the keyword as a const char *, which perfectly fits the original purpose.	2013-08-01 21:17:13 +02:00
Willy Tarreau	0fc36e3ae9	BUG/MAJOR: http: don't emit the send-name-header when no server is available Lukas Benes reported that http-send-name-header causes a segfault if no server is available because we're dereferencing the session's target which is NULL. The tiniest reproducer looks like this : listen foo bind :1234 mode http http-send-name-header srv This obvious fix must be backported to 1.4 which is affected as well.	2013-07-04 11:44:27 +02:00
Willy Tarreau	7af7d5957d	BUG: counters: third counter was not stored if others unset Commit `e25c917a` introduced a third tracking counter bug forgot to check it when storing values at the end of the session. The impact is that if neither the first nor the second one are changed, none of them are saved.	2013-07-01 18:08:41 +02:00
Willy Tarreau	dc13c11c1e	BUG/MEDIUM: prevent gcc from moving empty keywords lists into BSS Benoit Dolez reported a failure to start haproxy 1.5-dev19. The process would immediately report an internal error with missing fetches from some crap instead of ACL names. The cause is that some versions of gcc seem to trim static structs containing a variable array when moving them to BSS, and only keep the fixed size, which is just a list head for all ACL and sample fetch keywords. This was confirmed at least with gcc 3.4.6. And we can't move these structs to const because they contain a list element which is needed to link all of them together during the parsing. The bug indeed appeared with 1.5-dev19 because it's the first one to have some empty ACL keyword lists. One solution is to impose -fno-zero-initialized-in-bss to everyone but this is not really nice. Another solution consists in ensuring the struct is never empty so that it does not move there. The easy solution consists in having a non-null list head since it's not yet initialized. A new "ILH" list head type was thus created for this purpose : create an Initialized List Head so that gcc cannot move the struct to BSS. This fixes the issue for this version of gcc and does not create any burden for the declarations.	2013-06-21 23:29:02 +02:00
Willy Tarreau	8615c2af67	MEDIUM: session: disable lingering on the server when the client aborts When abortonclose is used and an error is detected on the client side, better force an RST to the server. That way we propagate to the server the same vision we got from the client, and we ensure that we won't keep TIME_WAITs.	2013-06-21 08:20:19 +02:00
Willy Tarreau	be4a3eff34	MEDIUM: counters: use sc0/sc1/sc2 instead of sc1/sc2/sc3 It was a bit inconsistent to have gpc start at 0 and sc start at 1, so make sc start at zero like gpc. No previous release was issued with sc3 anyway, so no existing setup should be affected.	2013-06-17 15:04:07 +02:00
Willy Tarreau	6d4e4e8dd2	MEDIUM: acl: remove a lot of useless ACLs that are equivalent to their fetches The following 116 ACLs were removed because they're redundant with their fetch function since last commit which allows the fetch function to be used instead for types BOOL, INT and IP. Most places are now left with an empty ACL keyword list that was not removed so that it's easier to add other ACLs later. always_false, always_true, avg_queue, be_conn, be_id, be_sess_rate, connslots, nbsrv, queue, srv_conn, srv_id, srv_is_up, srv_sess_rate, res.comp, fe_conn, fe_id, fe_sess_rate, dst_conn, so_id, wait_end, http_auth, http_first_req, status, dst, dst_port, src, src_port, sc1_bytes_in_rate, sc1_bytes_out_rate, sc1_clr_gpc0, sc1_conn_cnt, sc1_conn_cur, sc1_conn_rate, sc1_get_gpc0, sc1_gpc0_rate, sc1_http_err_cnt, sc1_http_err_rate, sc1_http_req_cnt, sc1_http_req_rate, sc1_inc_gpc0, sc1_kbytes_in, sc1_kbytes_out, sc1_sess_cnt, sc1_sess_rate, sc1_tracked, sc1_trackers, sc2_bytes_in_rate, sc2_bytes_out_rate, sc2_clr_gpc0, sc2_conn_cnt, sc2_conn_cur, sc2_conn_rate, sc2_get_gpc0, sc2_gpc0_rate, sc2_http_err_cnt, sc2_http_err_rate, sc2_http_req_cnt, sc2_http_req_rate, sc2_inc_gpc0, sc2_kbytes_in, sc2_kbytes_out, sc2_sess_cnt, sc2_sess_rate, sc2_tracked, sc2_trackers, sc3_bytes_in_rate, sc3_bytes_out_rate, sc3_clr_gpc0, sc3_conn_cnt, sc3_conn_cur, sc3_conn_rate, sc3_get_gpc0, sc3_gpc0_rate, sc3_http_err_cnt, sc3_http_err_rate, sc3_http_req_cnt, sc3_http_req_rate, sc3_inc_gpc0, sc3_kbytes_in, sc3_kbytes_out, sc3_sess_cnt, sc3_sess_rate, sc3_tracked, sc3_trackers, src_bytes_in_rate, src_bytes_out_rate, src_clr_gpc0, src_conn_cnt, src_conn_cur, src_conn_rate, src_get_gpc0, src_gpc0_rate, src_http_err_cnt, src_http_err_rate, src_http_req_cnt, src_http_req_rate, src_inc_gpc0, src_kbytes_in, src_kbytes_out, src_sess_cnt, src_sess_rate, src_updt_conn_cnt, table_avl, table_cnt, ssl_c_ca_err, ssl_c_ca_err_depth, ssl_c_err, ssl_c_used, ssl_c_verify, ssl_c_version, ssl_f_version, ssl_fc, ssl_fc_alg_keysize, ssl_fc_has_crt, ssl_fc_has_sni, ssl_fc_use_keysize,	2013-06-11 21:22:58 +02:00
Willy Tarreau	9a355ec257	MEDIUM: http: add support for action "set-log-level" in http-request/http-response Some users want to disable logging for certain non-important requests such as stats requests or health-checks coming from another equipment. Other users want to log with a higher importance (eg: notice) some special traffic (POST requests, authenticated requests, requests coming from suspicious IPs) or some abnormally large responses. This patch responds to all these needs at once by adding a "set-log-level" action to http-request/http-response. The 8 syslog levels are supported, as well as "silent" to disable logging.	2013-06-11 17:50:26 +02:00
Willy Tarreau	2b57cb8f30	MEDIUM: protocol: implement a "drain" function in protocol layers Since commit `cfd97c6f` was merged into 1.5-dev14 (BUG/MEDIUM: checks: prevent TIME_WAITs from appearing also on timeouts), some valid health checks sometimes used to show some TCP resets. For example, this HTTP health check sent to a local server : 19:55:15.742818 IP 127.0.0.1.16568 > 127.0.0.1.8000: S 3355859679:3355859679(0) win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:15.742841 IP 127.0.0.1.8000 > 127.0.0.1.16568: S 1060952566:1060952566(0) ack 3355859680 win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:15.742863 IP 127.0.0.1.16568 > 127.0.0.1.8000: . ack 1 win 257 19:55:15.745402 IP 127.0.0.1.16568 > 127.0.0.1.8000: P 1:23(22) ack 1 win 257 19:55:15.745488 IP 127.0.0.1.8000 > 127.0.0.1.16568: FP 1:146(145) ack 23 win 257 19:55:15.747109 IP 127.0.0.1.16568 > 127.0.0.1.8000: R 23:23(0) ack 147 win 257 After some discussion with Chris Huang-Leaver, it appeared clear that what we want is to only send the RST when we have no other choice, which means when the server has not closed. So we still keep SYN/SYN-ACK/RST for pure TCP checks, but don't want to see an RST emitted as above when the server has already sent the FIN. The solution against this consists in implementing a "drain" function at the protocol layer, which, when defined, causes as much as possible of the input socket buffer to be flushed to make recv() return zero so that we know that the server's FIN was received and ACKed. On Linux, we can make use of MSG_TRUNC on TCP sockets, which has the benefit of draining everything at once without even copying data. On other platforms, we read up to one buffer of data before the close. If recv() manages to get the final zero, we don't disable lingering. Same for hard errors. Otherwise we do. In practice, on HTTP health checks we generally find that the close was pending and is returned upon first recv() call. The network trace becomes cleaner : 19:55:23.650621 IP 127.0.0.1.16561 > 127.0.0.1.8000: S 3982804816:3982804816(0) win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:23.650644 IP 127.0.0.1.8000 > 127.0.0.1.16561: S 4082139313:4082139313(0) ack 3982804817 win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:23.650666 IP 127.0.0.1.16561 > 127.0.0.1.8000: . ack 1 win 257 19:55:23.651615 IP 127.0.0.1.16561 > 127.0.0.1.8000: P 1:23(22) ack 1 win 257 19:55:23.651696 IP 127.0.0.1.8000 > 127.0.0.1.16561: FP 1:146(145) ack 23 win 257 19:55:23.652628 IP 127.0.0.1.16561 > 127.0.0.1.8000: F 23:23(0) ack 147 win 257 19:55:23.652655 IP 127.0.0.1.8000 > 127.0.0.1.16561: . ack 24 win 257 This change should be backported to 1.4 which is where Chris encountered this issue. The code is different, so probably the tcp_drain() function will have to be put in the checks only.	2013-06-10 20:33:23 +02:00
Willy Tarreau	6f1615f596	MINOR: counters: add fetch/acl sc*_tracked to indicate whether a counter is tracked Sometimes we'd like to know if a counter is being tracked before adding a header to an outgoing request. These ones do that.	2013-06-10 10:30:09 +02:00
Willy Tarreau	ba2ffd18b5	MEDIUM: counters: add a new "gpc0_rate" counter in stick-tables This counter is special in that instead of reporting the gpc0 cumulative count, it returns its increase rate over the configured period.	2013-05-29 15:54:14 +02:00
Willy Tarreau	e25c917af8	MEDIUM: counters: add support for tracking a third counter We're often missin a third counter to track base, src and base+src at the same time. Here we introduce track_sc3 to have this third counter. It would be wise not to add much more counters because that slightly increases the session size and processing time though the real issue is more the declaration of the keywords in the code and in the doc.	2013-05-29 00:37:16 +02:00
Willy Tarreau	d5ca9abb0d	MINOR: counters: make it easier to extend the amount of tracked counters By properly affecting the flags and values, it becomes easier to add more tracked counters, for example for experimentation. It also slightly reduces the code and the number of tests. No counters were added with this patch.	2013-05-28 17:43:40 +02:00
Willy Tarreau	1e5dfdad77	MINOR: session: only call http_send_name_header() when changing the server Till now we used to call the function until the connection established, which means that the header rewriting was performed for nothing upon each even (eg: uploaded contents) until the server responded or timed out. Now we only call the function when we assign the server.	2013-04-11 18:18:01 +02:00
Willy Tarreau	d86e29d2a1	CLEANUP: acl: remove unused references to ACL_USE_* Now that acl->requires is not used anymore, we can remove all references to it as well as all ACL_USE_* flags.	2013-04-03 02:13:00 +02:00
Willy Tarreau	c48c90dfa5	MAJOR: acl: remove the arg_mask from the ACL definition and use the sample fetch's Now that ACLs solely rely on sample fetch functions, make them use the same arg mask. All inconsistencies have been fixed separately prior to this patch, so this patch almost only adds a new pointer indirection and removes all references to ARG*() in the definitions. The parsing is still performed by the ACL code though.	2013-04-03 02:12:58 +02:00
Willy Tarreau	8ed669b12a	MAJOR: acl: make all ACLs reference the fetch function via a sample. ACL fetch functions used to directly reference a fetch function. Now that all ACL fetches have their sample fetches equivalent, we can make ACLs reference a sample fetch keyword instead. In order to simplify the code, a sample keyword name may be NULL if it is the same as the ACL's, which is the most common case. A minor change appeared, http_auth always expects one argument though the ACL allowed it to be missing and reported as such afterwards, so fix the ACL to match this. This is not really a bug.	2013-04-03 02:12:58 +02:00
Willy Tarreau	281c799e25	MINOR: session: rename sample fetch functions and declare the sample keywords The following sample fetch functions were only usable by ACLs but are now usable by sample fetches too : sc1_bytes_in_rate, sc1_bytes_out_rate, sc1_clr_gpc0, sc1_conn_cnt, sc1_conn_cur, sc1_conn_rate, sc1_get_gpc0, sc1_http_err_cnt, sc1_http_err_rate, sc1_http_req_cnt, sc1_http_req_rate, sc1_inc_gpc0, sc1_kbytes_in, sc1_kbytes_out, sc1_sess_cnt, sc1_sess_rate, sc1_trackers, sc2_bytes_in_rate, sc2_bytes_out_rate, sc2_clr_gpc0, sc2_conn_cnt, sc2_conn_cur, sc2_conn_rate, sc2_get_gpc0, sc2_http_err_cnt, sc2_http_err_rate, sc2_http_req_cnt, sc2_http_req_rate, sc2_inc_gpc0, sc2_kbytes_in, sc2_kbytes_out, sc2_sess_cnt, sc2_sess_rate, sc2_trackers, src_bytes_in_rate, src_bytes_out_rate, src_clr_gpc0, src_conn_cnt, src_conn_cur, src_conn_rate, src_get_gpc0, src_http_err_cnt, src_http_err_rate, src_http_req_cnt, src_http_req_rate, src_inc_gpc0, src_kbytes_in, src_kbytes_out, src_sess_cnt, src_sess_rate, src_updt_conn_cnt, table_avl, table_cnt, The fetch functions have been renamed "smp_fetch_*".	2013-04-03 02:12:58 +02:00
Willy Tarreau	a7a7ebc382	BUG/MINOR: http: don't process abortonclose when request was sent option abortonclose may cause a valid connection to be aborted just after the request has been sent. This is because we check for it during the session establishment sequence before checking for write activity. So if the abort and the connect complete at the same time, the abort is still considered. Let's check for an explicity partial write before aborting. This fix should be backported to 1.4 too.	2012-12-30 00:50:35 +01:00
Willy Tarreau	71241abfd3	MINOR: http: move redirect rule processing to its own function We now have http_apply_redirect_rule() which does all the redirect-specific job instead of having this inside http_process_req_common(). Also one of the benefit gained from uniformizing this code is that both keep-alive and close response do emit the PR-- flags. The fix for the flags could probably be backported to 1.4 though it's very minor. The previous function http_perform_redirect() was becoming confusing so it was renamed http_perform_server_redirect() since it only applies to server-based redirection.	2012-12-28 14:47:19 +01:00
Willy Tarreau	d79a3b248e	BUG/MINOR: log: make log-format, unique-id-format and add-header more independant It happens that all of them call parse_logformat_line() which sets proxy->to_log with a number of flags affecting the line format for all three users. For example, having a unique-id specified disables the default log-format since fe->to_log is tested when the session is established. Similarly, having "option logasap" will cause "+" to be inserted in unique-id or headers referencing some of the fields depending on LW_BYTES. This patch first removes most of the dependency on fe->to_log whenever possible. The first possible cleanup is to stop checking fe->to_log for being null, considering that it always contains at least LW_INIT when any such usage is made of the log-format! Also, some checks are wrong. s->logs.logwait cannot be nulled by "logwait &= ~LW_" since LW_INIT is always there. This results in getting the wrong log at the end of a request or session when a unique-id or add-header is set, because logwait is still not null but the log-format is not checked. Further cleanups are required. Most LW_ flags should be removed or at least replaced with what they really mean (eg: depend on client-side connection, depend on server-side connection, etc...) and this should only affect logging, not other mechanisms. This patch fixes the default log-format and tries to limit interferences between the log formats, but does not pretend to do more for the moment, since it's the most visible breakage.	2012-12-28 09:51:00 +01:00
Willy Tarreau	20d46a5a95	CLEANUP: session: use an array for the stick counters The stick counters were in two distinct sets of struct members, causing some code to be duplicated. Now we use an array, which enables some processing to be performed in loops. This allowed the code to be shrunk by 700 bytes.	2012-12-09 15:57:16 +01:00
Willy Tarreau	2406db4b39	MEDIUM: counters: add sc1_trackers/sc2_trackers Returns the current amount of concurrent connections tracking the same tracked counters. This number is automatically incremented when tracking begins and decremented when tracking stops. It differs from sc1_conn_cur in that it does not rely on any stored information but on the table's reference count (the "use" value which is returned by "show table" on the CLI). This may sometimes be more suited for layer7 tracking.	2012-12-09 14:08:47 +01:00
Willy Tarreau	5d5b5d8eaf	MEDIUM: proto_tcp: add support for tracking L7 information Until now it was only possible to use track-sc1/sc2 with "src" which is the IPv4 source address. Now we can use track-sc1/sc2 with any fetch as well as any transformation type. It works just like the "stick" directive. Samples are automatically converted to the correct types for the table. Only "tcp-request content" rules may use L7 information, and such information must already be present when the tracking is set up. For example it becomes possible to track the IP address passed in the X-Forwarded-For header. HTTP request processing now also considers tracking from backend rules because we want to be able to update the counters even when the request was already parsed and tracked. Some more controls need to be performed (eg: samples do not distinguish between L4 and L6).	2012-12-09 14:08:47 +01:00
Willy Tarreau	0ede5a3318	BUG/MEDIUM: session: fix FD leak when transport layer logging is enabled Commit `2b199c9a` attempted to fix all places where the transport layer is improperly closed, but it missed one place in session_free(). If SSL ciphers are logged, the close() is delayed post-log and performed in session_free(). However, conn_xprt_close() only closes the transport layer but not the file descriptor, resulting in a slow FD leak which is hardly noticeable until the process cannot accept any new connection. A workaround consisted in disabling %sslv/%sslc in log-format. So use conn_full_close() instead of conn_xprt_close() to fix this there too. A similar pending issue existed in the close during outgoing connection failure, though on this side, the transport layer is never tracked at the moment.	2012-12-08 08:48:04 +01:00
Willy Tarreau	20879a0233	MEDIUM: connection: add error reporting for the SSL Get a bit more info in the logs when client-side SSL handshakes fail.	2012-12-03 17:21:52 +01:00
Willy Tarreau	8e3bf699db	MEDIUM: connection: add error reporting for the PROXY protocol header When the PROXY protocol header is expected and fails, leading to an abort of the incoming connection, we now emit a log message. If option dontlognull is set and it was just a port probe, then nothing is logged.	2012-12-03 17:21:51 +01:00
Willy Tarreau	0af2912fd1	MEDIUM: connection: add minimal error reporting in logs for incomplete connections Since the introduction of SSL, it became quite annoying not to get any useful info in logs about handshake failures. Let's improve reporting for embryonic sessions by checking a per-connection error code and reporting it into the logs if an error happens before the session is completely instanciated. The "dontlognull" option is supported in that if a connection does not talk before being aborted, nothing will be emitted. At the moment, only timeouts are considered for SSL and the PROXY protocol, but next patches will handle more errors.	2012-12-03 15:38:23 +01:00
Willy Tarreau	14cba4b0b1	MEDIUM: connection: add an error code in connections This will be needed to improve error reporting, especially for SSL.	2012-12-03 14:22:13 +01:00
Willy Tarreau	8139b9959f	MINOR: compression: make the stats a bit more robust To ensure that we only count when a response was compressed, we also check for the SN_COMP_READY flag which indicates that the compression was effectively initialized. Comp_algo alone is meaningless.	2012-11-27 09:34:00 +01:00
Willy Tarreau	5e16cbc3bd	MINOR: stats: report the total number of compressed responses per front/back Depending on the content-types and accept-encoding fields, some responses might or might not be compressed. Let's have a counter of the number of compressed responses and report it in the stats to help improve compression usage. Some cosmetic issues were fixed in the CSV output too (missing commas at the end).	2012-11-24 14:54:13 +01:00
Willy Tarreau	2b199c9ac3	MEDIUM: connection: provide a common conn_full_close() function Several places got the connection close sequence wrong because it was not obvious. In practice we always need the same sequence when aborting, so let's have a common function for this.	2012-11-23 17:32:21 +01:00
Willy Tarreau	543db62e1f	BUG/MEDIUM: compression: release the zlib pools between keep-alive requests There was a possible memory leak in the zlib code when the first response of a keep-alive session was compressed, because the next request would reset the compression algo, preventing a later call to session_free() from releasing it. The reason is that it is necessary to release the assigned resources in http_end_txn_clean_session().	2012-11-15 16:41:22 +01:00
William Lallemand	ec3e3890f0	BUG/MINOR: compression: deinit zlib only when required The zlib stream was deinitialized even when the init failed.	2012-11-15 15:42:17 +01:00
Willy Tarreau	3fdb366885	MAJOR: connection: replace struct target with a pointer to an enum Instead of storing a couple of (int, ptr) in the struct connection and the struct session, we use a different method : we only store a pointer to an integer which is stored inside the target object and which contains a unique type identifier. That way, the pointer allows us to retrieve the object type (by dereferencing it) and the object's address (by computing the displacement in the target structure). The NULL pointer always corresponds to OBJ_TYPE_NONE. This reduces the size of the connection and session structs. It also simplifies target assignment and compare. In order to improve the generated code, we try to put the obj_type element at the beginning of all the structs (listener, server, proxy, si_applet), so that the original and target pointers are always equal. A lot of code was touched by massive replaces, but the changes are not that important.	2012-11-12 00:42:33 +01:00
Willy Tarreau	b31c971bef	CLEANUP: channel: remove any reference of the hijackers Hijackers were functions designed to inject data into channels in the distant past. They became unused around 1.3.16, and since there has not been any user of this mechanism to date, it's uncertain whether the mechanism still works (and it's not really useful anymore). So better remove it as well as the pointer it uses in the channel struct.	2012-11-11 23:05:39 +01:00
Willy Tarreau	7f7ad91056	BUILD: stream_interface: remove si_fd() and its references si_fd() is not used a lot, and breaks builds on OpenBSD 5.2 which defines this name for its own purpose. It's easy enough to remove this one-liner function, so let's do it.	2012-11-11 20:53:29 +01:00
Willy Tarreau	815f5ecffa	BUG/MINOR: session: mark the handshake as complete earlier There is a small waste of CPU cycles when no handshake is required on an accepted connection, because we had to perform one call to conn_fd_handler() to mark the connection CONNECTED and to call process_session() again to say that nothing happened. By marking the connection CONNECTED when there is no pending handshake, we avoid this extra call to process_session().	2012-11-09 22:09:08 +01:00
Willy Tarreau	798f4325fa	OPTIM: session: don't process the whole session when only timers need a refresh Having a global expiration timer for a task means that the tasks are regularly woken up (at least after each expiration timer). It's totally useless and counter productive to process the whole session upon each such wakeup, and it's fairly easy to detect such wakeups, so let's just update the task's timer and return to sleep when this happens. For 100k concurrent connections with 10s of timeouts, this can save 10k wakeups per second, which is not bad.	2012-11-08 16:55:07 +01:00
William Lallemand	1c2d622d82	CLEANUP: use struct comp_ctx instead of union Replace union comp_ctx by struct comp_ctx. Use struct comp_ctx * in the init/add_data/flush/reset/end prototypes of compression.h functions.	2012-11-05 10:23:16 +01:00
Willy Tarreau	e3224e870f	BUG/MINOR: session: ensure that we don't retry connection if some data were sent With extra-large buffers, it is possible that a lot of data are sent upon connection establishment before the session is notified. The issue is how to handle a send() error after some data were actually sent. At the moment, only a connection error is reported, causing a new connection attempt and send() to restart after the last data. We absolutely don't want to retry the connect() if at least one byte was sent, because those data are lost. The solution consists in reporting exactly what happens, which is : - a successful connection attempt - a read/write error on the channel That way we go on with sess_establish(), the response analysers are called and report the appropriate connection state for the error (typically a server abort while waiting for a response). This mechanism also guarantees that we won't retry since it's a success. The logs also report the correct connect time. Note that 1.4 is not directly affected because it only attempts one send(), so it cannot detect a send() failure here and distinguish it form a failed connection attempt. So no backport is needed. Also, this is just a safe belt we're taking, since this issue should not happen anymore since previous commit.	2012-10-29 23:31:04 +01:00
Willy Tarreau	19d14ef104	MEDIUM: make the trash be a chunk instead of a char * The trash is used everywhere to store the results of temporary strings built out of s(n)printf, or as a storage for a chunk when chunks are needed. Using global.tune.bufsize is not the most convenient thing either. So let's replace trash with a chunk and directly use it as such. We can then use trash.size as the natural way to get its size, and get rid of many intermediary chunks that were previously used. The patch is huge because it touches many areas but it makes the code a lot more clear and even outlines places where trash was used without being that obvious.	2012-10-29 16:57:30 +01:00
Willy Tarreau	f2943dccd0	MAJOR: session: detach the connections from the stream interfaces We will need to be able to switch server connections on a session and to keep idle connections. In order to achieve this, the preliminary requirement is that the connections can survive the session and be detached from them. Right now they're still allocated at exactly the same place, so when there is a session, there are always 2 connections. We could soon improve on this by allocating the outgoing connection only during a connect(). This current patch touches a lot of code and intentionally does not change any functionnality. Performance tests show no regression (even a very minor improvement). The doc has not yet been updated.	2012-10-26 20:15:20 +02:00
Willy Tarreau	c919dc66a3	CLEANUP: remove trashlen trashlen is a copy of global.tune.bufsize, so let's stop using it as a duplicate, fall back to the original bufsize, it's less confusing this way.	2012-10-26 20:04:27 +02:00
William Lallemand	82fe75c1a7	MEDIUM: HTTP compression (zlib library support) This commit introduces HTTP compression using the zlib library. http_response_forward_body has been modified to call the compression functions. This feature includes 3 algorithms: identity, gzip and deflate: * identity: this is mostly for debugging, and it was useful for developping the compression feature. With Content-Length in input, it is making each chunk with the data available in the current buffer. With chunks in input, it is rechunking, the output chunks will be bigger or smaller depending of the size of the input chunk and the size of the buffer. Identity does not apply any change on data. * gzip: same as identity, but applying a gzip compression. The data are deflated using the Z_NO_FLUSH flag in zlib. When there is no more data in the input buffer, it flushes the data in the output buffer (Z_SYNC_FLUSH). At the end of data, when it receives the last chunk in input, or when there is no more data to read, it writes the end of data with Z_FINISH and the ending chunk. * deflate: same as gzip, but with deflate algorithm and zlib format. Note that this algorithm has ambiguous support on many browsers and no support at all from recent ones. It is strongly recommended not to use it for anything else than experimentation. You can't choose the compression ratio at the moment, it will be set to Z_BEST_SPEED (1), as tests have shown very little benefit in terms of compression ration when going above for HTML contents, at the cost of a massive CPU impact. Compression will be activated depending of the Accept-Encoding request header. With identity, it does not take care of that header. To build HAProxy with zlib support, use USE_ZLIB=1 in the make parameters. This work was initially started by David Du Colombier at Exceliance.	2012-10-26 02:30:48 +02:00
Willy Tarreau	c93f7959e5	CLEANUP: session: remove term_trace which is not used anymore This field was used to trace precisely where a session was terminated but it did not survive code rearchitecture and was not used at all anymore. Let's get rid of it.	2012-10-13 11:10:30 +02:00
Willy Tarreau	9b28e03b66	MAJOR: channel: replace the struct buffer with a pointer to a buffer With this commit, we now separate the channel from the buffer. This will allow us to replace buffers on the fly without touching the channel. Since nobody is supposed to keep a reference to a buffer anymore, doing so is not a problem and will also permit some copy-less data manipulation. Interestingly, these changes have shown a 2% performance increase on some workloads, probably due to a better cache placement of data.	2012-10-13 09:07:52 +02:00
Willy Tarreau	394db379eb	REORG: http: rename msg->buf to msg->chn since it's a channel It's extremely confusing to have all those msg->buf->buf everywhere after the extraction of the buffer from the channel. Let's clean this up.	2012-10-12 22:40:39 +02:00
Willy Tarreau	93dbc2bc0e	MEDIUM: log: add a new LW_XPRT flag to pin the transport layer This flag will have to be set on log tags which require transport layer information. They will prevent the conn_xprt_close() call from releasing the transport layer too early.	2012-10-12 20:30:51 +02:00
Willy Tarreau	1e954913de	MEDIUM: connection: add a flag to hold the transport layer When we start logging SSL information, we need the SSL struct to be present even past the conn_xprt_close() call. In order to achieve this, we should use refcounting on the connection and the transport layer. At the moment it's not worth using plain refcounting as only the logs require this, so instead of real refcounting we just use a flag which will be set by the log subsystem when SSL data need to be logged. What happens then is that the xprt->close() call is ignored and the transport layer is closed again during session_free(), after the log line is emitted.	2012-10-12 20:30:50 +02:00
Willy Tarreau	91083f5c8f	BUG/MEDIUM: session: enable the conn_session_update() callback This callback was introduced by commit `9683e9a0` but never enabled because the CO_FL_WAKE_DATA flag was not set. The result is that this function is never called when an SSL handshake fails, so the connection is only closed on timeout.	2012-10-12 20:30:38 +02:00
Willy Tarreau	e9909f4e50	BUG/MINOR: session: fix some leftover from debug code Commit `82569f91` moved the health and monitor-net checks to session.c but a debug test introduced 0& to disable MSG_DONTWAIT in the recv() call and this debug code remained there. Since the socket is marked non-blocking, there should be no effect but it's dangerous to keep such a thing here.	2012-10-12 17:36:40 +02:00
Willy Tarreau	1bc4aab290	MEDIUM: listener: add support for linux's accept4() syscall On Linux, accept4() does the same as accept() except that it allows the caller to specify some flags to set on the resulting socket. We use this to set the O_NONBLOCK flag and thus to save one fcntl() call in each connection. The effect is a small performance gain of around 1%. The option is automatically enabled when target linux2628 is set, or when the USE_ACCEPT4 Makefile variable is set. If the libc is too old to provide the equivalent function, this is automatically detected and our own function is used instead. In any case it is possible to force the use of our implementation with USE_MY_ACCEPT4.	2012-10-08 20:11:03 +02:00
Willy Tarreau	9683e9a05f	MEDIUM: session: register a data->wake callback to process errors The connection layer will soon call ->wake() only when errors happen, and not ->init(). So make the session layer use this callback to detect errors and abort connections.	2012-10-04 22:26:10 +02:00
Willy Tarreau	071e137ec2	MEDIUM: connection: use a generic data-layer init() callback The generic data-layer init callback is now used after the transport layer is complete and before calling the data layer recv/send callbacks. This allows the session to switch from the embryonic session data layer to the complete stream interface data layer, by making conn_session_complete() the data layer's init callback. It sill looks awkwards that the init() callback must be used opon error, but except by adding yet another one, it does not seem to be mergeable into another function (eg: it should probably not be merged with ->wake to avoid unneeded calls during the handshake, though semantically that would make sense).	2012-10-04 22:26:10 +02:00
Willy Tarreau	5e75e2755e	MEDIUM: session: use a specific data_cb for embryonic sessions We don't want to have the recv or send callbacks in embryonic sessions, and we want the stream interface to be referenced as the connection owner only once the session is instanciated. So let's first have the embryonic session be the owner, then replaced later by the stream interface once the transport layer is ready.	2012-10-04 22:26:10 +02:00
Willy Tarreau	4aa3683b2d	MINOR: connection: provide a generic data layer wakeup callback Instead of calling conn_notify_si() from the connection handler, we now call data->wake(), which will allow us to use a different callback with health checks. Note that we still rely on a flag in order to decide whether or not to call this function. The reason is that with embryonic sessions, the callback is already initialized to si_conn_cb without the flag, and we can't call the SI notify function in the leave path before the stream interface is initialized. This issue should be addressed by involving a different data_cb for embryonic sessions and for stream interfaces, that would be changed during session_complete() for the final data_cb.	2012-10-04 22:26:10 +02:00
Willy Tarreau	f7bc57ca6e	REORG: connection: rename the data layer the "transport layer" While working on the changes required to make the health checks use the new connections, it started to become obvious that some naming was not logical at all in the connections. Specifically, it is not logical to call the "data layer" the layer which is in charge for all the handshake and which does not yet provide a data layer once established until a session has allocated all the required buffers. In fact, it's more a transport layer, which makes much more sense. The transport layer offers a medium on which data can transit, and it offers the functions to move these data when the upper layer requests this. And it is the upper layer which iterates over the transport layer's functions to move data which should be called the data layer. The use case where it's obvious is with embryonic sessions : an incoming SSL connection is accepted. Only the connection is allocated, not the buffers nor stream interface, etc... The connection handles the SSL handshake by itself. Once this handshake is complete, we can't use the data functions because the buffers and stream interface are not there yet. Hence we have to first call a specific function to complete the session initialization, after which we'll be able to use the data functions. This clearly proves that SSL here is only a transport layer and that the stream interface constitutes the data layer. A similar change will be performed to rename app_cb => data, but the two could not be in the same commit for obvious reasons.	2012-10-04 22:26:09 +02:00
Willy Tarreau	e603e69d18	MEDIUM: connection: make use of the owner instead of container_of This way the connection can become independant on the stream interface.	2012-09-28 00:01:23 +02:00
Willy Tarreau	82569f9158	MEDIUM: monitor: simplify handling of monitor-net and mode health We were having several different behaviours with monitor-net and "mode health" : - monitor-net on TCP connections was evaluated just after accept(), did not count a connection on the frontend and were not subject to tcp-request connection rules, and caused an immediate close(). - monitor-net in HTTP mode was evaluated once the session was accepted (eg: on top of SSL), returned "HTTP/1.0 200 OK\r\n\r\n" over the connection's data layer and instanciated a session which was responsible for closing this connection. A connection AND a session were counted for the frontend ; - "mode health" with "option httpchk" would do exactly the same as monitor-net in HTTP mode ; - "mode health" without "option httpchk" would do the same as above except that "OK" was returned instead of "HTTP/1.0 200 OK\r\n\r\n". None of them took care of cleaning the input buffer, sometimes resulting in a TCP reset to be emitted after the last packet if a request was received over the connection. Given the inconsistencies and the complexity in keeping all these features handled at the right position, we now slightly changed the way they are handled : - all of them are handled just after the "tcp-request connection" rules, so that all of them may be blocked using such rules, offering more flexibility and consistency ; - no connection handshake is performed anymore for non-TCP modes - all of them send the response as raw data over the socket, there is no more difference between TCP and HTTP mode for example (these rules were never meant to be served over SSL connections and were never documented as able to do that). - any possible pending data on the incoming socket is drained before the response is sent, in order to avoid the risk of a reset. - none of them exactly did what was documented ! This results in more consistent, more flexible and more accurate handling of monitor rules, with smaller and more robust code.	2012-09-28 00:01:22 +02:00
Cyril Bonté	3aaba440a2	BUILD: fix compilation error with DEBUG_FULL Recent changes in structures broke the compilation when using DEBUG_FULL. Let's update apply the changes also to the variables used in DPRINTF calls.	2012-09-24 20:36:39 +02:00
Willy Tarreau	d1d5454180	REORG: split "protocols" files into protocol and listener It was becoming confusing to have protocols and listeners in the same files, split them.	2012-09-15 22:29:32 +02:00
Willy Tarreau	cbaaec475c	MINOR: session: do not send an HTTP/500 error on SSL sockets If a session fails its initialization, we don't want to send HTTP/500 over the socket if it's not a raw data layer.	2012-09-06 11:32:07 +02:00
Willy Tarreau	783f25800c	BUILD: http: rename error_message http_error_message to fix conflicts on RHEL Duncan Hall reported a build issue on CentOS where error_message conflicts with another system declaration when SSL is enabled. Rename the function.	2012-09-04 12:19:04 +02:00
Willy Tarreau	dd2f85eb3b	CLEANUP: includes: fix includes for a number of users of fd.h It appears that fd.h includes a number of unneeded files and was included from standard.h, and as such served as an intermediary to provide almost everything to everyone. By removing its useless includes, a long dependency chain broke but could easily be fixed.	2012-09-03 20:49:14 +02:00
Willy Tarreau	40ff59d820	CLEANUP: fd: remove fdtab->flags These flags were added for TCP_CORK. They were only set at various places but never checked by any user since TCP_CORK was replaced with MSG_MORE. Simply get rid of this now.	2012-09-03 20:49:14 +02:00
Willy Tarreau	74172ff9c3	CLEANUP: frontend: remove the old proxy protocol decoder This one used to rely on a stream analyser which was inappropriate. It's not used anymore.	2012-09-03 20:47:35 +02:00
Willy Tarreau	22cda21ad5	MAJOR: connection: make the PROXY decoder a handshake handler The PROXY protocol is now decoded in the connection before other handshakes. This means that it may be extracted from a TCP stream before SSL is decoded from this stream.	2012-09-03 20:47:35 +02:00
Willy Tarreau	2542b53b19	MAJOR: session: introduce embryonic sessions When an incoming connection request is accepted, a connection structure is needed to store its state. However we don't want to fully initialize a session until the data layer is about to be ready. As long as the connection is physically stored into the session, it's not easy to split both allocations. As such, we only initialize the minimum requirements of a session, which results in what we call an embryonic session. Then once the data layer is ready, we can complete the function's initialization. Doing so avoids buffers allocation and ensures that a session only sees ready connections. The frontend's client timeout is used as the handshake timeout. It is likely that another timeout will be used in the future.	2012-09-03 20:47:35 +02:00
Willy Tarreau	15678efc45	MEDIUM: connection: add an ->init function to data layer SSL need to initialize the data layer before proceeding with data. At the moment, this data layer is automatically initialized from itself, which will not be possible once we extract connection from sessions since we'll only create the data layer once the handshake is finished. So let's have the application layer initialize the data layer before using it.	2012-09-03 20:47:34 +02:00
Willy Tarreau	64ee491309	MINOR: tcp: replace tcp_src_to_stktable_key with addr_to_stktable_key Make it more obvious that this function does not depend on any knowledge of the session. This is important to plan for TCP rules that can run on connection without any initialized session yet.	2012-09-03 20:47:34 +02:00
Willy Tarreau	93b0f4f6c6	MEDIUM: stream_interface: remove CAP_SPLTCP/CAP_SPLICE flags These ones are implicitly handled by the connection's data layer, no need to rely on them anymore and reaching them maintains undesired dependences on stream-interface.	2012-09-03 20:47:34 +02:00
Willy Tarreau	986a9d2d12	MAJOR: connection: move the addr field from the stream_interface We need to have the source and destination addresses in the connection. They were lying in the stream interface so let's move them. The flags SI_FL_FROM_SET and SI_FL_TO_SET have been moved as well. It's worth noting that tcp_connect_server() almost does not use the stream interface anymore except for a few flags. It has been identified that once we detach the connection from the SI, it will probably be needed to keep a copy of the server-side addresses in the SI just for logging purposes. This has not been implemented right now though.	2012-09-03 20:47:34 +02:00
Willy Tarreau	3cefd521fa	REORG: connection: move the target pointer from si to connection The target is per connection and is directly used by the connection, so we need it there. It's not needed anymore in the SI however.	2012-09-03 20:47:34 +02:00
Willy Tarreau	8263d2b259	CLEANUP: channel: use "channel" instead of "buffer" in function names This is a massive rename of most functions which should make use of the word "channel" instead of the word "buffer" in their names. In concerns the following ones (new names) : unsigned long long channel_forward(struct channel buf, unsigned long long bytes); static inline void channel_init(struct channel buf) static inline int channel_input_closed(struct channel buf) static inline int channel_output_closed(struct channel buf) static inline void channel_check_timeouts(struct channel b) static inline void channel_erase(struct channel buf) static inline void channel_shutr_now(struct channel buf) static inline void channel_shutw_now(struct channel buf) static inline void channel_abort(struct channel buf) static inline void channel_stop_hijacker(struct channel buf) static inline void channel_auto_connect(struct channel buf) static inline void channel_dont_connect(struct channel buf) static inline void channel_auto_close(struct channel buf) static inline void channel_dont_close(struct channel buf) static inline void channel_auto_read(struct channel buf) static inline void channel_dont_read(struct channel buf) unsigned long long channel_forward(struct channel *buf, unsigned long long bytes) Some functions provided by channel.[ch] have kept their "buffer" name because they are really designed to act on the buffer according to some information gathered from the channel. They have been moved together to the same place in the file for better readability but they were not changed at all. The "buffer" memory pool was also renamed "channel".	2012-09-03 20:47:33 +02:00
Willy Tarreau	03cdb7c678	CLEANUP: channel: usr CF_/CHN_ prefixes instead of BF_/BUF_ Get rid of these confusing BF_* flags. Now channel naming should clearly be used everywhere appropriate. No code was changed, only a renaming was performed. The comments about channel operations was updated.	2012-09-03 20:47:33 +02:00
Willy Tarreau	3bf1b2b816	MAJOR: channel: stop relying on BF_FULL to take action This flag is quite complex to get right and updating it everywhere is a major pain, especially since the buffer/channel split. This is the first step of getting rid of it. Instead now it's dynamically computed whenever needed.	2012-09-03 20:47:33 +02:00
Willy Tarreau	a75bcef867	REORG: buffer: move buffer_flush, b_adv and b_rew to buffer.h These one now operate over real buffers, not channels anymore.	2012-09-03 20:47:32 +02:00
Willy Tarreau	8e21bb9e52	MAJOR: channel: remove the BF_OUT_EMPTY flag This flag was very problematic because it was composite in that both changes to the pipe or to the buffer had to cause this flag to be updated, which is not always simple (eg: there may not even be a channel attached to a buffer at all). There were not that many users of this flags, mostly setters. So the flag got replaced with a macro which reports whether the channel is empty or not, by checking both the pipe and the buffer. One part of the change is sensible : the flag was also part of BF_MASK_STATIC, which is used by process_session() to rescan all analysers in case the flag's status changes. At first glance, none of the analysers seems to change its mind base on this flag when it is subject to change, so it seems fine not to add variation checks here. Otherwise it's possible that checking the buffer's output size is more useful than checking the flag's replacement.	2012-09-03 20:47:32 +02:00
Willy Tarreau	c7e4238df0	REORG: buffers: split buffers into chunk,buffer,channel Many parts of the channel definition still make use of the "buffer" word.	2012-09-03 20:47:32 +02:00
Willy Tarreau	c578891112	CLEANUP: connection: split sock_ops into data_ops, app_cp and si_ops Some parts of the sock_ops structure were only used by the stream interface and have been moved into si_ops. Some of them were callbacks to the stream interface from the connection and have been moved into app_cp as they're the application seen from the connection (later, health-checks will need to use them). The rest has moved to data_ops. Normally at this point the connection could live without knowing about stream interfaces at all.	2012-09-03 20:47:31 +02:00
Willy Tarreau	96199b1016	MAJOR: stream-interface: restore splicing mechanism The splicing is now provided by the data-layer rcv_pipe/snd_pipe functions which in turn are called by the stream interface's recv and send callbacks. The presence of the rcv_pipe/snd_pipe functions is used to attest support for splicing at the data layer. It looks like the stream-interface's SI_FL_CAP_SPLICE flag does not make sense anymore as it's used as a proxy for the pointers above. It also appears that we call chk_snd() from the recv callback and then try to call it again in update_conn(). It is very likely that this last function will progressively slip into the recv/send callbacks in order to avoid duplicate check code. The code works right now with and without splicing. Only raw_sock provides support for it and it is automatically selected when the various splice options are set. However it looks like splice-auto doesn't enable it, which possibly means that the streamer detection code does not work anymore, or that it's only called at a time where it's too late to enable splicing (in process_session).	2012-09-03 20:47:31 +02:00
Willy Tarreau	75bf2c925f	REORG: sock_raw: rename the files raw_sock* The "raw_sock" prefix will be more convenient for naming functions as it will be prefixed with the data layer and suffixed with the data direction. So let's rename the files now to avoid any further confusion. The #include directive was also removed from a number of files which do not need it anymore.	2012-09-02 21:54:56 +02:00
Willy Tarreau	572bf9095d	REORG/MAJOR: extract "struct buffer" from "struct channel" At the moment, the struct is still embedded into the struct channel, but all the functions have been updated to use struct buffer only when possible, otherwise struct channel. Some functions would likely need to be splitted between a buffer-layer primitive and a channel-layer function. Later the buffer should become a pointer in the struct buffer, but doing so requires a few changes to the buffer allocation calls.	2012-09-02 21:54:56 +02:00
Willy Tarreau	7421efb85f	REORG/MAJOR: use "struct channel" instead of "struct buffer" This is a massive rename. We'll then split channel and buffer. This change needs a lot of cleanups. At many locations, the parameter or variable is still called "buf" which will become ambiguous. Also, the "struct channel" is still defined in buffers.h.	2012-09-02 21:54:55 +02:00
Willy Tarreau	f9dabecd03	MEDIUM: connection: make use of the new polling functions Now the connection handler, the handshake callbacks and the I/O callbacks make use of the connection-layer polling functions to enable or disable polling on a file descriptor. Some changes still need to be done to avoid using the FD_WAIT_* constants.	2012-09-02 21:53:11 +02:00
Willy Tarreau	49b046dddf	MAJOR: fd: replace all EV_FD_* macros with new fd__ inline calls These functions have a more explicity meaning and will offer provisions for explicit polling. EV_FD_ISSET() has been left for now as it is still in use in checks.	2012-09-02 21:53:11 +02:00
Willy Tarreau	8b117082bc	REORG: connection: replace si_data_close() with conn_data_close() This close function only applies to connection-specific parts and the stream-interface entry may soon disappear. Move this to the connection instead.	2012-09-02 21:53:10 +02:00
Willy Tarreau	fd31e53139	MAJOR: remove the stream interface and task management code from sock_* The socket data layer code must only focus on moving data between a socket and a buffer. We need a special stream interface handler to update the stream interface and the file descriptor status. At the moment the code works but suffers from a race condition caused by its API : the read/write callbacks still make use of the fd instead of using the connection. And when a double shutdown is performed, a call to ->write() after ->read() processed an error results in dereferencing a NULL fdtab[]->owner. This is only a temporary issue which doesn't need to be fixed now since this will automatically go away when the functions change to use the connection instead.	2012-09-02 21:53:08 +02:00
Willy Tarreau	076be25ab8	CLEANUP: remove the now unused fdtab direct I/O callbacks They were all left to NULL since last commit so we can safely remove them all now and remove the temporary dual polling logic in pollers.	2012-09-02 21:51:29 +02:00
Willy Tarreau	8018471f44	MINOR: fd: make fdtab->owner a connection and not a stream_interface anymore It is more convenient with a connection here and will abstract stream_interface more easily.	2012-09-02 21:51:28 +02:00
Willy Tarreau	d2274c6536	MAJOR: connection: replace direct I/O callbacks with the connection callback Almost all direct I/O callbacks have been changed to use the connection callback instead. Only the TCP connection validation remains.	2012-09-02 21:51:28 +02:00
Willy Tarreau	4e6049e553	MINOR: fd: add a new I/O handler to fdtab This one will eventually replace both cb[] handlers. At the moment it is not used yet.	2012-09-02 21:51:27 +02:00
Willy Tarreau	505e34a36d	MAJOR: get rid of fdtab[].state and use connection->flags instead fdtab[].state was only used to know whether a connection was in progress or an error was encountered. Instead we now use connection->flags to store a flag for both. This way, connection management will be able to update the connection status on I/O.	2012-09-02 21:51:26 +02:00
Willy Tarreau	db3b32610f	REORG/MEDIUM: fd: remove FD_STCLOSE from struct fdtab In an attempt to get rid of fdtab[].state, and to move the relevant parts to the connection struct, we remove the FD_STCLOSE state which can easily be deduced from the <owner> pointer as there is a 1:1 match.	2012-09-02 21:51:25 +02:00
Willy Tarreau	96596aeead	MEDIUM: fd/si: move peeraddr from struct fdinfo to struct connection The destination address is purely a connection thing and not an fd thing. It's also likely that later the address will be stored into the connection and linked to by the SI. struct fdinfo only keeps the pointer to the port range and the local port for now. All of this also needs to move to the connection but before this the release of the port range must move from fd_delete() to a new function dedicated to the connection.	2012-06-08 22:59:52 +02:00
Emeric Brun	d88fd824b7	MEDIUM: protocol: add a pointer to struct sock_ops to the listener struct The listener struct is now aware of the socket layer to use upon accept(). At the moment, only sock_raw is supported so this patch should not change anything.	2012-05-21 22:22:39 +02:00
Emeric Brun	21adb02d19	MINOR: stream_interface: add a pointer to the listener for TARG_TYPE_CLIENT When the target is a client, it will be convenient to have a pointer to the original listener so that we can retrieve some configuration information at the stream interface level.	2012-05-21 22:22:39 +02:00
Willy Tarreau	4da69a91a0	MEDIUM: stream_interface: call si_data_close() before releasing the si This will ensure that the data layer releases anything previously allocated.	2012-05-21 18:07:11 +02:00
Willy Tarreau	fb7508aefb	REORG/MINOR: stream_interface: move si->fd to struct connection The socket fd is used only when in socket mode and with a connection.	2012-05-21 16:47:54 +02:00
Willy Tarreau	73b013b070	MINOR: stream_interface: introduce a new "struct connection" type We start to move everything needed to manage a connection to a special entity "struct connection". We have the data layer operations and the control operations there. We'll also have more info in the future such as file descriptors and applet contexts, so that in the end it becomes detachable from the stream interface, which will allow connections to be reused between sessions. For now on, we start with minimal changes.	2012-05-21 16:31:45 +02:00
Willy Tarreau	fe7f1ea68e	REORG/MINOR: session: detect the TCP monitor checks at the protocol accept It does not make sense anymore to wait for a session creation to process a TCP monitor check which only closes the connection and returns. Better to process this immediately after the accept() return. It also saves us from counting a connection for monitor checks, which is much more logical.	2012-05-20 19:22:25 +02:00
Willy Tarreau	be0688c64d	MEDIUM: stream_interface: remove the si->init Calling the init() function in sess_establish was a bad idea, it is too late to allow it to fail on lack of resource and does not help at all. Remove it for now before it's used.	2012-05-18 15:15:26 +02:00
Willy Tarreau	7bb68abb9f	OPTIM/MEDIUM: stream_interface: add a new SI_FL_NOHALF flag This flag indicates that we're not interested in keeping half-open connections on a stream interface. It has the benefit of allowing the socket layer to cause an immediate write close when detecting an incoming read close. This releases resources much faster and saves one syscall (either a shutdown or setsockopt). This flag is only set by HTTP on the interface going to the server since we don't want to continue pushing data there when it has closed. Another benefit is that it responds with a FIN to a server's FIN instead of responding with an RST as it used to, which is much cleaner. Performance gains of 7.5% have been measured on HTTP connection rate on empty objects.	2012-05-13 14:52:22 +02:00
Willy Tarreau	b147a8382a	CLEANUP: fd: remove unused cb->b pointers in the struct fdtab These pointers were used to hold pointers to buffers in the past, but since we introduced the stream interface, they're no longer used but they were still sometimes set. Removing them shrink the struct fdtab from 32 to 24 bytes on 32-bit machines, and from 52 to 36 bytes on 64-bit machines, which is a significant saving. A quick tests shows a steady 0.5% performance gain, probably due to the better cache efficiency.	2012-05-13 00:35:44 +02:00
Willy Tarreau	ce887fd3b2	MEDIUM: session: add support for tunnel timeouts Tunnel timeouts are used when TCP connections are forwarded, or when forwarding upgraded HTTP connections (WebSocket) as well as CONNECT requests to proxies. This timeout allows long-lived sessions to be supported without having to set large timeouts to normal requests.	2012-05-12 12:50:00 +02:00
Willy Tarreau	2f5b6fc090	MINOR: session: call the socket layer init function when a session establishes In sess_establish, once we've prepared everythin, we can call the socket layer init function. We pass an argument for targets which have one (eg: servers). At the moment, the existing socket layers don't have init functions, but SSL will need one.	2012-05-12 08:09:27 +02:00
Willy Tarreau	f873d754f8	CLEANUP: stream_interface: stop exporting socket layer functions Similarly to the previous patch, we don't need the socket-layer functions outside of stream_interface. They could even move to a file dedicated to applets, though that does not seem particularly useful at the moment.	2012-05-11 17:47:17 +02:00
Willy Tarreau	b277d6e568	CLEANUP: sock_raw: remove last references to stream_sock We also stop exporting all functions since they're not needed anymore outside of sock_raw.c.	2012-05-11 17:03:42 +02:00
Willy Tarreau	1539a01645	MINOR: stream_interface: add a client target : TARG_TYPE_CLIENT This one will be used to identify the direction the SI is being used. All incoming connections have a target of type TARG_TYPE_CLIENT.	2012-05-11 14:47:34 +02:00
Willy Tarreau	c63190d429	REORG: use the name sock_raw instead of stream_sock We'll soon have an SSL socket layer, and in order to ease the difference between the two, we use the name "sock_raw" to designate the one which directly talks to the sockets without any conversion.	2012-05-11 14:23:52 +02:00
Willy Tarreau	0a3dd74c9c	MEDIUM: cfgparse: use the new error reporting framework for remaining cfg_keywords All keywords registered using a cfg_kw_list now make use of the new error reporting framework. This allows easier and more precise error reporting without having to deal with complex buffer allocation issues.	2012-05-08 21:28:17 +02:00
Willy Tarreau	bd83314ee9	BUG/MEDIUM: log: ensure that unique_id is properly initialized Last memory poisonning patch immediately made this issue appear. The unique_id field is released but not properly initialized. The feature was introduced very recently, no backport is needed.	2012-05-08 21:28:16 +02:00
Willy Tarreau	63e7fe310e	BUG/MEDIUM: send_proxy: fix initialisation of send_proxy_ofs Commit `b22e55bc` introduced send_proxy_ofs but forgot to initialize it, which remained unnoticed since it's always at the same place in the stream interface. On a machine with dirty RAM returned by malloc(), some responses were holding a PROXY header, which normally is not possible. The problem goes away after properly initializing the field upon each new session_accept(). This fix does not need to be backported except if any code makes use of a backport of this feature.	2012-05-08 21:28:16 +02:00
Willy Tarreau	26d8c59f0b	REORG/MEDIUM: replace stream interface protocol functions by a proto pointer The stream interface now makes use of the socket protocol pointer instead of the direct functions.	2012-05-08 21:28:15 +02:00
Willy Tarreau	5c979a9c71	REORG/MEDIUM: stream_interface: initialize socket ops from descriptors	2012-05-08 21:28:14 +02:00
Willy Tarreau	1b79bdee26	REORG/MEDIUM: move protocol->{read,write} to sock_ops The protocol must not set the read and write callbacks, they're specific to the socket layer. Move them to sock_ops instead.	2012-05-08 21:28:14 +02:00
Willy Tarreau	060781fb4a	REORG: stream_interface: create a struct sock_ops to hold socket operations These operators are used regardless of the socket protocol family. Move them to a "sock_ops" struct. ->read and ->write have been moved there too as they have no reason to remain at the protocol level.	2012-05-08 21:28:14 +02:00
Willy Tarreau	cd3b094618	REORG: rename "pattern" files They're now called "sample" everywhere to match their description.	2012-05-08 20:57:21 +02:00
Willy Tarreau	1278578487	REORG: use the name "sample" instead of "pattern" to designate extracted data This is mainly a massive renaming in the code to get it in line with the calling convention. Next patch will rename a few files to complete this operation.	2012-05-08 20:57:20 +02:00
Willy Tarreau	32a6f2e572	MEDIUM: acl/pattern: use the same direction scheme Patterns were using a bitmask to indicate if request or response was desired in fetch functions and keywords. ACLs were using a bitmask in fetch keywords and a single bit in fetch functions. ACLs were also using an ACL_PARTIAL bit in fetch functions indicating that a non-final fetch was performed, which was an abuse of the existing direction flag. The change now consists in using : - a capabilities field for fetch keywords => SMP_CAP_REQ/RES to indicate if a keyword supports requests, responses, both, etc... - an option field for fetch functions to indicate what the caller expects (request/response, final/non-final) The ACL_PARTIAL bit was reversed to get SMP_OPT_FINAL as it's more explicit to know we're working on a final buffer than on a non-final one. ACL_DIR_* were removed, as well as PATTERN_FETCH_*. L4 fetches were improved to support being called on responses too since they're still available. The <dir> field of all fetch functions was changed to <opt> which is now unsigned. The patch is large but mostly made of cosmetic changes to accomodate this, as almost no logic change happened.	2012-05-08 20:57:17 +02:00
Willy Tarreau	24e32d8c6b	MEDIUM: acl: replace acl_expr with args in acl fetch_* functions Having the args everywhere will make it easier to share fetch functions between patterns and ACLs. The only place where we could have needed the expr was in the http_prefetch function which can do well without.	2012-05-08 20:57:16 +02:00
Willy Tarreau	f853c46bc3	MEDIUM: pattern/acl: get rid of temp_pattern in ACLs This one is not needed anymore as we can return the data and its type in the sample provided by the caller. ACLs now always return the proper type. BOOL is already returned when the result is expected to be processed as a boolean. temp_pattern has been unexported now.	2012-05-08 20:57:14 +02:00
Willy Tarreau	3740635b88	MAJOR: acl: make use of the new sample struct and get rid of acl_test This change is invasive in lines of code but not much in terms of functionalities as it's mainly a replacement of struct acl_test with struct sample.	2012-05-08 20:57:14 +02:00
Willy Tarreau	422aa0792d	MEDIUM: pattern: add new sample types to replace pattern types The new sample types are necessary for the acl-pattern convergence. These types are boolean and signed int. Some types were renamed for less ambiguity (ip->ipv4, integer->uint).	2012-05-08 20:57:14 +02:00
Willy Tarreau	0146c2e873	MEDIUM: acl: remove unused tests for missing args when args are mandatory A number of ACL fetch methods use mandatory arguments (eg: proxy names) so it's pointless to test for the presence of this argument now.	2012-05-08 20:57:12 +02:00
Willy Tarreau	fc2c1fd449	MAJOR: acl: ensure that implicit table and proxies are valid A large number of ACLs make use of frontend, backend or table names in their arguments, and fall back to the current proxy when no argument is passed. If the expected capability is not available, the ACL silently fails at runtime. Now we make all those names mandatory in the parser and we rely on acl_find_targets() to replace the missing names with the holding proxy, then to perform the appropriate tests, and to reject errors at parsing time. It is possible that some faulty configurations will get rejected from now on, while they used to silently fail till now. This is the reason why this change is marked as MAJOR.	2012-05-08 20:57:12 +02:00
Willy Tarreau	d28c353fc5	MAJOR: acl: make acl_find_targets also resolve proxy names at config time Proxy names are now resolved when the config is parsed and not at runtime. This means that errors will be caught for real instead of having an ACL silently never match. Another benefit is that the fetch will be much faster since the lookup will not have to be performed anymore, eg for all ACLs based on explicitly named stick-tables. However some buggy configurations which used to silently fail in the past will now refuse to load, hence the MAJOR tag.	2012-05-08 20:57:11 +02:00
Willy Tarreau	61612d49a7	MAJOR: acl: store the ACL argument types in the ACL keyword declaration The types and minimal number of ACL keyword arguments are now stored in their declaration. This will allow many more fantasies if some ACL use several arguments or types. Doing so required to rework all ACL keyword declarations to add two parameters. So this was a good opportunity for a general cleanup and to sort all entries in alphabetical order. We still have two pending issues : - parse_acl_expr() checks for errors but has no way to report them to the user ; - the types of some arguments are still not resolved and kept as strings (eg: ARGT_FE/BE/TAB) for compatibility reasons, which must be resolved in acl_find_targets()	2012-05-08 20:57:11 +02:00
Willy Tarreau	34db108423	MAJOR: acl: make use of the new argument parsing framework The ACL parser now uses the argument parser to build a typed argument list. Right now arguments are all strings and only one argument is supported since this is what ACLs currently support.	2012-05-08 20:57:11 +02:00
Willy Tarreau	45c0d98769	MEDIUM: http: http_send_name_header: remove references to msg and buffer They can be deduced from txn.	2012-05-08 12:28:12 +02:00
Willy Tarreau	62f791ea6f	MEDIUM: http: add a pointer to the buffer in http_msg ACLs and patterns only rely on a struct http_msg and don't know the pointer to the actual data. struct http_msg will soon only hold relative references so that's not possible. We need http_msg to hold a reference to the struct buffer before having relative pointers everywhere. It is likely that doing so will also result in opportunities to simplify a number of functions arguments. The following functions are already candidate : http_buffer_heavy_realign http_capture_bad_message http_change_connection_header http_forward_trailers http_header_add_tail http_header_add_tail2 http_msg_analyzer http_parse_chunk_size http_parse_connection_header http_remove_header2 http_send_name_header http_skip_chunk_crlf http_upgrade_v09_to_v10	2012-05-08 12:28:12 +02:00
Willy Tarreau	02d6cfc1d7	MAJOR: buffer: replace buf->l with buf->{o+i} We don't have buf->l anymore. We have buf->i for pending data and the total length is retrieved by adding buf->o. Some computation already become simpler. Despite extreme care, bugs are not excluded. It's worth noting that msg->err_pos as set by HTTP request/response analysers becomes relative to pending data and not to the beginning of the buffer. This has not been completed yet so differences might occur when outgoing data are left in the buffer.	2012-05-08 12:28:10 +02:00
Willy Tarreau	a36fc4d7ed	MEDIUM: move message-related flags from transaction to message Too many flags are stored in the transaction structure. Some flags are clearly message-specific and exist in two versions (request and response). Move them to a new "flags" field in the http_message struct instead.	2012-04-30 11:57:00 +02:00
Willy Tarreau	21337825c0	CLEANUP: remove a few warning about unchecked return values in debug code There were a few unchecked write() calls in the debug code that cause gcc 4.x to emit warnings on recent libc. We don't want to check them as we can't make anything from the result, let's simply surround them with an empty if statement. Note that one of the warnings was for chdir("/") which normally cannot fail since it follows a successful chroot (which means the perms are necessarily there). Anyway let's move the call uppe to protect it too.	2012-04-30 11:56:30 +02:00
Willy Tarreau	9b061e3320	MEDIUM: stream_sock: add a get_src and get_dst callback and remove SN_FRT_ADDR_SET These callbacks are used to retrieve the source and destination address of a socket. The address flags are not hold on the stream interface and not on the session anymore. The addresses are collected when needed. This still needs to be improved to store the IP and port separately so that it is not needed to perform a getsockname() when only the IP address is desired for outgoing traffic.	2012-04-07 18:03:52 +02:00
Willy Tarreau	4a5cadea40	MEDIUM: session: implement the "use-server" directive Sometimes it is desirable to forward a particular request to a specific server without having to declare a dedicated backend for this server. This can be achieved using the "use-server" rules. These rules are evaluated after the "redirect" rules and before evaluating cookies, and they have precedence on them. There may be as many "use-server" rules as desired. All of these rules are evaluated in their declaration order, and the first one which matches will assign the server.	2012-04-05 21:14:10 +02:00
Stathis Voukelatos	09a030a9a4	BUG/MINOR: fix typo in processing of http-send-name-header I downloaded version 1.4.19 this morning. While merging the code changes to a custom build that we have here for our project I noticed a typo in 'session.c', in the new code for inserting the server name in the HTTP header. The fix that I did is shown in the patch below. [WT: the bug is harmless, it is only suboptimal]	2012-01-09 14:27:13 +01:00
Mark Lamourine	c2247f0b8d	MEDIUM: http: add support for sending the server's name in the outgoing request New option "http-send-name-header" specifies the name of a header which will hold the server name in outgoing requests. This is the name of the server the connection is really sent to, which means that upon redispatches, the header's value is updated so that it always matches the server's name.	2012-01-05 15:17:31 +01:00
Willy Tarreau	a5e375646c	MEDIUM: acl: use temp_pattern to store any integer-type information All ACL fetches which return integer value now store the result into the temporary pattern struct. All ACL matches which rely on integer also get their value there. Note: the pattern data types are not set right now.	2011-12-30 17:33:26 +01:00
Willy Tarreau	34eb671f24	OPTIM/MINOR: move the hdr_idx pools out of the proxy struct It makes no sense to have one pointer to the hdr_idx pool in each proxy struct since these pools do not depend on the proxy. Let's have a common pool instead as it is already the case for other types.	2011-10-24 18:15:04 +02:00
Willy Tarreau	6471afb43d	MINOR: remove the client/server side distinction in SI addresses Stream interfaces used to distinguish between client and server addresses because they were previously of different types (sockaddr_storage for the client, sockaddr_in for the server). This is not the case anymore, and this distinction is confusing at best and has caused a number of regressions to be introduced in the process of converting everything to full-ipv6. We can now remove this and have a much cleaner code.	2011-09-23 10:54:59 +02:00
Willy Tarreau	a2a64e9689	[MEDIUM] session: make session_shutdown() an independant function We already had the ability to kill a connection, but it was only for the checks. Now we can do this for any session, and for this we add a specific flag "K" to the logs.	2011-09-07 23:01:56 +02:00
Willy Tarreau	3c63fd828a	[MEDIUM] don't limit peers nor stats socket to maxconn nor maxconnrate The peers and the stats socket are control sockets, they must not be limited by traffic rules.	2011-09-07 22:47:42 +02:00
Willy Tarreau	f73cd1198f	[MINOR] session-counters: add the ability to clear the counters Sometimes it can be useful to reset a counter : one condition increments it and another one resets it. It can be used to better detect abuses.	2011-08-13 01:45:16 +02:00
Willy Tarreau	b32907b6c7	[MINOR] sessions: only wake waiting listeners up if rate limit is OK Instead of waking a listener up then making it sleep, we only wake them up if we know their rate limit is fine. In the future we could improve on top of that by deciding to wake a proxy-specific task in XX milliseconds to take care of enabling the listeners again.	2011-07-25 08:37:44 +02:00
Willy Tarreau	07687c171e	[MEDIUM] listeners: queue proxy-bound listeners at the proxy's All listeners that are limited by a proxy-specific resource are now queued at the proxy's and not globally. This allows finer-grained wakeups when releasing resource.	2011-07-24 23:55:06 +02:00
Willy Tarreau	08ceb1012b	[MEDIUM] listeners: put listeners in queue upon resource shortage When an accept() fails because of a connection limit or a memory shortage, we now disable it and queue it so that it's dequeued only when a connection is released. This has improved the behaviour of the process near the fd limit as now a listener with a no connection (eg: stats) will not loop forever trying to get its connection accepted. The solution is still not 100% perfect, as we'd like to have this used when proxy limits are reached (use a per-proxy list) and for safety, we'd need to have dedicated tasks to periodically re-enable them (eg: to overcome temporary system-wide resource limitations when no connection is released).	2011-07-24 22:58:00 +02:00
Willy Tarreau	627937158f	[MINOR] listeners: add listen_full() to mark a listener full This is just a cleanup which removes calls to EV_FD_CLR() and state setting everywhere in the code.	2011-07-24 19:25:28 +02:00
Willy Tarreau	2b15492a75	[MINOR] session: try to emit a 500 response on memory allocation errors When we fail to create a session because of memory shortage, let's at least try to send a 500 message directly on the socket. Even if we don't have any buffers left, the kernel's orphans management will take care of delivering the message as long as there are socket buffers left.	2011-07-24 16:12:25 +02:00
Willy Tarreau	9bd0d744ef	[BUG] session: risk of crash on out of memory (1.5-dev regression) Patch af5149 introduced an issue which can be detected only on out of memory conditions : a LIST_DEL() may be performed on an uninitialized struct member instead of a LIST_INIT() during the accept() phase, causing crashes and memory corruption to occur. This issue was detected and diagnosed by the Exceliance R&D team. This is 1.5-specific and very recent, so no existing deployment should be impacted.	2011-07-20 00:22:54 +02:00
Simon Horman	fa46168c8f	[MINOR] Add non-stick server option Never add connections allocated to this sever to a stick-table. This may be used in conjunction with backup to ensure that stick-table persistence is disabled for backup servers.	2011-06-25 21:14:17 +02:00
Simon Horman	af51495397	[MINOR] Add active connection list to server The motivation for this is to allow iteration of all the connections of a server without the expense of iterating over the global list of connections. The first use of this will be to implement an option to close connections associated with a server when is is marked as being down or in maintenance mode.	2011-06-21 22:00:12 +02:00
Simon Horman	dec5be4ed4	[CLEANUP] session.c: Make functions static where possible	2011-06-18 20:27:19 +02:00
Willy Tarreau	96e312139a	[MEDIUM] http: add support for "http-no-delay" There are some very rare server-to-server applications that abuse the HTTP protocol and expect the payload phase to be highly interactive, with many interleaved data chunks in both directions within a single request. This is absolutely not supported by the HTTP specification and will not work across most proxies or servers. When such applications attempt to do this through haproxy, it works but they will experience high delays due to the network optimizations which favor performance by instructing the system to wait for enough data to be available in order to only send full packets. Typical delays are around 200 ms per round trip. Note that this only happens with abnormal uses. Normal uses such as CONNECT requests nor WebSockets are not affected. When "option http-no-delay" is present in either the frontend or the backend used by a connection, all such optimizations will be disabled in order to make the exchanges as fast as possible. Of course this offers no guarantee on the functionality, as it may break at any other place. But if it works via HAProxy, it will work as fast as possible. This option should never be used by default, and should never be used at all unless such a buggy application is discovered. The impact of using this option is an increase of bandwidth usage and CPU usage, which may significantly lower performance in high latency environments. This change should be backported to 1.4 since the first report of such a misuse was in 1.4. Next patch will also be needed.	2011-05-30 18:42:41 +02:00
David du Colombier	4f92d32004	[MEDIUM] IPv6 support for stick-tables Since IPv6 is a different type than IPv4, the pattern fetch functions src6 and dst6 were added. IPv6 stick-tables can also fetch IPv4 addresses with src and dst. In this case, the IPv4 addresses are mapped to their IPv6 counterpart, according to RFC 4291.	2011-03-29 01:09:14 +02:00
Willy Tarreau	c735a0728e	[MINOR] acl: add support for table_cnt and table_avl matches Those trivial matches respectively return the number of entries used in a stick-table and the number of entries still available in a table.	2011-03-29 00:57:02 +02:00
Willy Tarreau	0b3a411543	[BUG] session: conn_retries was not always initialized Johannes Smith reported some wrong retries count in logs associated with bad requests. The cause was that the conn_retries field in the stream interface was only initialized when attempting to connect, but is used when logging, possibly with an uninitialized value holding last connection's conn_retries. This could have been avoided by making use of a stream interface initializer. This bug is 1.5-specific.	2011-03-27 19:16:56 +02:00
Willy Tarreau	1b6e608c11	[BUG] session: src_conn_cur was returning src_conn_cnt instead Issue reported by Cory Forsyth and diagnosed by Cyril Bont�. Just a plain stupid copy-paste of the wrong fetch function call.	2011-03-16 06:56:57 +01:00
Willy Tarreau	7d0aaf39d1	[MEDIUM] stats: split frontend and backend stats It's very annoying that frontend and backend stats are merged because we don't know what we're observing. For instance, if a "listen" instance makes use of a distinct backend, it's impossible to know what the bytes_out means. Some points take care of not updating counters twice if the backend points to the frontend, indicating a "listen" instance. The thing becomes more complex when we try to add support for server side keep-alive, because we have to maintain a pointer to the backend used for last request, and to update its stats. But we can't perform such comparisons anymore because the counters will not match anymore. So in order to get rid of this situation, let's have both frontend AND backend stats in the "struct proxy". We simply update the relevant ones during activity. Some of them are only accounted for in the backend, while others are just for frontend. Maybe we can improve a bit on that later, but the essential part is that those counters now reflect what they really mean.	2011-03-13 22:00:23 +01:00
Willy Tarreau	827aee913f	[MAJOR] session: remove the ->srv pointer from struct session This one has been removed and is now totally superseded by ->target. To get the server, one must use target_srv(&s->target) instead of s->srv now. The function ensures that non-server targets still return NULL.	2011-03-10 23:32:17 +01:00
Willy Tarreau	9e000c6ec8	[CLEANUP] stream_interface: use inline functions to manipulate targets The connection target involves a type and a union of pointers, let's make the code cleaner using simple wrappers.	2011-03-10 23:32:17 +01:00
Willy Tarreau	3d80d911aa	[MEDIUM] session: remove s->prev_srv which is not needed anymore s->prev_srv is used by assign_server() only, but all code paths leading to it now take s->prev_srv from the existing s->srv. So assign_server() can do that copy into its own stack. If at one point a different srv is needed, we still have a copy of the last server on which we failed a connection attempt in s->target.	2011-03-10 23:32:16 +01:00
Willy Tarreau	664beb8610	[MINOR] session: add a pointer to the new target into the session When dealing with HTTP keep-alive, we'll have to know if we can reuse an existing connection. For that, we'll have to check if the current connection was made on the exact same target (referenced in the stream interface). Thus, we need to first assign the next target to the session, then copy it to the stream interface upon connect(). Later we'll check for equivalence between those two operations.	2011-03-10 23:32:16 +01:00
Willy Tarreau	7c0a151a2e	[CLEANUP] stream_interface: remove the applet.handler pointer Now that we have the target pointer and type in the stream interface, we don't need the applet.handler pointer anymore. That makes the code somewhat cleaner because we know we're dealing with an applet by checking its type instead of checking the pointer is not null.	2011-03-10 23:32:15 +01:00
Willy Tarreau	ac82540c35	[MEDIUM] stream_interface: store the target pointer and type When doing a connect() on a stream interface, some information is needed from the server and from the backend. In some situations, we don't have a server and only a backend (eg: peers). In other cases, we know we have an applet and we don't want to connect to anything, but we'd still like to have the info about the applet being used. For this, we now store a pointer to the "target" into the stream interface. The target describes what's on the other side before trying to connect. It can be a server, a proxy or an applet for now. Later we'll probably have descriptors for multiple-stage chains so that the final information may still be found. This will help removing many specific cases in the code. It already made it possible to remove the "srv" and "be" parameters to tcpv4_connect_server().	2011-03-10 23:32:15 +01:00
Willy Tarreau	957c0a5845	[REORG] session: move client and server address to the stream interface This will be needed very soon for the keep-alive.	2011-03-10 23:32:14 +01:00
Willy Tarreau	b24281b0ff	[MINOR] stream_interface: make use of an applet descriptor for IO handlers I/O handlers are still delicate to manipulate. They have no type, they're just raw functions which have no knowledge of themselves. Let's have them declared as applets once for all. That way we can have multiple applets share the same handler functions and we can store their names there. When we later need to add more parameters (eg: usage stats), we'll be able to do so in the applets themselves. The CLI functions has been prefixed with "cli" instead of "stats" as it's clearly what is going on there. The applet descriptor in the stream interface should get all the applet specific data (st0, ...) but this will be done in the next patch so that we don't pollute this one too much.	2011-03-10 23:32:14 +01:00
Willy Tarreau	b89cfca494	[BUG] session: release slot before processing pending connections When a connection error is encountered on a server and the server's connection pool is full, pending connections are not woken up because the current connection is still accounted for on the server, so it still appears full. This becomes visible on a server which has "maxconn 1" because the pending connections will only be able to expire in the queue. Now we take care of releasing our current connection before trying to offer it to another pending request, so that the server can accept a next connection. This patch should be backported to 1.4.	2010-12-29 14:38:29 +01:00
Willy Tarreau	0499e3575c	[BUG] http: analyser optimizations broke pipelining HTTP pipelining currently needs to monitor the response buffer to wait for some free space to be able to send a response. It was not possible for the HTTP analyser to be called based on response buffer activity. Now we introduce a new buffer flag BF_WAKE_ONCE which is set when the HTTP request analyser is set on the response buffer and some activity is detected. This is not clean at all but once of the only ways to fix the issue before we make it possible to register events for analysers. Also it appeared that one realign condition did not cover all cases.	2010-12-17 07:15:57 +01:00
Willy Tarreau	2f976e18b8	[OPTIM] session: don't recheck analysers when buffer flags have not changed Analysers were re-evaluated when some flags were still present in the buffers, even if they had not changed since previous pass, resulting in a waste of CPU cycles. Ensuring that the flags have changed has saved some useless calls : function min calls per session (before -> after) http_request_forward_body 5 -> 4 http_response_forward_body 3 -> 2 http_sync_req_state 10 -> 8 http_sync_res_state 8 -> 6 http_resync_states 8 -> 6	2010-11-11 14:28:47 +01:00
Willy Tarreau	abe8ea5c1d	[BUG] accept: don't close twice upon error The stream_sock's accept() used to close the FD upon error, but this was also sometimes performed by the frontend's accept() called via the session's accept(). Those interlaced calls were also responsible for the spaghetti-looking error unrolling code in session.c and stream_sock.c. Now the frontend must not close the FD anymore, the session is responsible for that. It also takes care of just closing the FD or also removing from the FD lists, depending on its state. The socket-level accept() does not have to care about that anymore.	2010-11-11 11:05:20 +01:00
Willy Tarreau	fffe1325df	[CLEANUP] accept: replace some inappropriate Alert() calls with send_log() Some Alert() messages were remaining in the accept() path, which they would have no chance to be detected. Remove some of them (the impossible ones) and replace the relevant ones with send_log() so that the admin has a chance to catch them.	2010-11-11 09:51:38 +01:00
Emeric Brun	85e77c7f0d	[MEDIUM] Create updates tree on stick table to manage sync.	2010-11-11 09:29:08 +01:00
Emeric Brun	485479d8e9	[MEDIUM] Create new protected pattern types CONSTSTRING and CONSTDATA to force memcpy if data from protected areas need to be manipulated. Enhance pattern convs and fetch argument parsing, now fetchs and convs callbacks used typed args. Add more details on error messages on parsing pattern expression function. Update existing pattern convs and fetchs to new proto. Create stick table key type "binary". Manage Truncation and padding if pattern's fetch-converted result don't match table key size.	2010-11-11 09:29:07 +01:00
Emeric Brun	97679e7901	[MEDIUM] Implement tcp inspect response rules	2010-11-11 09:28:18 +01:00
Willy Tarreau	da4d9fe5a4	[BUG] session: don't stop forwarding of data upon last packet If a read shutdown is encountered on the first packet of a connection right after the data and the last analyser is unplugged at the same time, then that last data chunk may never be forwarded. In practice, right now it cannot happen on requests due to the way they're scheduled, nor can it happen on responses due to the way their analysers work. But this behaviour has been observed with new response analysers being developped. The reason is that when the read shutdown is encountered and an analyser is present, data cannot be forwarded but the BF_SHUTW_NOW flag is set. After that, the analyser gets called and unplugs itself, hoping that process_session() will automatically forward the data. This does not happen due to BF_SHUTW_NOW. Simply removing the test on this flag is not enough because then aborted requests still get forwarded, due to the forwarding code undoing the abort. The solution here consists in checking BF_SHUTR_NOW instead of BF_SHUTW_NOW. BF_SHUTR_NOW is only set on aborts and remains set until ->shutr() is called. This is enough to catch recent aborts but not prevent forwarding in other cases. Maybe a new special buffer flag "BF_ABORT" might be desirable in the future. This patch does not need to be backported because older versions don't have the analyser which make the problem appear.	2010-11-11 09:26:29 +01:00
Willy Tarreau	3041b9fcc3	[MEDIUM] session: call the frontend_decode_proxy analyser on proxied connections This analyser must absolutely be the earliest one to process contents, given the nature of the protocol.	2010-10-30 19:04:38 +02:00
Willy Tarreau	af7ad00a99	[MINOR] support a global jobs counter This counter is incremented for each incoming connection and each active listener, and is used to prevent haproxy from stopping upon SIGUSR1. It will thus be possible for some tasks in increment this counter in order to prevent haproxy from dying until they have completed their job.	2010-08-31 15:39:26 +02:00
Willy Tarreau	56123282ef	[MINOR] session-counters: use "track-sc{1,2}" instead of "track-{fe,be}-counters" The assumption that there was a 1:1 relation between tracked counters and the frontend/backend role was wrong. It is perfectly possible to track the track-fe-counters from the backend and the track-be-counters from the frontend. Thus, in order to reduce confusion, let's remove this useless {fe,be} reference and simply use {1,2} instead. The keywords have also been renamed in order to limit confusion. The ACL rule action now becomes "track-sc{1,2}". The ACLs are now "sc{1,2}_" instead of "trk{fe,be}_". That means that we can reasonably document "sc1" and "sc2" (sticky counters 1 and 2) as sort of patterns that are available during the whole session's life and use them just like any other pattern.	2010-08-10 18:04:15 +02:00
Willy Tarreau	9e9879a263	[MEDIUM] session-counters: make it possible to count connections from frontend In case a "track-be-counters" rule is referenced in the frontend, count it so that the connection counts are correct.	2010-08-10 18:04:15 +02:00
Willy Tarreau	f059a0f63a	[MAJOR] session-counters: split FE and BE track counters Having a single tracking pointer for both frontend and backend counters does not work. Instead let's have one for each. The keyword has changed to "track-be-counters" and "track-fe-counters", and the ACL "trk_" changed to "trkfe_" and "trkbe_*".	2010-08-10 18:04:15 +02:00
Willy Tarreau	da7ff64aa9	[MEDIUM] session-counters: add HTTP req/err tracking This patch adds support for the following session counters : - http_req_cnt : HTTP request count - http_req_rate: HTTP request rate - http_err_cnt : HTTP request error count - http_err_rate: HTTP request error rate The equivalent ACLs have been added to check the tracked counters for the current session or the counters of the current source.	2010-08-10 18:04:14 +02:00
Willy Tarreau	c3bd972cda	[MINOR] session-counters: add a general purpose counter (gpc0) This counter may be used to track anything. Two sets of ACLs are available to manage it, one gets its value, and the other one increments its value and returns it. In the second case, the entry is created if it did not exist. Thus it is possible for example to mark a source as being an abuser and to keep it marked as long as it does not wait for the entry to expire : # The rules below use gpc0 to track abusers, and reject them if # a source has been marked as such. The track-counters statement # automatically refreshes the entry which will not expire until a # 1-minute silence is respected from the source. The second rule # evaluates the second part if the first one is true, so GPC0 will # be increased once the conn_rate is above 100/5s. stick-table type ip size 200k expire 1m store conn_rate(5s),gpc0 tcp-request track-counters src tcp-request reject if { trk_get_gpc0 gt 0 } tcp-request reject if { trk_conn_rate gt 100 } { trk_inc_gpc0 gt 0} Alternatively, it is possible to let the entry expire even in presence of traffic by swapping the check for gpc0 and the track-counters statement : stick-table type ip size 200k expire 1m store conn_rate(5s),gpc0 tcp-request reject if { src_get_gpc0 gt 0 } tcp-request track-counters src tcp-request reject if { trk_conn_rate gt 100 } { trk_inc_gpc0 gt 0} It is also possible not to track counters at all, but entry lookups will then be performed more often : stick-table type ip size 200k expire 1m store conn_rate(5s),gpc0 tcp-request reject if { src_get_gpc0 gt 0 } tcp-request reject if { src_conn_rate gt 100 } { src_inc_gpc0 gt 0} The '0' at the end of the counter name is there because if we find that more counters may be useful, other ones will be added.	2010-08-10 18:04:14 +02:00
Willy Tarreau	1f7e925d6a	[MINOR] stktable: add a stktable_update_key() function This function looks up a key, updates its expiration date, or creates it if it was not found. acl_fetch_src_updt_conn_cnt() was updated to make use of it.	2010-08-10 18:04:14 +02:00
Willy Tarreau	6c59e0a942	[MEDIUM] session counters: add bytes_in_rate and bytes_out_rate counters These counters maintain incoming and outgoing byte rates in a stick-table, over a period which is defined in the configuration (2 ms to 24 days). They can be used to detect service abuse and enforce a certain bandwidth limits per source address for instance, and block if the rate is passed over. Since 32-bit counters are used to compute the rates, it is important not to use too long periods so that we don't have to deal with rates above 4 GB per period. Example : # block if more than 5 Megs retrieved in 30 seconds from a source. stick-table type ip size 200k expire 1m store bytes_out_rate(30s) tcp-request track-counters src tcp-request reject if { trk_bytes_out_rate gt 5000000 } # cause a 15 seconds pause to requests from sources in excess of 2 megs/30s tcp-request inspect-delay 15s tcp-request content accept if { trk_bytes_out_rate gt 2000000 } WAIT_END	2010-08-10 18:04:13 +02:00
Willy Tarreau	91c43d7fe4	[MEDIUM] session counters: add conn_rate and sess_rate counters These counters maintain incoming connection rates and session rates in a stick-table, over a period which is defined in the configuration (2 ms to 24 days). They can be used to detect service abuse and enforce a certain accept rate per source address for instance, and block if the rate is passed over. Example : # block if more than 50 requests per 5 seconds from a source. stick-table type ip size 200k expire 1m store conn_rate(5s),sess_rate(5s) tcp-request track-counters src tcp-request reject if { trk_conn_rate gt 50 } # cause a 3 seconds pause to requests from sources in excess of 20 requests/5s tcp-request inspect-delay 3s tcp-request content accept if { trk_sess_rate gt 20 } WAIT_END	2010-08-10 18:04:13 +02:00
Willy Tarreau	f4d17d9071	[MEDIUM] session: add a counter on the cumulated number of sessions Sessions are like connections but they have been accepted by L4 rules and really became sessions.	2010-08-10 18:04:13 +02:00
Willy Tarreau	1aa006fe7a	[MINOR] session: add trk_kbytes_* ACL keywords to track data size These one apply to the entry being tracked by current session.	2010-08-10 18:04:13 +02:00

... 2 3 4 5 6 ...

502 Commits