haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-10-11 07:31:30 +02:00

Author	SHA1	Message	Date
Willy Tarreau	c3bd972cda	[MINOR] session-counters: add a general purpose counter (gpc0) This counter may be used to track anything. Two sets of ACLs are available to manage it, one gets its value, and the other one increments its value and returns it. In the second case, the entry is created if it did not exist. Thus it is possible for example to mark a source as being an abuser and to keep it marked as long as it does not wait for the entry to expire : # The rules below use gpc0 to track abusers, and reject them if # a source has been marked as such. The track-counters statement # automatically refreshes the entry which will not expire until a # 1-minute silence is respected from the source. The second rule # evaluates the second part if the first one is true, so GPC0 will # be increased once the conn_rate is above 100/5s. stick-table type ip size 200k expire 1m store conn_rate(5s),gpc0 tcp-request track-counters src tcp-request reject if { trk_get_gpc0 gt 0 } tcp-request reject if { trk_conn_rate gt 100 } { trk_inc_gpc0 gt 0} Alternatively, it is possible to let the entry expire even in presence of traffic by swapping the check for gpc0 and the track-counters statement : stick-table type ip size 200k expire 1m store conn_rate(5s),gpc0 tcp-request reject if { src_get_gpc0 gt 0 } tcp-request track-counters src tcp-request reject if { trk_conn_rate gt 100 } { trk_inc_gpc0 gt 0} It is also possible not to track counters at all, but entry lookups will then be performed more often : stick-table type ip size 200k expire 1m store conn_rate(5s),gpc0 tcp-request reject if { src_get_gpc0 gt 0 } tcp-request reject if { src_conn_rate gt 100 } { src_inc_gpc0 gt 0} The '0' at the end of the counter name is there because if we find that more counters may be useful, other ones will be added.	2010-08-10 18:04:14 +02:00
Willy Tarreau	1f7e925d6a	[MINOR] stktable: add a stktable_update_key() function This function looks up a key, updates its expiration date, or creates it if it was not found. acl_fetch_src_updt_conn_cnt() was updated to make use of it.	2010-08-10 18:04:14 +02:00
Willy Tarreau	6c59e0a942	[MEDIUM] session counters: add bytes_in_rate and bytes_out_rate counters These counters maintain incoming and outgoing byte rates in a stick-table, over a period which is defined in the configuration (2 ms to 24 days). They can be used to detect service abuse and enforce a certain bandwidth limits per source address for instance, and block if the rate is passed over. Since 32-bit counters are used to compute the rates, it is important not to use too long periods so that we don't have to deal with rates above 4 GB per period. Example : # block if more than 5 Megs retrieved in 30 seconds from a source. stick-table type ip size 200k expire 1m store bytes_out_rate(30s) tcp-request track-counters src tcp-request reject if { trk_bytes_out_rate gt 5000000 } # cause a 15 seconds pause to requests from sources in excess of 2 megs/30s tcp-request inspect-delay 15s tcp-request content accept if { trk_bytes_out_rate gt 2000000 } WAIT_END	2010-08-10 18:04:13 +02:00
Willy Tarreau	91c43d7fe4	[MEDIUM] session counters: add conn_rate and sess_rate counters These counters maintain incoming connection rates and session rates in a stick-table, over a period which is defined in the configuration (2 ms to 24 days). They can be used to detect service abuse and enforce a certain accept rate per source address for instance, and block if the rate is passed over. Example : # block if more than 50 requests per 5 seconds from a source. stick-table type ip size 200k expire 1m store conn_rate(5s),sess_rate(5s) tcp-request track-counters src tcp-request reject if { trk_conn_rate gt 50 } # cause a 3 seconds pause to requests from sources in excess of 20 requests/5s tcp-request inspect-delay 3s tcp-request content accept if { trk_sess_rate gt 20 } WAIT_END	2010-08-10 18:04:13 +02:00
Willy Tarreau	f4d17d9071	[MEDIUM] session: add a counter on the cumulated number of sessions Sessions are like connections but they have been accepted by L4 rules and really became sessions.	2010-08-10 18:04:13 +02:00
Willy Tarreau	1aa006fe7a	[MINOR] session: add trk_kbytes_* ACL keywords to track data size These one apply to the entry being tracked by current session.	2010-08-10 18:04:13 +02:00
Willy Tarreau	9b0ddcfd84	[MINOR] session: add the trk_conn_cur ACL keyword to track concurrent connection This one applies to the entry being tracked by current session.	2010-08-10 18:04:13 +02:00
Willy Tarreau	9a3f849371	[MINOR] session: add the trk_conn_cnt ACL keyword to track connection counts Most of the time we'll want to check the connection count of the criterion we're currently tracking. So instead of duplicating the src* tests, let's add trk_conn_cnt to report the total number of connections from the stick table entry currently being tracked. A nice part of the code was factored, and we should do the same for the other criteria.	2010-08-10 18:04:12 +02:00
Willy Tarreau	855e4bbcc7	[MEDIUM] session: add data in and out volume counters The new "bytes_in_cnt" and "bytes_out_cnt" session counters have been added. They're automatically updated when session counters are updated. They can be matched with the "src_kbytes_in" and "src_kbytes_out" ACLs which apply to the volume per source address. This can be used to deny access to service abusers.	2010-08-10 18:04:12 +02:00
Willy Tarreau	38285c18f4	[MEDIUM] session: add concurrent connections counter The new "conn_cur" session counter has been added. It is automatically updated upon "track XXX" directives, and the entry is touched at the moment we increment the value so that we don't consider further counter updates as real updates, otherwise we would end up updating upon completion, which may not be desired. Probably that some other event counters (eg: HTTP requests) will have to be updated upon each event though. This counter can be matched against current session's source address using the "src_conn_cur" ACL.	2010-08-10 18:04:12 +02:00
Willy Tarreau	8b22a71a4d	[MEDIUM] session: move counter ACL fetches from proto_tcp It was not normal to have counter fetches in proto_tcp.c. The only reason was that the key based on the source address was fetched there, but now we have split the key extraction and data processing, we must move that to a more appropriate place. Session seems OK since the counters are all manipulated from here. Also, since we're precisely counting number of connections with these ACLs, we rename them src_conn_cnt and src_updt_conn_cnt. This is not a problem right now since no version was emitted with these keywords.	2010-08-10 18:04:12 +02:00
Willy Tarreau	9ba2dcc86c	[MAJOR] session: add track-counters to track counters related to the session This patch adds the ability to set a pointer in the session to an entry in a stick table which holds various counters related to a specific pattern. Right now the syntax matches the target syntax and only the "src" pattern can be specified, to track counters related to the session's IPv4 source address. There is a special function to extract it and convert it to a key. But the goal is to be able to later support as many patterns as for the stick rules, and get rid of the specific function. The "track-counters" directive may only be set in a "tcp-request" statement right now. Only the first one applies. Probably that later we'll support multi-criteria tracking for a single session and that we'll have to name tracking pointers. No counter is updated right now, only the refcount is. Some subsequent patches will have to bring that feature.	2010-08-10 18:04:12 +02:00
Willy Tarreau	fb35620e87	[MEDIUM] session: support "tcp-request content" rules in backends Sometimes it's necessary to be able to perform some "layer 6" analysis in the backend. TCP request rules were not available till now, although documented in the diagram. Enable them in backend now.	2010-08-10 14:10:58 +02:00
Willy Tarreau	815a9b2039	[BUG] session: analysers must be checked when SI state changes Since the BF_READ_ATTACHED bug was fixed, a new issue surfaced. When a connection closes on the return path in tunnel mode while the request input is already closed, the request analyser which is waiting for a state change never gets woken up so it never closes the request output. This causes stuck sessions to remain indefinitely. One way to reliably reproduce the issue is the following (note that the client expects a keep-alive but not the server) : server: printf "HTTP/1.0 303\r\n\r\n" \| nc -lp8080 client: printf "GET / HTTP/1.1\r\n\r\n" \| nc 127.1 2500 The reason for the issue is that we don't wake the analysers up on stream interface state changes. So the least intrusive and most reliable thing to do is to consider stream interface state changes to call the analysers. We just need to remember what state each series of analysers have seen and check for the differences. In practice, that works. A later improvement later could consist in being able to let analysers state what they're interested to monitor : - left SI's state - right SI's state - request buffer flags - response buffer flags That could help having only one set of analysers and call them once status changes.	2010-08-10 14:04:28 +02:00
Willy Tarreau	7a20aa6e6b	[MEDIUM] session: make it possible to call an I/O handler on both SI This will be used when an I/O handler running in a stream interface needs to establish a connection somewhere. We want the session processor to evaluate both I/O handlers, depending on which side has one. Doing so also requires that stream_int_update_embedded() wakes the session up only when the other side is established or has closed, for instance in order to handle connection errors without looping indefinitely during the connection setup time. The session processor still relies on BF_READ_ATTACHED being set, though we must do whatever is required to remove this dependency.	2010-07-13 16:34:26 +02:00
Willy Tarreau	0bd05eaf24	[MEDIUM] stream-interface: add a ->release callback When a connection is closed on a stream interface, some iohandlers will need to be informed in order to release some resources. This normally happens upon a shutr+shutw. It is the equivalent of the fd_delete() call which is done for real sockets, except that this time we release internal resources. It can also be used with real sockets because it does not cost anything else and might one day be useful.	2010-07-13 16:06:23 +02:00
Willy Tarreau	e8f6338c5d	[BUG] stick-table: correctly refresh expiration timers The store operation did not correctly refresh the expiration timer on the stick entry. It did so on the temporary one instead.	2010-07-13 15:20:24 +02:00
Willy Tarreau	2a164ee549	[BUG] stick_table: the fix for the memory leak caused a regression (cherry picked from commit 61ba936e6858dfcf9964d25870726621d8188fb9) [ note: the bug was finally not present in 1.5-dev but at least we have to reset store_count to be compatible with 1.4 ] Commit d6e9e3b5e320b957e6c491bd92d91afad30ba638 caused recently created entries to be removed as soon as they were created, breaking stickiness. It is not clear whether a use-after-free was possible or not in this case. This bug was reported by Ben Congleton and narrowed down by Herv� Commowick, both of whom also tested the fix. Thanks to them !	2010-06-18 09:57:45 +02:00
Willy Tarreau	5214be1b22	[MINOR] session: add a pointer to the tracked counters for the source We'll have to keep counters of various criteria specific to the session's source. When we get one, keep a pointer to it in the session.	2010-06-14 15:32:18 +02:00
Willy Tarreau	cb18364ca7	[MEDIUM] stick_table: separate storage and update of session entries When an entry already exists, we just need to update its expiration timer. Let's have a dedicated function for that instead of spreading open code everywhere. This change also ensures that an update of an existing sticky session really leads to an update of its expiration timer, which was apparently not the case till now. This point needs to be checked in 1.4.	2010-06-14 15:10:26 +02:00
Willy Tarreau	13c29dee21	[MEDIUM] stick_table: move the server ID to a generic data type The server ID is now stored just as any other data type. It is only allocated if needed and is manipulated just like the other ones.	2010-06-14 15:10:25 +02:00
Willy Tarreau	f16d2b8c1b	[MEDIUM] stick_table: don't overwrite data when storing an entry Till now sticky sessions only held server IDs. Now there are other data types so it is not acceptable anymore to overwrite the server ID when writing something. The server ID must then only be written from the caller when appropriate. Doing this has also led to separate lookup and storage.	2010-06-14 15:10:24 +02:00
Willy Tarreau	f0b38bfc33	[CLEANUP] stick_table: move pattern to key functions to stick_table.c pattern.c depended on stick_table while in fact it should be the opposite. So we move from pattern.c everything related to stick_tables and invert the dependency. That way the code becomes more logical and intuitive.	2010-06-14 15:10:24 +02:00
Willy Tarreau	393379c3e0	[MINOR] stick_table: add support for variable-sized data Right now we're only able to store a server ID in a sticky session. The goal is to be able to store anything whose size is known at startup time. For this, we store the extra data before the stksess pointer, using a negative offset. It will then be easy to cumulate multiple data provided they each have their own offset.	2010-06-14 15:10:23 +02:00
Willy Tarreau	24dcaf3450	[MEDIUM] frontend: count the incoming connection earlier The frontend's connection was accounted for once the session was instanciated. This was problematic because the early ACLs weren't able to correctly account for the number of concurrent connections. Now we count the connection once it is assigned to the frontend. It also brings the nice advantage of being more symmetrical, because the stream_sock's accept() does not have to account for that anymore, only the session's accept() does.	2010-06-14 10:53:19 +02:00
Willy Tarreau	b36b4244a2	[MINOR] session: differenciate between accepted connections and received connections Now we're able to reject connections very early, so we need to use a different counter for the connections that are received and the ones that are accepted and converted into sessions, so that the rate limits can still apply to the accepted ones. The session rate must still be used to compute the rate limit, so that we can reject undesired traffic without affecting the rate.	2010-06-14 10:53:19 +02:00
Willy Tarreau	81f9aa3bf2	[MAJOR] frontend: split accept() into frontend_accept() and session_accept() A new function session_accept() is now called from the lower layer to instanciate a new session. Once the session is instanciated, the upper layer's frontent_accept() is called. This one can be service-dependant. That way, we have a 3-phase accept() sequence : 1) protocol-specific, session-less accept(), which is pointed to by the listener. It defaults to the generic stream_sock_accept(). 2) session_accept() which relies on a frontend but not necessarily for use in a proxy (eg: stats or any future service). 3) frontend_accept() which performs the accept for the service offerred by the frontend. It defaults to frontend_accept() which is really what is used by a proxy. The TCP/HTTP proxies have been moved to this mode so that we can now rely on frontend_accept() for any type of session initialization relying on a frontend. The next step will be to convert the stats to use the same system for the stats.	2010-06-14 10:53:17 +02:00
Willy Tarreau	070ceb6cfb	[MEDIUM] session: don't assign conn_retries upon accept() anymore The conn_retries attribute is now assigned when switching from SI_ST_INI to SI_ST_REQ. This eliminates one of the last dependencies on the backend in the frontend's accept() function.	2010-06-14 10:53:16 +02:00
Willy Tarreau	ee28de0a12	[MEDIUM] session: move the conn_retries attribute to the stream interface The conn_retries still lies in the session and its initialization depends on the backend when it may not yet be known. Let's first move it to the stream interface.	2010-06-14 10:53:16 +02:00
Willy Tarreau	d04e858db0	[MEDIUM] session: initialize server-side timeouts after connect() It was particularly embarrassing that the server timeout was assigned to buffers during an accept() just to be potentially changed later in case of a use_backend rule. The frontend side has nothing to do with server timeouts. Now we initialize them right after the connect() succeeds. Later this should change for a unique stream-interface timeout setting only.	2010-06-14 10:53:14 +02:00
Willy Tarreau	85e7d00a70	[MEDIUM] session: finish session establishment sequence in with I/O handlers Calling sess_establish() upon a successful connect() was essential, but it was not clearly stated whether it was necessary for an access to an I/O handler or not. While it would be desired, having it automatically add the response analyzers is quite a problem, and it breaks HTTP stats. The solution is thus not to call it for now and to perform the few response initializations as needed. For the long term, we need to find a way to specify the analyzers to install during a stream_int_register_handler() if any.	2010-06-14 10:53:14 +02:00
Willy Tarreau	a4cda67323	[BUG] stick_table: fix possible memory leak in case of connection error If a "stick store-request" rule is present, an entry is preallocated during the request. However, if there is no response due to an error or to a redir mode server, we never release it.	2010-06-14 10:49:24 +02:00
Willy Tarreau	a6eebb372d	[BUG] session: clear BF_READ_ATTACHED before next I/O The BF_READ_ATTACHED flag was created to wake analysers once after a connection was established. It turns out that this flag is never cleared once set, so even if there is no event, some analysers are still evaluated for no reason. The bug was introduced with commit ea38854d34675d5472319c453b7027af42fe8aab. It may cause slightly increased CPU usages during data transfers, maybe even quite noticeable once when transferring transfer-encoded data, due to the fact that the request analysers are being checked for every chunk. This fix must be backported in 1.4 after all non-reg tests have been completed.	2010-06-04 14:49:52 +02:00
Cyril Bont�	47fdd8e993	[MINOR] add the "ignore-persist" option to conditionally ignore persistence This is used to disable persistence depending on some conditions (for example using an ACL matching static files or a specific User-Agent). You can see it as a complement to "force-persist". In the configuration file, the force-persist/ignore-persist declaration order define the rules priority. Used with the "appsesion" keyword, it can also help reducing memory usage, as the session won't be hashed the persistence is ignored.	2010-04-25 22:37:14 +02:00
Willy Tarreau	e45997661b	[MEDIUM] session: better fix for connection to servers with closed input The following patch fixed an issue but brought another one : 296897 [MEDIUM] connect to servers even when the input has already been closed The new issue is that when a connection is inspected and aborted using TCP inspect rules, now it is sent to the server before being closed. So that test is not satisfying. A probably better way is not to prevent a connection from establishing if only BF_SHUTW_NOW is set but BF_SHUTW is not. That way, the BF_SHUTW flag is not set if the request has any data pending, which still fixes the stats issue, but does not let any empty connection pass through. Also, as a safety measure, we extend buffer_abort() to automatically disable the BF_AUTO_CONNECT flag. While it appears to always be OK, it is by pure luck, so better safe than sorry.	2010-03-21 23:31:42 +01:00
Willy Tarreau	296897f2c6	[MEDIUM] connect to servers even when the input has already been closed The BF_AUTO_CLOSE flag prevented a connection from establishing on a server if the other side's input channel was already closed. This is wrong because there may be pending data to be sent. This was causing an issue with stats, as noticed and reported by Cyril Bont�. Since the stats are now handled as a server, sometimes concurrent accesses were causing one of the connections to send the shutdown(write) before the connection to the stats function was established, which aborted it early. This fix causes the BF_AUTO_CLOSE flag to be checked only when the connection on the outgoing stream interface has reached an established state. That way we're still able to connect, send the request then close.	2010-03-14 19:21:34 +01:00
Willy Tarreau	15e5554467	[CLEANUP] session: remove duplicate test This duplicate test should have been removed with the loop rework but was forgotten. It was harmless, but disassembly shows that it prevents gcc from correctly optimizing the loop.	2010-03-05 10:12:01 +01:00
Willy Tarreau	ae52678444	[STATS] count transfer aborts caused by client and by server Often we need to understand why some transfers were aborted or what constitutes server response errors. With those two counters, it is now possible to detect an unexpected transfer abort during a data phase (eg: too short HTTP response), and to know what part of the server response errors may in fact be assigned to aborted transfers.	2010-03-04 20:34:23 +01:00
Willy Tarreau	033b2dbeb3	[BUG] logs: don't report "last data" when we have just closed after an error Some people have reported seeing "SL" flags in their logs quite often while this should never happen. The reason was that then a server error is detected, we close the connection to that server and when we decide what state we were in, we see the connection is closed, and deduce it was the last data transfer, which is wrong. We should report DATA if the previous state was an established state, which this patch does. Now logs correctly report "SD" and not "SL" when a server resets a connection before the end of the transfer.	2010-03-04 18:45:47 +01:00
Willy Tarreau	2465779459	[STATS] separate frontend and backend HTTP stats It is wrong to merge FE and BE stats for a proxy because when we consult a BE's stats, it reflects the FE's stats eventhough the BE has received no traffic. The most common example happens with listen instances, where the backend gets credited for all the trafic even when a use_backend rule makes use of another backend.	2010-02-26 10:30:28 +01:00
Willy Tarreau	2e2b3eb65a	[BUILD] fix build breakage with DEBUG_FULL Paul Hirose reported a build error when DEBUG_FULL is set.	2010-02-09 20:55:44 +01:00
Krzysztof Piotr Oledzki	f9423ae43a	[MINOR] acl: add http_auth and http_auth_group Add two acls to match http auth data: acl <name> http_auth(userlist) acl <name> http_auth_hroup(userlist) group1 group2 (...)	2010-01-31 19:14:09 +01:00
Willy Tarreau	4de9149f87	[MINOR] add the "force-persist" statement to force persistence on down servers This is used to force access to down servers for some requests. This is useful when validating that a change on a server correctly works before enabling the server again.	2010-01-22 19:10:05 +01:00
Emeric Brun	1d33b2965e	[MEDIUM] Add stick and store rules analysers.	2010-01-12 16:01:24 +01:00
Willy Tarreau	762a23618e	[BUG] appsession's sessid must be reset at end of transaction If we don't do that, we may corrupt the pools in keep-alive sessions.	2010-01-09 13:57:26 +01:00
Willy Tarreau	e34070e1be	[MEDIUM] session: limit the number of analyser loops The initial code's intention was to loop on the analysers as long as an analyser is added by another one. [This code was wrong due to the while(0) which breaks even on a continue statement, but the initial intention must be changed too]. In fact we should limit the number of times we loop on analysers in order to limit latency. Using maxpollevents as a limit makes sense since this tunable is used for the exact same purposes. We may add another tunable later if that ever makes sense, so it's very unlikely.	2010-01-08 00:36:57 +01:00
Willy Tarreau	4602363f6a	[BUG] http: fix for capture memory leak was incorrect That patch was incorrect because under some circumstances, the capture memory could be freed by session_free() and then again by http_end_txn(), causing a double free and an eventual segfault. The pool use count was also reported wrong due to this bug. The cleanup code was removed from session_free() to remain only in http_end_txn().	2010-01-07 22:51:47 +01:00
Willy Tarreau	90deb18916	[MEDIUM] http: make safer use of the DONT_READ and AUTO_CLOSE flags Several HTTP analysers used to set those flags to values that were useful but without considering the possibility that they were not called again to clean what they did. First, replace direct flag manipulation with more explicit macros. Second, enforce a rule stating that any buffer which changes one of these flags from the default must restore it after completion, so that other analysers see correct flags. With both this fix and the previous one about analyser bits, we should not see any more stuck sessions.	2010-01-07 00:20:41 +01:00
Willy Tarreau	576507f4c5	[MEDIUM] session: also consider request analysers added during response A request analyser may very well be added while processing a response (eg: end of an HTTP keep-alive response). It's very dangerous to only rely on flags that ought to change in order to loop back, so let's correctly detect a possible new analyser addition instead of guessing.	2010-01-07 00:09:04 +01:00
Willy Tarreau	1e0bbafcbe	[MAJOR] session: fix the order by which the analysers are run With the introduction of keep-alive, we have created situations where an analyser can add other analysers to the current list, which are behind it, which have already been processed once, and which are needed immediately because without them there will be no more I/O activity. This is typically the case for enabling reading of a new request after preparing for a new request. Instead of creating specific cases for some analysers (there was already one such before), we now use a little bit of algorithmics to create an ordered bit chain supporting priorities and fast operations. Another advantage of this new construction is that it's not a real loop anymore, so if an analyser is unknown, it will not loop but just ignore it. Note that it is easy to skip multiple analysers at once now in order to speed up the checking a bit. Some test code has shown a minor gain though. This change has been carefully re-read and has no direct reason of causing a regression. However it has been tagged "major" because the fact that it runs the analysers correctly might trigger an old sleeping bug somewhere in one of the analysers.	2010-01-07 00:01:03 +01:00

... 7 8 9 10 11 ...

558 Commits