haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-07 15:47:01 +02:00

Author	SHA1	Message	Date
Willy Tarreau	d669a4f72b	[MEDIUM] backend: support servers on 0.0.0.0 Till now when a server was configured with address 0.0.0.0, the connection was forwarded to this address which generally is intercepted by the system as a local address, so this was completely useless. One sometimes useful feature for outgoing transparent proxies is to be able to forward the connection to the same address the client requested. This patch fixes the meaning of 0.0.0.0 precisely to ensure that the connection will be forwarded to the initial client's destination address.	2010-07-13 14:57:52 +02:00
Willy Tarreau	2a164ee549	[BUG] stick_table: the fix for the memory leak caused a regression (cherry picked from commit 61ba936e6858dfcf9964d25870726621d8188fb9) [ note: the bug was finally not present in 1.5-dev but at least we have to reset store_count to be compatible with 1.4 ] Commit d6e9e3b5e320b957e6c491bd92d91afad30ba638 caused recently created entries to be removed as soon as they were created, breaking stickiness. It is not clear whether a use-after-free was possible or not in this case. This bug was reported by Ben Congleton and narrowed down by Herv� Commowick, both of whom also tested the fix. Thanks to them !	2010-06-18 09:57:45 +02:00
Willy Tarreau	acf9577350	[MINOR] config: provide a function to quote args in a more friendly way The quote_arg() function can be used to quote an argument or indicate "end of line" if it's null or empty. It should be useful to more precisely report location of problems in the configuration.	2010-06-14 19:09:21 +02:00
Willy Tarreau	f535683123	[BUG] config: report the correct proxy type in tcp-request errors A copy-paste typo caused a wrong proxy's type to be reported in case of parsing errors.	2010-06-14 18:40:26 +02:00
Willy Tarreau	6a984fa7c1	[CLEANUP] proto_tcp: make the config parser a little bit more flexible We'll need to let the tcp-request parser able to delegate parsing of track-counters to a commun function, let's prepare it.	2010-06-14 16:44:27 +02:00
Willy Tarreau	5214be1b22	[MINOR] session: add a pointer to the tracked counters for the source We'll have to keep counters of various criteria specific to the session's source. When we get one, keep a pointer to it in the session.	2010-06-14 15:32:18 +02:00
Willy Tarreau	e7f3d7ab9f	[MEDIUM] stick-tables: add a reference counter to each entry We'll soon have to maintain links from sessions to entries, so let's add a refcount in entries to avoid purging them if it's not null.	2010-06-14 15:10:26 +02:00
Willy Tarreau	cb18364ca7	[MEDIUM] stick_table: separate storage and update of session entries When an entry already exists, we just need to update its expiration timer. Let's have a dedicated function for that instead of spreading open code everywhere. This change also ensures that an update of an existing sticky session really leads to an update of its expiration timer, which was apparently not the case till now. This point needs to be checked in 1.4.	2010-06-14 15:10:26 +02:00
Willy Tarreau	a975b8f381	[MINOR] tcp: add per-source connection rate limiting This change makes use of the stick-tables to keep track of any source address activity. Two ACLs make it possible to check the count of an entry or update it and act accordingly. The typical usage will be to reject a TCP request upon match of an excess value.	2010-06-14 15:10:25 +02:00
Willy Tarreau	41883e2041	[MINOR] stick_table: export the stick_table_key This one is huge and will be needed by other portions of code for various data lookups. Let's not have them allocate it in the stack.	2010-06-14 15:10:25 +02:00
Willy Tarreau	c00cdc2eb0	[MINOR] stick_table: enable it for frontends too A frontend may very well host a stick-table. In fact it will be useful with connection throttling.	2010-06-14 15:10:25 +02:00
Willy Tarreau	13c29dee21	[MEDIUM] stick_table: move the server ID to a generic data type The server ID is now stored just as any other data type. It is only allocated if needed and is manipulated just like the other ones.	2010-06-14 15:10:25 +02:00
Willy Tarreau	68129b90eb	[MINOR] stick_table: provide functions to return stksess data from a type This function does the indirection job in the table to find the pointer to the real data matching the requested type.	2010-06-14 15:10:25 +02:00
Willy Tarreau	056f5683e3	[MINOR] config: initialize stick tables after all the parsing We'll be able to add data types to stick tables while parsing their users, so let's initialize them at the end.	2010-06-14 15:10:24 +02:00
Willy Tarreau	f16d2b8c1b	[MEDIUM] stick_table: don't overwrite data when storing an entry Till now sticky sessions only held server IDs. Now there are other data types so it is not acceptable anymore to overwrite the server ID when writing something. The server ID must then only be written from the caller when appropriate. Doing this has also led to separate lookup and storage.	2010-06-14 15:10:24 +02:00
Willy Tarreau	69b870f862	[MINOR] stick_table: add support for "conn_cum" data type. This one can be parsed on the "stick-table" after with the "store" keyword. It will hold the number of connections matching the entry, for use with ACLs or anything else.	2010-06-14 15:10:24 +02:00
Willy Tarreau	08d5f98294	[MEDIUM] stick_table: add room for extra data types The stick_tables will now be able to store extra data for a same key. A limited set of extra data types will be defined and for each of them an offset in the sticky session will be assigned at startup time. All of this information will be stored in the stick table. The extra data types will have to be specified after the new "store" keyword of the "stick-table" directive, which will reserve some space for them.	2010-06-14 15:10:24 +02:00
Willy Tarreau	f0b38bfc33	[CLEANUP] stick_table: move pattern to key functions to stick_table.c pattern.c depended on stick_table while in fact it should be the opposite. So we move from pattern.c everything related to stick_tables and invert the dependency. That way the code becomes more logical and intuitive.	2010-06-14 15:10:24 +02:00
Willy Tarreau	86257dc116	[CLEANUP] stick_table: rename some stksess struct members to avoid confusion The name 'exps' and 'keys' in struct stksess was confusing because it was the same name as in the table which holds all of them, while they only hold one node each. Remove the trailing 's' to more clearly identify who's who.	2010-06-14 15:10:23 +02:00
Willy Tarreau	393379c3e0	[MINOR] stick_table: add support for variable-sized data Right now we're only able to store a server ID in a sticky session. The goal is to be able to store anything whose size is known at startup time. For this, we store the extra data before the stksess pointer, using a negative offset. It will then be easy to cumulate multiple data provided they each have their own offset.	2010-06-14 15:10:23 +02:00
Willy Tarreau	aea940eb23	[CLEANUP] stick_table: add/clarify some comments	2010-06-14 15:10:23 +02:00
Willy Tarreau	2799e98a36	[MINOR] frontend: count denied TCP requests separately It's very disturbing to see the "denied req" counter increase without any other session counter moving. In fact, we can't count a rejected TCP connection as "denied req" as we have not yet instanciated any session at all. Let's use a new counter for that.	2010-06-14 10:53:20 +02:00
Willy Tarreau	24dcaf3450	[MEDIUM] frontend: count the incoming connection earlier The frontend's connection was accounted for once the session was instanciated. This was problematic because the early ACLs weren't able to correctly account for the number of concurrent connections. Now we count the connection once it is assigned to the frontend. It also brings the nice advantage of being more symmetrical, because the stream_sock's accept() does not have to account for that anymore, only the session's accept() does.	2010-06-14 10:53:19 +02:00
Willy Tarreau	b36b4244a2	[MINOR] session: differenciate between accepted connections and received connections Now we're able to reject connections very early, so we need to use a different counter for the connections that are received and the ones that are accepted and converted into sessions, so that the rate limits can still apply to the accepted ones. The session rate must still be used to compute the rate limit, so that we can reject undesired traffic without affecting the rate.	2010-06-14 10:53:19 +02:00
Willy Tarreau	7999ddbc95	[MINOR] stream_sock: don't dereference a non-existing frontend The stream_sock accept() can be used without any frontend. Check everywhere if it exists before dereferencing it in the error path.	2010-06-14 10:53:18 +02:00
Willy Tarreau	decd14d298	[MEDIUM] stats: rely on the standard session_accept() function The stats' accept() function is now ridiculously small. It could even be reduced by moving some parts to the common accept code.	2010-06-14 10:53:18 +02:00
Willy Tarreau	81f9aa3bf2	[MAJOR] frontend: split accept() into frontend_accept() and session_accept() A new function session_accept() is now called from the lower layer to instanciate a new session. Once the session is instanciated, the upper layer's frontent_accept() is called. This one can be service-dependant. That way, we have a 3-phase accept() sequence : 1) protocol-specific, session-less accept(), which is pointed to by the listener. It defaults to the generic stream_sock_accept(). 2) session_accept() which relies on a frontend but not necessarily for use in a proxy (eg: stats or any future service). 3) frontend_accept() which performs the accept for the service offerred by the frontend. It defaults to frontend_accept() which is really what is used by a proxy. The TCP/HTTP proxies have been moved to this mode so that we can now rely on frontend_accept() for any type of session initialization relying on a frontend. The next step will be to convert the stats to use the same system for the stats.	2010-06-14 10:53:17 +02:00
Willy Tarreau	35a0994984	[MAJOR] frontend: reorder the session initialization upon accept This will be needed for the last factoring step which adds support for application-level accept(). The tcp/http accept() code has now been isolated and will have to move to a separate function.	2010-06-14 10:53:17 +02:00
Willy Tarreau	ee55dc024b	[MINOR] frontend: rely on the frontend and not the backend for INDEPSTR Till now, the frontend relied on the backend's options for INDEPSTR, while at the time of accept, the frontend and backend are the same. So we now use the frontend's pointer instead of the backend and we don't have any dependency on the backend anymore in the frontend's accept code.	2010-06-14 10:53:17 +02:00
Willy Tarreau	070ceb6cfb	[MEDIUM] session: don't assign conn_retries upon accept() anymore The conn_retries attribute is now assigned when switching from SI_ST_INI to SI_ST_REQ. This eliminates one of the last dependencies on the backend in the frontend's accept() function.	2010-06-14 10:53:16 +02:00
Willy Tarreau	ee28de0a12	[MEDIUM] session: move the conn_retries attribute to the stream interface The conn_retries still lies in the session and its initialization depends on the backend when it may not yet be known. Let's first move it to the stream interface.	2010-06-14 10:53:16 +02:00
Willy Tarreau	a360d28e84	[MAJOR] frontend: don't initialize the server-side stream_int anymore The frontend has no reason to initialize the server-side stream_interface. It's a leftover from old times which now makes no sense due to the fact that we don't know in the frontend whether the other side will be a socket, a task or anything else. Removing this part is possible due to previous patches which perform the initialization at the proper place. We'll still have to be able to register an I/O handler for situations where everything is known only to the frontend (eg: unix stats socket), before merging the various instanciations of this accept() function.	2010-06-14 10:53:15 +02:00
Willy Tarreau	a8f55d5473	[MEDIUM] backend: initialize the server stream_interface upon connect() It's not normal to initialize the server-side stream interface from the accept() function, because it may change later. Thus, we introduce a new stream_sock_prepare_interface() function which is called just before the connect() and which sets all of the stream_interface's callbacks to the default ones used for real sockets. The ->connect function is also set at the same instant so that we can easily add new server-side protocols soon.	2010-06-14 10:53:15 +02:00
Willy Tarreau	d04e858db0	[MEDIUM] session: initialize server-side timeouts after connect() It was particularly embarrassing that the server timeout was assigned to buffers during an accept() just to be potentially changed later in case of a use_backend rule. The frontend side has nothing to do with server timeouts. Now we initialize them right after the connect() succeeds. Later this should change for a unique stream-interface timeout setting only.	2010-06-14 10:53:14 +02:00
Willy Tarreau	85e7d00a70	[MEDIUM] session: finish session establishment sequence in with I/O handlers Calling sess_establish() upon a successful connect() was essential, but it was not clearly stated whether it was necessary for an access to an I/O handler or not. While it would be desired, having it automatically add the response analyzers is quite a problem, and it breaks HTTP stats. The solution is thus not to call it for now and to perform the few response initializations as needed. For the long term, we need to find a way to specify the analyzers to install during a stream_int_register_handler() if any.	2010-06-14 10:53:14 +02:00
Willy Tarreau	ace495e468	[CLEANUP] buffer->cto is not used anymore The connection timeout stored in the buffer has not been used since the stream interface were introduced. Let's get rid of it as it's one of the things that complicate factoring of the accept() functions.	2010-06-14 10:53:14 +02:00
Willy Tarreau	de3041d443	[MINOR] frontend: only check for monitor-net rules if LI_O_CHK_MONNET is set We can disable the monitor-net rules on a listener if this flag is not set in the listener's options. This will be useful when we don't want to check that fe->addr is set or not for non-TCP frontends.	2010-06-14 10:53:13 +02:00
Willy Tarreau	a5c0ab200b	[MEDIUM] frontend: check for LI_O_TCP_RULES in the listener The new LI_O_TCP_RULES listener option indicates that some TCP rules must be checked upon accept on this listener. It is now checked by the frontend and the L4 rules are evaluated only in this case. The flag is only set when at least one tcp-req rule is present in the frontend. The L4 rules check function has now been moved to proto_tcp.c where it ought to be.	2010-06-14 10:53:13 +02:00
Willy Tarreau	2281b7fd12	[OPTIM] frontend: tell the compiler that errors are unlikely to occur Doing this brings better, more linear object code with less jumps in the normal path.	2010-06-14 10:53:12 +02:00
Willy Tarreau	f67c978bed	[MEDIUM] tcp: check for pure layer4 rules immediately after accept() The tcp inspection rules were fast but were only processed after a schedule had occurred and all resources were allocated. When defending against DDoS, it's important to be able to apply some protection the earliest possible instant. Thus we introduce a new set of rules : tcp-request rules which act on pure layer4 information (no content). They are evaluated even before the buffers are allocated for the session, saving as much time as possible. That way it becomes possible to check an incoming connection's source IP address against a list of authorized/blocked networks, and immediately drop the connection. The rules are checked even before we perform any socket-specific operation, so that we can optimize the reject case, which will be the problematic one during a DDoS. The second stream interface and s->txn are also now initialized after the rules are parsed for the same reason. All these optimisations have permitted to reach up to 212000 connnections/s with a real rule rejecting based on the source IP address.	2010-06-14 10:53:12 +02:00
Willy Tarreau	1a68794418	[MEDIUM] config: parse tcp layer4 rules (tcp-request accept/reject) These rules currently only support the "accept" and "reject" actions. They will apply on pure layer 4 and will not support any content.	2010-06-14 10:53:12 +02:00
Willy Tarreau	ab786194f0	[MINOR] proxy: add a list to hold future layer 4 rules This list will be evaluated right after the accept() call.	2010-06-14 10:53:11 +02:00
Willy Tarreau	eb472685cb	[MEDIUM] separate protocol-level accept() from the frontend's For a long time we had two large accept() functions, one for TCP sockets instanciating proxies, and another one for UNIX sockets instanciating the stats interface. A lot of code was duplicated and both did not work exactly the same way. Now we have a stream_sock layer accept() called for either TCP or UNIX sockets, and this function calls the frontend-specific accept() function which does the rest of the frontend-specific initialisation. Some code is still duplicated (session & task allocation, stream interface initialization), and might benefit from having an intermediate session-level accept() callback to perform such initializations. Still there are some minor differences that need to be addressed first. For instance, the monitor nets should only be checked for proxies and not for other connection templates. Last, we renamed l->private as l->frontend. The "private" pointer in the listener is only used to store a frontend, so let's rename it to eliminate this ambiguity. When we later support detached listeners (eg: FTP), we'll add another field to avoid the confusion.	2010-06-14 10:53:11 +02:00
Willy Tarreau	03fa5df64a	[CLEANUP] rename client -> frontend The 'client.c' file now only contained frontend-specific functions, so it has naturally be renamed 'frontend.c'. Same for client.h. This has also been an opportunity to remove some cross references from files that should not have depended on it. In the end, this file should contain a protocol-agnostic accept() code, which would initialize a session, task, etc... based on an accept() from a lower layer. Right now there are still references to TCP.	2010-06-14 10:53:10 +02:00
Willy Tarreau	645513ade8	[CLEANUP] client: move some ACLs away to their respective locations Some ACLs in the client ought to belong to proto_tcp, or protocols. This file should only contain frontend-specific information and will be renamed that way in next commit.	2010-06-14 10:53:10 +02:00
Willy Tarreau	44b90cc4d8	[CLEANUP] tcp: move some non tcp-specific layer6 processing out of proto_tcp Some functions which act on generic buffer contents without being tcp-specific were historically in proto_tcp.c. This concerns ACLs and RDP cookies. Those have been moved away to more appropriate locations. Ideally we should create some new files for each layer6 protocol parser. Let's do that later.	2010-06-14 10:53:09 +02:00
Willy Tarreau	a93c4bbdb7	[MINOR] accept: count the incoming connection earlier Right now we count the incoming connection only once everything has been allocated. Since we're planning on considering early ACL rules, we need to count the connection earlier.	2010-06-14 10:53:09 +02:00
Willy Tarreau	06457871a4	[CLEANUP] acl: use 'L6' instead of 'L4' in ACL flags relying on contents Just like we do on health checks, we should consider that ACLs that make use of buffer data are layer 6 and not layer 4, because we'll soon have to distinguish between pure layer 4 ACLs (without any buffer) and these ones.	2010-06-14 10:53:09 +02:00
Willy Tarreau	a4cda67323	[BUG] stick_table: fix possible memory leak in case of connection error If a "stick store-request" rule is present, an entry is preallocated during the request. However, if there is no response due to an error or to a redir mode server, we never release it.	2010-06-14 10:49:24 +02:00
Willy Tarreau	663308bea1	[BUG] debug: correctly report truncated messages By using msg->sol as the beginning of a message, wrong messages were displayed in debug mode when they were truncated on the last line, because msg->sol points to the beginning of the last line. Use data+msg->som instead.	2010-06-07 22:43:55 +02:00
Willy Tarreau	1ba0e5f451	[BUG] debug: wrong pointer was used to report a status line This would only be wrong when the server has not completely responded yet. Fix two other occurrences of wrong rsp<->sl associations which were harmless but wrong anyway.	2010-06-07 22:43:55 +02:00
Willy Tarreau	79ebac602d	[BUG] http: report correct flags in case of client aborts during body Some client abort/timeouts during body transfer were reported as "PR--" instead of "CD--" or "cD--". This fix has to be ported to 1.5.	2010-06-07 22:43:54 +02:00
Willy Tarreau	d9bbe17b7f	[BUG] proxy: connection rate limiting was eating lots of CPU The rate-limit feature relied on a timer to define how long a frontend must remain idle. It was not considering the pending connections, so it was almost always ready to be used again and only the accept's limit was preventing new connections from coming in. By accounting for the pending connection, we can compute a correct delay and effectively make the frontend go idle for that (short) time.	2010-06-07 22:43:54 +02:00
Willy Tarreau	4fe4190278	[BUG] http: automatically close response if req is aborted Latest BF_READ_ATTACHED fix has unveiled a nice issue with the way HTTP requests and responses are forwarded. The case where the request aborts after the response has responded (POST with early response) forgot to re-enable auto-close on the response. In fact it still worked thanks to a side effect as long as BF_READ_ATTACHED was there to force the states to be resynced (and the flags). Since last fix, the missing auto-close causes CLOSE_WAIT connections when the client aborts too late during a data transfer. The right fix consists in considering the situation where the client experiences an error and to explicitly abort the transfer. There is no need to wake the response analysers up for that since they'd have no added value and the analysers flags are cleared. However for a future usage, that might help (eg: stickiness, ...). This fix should be backported to 1.4 if the previous one is backported too. After all the non-reg tests, the risks to see a problem arise without both patches seems low, and both patches touch sensible areas of the code. So there's no hurry.	2010-06-07 22:42:44 +02:00
Willy Tarreau	a6eebb372d	[BUG] session: clear BF_READ_ATTACHED before next I/O The BF_READ_ATTACHED flag was created to wake analysers once after a connection was established. It turns out that this flag is never cleared once set, so even if there is no event, some analysers are still evaluated for no reason. The bug was introduced with commit `ea38854d34`. It may cause slightly increased CPU usages during data transfers, maybe even quite noticeable once when transferring transfer-encoded data, due to the fact that the request analysers are being checked for every chunk. This fix must be backported in 1.4 after all non-reg tests have been completed.	2010-06-04 14:49:52 +02:00
Willy Tarreau	e29e1c5df4	[BUG] client: always ensure to zero rep->analysers The response analyser was not emptied upon creation of a new session. In fact it was always zero just because last session leaved it in a zero state, but in case of shared pools this cannot be guaranteed. The net effect is that it was possible to have some HTTP (or any other) analysers on the response path of a stats unix socket, which would reject the response. This fix must be backported to 1.4.	2010-06-04 14:49:27 +02:00
Willy Tarreau	39e4f62186	[BUG] http: the transaction must be initialized even in TCP mode (part 2) Commit 4605e3b641cebbdbb2ee5726e5bbc3c03a2d7b5e was not enough, because connections passing from a TCP frontend to an HTTP backend without any ACL and via a "default_backend" statement were still working on non-initialized data. An initialization was missing in the session_set_backend() function, next to the initialization of hdr_idx.	2010-06-04 14:37:05 +02:00
Willy Tarreau	dd701651fe	[BUG] consistent hash: balance on all servers, not only 2 ! It was once reported at least by Dirk Taggesell that the consistent hash had a very poor distribution, making use of only two servers. Jeff Persch analysed the code and found the root cause. Consistent hash makes use of the server IDs, which are completed after the chash array initialization. This implies that each server which does not have an explicit "id" parameter will be merged at the same place in the chash tree and that in the end, only the first or last servers may be used. The now obvious fix (thanks to Jeff) is to assign the missing IDs earlier. However, it should be clearly understood that changing a hash algorithm on live systems will rebalance the whole system. Anyway, the only affected users will be the ones for which the system is quite unbalanced already. The ones who fix their IDs are not affected at all. Kudos to Jeff for spotting that bug which got merged 3 days after the consistent hashing !	2010-06-04 14:36:39 +02:00
Willy Tarreau	23968d898a	[BUG] tcp: dropped connections must be counted as "denied" not "failed" This probably was a copy-paste typo from the initial tcp-request feature. This must be backported to 1.4 and possibly 1.3.	2010-05-28 18:10:31 +02:00
Willy Tarreau	a3445fce16	[BUG] http: the transaction must be initialized even in TCP mode When running in pure TCP mode with a traffic inspection rule to detect HTTP protocol, we have to initialize the HTTP transaction too. The effect of not doing this was that some incoming connections could have been matched as carrying HTTP protocol eventhough this was not the case.	2010-05-23 08:56:02 +02:00
Willy Tarreau	d45b3d5aff	[BUG] http: dispatch and http_proxy modes were broken for a long time Both dispatch and http_proxy modes were broken since 1.4-dev5 when the adjustment of server health based on response codes was introduced. In fact, in these modes, s->srv == NULL. The result is a plain segfault. It should have been noted critical, but the fact that it remained 6 months without being noticed indicates that almost nobody uses these modes anymore. Also, the crash is immediate upon first request. Further versions should not be affected anymore since it's planned to have a dummy server instead of these annoying NULL pointers.	2010-05-23 08:56:02 +02:00
Willy Tarreau	0b1cd94c8b	[MINOR] acl: add srv_is_up() to check that a specific server is up or not This ACL was missing in complex setups where the status of a remote site has to be considered in switching decisions. Until there, using a server's status in an ACL required to have a dedicated backend, which is a bit heavy when multiple servers have to be monitored.	2010-05-16 22:18:27 +02:00
Willy Tarreau	020534d6f7	[CLEANUP] acl: make use of eb_is_empty() instead of open coding the tree's emptiness test Since ebtree 6.0.1, we now have eb_is_empty() which is cleaner and safer to use than checking the left pointer of the tree, so let's use that.	2010-05-16 21:45:45 +02:00
Willy Tarreau	4a568976c5	[MINOR] stick-tables: add support for "stick on hdr" It is now possible to stick on an IP address found in a HTTP header. Right now only the last occurrence of the header can be used, which is generally enough for most uses. Also, the header extraction rule only knows how to convert the header to IP. Later it will be usable as a plain string with an implicit conversion, and the syntax will not change.	2010-05-13 22:10:02 +02:00
Willy Tarreau	58215a01af	[MINOR] acl: ignore empty lines and comments in pattern files Most often, pattern files used by ACLs will be produced by tools which emit some comments (eg: geolocation lists). It's very annoying to have to clean the files before using them, and it does not make much sense to be able to support patterns we already can't input in the config file. So this patch makes the pattern file loader skip lines beginning with a sharp and the empty ones, and strips leading spaces and tabs.	2010-05-13 22:10:02 +02:00
Willy Tarreau	b337b532de	[MEDIUM] acl: add tree-based lookups of networks Networks patterns loaded from files for longest match ACL testing will now be arranged into a prefix tree. This is possible thanks to the new prefix features in ebtree v6.0. Longest match testing is slightly slower than exact data maching. However, the measured impact of running at 42000 requests per second and testing whether the IP address found in a header belongs to a list of 52000 networks or not is 3% CPU (increase from 66% to 69%). This is low enough to permit true geolocation based on huge tables.	2010-05-13 21:37:50 +02:00
Willy Tarreau	c4262961f8	[MEDIUM] acl: add tree-based lookups of exact strings Now if some ACL patterns are loaded from a file and the operation is an exact string match, the data will be arranged in a tree, yielding a significant performance boost on large data sets. Note that this only works when case is sensitive. A new dedicated function, acl_lookup_str(), has been created for this matching. It is called for every possible input data to test and it looks the tree up for the data. Since the keywords are loosely typed, we would have had to add a new columns to all keywords to adjust the function depending on the type. Instead, we just compare on the match function. We call acl_lookup_str() when we could use acl_match_str(). The tree lookup is performed first, then the remaining patterns are attempted if the tree returned nothing. A quick test shows that when matching a header against a list of 52000 network names, haproxy uses 68% of one core on a core2-duo 3.2 GHz at 42000 requests per second, versus 66% without any rule, which means only a 2% CPU increase for 52000 rules. Doing the same test without the tree leads to 100% CPU at 6900 requests/s. Also it was possible to run the same test at full speed with about 50 sets of 52000 rules without any measurable performance drop.	2010-05-13 21:37:45 +02:00
Willy Tarreau	e56cda9a6a	[MEDIUM] acl: add ability to insert patterns in trees The code is now ready to support loading pattern from filesinto trees. For that, it will be required that the ACL keyword has a flag ACL_MAY_LOOKUP and that the expr is case sensitive. When that is true, the pattern will have a flag ACL_PAT_F_TREE_OK to indicate that it is possible to feed the tree instead of a usual pattern if the parsing function is able to do this. The tree's root is pre-initialized in the pattern's value so that the function can easily find it. At that point, if the parsing function decides to use the tree, it just sets ACL_PAT_F_TREE in the return flags so that the caller knows the tree has been used and the pattern can be recycled. That way it will be possible to load some patterns into the tree when it is compatible, and other ones as linear linked lists. A good example of this might be IPv4 network entries : right now we support holes in masks, but this very rare feature is not compatible with binary lookup in trees. So the parser will be able to decide itself whether the pattern can go to the tree or not.	2010-05-13 21:37:41 +02:00
Willy Tarreau	2b5285da33	[MINOR] acl: support loading values from files The "acl XXX -f <file>" syntax was supported but nothing was read from the file. This is now possible. All lines are merged verbatim, even if they contain spaces (useful for user-agents). There are shortcomings though. The worst one is that error reporting is too approximative.	2010-05-09 23:45:24 +02:00
Delta Yeh	af01c7c2a6	[BUG] cttproxy: socket fd leakage in check_cttproxy_version in cttproxy.c check_cttproxy_version socket is not closed before function returned. Although it is called only once, I think it is better to close the socket.	2010-05-09 21:19:08 +02:00
Willy Tarreau	c3bfeebdb4	[MINOR] fix possible crash in debug mode with invalid responses When trying to display an invalid request or response we received, we must at least check that we have identified something looking like a start of message, otherwise we can dereference a NULL pointer.	2010-04-29 07:09:25 +02:00
Willy Tarreau	e52564c4dd	[MINOR] option http-pretend-keepalive is both for FEs and BEs The config parser only accepted it in frontends.	2010-04-27 22:19:14 +02:00
Cyril Bont�	47fdd8e993	[MINOR] add the "ignore-persist" option to conditionally ignore persistence This is used to disable persistence depending on some conditions (for example using an ACL matching static files or a specific User-Agent). You can see it as a complement to "force-persist". In the configuration file, the force-persist/ignore-persist declaration order define the rules priority. Used with the "appsesion" keyword, it can also help reducing memory usage, as the session won't be hashed the persistence is ignored.	2010-04-25 22:37:14 +02:00
Cyril Bont�	17530c34e4	[BUG] appsession should match the whole cookie name I met a strange behaviour with appsession. I firstly thought this was a regression due to one of my previous patch but after testing with a 1.3.15.12 version, I also could reproduce it. To illustrate, the configuration contains : appsession PHPSESSID len 32 timeout 1h Then I call a short PHP script containing : setcookie("P", "should not match") When calling this script thru haproxy, the cookie "P" matches the appsession rule : Dumping hashtable 0x11f05c8 table[1572]: should+not+match Shouldn't it be ignored ? If you confirm, I'll send a patch for 1.3 and 1.4 branches to check that the cookie length is equal to the appsession name length. This is due to the comparison length, where the cookie length is took into account instead of the appsession name length. Using the appsession name length would allow ASPSESSIONIDXXX (+ check that memcmp won't go after the buffer size). Also, while testing, I noticed that HEAD requests where not available for URIs containing the appsession parameter. 1.4.3 patch fixes an horrible segfault I missed in a previous patch when appsession is not in the configuration and HAProxy is compiled with DEBUG_HASH.	2010-04-07 21:56:10 +02:00
Willy Tarreau	70160200e0	[MINOR] config: report "default-server" instead of "(null)" in error messages When an error is reported in a default-server entry, we want to have that name in the error message instead of "(null)".	2010-04-07 16:06:40 +02:00
Willy Tarreau	8a8e1d99cb	[MINOR] http: make it possible to pretend keep-alive when doing close Some servers do not completely conform with RFC2616 requirements for keep-alive when they receive a request with "Connection: close". More specifically, they don't bother using chunked encoding, so the client never knows whether the response is complete or not. One immediately visible effect is that haproxy cannot maintain client connections alive. The second issue is that truncated responses may be cached on clients in case of network error or timeout. �scar Fr�as Barranco reported this issue on Tomcat 6.0.20, and Patrik Nilsson with Jetty 6.1.21. Cyril Bont� proposed this smart idea of pretending we run keep-alive with the server and closing it at the last moment as is already done with option forceclose. The advantage is that we only change one emitted header but not the overall behaviour. Since some servers such as nginx are able to close the connection very quickly and save network packets when they're aware of the close negociation in advance, we don't enable this behaviour by default. "option http-pretend-keepalive" will have to be used for that, in conjunction with "option http-server-close".	2010-04-05 16:26:34 +02:00
Willy Tarreau	efa5f51f00	[BUILD] config: last patch breaks build without CONFIG_HAP_LINUX_TPROXY A few ifdefs were missing in the config validity check function.	2010-03-30 20:13:29 +02:00
Willy Tarreau	bce7088275	[MEDIUM] add ability to connect to a server from an IP found in a header Using get_ip_from_hdr2() we can look for occurrence #X or #-X and extract the IP it contains. This is typically designed for use with the X-Forwarded-For header. Using "usesrc hdr_ip(name,occ)", it becomes possible to use the IP address found in <name>, and possibly specify occurrence number <occ>, as the source to connect to a server. This is possible both in a server and in a backend's source statement. This is typically used to use the source IP previously set by a upstream proxy.	2010-03-30 10:39:43 +02:00
Willy Tarreau	090466c91a	[MINOR] add new tproxy flags for dynamic source address binding This patch adds a new TPROXY bind type, TPROXY_DYN, to indicate to the TCP connect function that we want to bind to the address passed in argument.	2010-03-30 09:59:44 +02:00
Willy Tarreau	d54bbdce87	[MINOR] add very fast IP parsing functions Those functions were previouly used in my firewall log parser, and are particularly suited for use with http headers.	2010-03-30 09:59:44 +02:00
Willy Tarreau	b1d67749db	[MEDIUM] backend: move the transparent proxy address selection to backend The transparent proxy address selection was set in the TCP connect function which is not the most appropriate place since this function has limited access to the amount of parameters which could produce a source address. Instead, now we determine the source address in backend.c:connect_server(), right after calling assign_server_address() and we assign this address in the session and pass it to the TCP connect function. This cannot be performed in assign_server_address() itself because in some cases (transparent mode, dispatch mode or http_proxy mode), we assign the address somewhere else. This change will open the ability to bind to addresses extracted from many other criteria (eg: from a header).	2010-03-30 09:59:43 +02:00
Willy Tarreau	07a5490881	[CLEANUP] proxy: move PR_O_SSL3_CHK to options2 to release one flag We'll need another flag in the 'options' member close to PR_O_TPXY_*, and all are used, so let's move this easy one to options2 (which are already used for SQL checks).	2010-03-30 09:59:43 +02:00
Willy Tarreau	9683909dce	[MINOR] checks: add the ability to disable a server in the config Adding the "disabled" keyword on a server line disables it. It can then be enabled again on the unix socket.	2010-03-29 11:50:34 +02:00
Willy Tarreau	4554bc189d	[MINOR] config: allow "slowstart 0s" Sometimes it's useful to be able to disable slowstart by setting "slowstart 0".	2010-03-26 10:40:49 +01:00
Willy Tarreau	e24fdfb8be	[MINOR] config: emit warnings when HTTP-only options are used in TCP mode It's very common to see people getting trapped by HTTP-only options which don't work in TCP proxies. To help them definitely get rid of those configs, let's emit warnings for all options and statements which are not supported in their mode. That includes all HTTP-only options, the cookies and the stats. In order to ensure internal config correctness, the options are also disabled.	2010-03-25 07:26:18 +01:00
Willy Tarreau	3fd6cec5f6	[MINOR] cli: "show errors" should display "backend <NONE>" when backend was not used It was disturbing to see a backend name associated with a bad request when this "backend" was in fact the frontend. Instead, we now display "backend <NONE>" if the "backend" has no backend capability : > show errors [25/Mar/2010:06:44:25.394] frontend fe (#1): invalid request src 127.0.0.1, session #0, backend <NONE> (#-1), server <NONE> (#-1) request length 45 bytes, error at position 0:	2010-03-25 06:45:07 +01:00
Willy Tarreau	eaed5a1d46	[BUG] backend: L7 hashing must not be performed on incomplete requests Isidore Li reported an occasional segfault when using URL hashing, and kindly provided backtraces and core files to help debugging. The problem was triggered by reset connections before the URL was sent, and was due to the same bug which was fixed by commit `e45997661b` (connections were attempted in case of connection abort). While that bug was already fixed, it appeared that the same segfault could be triggered when URL hashing is configured in an HTTP backend when the frontend runs in TCP mode and no URL was seen. It is totally abnormal to try to hash a null URL, as well as to process any kind of L7 hashing when a full request was not seen. This additional fix now ensures that layer7 hashing is not performed on incomplete requests.	2010-03-25 06:32:52 +01:00
Willy Tarreau	e45997661b	[MEDIUM] session: better fix for connection to servers with closed input The following patch fixed an issue but brought another one : 296897 [MEDIUM] connect to servers even when the input has already been closed The new issue is that when a connection is inspected and aborted using TCP inspect rules, now it is sent to the server before being closed. So that test is not satisfying. A probably better way is not to prevent a connection from establishing if only BF_SHUTW_NOW is set but BF_SHUTW is not. That way, the BF_SHUTW flag is not set if the request has any data pending, which still fixes the stats issue, but does not let any empty connection pass through. Also, as a safety measure, we extend buffer_abort() to automatically disable the BF_AUTO_CONNECT flag. While it appears to always be OK, it is by pure luck, so better safe than sorry.	2010-03-21 23:31:42 +01:00
Willy Tarreau	4e1554c295	[CLEANUP] stats: remove printf format warning in stats_dump_full_sess_to_buffer() This warning was first reported by Ross West on FreeBSD, then by Holger Just on OpenSolaris. It also happens on 64bit Linux. However, fixing the format to use long int complains on 32bit Linux where ptrdiff_t is apparently different. Better cast the pointer difference to an int then.	2010-03-21 23:21:00 +01:00
Cyril Bont�	c9f825f060	[OPTIM] config: only allocate check buffer when checks are enabled To save a little memory, the check_data buffer is only allocated for the servers that are checked. [WT: this patch saves 80 MB of RAM on the test config with 5000 servers]	2010-03-17 22:05:23 +01:00
Willy Tarreau	039381855d	[BUG] checks: don't wait for a close to start parsing the response Cyril Bont� reported a regression introduced with very last changes on the checks code, which causes failed checks on if the server does not close the connection in time. This happens on HTTP/1.1 checks or on SMTP checks for instance. This fix consists in restoring the old behaviour of parsing as soon as something is available in the response buffer, and waiting for more data if some are missing. This also helps releasing connections earlier (eg: a GET check will not have to download the whole object).	2010-03-17 21:52:07 +01:00
Willy Tarreau	e437c44483	[BUG] init: unconditionally catch SIGPIPE Apparently some systems define MSG_NOSIGNAL but do not necessarily check it (or maybe binaries are built somewhere and used on older versions). There were reports of very recent FreeBSD setups causing SIGPIPEs, while older ones catch the signal. Recent FreeBSD manpages indeed define MSG_NOSIGNAL. So let's now unconditionnaly catch the signal. It's useless not to do it for the rare cases where it's not needed (linux 2.4 and below).	2010-03-17 18:02:46 +01:00
Willy Tarreau	bf3f1de5b5	[BUG] http: fix truncated responses on chunk encoding when size divides buffer size Bernhard Krieger reported truncated HTTP responses in presence of some specific chunk-encoded data, and kindly offered complete traces of the issue which made it easy to reproduce it. Those traces showed that the chunks were of exactly 8192 bytes, chunk size and CRLF included, which was exactly half the size of the buffer. In this situation, the function http_chunk_skip_crlf() could erroneously try to parse a CRLF after the chunk believing there were more data pending, because the number of bytes present in the buffer was considered instead of the number of remaining bytes to be parsed.	2010-03-17 15:54:24 +01:00
Willy Tarreau	6315d914b6	[MINOR] checks: make shutdown() silently fail Shutdown may fail for instance after an RST. So we must not report any error for that.	2010-03-16 22:57:27 +01:00
Willy Tarreau	659d7bc4e3	[BUG] checks: don't abort when second poll returns an error Now that the response may be fragmented, we may receive early notifications of aborts in return of poll(), as indicated below, which currently cause an early error detection : 21:11:21.036600 epoll_wait(3, {{EPOLLIN, {u32=7, u64=7}}}, 8, 993) = 1 21:11:21.054361 gettimeofday({1268770281, 54467}, NULL) = 0 21:11:21.054540 recv(7, "H"..., 8030, 0) = 1 21:11:21.054694 recv(7, 0x967e759, 8029, 0) = -1 EAGAIN (Resource temporarily unavailable) 21:11:21.054843 epoll_wait(3, {{EPOLLIN\|EPOLLERR\|EPOLLHUP, {u32=7, u64=7}}}, 8, 975) = 1 21:11:21.060274 gettimeofday({1268770281, 60386}, NULL) = 0 21:11:21.060454 close(7) = 0 Just as in stream_sock, we must not believe poll() without attempting to receive, which fixes the issue : 21:11:59.402207 recv(7, "H"..., 8030, 0) = 1 21:11:59.402362 recv(7, 0x8b5c759, 8029, 0) = -1 EAGAIN (Resource temporarily unavailable) 21:11:59.402511 epoll_wait(3, {{EPOLLIN\|EPOLLERR\|EPOLLHUP, {u32=7, u64=7}}}, 8, 974) = 1 21:11:59.407242 gettimeofday({1268770319, 407353}, NULL) = 0 21:11:59.407425 recv(7, "TTP/1.0 200 OK\r\n"..., 8029, 0) = 16 21:11:59.407606 recv(7, 0x8b5c769, 8013, 0) = -1 ECONNRESET (Connection reset by peer) 21:11:59.407753 shutdown(7, 2 /* send and receive */) = -1 ENOTCONN (Transport endpoint is not connected)	2010-03-16 22:57:27 +01:00
Willy Tarreau	c1a07960a6	[BUG] checks: don't report an error when recv() returns an error after data This happens when a server immediately closes the connection after the response without lingering or when we close before the end of the data. We get an RST which translates into a late error. We must not declare an error without checking that the contents are OK.	2010-03-16 22:57:27 +01:00
Willy Tarreau	2c7ace07ad	[OPTIM] checks: try to detect the end of response without polling again Since the recv() call returns every time it succeeds, we always need to calls with one intermediate poll before detecting the end of response : 20:20:03.958207 recv(7, "HTTP/1.1 200\r\nConnection: close\r\n"..., 8030, 0) = 145 20:20:03.958365 epoll_wait(3, {{EPOLLIN, {u32=7, u64=7}}}, 8, 1000) = 1 20:20:03.958543 gettimeofday({1268767203, 958626}, NULL) = 0 20:20:03.958694 recv(7, ""..., 7885, 0) = 0 20:20:03.958833 shutdown(7, 2 /* send and receive /) = 0 Let's read as long as we can, that way we can detect end of connections in the same call, which is much more efficient especially for LBs with hundreds of servers : 20:29:58.797019 recv(7, "HTTP/1.1 200\r\nConnection: close\r\n"..., 8030, 0) = 145 20:29:58.797182 recv(7, ""..., 7885, 0) = 0 20:29:58.797356 shutdown(7, 2 / send and receive */) = 0	2010-03-16 22:57:27 +01:00
Nick Chalk	57b1bf7785	[MEDIUM] checks: support multi-packet health check responses We are seeing both real servers repeatedly going on- and off-line with a period of tens of seconds. Packet tracing, stracing, and adding debug code to HAProxy itself has revealed that the real servers are always responding correctly, but HAProxy is sometimes receiving only part of the response. It appears that the real servers are sending the test page as three separate packets. HAProxy receives the contents of one, two, or three packets, apparently randomly. Naturally, the health check only succeeds when all three packets' data are seen by HAProxy. If HAProxy and the real servers are modified to use a plain HTML page for the health check, the response is in the form of a single packet and the checks do not fail. (...) I've added buffer and length variables to struct server, and allocated space with the rest of the server initialisation. (...) It seems to be working fine in my tests, and handles check responses that are bigger than the buffer.	2010-03-16 22:57:26 +01:00
Willy Tarreau	3965040898	[MINOR] http: don't mark a server as failed when it returns 501/505 Those two codes can be triggered on demand by client requests. We must not fail a server on them. Ideally we should ignore a certain amount of status codes which do not indicate life nor death.	2010-03-15 19:44:39 +01:00
Willy Tarreau	f53b25d7c1	[BUG] config: fix endless loop when parsing "on-error" An arg index increment was missing causing the same arg to be parsed in an endless loop. Proabably a merge conflict that remained undetected.	2010-03-15 19:40:37 +01:00

1 2 3 4 5 ...

1240 Commits