haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-14 11:06:56 +02:00

Author	SHA1	Message	Date
Thierry FOURNIER	39bef456fb	MINOR: pattern/map/acl: Centralization of the file parsers The acl and map function do the same work with the file parsing. This patch merge these code in only one. Note that the function map_read_entries_from_file() in the file "map.c" is moved to the the function pat_ref_read_from_file_smp() in the file "pattern.c". The code of this function is not modified, only the the name and the arguments order has changed.	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	e369ca2e66	MEDIUM: pattern_find_smp: functions find_smp uses the pat_ref_elt to find the element to be removed The find_smp search the smp using the value of the pat_ref_elt pointer. The pat_find_smp_* are no longer used. The function pattern_find_smp() known all pattern indexation, and can be found	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	7acca4b269	MEDIUM: pattern: delete() function uses the pat_ref_elt to find the element to be removed All the pattern delete function can use her reference to the original "struct pat_ref_elt" to find the element to be remove. The functions pat_del_list_str() and pat_del_meth() were deleted because after applying this modification, they have the same code than pat_del_list_ptr().	2014-03-17 18:06:08 +01:00
Thierry FOURNIER	0d6ba513a5	MINOR: pattern: store configuration reference for each acl or map pattern. This patch permit to add reference for each pattern reference. This is useful to identify the acl listed.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	aa222aadb6	MINOR: pattern: The function "pattern_register()" is no longer used. Remove the function "pattern_register()" and its prototype because it is no longer used.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	af5a29d5f8	MINOR: pattern: Each pattern is identified by unique id. The pattern reference are stored with two identifiers: the unique_id and the reference. The reference identify a file. Each file with the same name point to the same reference. We can register many times one file. If the file is modified, all his dependencies are also modified. The reference can be used with map or acl. The unique_id identify inline acl. The unique id is unique for each acl. You cannot force the same id in the configuration file, because this repport an error. The format of the acl and map listing through the "socket" has changed for displaying these new ids.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	5d34408785	MEDIUM: pattern: The expected type is stored in the pattern head, and conversion is executed once. This patch extract the expect_type variable from the "struct pattern" to "struct pattern_head". This variable is set during the declaration of ACL and MAP. With this change, the function "pat_parse_len()" become useless and can be replaced by "pat_parse_int()". Implicit ACLs by default rely on the fetch's output type, so let's simply do the same for all other ones. It has been verified that they all match.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	1e00d3853b	MAJOR: pattern/map: Extends the map edition system in the patterns This patch add the following socket command line options: show acl [<id>] clear acl <id> get acl <id> <pattern> del acl <id> <pattern> add acl <id> <pattern> The system used for maps is backported in the pattern functions.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	3667e514d9	MEDIUM: pattern/dumpstats: The function pattern_lookup() is no longer used This function are used in dumpstats. Now this function is replaced by delete and find_smp function pointer.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	55d0b10f06	MEDIUM: pattern: add sample lookup function. Some functions needs to change the sample associated to pattern. This new pointer permit to return the a pointer to the sample pointer. The caller can use or change the value.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	6f7203d673	MEDIUM: pattern: add prune function This path add specific pointer to each expression to point on prune function. Now, each pattern expression embed his own prune function.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	b113650e54	MEDIUM: pattern: add delete functions This commit adds a delete function for patterns. It looks up all instances of the pattern to delete and deletes them all. The fetch keyword declarations have been extended to point to the appropriate delete function.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	5338eea8eb	MEDIUM: pattern: The match function browse itself the list or the tree. The match function known the format of the pattern. The pattern can be stored in a list or in a tree. The pattern matching function use itself the good entry point and indexation type. Each pattern matching function return the struct pattern that match. If the flag "fill" is set, the struct pattern is filled, otherwise the content of this struct must not be used. With this feature, the general pattern matching function cannot have exceptions for building the "struct pattern".	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	1794fdf37e	MEDIUM: pattern: The function pattern_exec_match() returns "struct pattern" if the patten match. Before this commit, the pattern_exec_match() function returns the associate sample, the associate struct pattern or the associate struct pattern_tree. This is complex to use, because we can check the type of information returned. Now the function return always a "struct pattern". If <fill> is not set, only the value of the pointer can be used as boolean (NULL or other). If <fill> is set, you can use the <smp> pointer and the pattern information. If information must be duplicated, it is stored in trash buffer. Otherwise, the pattern can point on existing strings.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	d437314979	MEDIUM: sample/http_proto: Add new type called method The method are actuelly stored using two types. Integer if the method is known and string if the method is not known. The fetch is declared as UINT, but in some case it can provides STR. This patch create new type called METH. This type contain interge for known method and string for the other methods. It can be used with automatic converters. The pattern matching can expect method. During the free or prune function, http_meth pettern is freed. This patch initialise the freed pointer to NULL.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	7654c9ff44	MEDIUM: sample: Remove types SMP_T_CSTR and SMP_T_CBIN, replace it by SMP_F_CONST flags The operations applied on types SMP_T_CSTR and SMP_T_STR are the same, but the check code and the declarations are double, because it must declare action for SMP_T_C* and SMP_T_. The declared actions and checks are the same. this complexify the code. Only the "conv" functions can change from "C" to "*" Now, if a function needs to modify input string, it can call the new function smp_dup(). This one duplicate data in a trash buffer.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	0e9af55700	MINOR: sample: dont call the sample cast function "c_none" If the cast function to execute is c_none, dont execute it and return true. The function c_none, do nothing. This save a call.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	e1bcac5b8f	MINOR: pattern: Rename "pat_idx_elt" to "pattern_tree" This is just for having coherent struct names.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	edc15c3a35	MEDIUM: pattern: The parse functions just return "struct pattern" without memory allocation The pattern parse functions put the parsed result in a "struct pattern" without memory allocation. If the pattern must reference the input data without changes, the pattern point to the parsed string. If buffers are needed to store translated data, it use th trash buffer. The indexation function that allocate the memory later if it is needed.	2014-03-17 18:06:07 +01:00
Thierry FOURNIER	b9b08460a2	MEDIUM: pattern: add indexation function. Before this patch, the indexation function check the declared patttern matching function and index the data according with this function. This is not useful to add some indexation mode. This commit adds dedicated indexation function. Each struct pattern is associated with one indexation function. This function permit to index data according with the type of pattern and with the type of match.	2014-03-17 18:06:06 +01:00
Willy Tarreau	1cf8f08c17	MINOR: sample: move smp_to_type to sample.c This way it can be exported and reused anywhere else to report type names.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	3ead5b93c6	MINOR: pattern: separe list element from the data part. This commit separes the "struct list" used for the chain the "struct pattern" which contain the pattern data. Later, this change will permit to manipulate lists ans trees with the same "struct pattern".	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	972028fa67	MEDIUM: pattern: Change the prototype of the function pattern_register(). Each pattern parser take only one string. This change is reported to the function prototype of the function "pattern_register()". Now, it is called with just one string and no need to browse the array of args.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	580c32cb3a	MEDIUM: pattern: The pattern parser no more uses <opaque> and just takes one string. After the previous patches, the "pat_parse_strcat()" function disappear, and the "pat_parse_int()" and "pat_parse_dotted_ver()" functions dont use anymore the "opaque" argument, and take only one string on his input. So, after this patch, each pattern parser no longer use the opaque variable and take only one string as input. This patch change the prototype of the pattern parsing functions. Now, the "char *args" is replaced by a "char arg", the "int *opaque" is removed and these functions return 1 in succes case, and 0 if fail.	2014-03-17 18:06:06 +01:00
Thierry FOURNIER	9eec0a646b	MAJOR: auth: Change the internal authentication system. This patch remove the limit of 32 groups. It also permit to use standard "pat_parse_str()" function in place of "pat_parse_strcat()". The "pat_parse_strcat()" is no longer used and its removed. Before this patch, the groups are stored in a bitfield, now they are stored in a list of strings. The matching is slower, but the number of groups is low and generally the list of allowed groups is short. The fetch function "smp_fetch_http_auth_grp()" used with the name "http_auth_group" return valid username. It can be used as string for displaying the username or with the acl "http_auth_group" for checking the group of the user. Maybe the names of the ACL and fetch methods are no longer suitable, but I keep the current names for conserving the compatibility with existing configurations. The function "userlist_postinit()" is created from verification code stored in the big function "check_config_validity()". The code is adapted to the new authentication storage system and it is moved in the "src/auth.c" file. This function is used to check the validity of the users declared in groups and to check the validity of groups declared on the "user" entries. This resolve function is executed before the check of all proxy because many acl needs solved users and groups.	2014-03-17 18:06:06 +01:00
Willy Tarreau	f79d950163	MEDIUM: proxy: create a tree to store proxies by name Large configurations can take time to parse when thousands of backends are in use. Let's store all the proxies in trees. findproxy_mode() has been modified to use the tree for lookups, which has divided the parsing time by about 2.5. But many lookups are still present at many places and need to be dealt with.	2014-03-15 07:48:35 +01:00
Willy Tarreau	b145c78623	MINOR: channel: add the date of last read in the channel We store the time stamp of last read in the channel in order to be able to measure some bit rate and pause lengths. We only use 16 bits which were unused for this. We don't need more, as it allows us to measure with a millisecond precision for up to 65s.	2014-02-12 11:45:59 +01:00
Willy Tarreau	8f39dcdc8d	BUG/MINOR: channel: initialize xfer_small/xfer_large on new buffers These ones are only reset during transfers. There is a low but non-null risk that a first full read causes the previous value to be reused and immediately to immediately set the CF_STREAMER flag. The impact is only to increase earlier than expected the SSL record size and to use splice(). This bug was already present in 1.4, so a backport is possible.	2014-02-12 11:45:45 +01:00
Bhaskar Maddala	a20cb85eba	MINOR: stats: Enhancement to stats page to provide information of last session time. Summary: Track and report last session time on the stats page for each server in every backend, as well as the backend. This attempts to address the requirement in the ROADMAP - add a last activity date for each server (req/resp) that will be displayed in the stats. It will be useful with soft stop. The stats page reports this as time elapsed since last session. This change does not adequately address the requirement for long running session (websocket, RDP... etc).	2014-02-08 01:19:58 +01:00
Willy Tarreau	cc08d2c9ff	MEDIUM: counters: stop relying on session flags at all Till now, we had one flag per stick counter to indicate if it was tracked in a backend or in a frontend. We just had to add another flag per stick-counter to indicate if it relies on contents or just connection. These flags are quite painful to maintain and tend to easily conflict with other flags if their number is changed. The correct solution consists in moving the flags to the stkctr struct itself, but currently this struct is made of 2 pointers, so adding a new entry there to store only two bits will cause at least 16 more bytes to be eaten per counter due to alignment issues, and we definitely don't want to waste tens to hundreds of bytes per session just for things that most users don't use. Since we only need to store two bits per counter, an intermediate solution consists in replacing the entry pointer with a composite value made of the original entry pointer and the two flags in the 2 unused lower bits. If later a need for other flags arises, we'll have to store them in the struct. A few inline functions have been added to abstract the retrieval and assignment of the pointers and flags, resulting in very few changes. That way there is no more dependence on the number of stick-counters and their position in the session flags.	2014-01-28 23:34:45 +01:00
Willy Tarreau	f3338349ec	BUG/MEDIUM: counters: flush content counters after each request One year ago, commit `5d5b5d8` ("MEDIUM: proto_tcp: add support for tracking L7 information") brought support for tracking L7 information in tcp-request content rules. Two years earlier, commit `0a4838c` ("[MEDIUM] session-counters: correctly unbind the counters tracked by the backend") used to flush the backend counters after processing a request. While that earliest patch was correct at the time, it became wrong after the second patch was merged. The code does what it says, but the concept is flawed. "TCP request content" rules are evaluated for each HTTP request over a single connection. So if such a rule in the frontend decides to track any L7 information or to track L4 information when an L7 condition matches, then it is applied to all requests over the same connection even if they don't match. This means that a rule such as : tcp-request content track-sc0 src if { path /index.html } will count one request for index.html, and another one for each of the objects present on this page that are fetched over the same connection which sent the initial matching request. Worse, it is possible to make the code do stupid things by using multiple counters: tcp-request content track-sc0 src if { path /foo } tcp-request content track-sc1 src if { path /bar } Just sending two requests first, one with /foo, one with /bar, shows twice the number of requests for all subsequent requests. Just because both of them persist after the end of the request. So the decision to flush backend-tracked counters was not the correct one. In practice, what is important is to flush countent-based rules since they are the ones evaluated for each request. Doing so requires new flags in the session however, to keep track of which stick-counter was tracked by what ruleset. A later change might make this easier to maintain over time. This bug is 1.5-specific, no backport to stable is needed.	2014-01-28 21:40:28 +01:00
Willy Tarreau	91b843d0d2	REORG: stats: move the stats socket states to dumpstats.c There is no more usage of these values outside of dumpstats.c, and they're easier to maintain there. Also replace the #defines with an enum.	2014-01-28 16:28:21 +01:00
Willy Tarreau	71b734c307	MINOR: cli: add more information to the "show info" output In addition to previous outputs, we also emit the cumulated number of connections, the cumulated number of requests, the maximum allowed SSL connection concurrency, the current number of SSL connections and the cumulated number of SSL connections. This will help troubleshoot systems which experience memory shortage due to SSL.	2014-01-28 15:19:44 +01:00
Willy Tarreau	25002d206b	MINOR: polling: create function fd_compute_new_polled_status() This function is used to compute the new polling state based on the previous state. All pollers have to do this in their update loop, so better centralize the logic for it.	2014-01-26 00:42:32 +01:00
Willy Tarreau	e852545594	MEDIUM: polling: centralize polled events processing Currently, each poll loop handles the polled events the same way, resulting in a lot of duplicated, complex code. Additionally, epoll was the only one to handle newly created FDs immediately. So instead, let's move that code to fd.c in a new function dedicated to this task : fd_process_polled_events(). All pollers now use this function.	2014-01-26 00:42:32 +01:00
Willy Tarreau	6c11bd2f89	OPTIM: raw-sock: don't speculate after a short read if polling is enabled This is the reimplementation of the "done" action : when we experience a short read, we're almost certain that we've exhausted the system's buffers and that we'll meet an EAGAIN if we attempt to read again. If the FD is not yet polled, the stream interface already takes care of stopping the speculative read. When the FD is already being polled, we have two options : - either we're running from a level-triggered poller, in which case we'd rather report that we've reached the end so that we don't speculate over the poller and let it report next time data are available ; - or we're running from an edge-triggered poller in which case we have no choice and have to see the EAGAIN to re-enable events. At the moment we don't have any edge-triggered poller, so it's desirable to avoid speculative I/O that we know will fail. Note that this must not be ported to SSL since SSL hides the real readiness of the file descriptor. Thanks to this change, we observe no EAGAIN anymore during keep-alive transfers, and failed recvfrom() are reduced by half in http-server-close mode (the client-facing side is always being polled and the second recv can be avoided). Doing so results in about 5% performance increase in keep-alive mode. Similarly, we used to have up to about 1.6% of EAGAIN on accept() (1/maxaccept), and these have completely disappeared under high loads.	2014-01-26 00:42:32 +01:00
Willy Tarreau	aad69387ac	CLEANUP: connection: use conn_xprt_ready() instead of checking the flag It's easier and safer to rely on conn_xprt_ready() everywhere than to check the flag itself. It will also simplify adding extra checks later if needed. Some useless controls for !xprt have been removed, as the XPRT_READY flag itself guarantees xprt is set.	2014-01-26 00:42:31 +01:00
Willy Tarreau	3c72872da1	CLEANUP: connection: use conn_ctrl_ready() instead of checking the flag It's easier and safer to rely on conn_ctrl_ready() everywhere than to check the flag itself. It will also simplify adding extra checks later if needed. Some useless controls for !ctrl have been removed, as the CTRL_READY flag itself guarantees ctrl is set.	2014-01-26 00:42:31 +01:00
Willy Tarreau	e1f50c4b02	MEDIUM: connection: remove conn_{data,sock}_poll_{recv,send} We simply remove these functions and replace their calls with the appropriate ones : - if we're in the data phase, we can simply report wait on the FD - if we're in the socket phase, we may also have to signal the desire to read/write on the socket because it might not be active yet.	2014-01-26 00:42:30 +01:00
Willy Tarreau	310987a038	MAJOR: connection: remove the CO_FL_WAIT_{RD,WR} flags These flags were used to report the readiness of the file descriptor. Now this readiness is directly checked at the file descriptor itself. This removes the need for constantly synchronizing updates between the file descriptor and the connection and ensures that all layers share the same level of information. For now, the readiness is updated in conn_{sock,data}_poll_* by directly touching the file descriptor. This must move to the lower layers instead so that these functions can disappear as well. In this state, the change works but is incomplete. It's sensible enough to avoid making it more complex. Now the sock/data updates become much simpler because they just have to enable/disable access to a file descriptor and not to care anymore about its readiness.	2014-01-26 00:42:30 +01:00
Willy Tarreau	f817e9f473	MAJOR: polling: rework the whole polling system This commit heavily changes the polling system in order to definitely fix the frequent breakage of SSL which needs to remember the last EAGAIN before deciding whether to poll or not. Now we have a state per direction for each FD, as opposed to a previous and current state previously. An FD can have up to 8 different states for each direction, each of which being the result of a 3-bit combination. These 3 bits indicate a wish to access the FD, the readiness of the FD and the subscription of the FD to the polling system. This means that it will now be possible to remember the state of a file descriptor across disable/enable sequences that generally happen during forwarding, where enabling reading on a previously disabled FD would result in forgetting the EAGAIN flag it met last time. Several new state manipulation functions have been introduced or adapted : - fd_want_{recv,send} : enable receiving/sending on the FD regardless of its state (sets the ACTIVE flag) ; - fd_stop_{recv,send} : stop receiving/sending on the FD regardless of its state (clears the ACTIVE flag) ; - fd_cant_{recv,send} : report a failure to receive/send on the FD corresponding to EAGAIN (clears the READY flag) ; - fd_may_{recv,send} : report the ability to receive/send on the FD as reported by poll() (sets the READY flag) ; Some functions are used to report the current FD status : - fd_{recv,send}_active - fd_{recv,send}_ready - fd_{recv,send}_polled Some functions were removed : - fd_ev_clr(), fd_ev_set(), fd_ev_rem(), fd_ev_wai() The POLLHUP/POLLERR flags are now reported as ready so that the I/O layers knows it can try to access the file descriptor to get this information. In order to simplify the conditions to add/remove cache entries, a new function fd_alloc_or_release_cache_entry() was created to be used from pollers while scanning for updates. The following pollers have been updated : ev_select() : done, built, tested on Linux 3.10 ev_poll() : done, built, tested on Linux 3.10 ev_epoll() : done, built, tested on Linux 3.10 & 3.13 ev_kqueue() : done, built, tested on OpenBSD 5.2	2014-01-26 00:42:30 +01:00
Willy Tarreau	033cd9d78c	REORG: polling: rename "fd_process_spec_events()" to "fd_process_cached_events()" This is in order to be coherent with the rest.	2014-01-26 00:42:29 +01:00
Willy Tarreau	899d95757e	REORG: polling: rename the cache allocation functions - alloc_spec_entry() becomes fd_alloc_cache_entry() - release_spec_entry() becomes fd_release_cache_entry()	2014-01-26 00:42:29 +01:00
Willy Tarreau	16f649c82c	REORG: polling: rename "fd_spec" to "fd_cache" So fd_spec was renamed "fd_cache" as it's becoming an event cache, and fd_nbspec becomes fd_cache_num.	2014-01-26 00:42:29 +01:00
Willy Tarreau	15a4dec87e	REORG: polling: rename "spec_e" to "state" and "spec_p" to "cache" We're completely changing the way FDs will be polled. There will be no more speculative I/O since we'll know the exact FD state, so these will only be cached events. First, let's fix a few field names which become confusing. "spec_e" was used to store a speculative I/O event state. Now we'll store the whole R/W states for the FD there. "spec_p" was used to store a speculative I/O cache position. Now let's clearly call it "cache".	2014-01-26 00:42:29 +01:00
Willy Tarreau	69a41fa8a3	CLEANUP: polling: rename "spec_e" to "state" We're completely changing the way FDs will be polled. First, let's fix a few field names which become confusing. "spec_e" was used to store a speculative I/O event state. Now we'll store the whole R/W states for the FD there.	2014-01-26 00:42:28 +01:00
Willy Tarreau	45b34e8abc	MINOR: connection: add more error codes to report connection errors It is quite often that an connection error only reports "socket error" with no more information. This is especially problematic with health checks where many causes are possible, including resource exhaustion which do not lead to a valid errno code. So let's add explicit codes to cover these cases.	2014-01-24 16:15:04 +01:00
Thierry FOURNIER	e7ba23633b	MINOR: pattern: move functions for grouping pat_match_* and pat_parse_* and add documentation.	2014-01-21 22:14:21 +01:00
Willy Tarreau	2aefad5df7	MINOR: connection: add a new conn_drain() function Till now there was no way to know from a connection if a previous call to drain() had done any change. This function is used to drain incoming data and to update the connection's flags at the same time. It also correctly sets the polling flags on the connection if the drain function indicates inability to receive. This function will be used preferably over ctrl->drain() when a connection is used.	2014-01-20 22:27:16 +01:00
Willy Tarreau	8663105095	BUG: Revert "OPTIM: poll: restore polling after a poll/stop/want sequence" This reverts commit `1208266356`. It randomly breaks SSL. What happens is that if the SSL response is read at once by the SSL stack and is partially delivered to the buffer, then there's no way to read the next parts because we wait for some polling first. So we'll fix this after the polling rework.	2014-01-13 11:34:42 +01:00
Willy Tarreau	9fe7aae6eb	MINOR: checks: use an inline function for health_adjust() This function is called twice per request, and does almost always nothing. Better use an inline version to avoid entering it when we can. About 0.5% additional performance was gained this way.	2013-12-31 23:47:37 +01:00
Willy Tarreau	9e5a3aacf4	MEDIUM: stream-int: make si_connect() return an established state when possible si_connect() used to only return SI_ST_CON. But it already detect the connection reuse and is the function which avoids calling connect(). So it already knows the connection is valid and reuse. Thus we make it return SI_ST_EST when a connection is reused. This means that connect_server() can return this state and sess_update_stream_int() as well. Thanks to this change, we don't need to leave process_session() in SI_ST_CON state to immediately enter it again to switch to SI_ST_EST. Implementing this removes one call to process_session() per request in keep-alive mode. We're now at 2 calls per request, which is the minimum (one for the request and another one for the response). The number of calls to http_wait_for_response() has also dropped from 2 to one. Tests indicate a performance gain of about 2.6% in request rate in keep-alive mode. There should be no gain in http-server-close() since we don't use this faster path.	2013-12-31 23:32:12 +01:00
Willy Tarreau	51437d2c59	Revert "MEDIUM: stats: add support for HTTP keep-alive on the stats page" This reverts commit `f3221f99ac`. Igor reported some very strange breakage of his stats page which is clearly caused by the chunking, though I don't see at first glance what could be wrong. Better revert it for now.	2013-12-29 00:43:40 +01:00
Willy Tarreau	f3221f99ac	MEDIUM: stats: add support for HTTP keep-alive on the stats page In theory the principle is simple as we just need to send HTTP chunks if the client is 1.1 compatible. In practice it's harder because we have to append a CR LF after each block of data and we're never sure to have the room for this. In order not to have to deal with this, we instead send the CR LF prior to each chunk size. The only issue is for the first chunk and for this reason we avoid to send the empty header line when using chunked encoding.	2013-12-28 21:40:16 +01:00
Willy Tarreau	1208266356	OPTIM: poll: restore polling after a poll/stop/want sequence If a file descriptor is being polled, and it stopped (eg: buffer full or end of response), then re-enabled, currently what happens is that the polling is disabled, then the fd is enabled in speculative mode, an I/O attempt is made, it loses (otherwise the FD would surely not have been polled), and the polled is enabled again. This is too bad, especially with HTTP keep-alive on the server side where all operations are performed at once before going back to the poll loop. Now we improve the behaviour by ensuring that if an fd is still being polled, when it's enabled after having been disabled, we re-enable the polling. Doing so saves a number of syscalls and useless wakeups, and results in a significant performance gain on HTTP keep-alive. A 11% increase has been observed on the HTTP request rate in keep-alive thanks to this. It could be considered as a bug fix, but there was no harm with the current behaviour, except extra syscalls.	2013-12-27 20:18:52 +01:00
Willy Tarreau	2737562e43	MEDIUM: stream-int: implement a very simplistic idle connection manager Idle connections are not monitored right now. So if a server closes after a response without advertising it, it won't be detected until a next request wants to use the connection. This is a bit problematic because it unnecessarily maintains file descriptors and sockets in an idle state. This patch implements a very simple idle connection manager for the stream interface. It presents itself as an I/O callback. The HTTP engine enables it when it recycles a connection. If a close or an error is detected on the underlying socket, it tries to drain as much data as possible from the socket, detect the close and responds with a close as well, then detaches from the stream interface.	2013-12-17 00:00:28 +01:00
Willy Tarreau	e38feed966	BUG/MINOR: stats: correctly report throttle rate of low weight servers The throttling of low weight servers (<16) could mistakenly be reported as > 100% due to a rounding that was performed before a multiply by 100 instead of after. This was introduced in 1.5-dev20 when fixing a previous reporting issue by commit `d32c399` (MINOR: stats: report correct throttling percentage for servers in slowstart). It should be backported if the patch above is backported.	2013-12-16 18:04:57 +01:00
Willy Tarreau	b490b4e5ad	MAJOR: stream-int: handle the connection reuse in si_connect() This is the best place to reuse a connection. We centralize all connection requests and we're at the best place to know exactly what the current state of the underlying connection is. If the connection is reused, we just enable polling for send() in order to be able to emit the request.	2013-12-16 02:23:53 +01:00
Willy Tarreau	9471b8ced9	MEDIUM: connection: inform si_alloc_conn() whether existing conn is OK or not When allocating a new connection, only the caller knows whether it's acceptable to reuse the previous one or not. Let's pass this information to si_alloc_conn() which will do the cleanup if the connection is not acceptable.	2013-12-16 02:23:53 +01:00
Willy Tarreau	ad38acedaa	MEDIUM: connection: centralize handling of nolinger in fd management Right now we see many places doing their own setsockopt(SO_LINGER). Better only do it just before the close() in fd_delete(). For this we add a new flag on the file descriptor, indicating if it's safe or not to linger. If not (eg: after a connect()), then the setsockopt() call is automatically performed before a close(). The flag automatically turns to safe when receiving a read0.	2013-12-16 02:23:52 +01:00
Willy Tarreau	d02cdd23be	MINOR: connection: add simple functions to report connection readiness conn_xprt_ready() reports if the transport layer is ready. conn_ctrl_ready() reports if the control layer is ready. The stream interface uses si_conn_ready() to report that the underlying connection is ready. This will be used for connection reuse in keep-alive mode.	2013-12-16 02:23:52 +01:00
Willy Tarreau	975c1784c8	MINOR: sample: make sample_parse_expr() use memprintf() to report parse errors Doing so ensures that we're consistent between all the functions in the whole chain. This is important so that we can extract the argument parsing from this function.	2013-12-12 23:16:54 +01:00
Thierry FOURNIER	c0e0d7b7cf	MEDIUM: map: dynamic manipulation of maps This patch adds map manipulation commands to the socket interface. add map <map> <key> <value> Add the value <value> in the map <map>, at the entry corresponding to the key <key>. This command does not verify if the entry already exists. clear map <map> Remove entries from the map <map> del map <map> <key> Delete all the map entries corresponding to the <key> value in the map <map>. set map <map> <key> <value> Modify the value corresponding to each key <key> in a map <map>. The new value is <value>. show map [<map>] Dump info about map converters. Without argument, the list of all available maps are returned. If a <map> is specified, is content is dumped.	2013-12-12 15:58:30 +01:00
Thierry FOURNIER	01cdcd4a62	MINOR: pattern: add function to lookup a specific entry in pattern list This is used to dynamically delete or update map entry.	2013-12-12 15:50:01 +01:00
Thierry FOURNIER	b0c0a0f940	MINOR: map: export parse output sample functions This export is used to identify the parser used	2013-12-12 15:44:05 +01:00
Thierry FOURNIER	7609064fc3	MINOR: pattern: make the pattern matching function return a pointer to the matched element This feature will be used by the CLI to look up keys.	2013-12-12 15:44:05 +01:00
Thierry FOURNIER	0b2fe4a5cd	MINOR: pattern: add support for compiling patterns for lookups With this patch, patterns can be compiled for two modes : - match - lookup The match mode is used for example in ACLs or maps. The lookup mode is used to lookup a key for pattern maintenance. For example, looking up a network is different from looking up one address belonging to this network. A special case is made for regex. In lookup mode they return the input regex string and do not compile the regex.	2013-12-12 15:44:02 +01:00
Thierry FOURNIER	7148ce6ef4	MEDIUM: pattern: Extract the index process from the pat_parse_() functions Now, the pat_parse_() functions parses the incoming data. The input "pattern" struct can be preallocated. If the parser needs to add some buffers, it allocates memory. The function pattern_register() runs the call to the parser, process the key indexation and associate the "sample_storage" used by maps.	2013-12-12 15:42:11 +01:00
Thierry FOURNIER	e3ded59706	MEDIUM: acl: Last patch change the output type This patch remove the compatibility check from the input type and the match method. Now, it checks if a casts from the input type to output type exists and the pattern_exec_match() function apply casts before each pattern matching.	2013-12-12 15:42:11 +01:00
Thierry FOURNIER	cc0e0b3dbb	MINOR: pattern: Each pattern sets the expected input type This is used later for increasing the compability with incoming sample types. When multiple compatible types are supported, one is arbitrarily used (eg: UINT).	2013-12-12 11:07:33 +01:00
Thierry FOURNIER	2d4771ba17	MINOR: map: export map_get_reference() function This function is used to identify map with his reference into the CLI functions.	2013-12-11 22:05:03 +01:00
Willy Tarreau	3770f23a3a	MINOR: http: switch the http state to an enum This reduces its size which is not reused by anything else. However it will significantly improve the debugger's output since we'll now get real state values. The default case had to be enabled in the parsers because gcc tries to optimize the switch/case and noticed some values were missing from the enums and emitted a warning.	2013-12-09 16:06:22 +01:00
Willy Tarreau	4171e9eef0	MEDIUM: stats: delay appctx initialization Now that the session handler can automatically initialize the appctx, let's not do it in stats_accept() anymore.	2013-12-09 15:40:23 +01:00
Willy Tarreau	0a23bcb8be	MAJOR: stream-interface: dynamically allocate the applet context From now on, a call to stream_int_register_handler() causes a call to si_alloc_appctx() and returns an initialized appctx for the current stream interface. If one was previously allocated, it is released. If the stream interface was attached to a connection, it is released as well. The appctx are allocated from the same pools as the connections, because they're substantially smaller in size, and we can't have both a connection and an appctx on an interface at any moment. In case of memory shortage, the call may return NULL, which is already handled by all consumers of stream_int_register_handler(). The field appctx was removed from the stream interface since we only rely on the endpoint now. On 32-bit, the stream_interface size went down from 108 to 44 bytes. On 64-bit, it went down from 144 to 64 bytes. This represents a memory saving of 160 bytes per session. It seems that a later improvement could be to move the call to stream_int_register_handler() to session.c for most cases.	2013-12-09 15:40:23 +01:00
Willy Tarreau	1fbe1c9ec8	MEDIUM: stream-int: return the allocated appctx in stream_int_register_handler() The task returned by stream_int_register_handler() is never used, however we always need to access the appctx afterwards. So make it return the appctx instead. We already plan for it to fail, which is the reason for the addition of a few tests and the possibility for the HTTP analyser to return a status code 500.	2013-12-09 15:40:23 +01:00
Willy Tarreau	7b4b499fde	MEDIUM: stream-int: replace occurrences of si->appctx with si_appctx() We're about to remove si->appctx, so first let's replace all occurrences of its usage with a dynamic extract from si->end. A lot of code was changed by search-n-replace, but the behaviour was intentionally not altered. The code surrounding calls to stream_int_register_handler() was slightly changed since we can only use si->end after the registration.	2013-12-09 15:40:23 +01:00
Willy Tarreau	57cd3e46b9	MEDIUM: connection: merge the send_proxy and local_send_proxy calls We used to have two very similar functions for sending a PROXY protocol line header. The reason is that the default one relies on the stream interface to retrieve the other end's address, while the "local" one performs a local address lookup and sends that instead (used by health checks). Now that the send_proxy_ofs is stored in the connection and not the stream interface, we can make the local_send_proxy rely on it and support partial sends. This also simplifies the code by removing the local_send_proxy function, making health checks use send_proxy_ofs, resulting in the removal of the CO_FL_LOCAL_SPROXY flag, and the associated test in the connection handler. The other flag, CO_FL_SI_SEND_PROXY was renamed without the "SI" part so that it is clear that it is not dedicated anymore to a usage with a stream interface.	2013-12-09 15:40:23 +01:00
Willy Tarreau	1ec74bf660	MINOR: connection: check for send_proxy during the connect(), not the SI It's cleaner to check for a pending send_proxy_ofs while establishing the connection (which already checks it anyway) and not in the stream interface.	2013-12-09 15:40:23 +01:00
Willy Tarreau	b8020cefed	MEDIUM: connection: move the send_proxy offset to the connection Till now the send_proxy_ofs field remained in the stream interface, but since the dynamic allocation of the connection, it makes a lot of sense to move that into the connection instead of the stream interface, since it will not be statically allocated for each session. Also, it turns out that moving it to the connection fils an alignment hole on 64 bit architectures so it does not consume more memory, and removing it from the stream interface was an opportunity to correctly reorder fields and reduce the stream interface's size from 160 to 144 bytes (-10%). This is 32 bytes saved per session.	2013-12-09 15:40:23 +01:00
Willy Tarreau	32e3c6a607	MAJOR: stream interface: dynamically allocate the outgoing connection The outgoing connection is now allocated dynamically upon the first attempt to touch the connection's source or destination address. If this allocation fails, we fail on SN_ERR_RESOURCE. As we didn't use si->conn anymore, it was removed. The endpoints are released upon session_free(), on the error path, and upon a new transaction. That way we are able to carry the existing server's address across retries. The stream interfaces are not initialized anymore before session_complete(), so we could even think about allocating them dynamically as well, though that would not provide much savings. The session initialization now makes use of conn_new()/conn_free(). This slightly simplifies the code and makes it more logical. The connection initialization code is now shorter by about 120 bytes because it's done at once, allowing the compiler to remove all redundant initializations. The si_attach_applet() function now takes care of first detaching the existing endpoint, and it is called from stream_int_register_handler(), so we can safely remove the calls to si_release_endpoint() in the application code around this call. A call to si_detach() was made upon stream_int_unregister_handler() to ensure we always free the allocated connection if one was allocated in parallel to setting an applet (eg: detect HTTP proxy while proceeding with stats maybe).	2013-12-09 15:40:23 +01:00
Willy Tarreau	2a6e8802c0	MEDIUM: stream-interface: introduce si_attach_conn to replace si_prepare_conn si_prepare_conn() is not appropriate in our case as it both initializes and attaches the connection to the stream interface. Due to the asymmetry between accept() and connect(), it causes some fields such as the control and transport layers to be reinitialized. Now that we can separately initialize these fields using conn_prepare(), let's break this function to only attach the connection to the stream interface. Also, by analogy, si_prepare_none() was renamed si_detach(), and si_prepare_applet() was renamed si_attach_applet().	2013-12-09 15:40:23 +01:00
Willy Tarreau	7abddb5c67	MINOR: connection: replace conn_assign with conn_attach We don't want to assign the control nor transport layers anymore at the same time as the data layer, because it prevents one from keeping existing settings when reattaching a connection to an existing stream interface. Let's have conn_attach() replace conn_assign() for this purpose. Thus, conn_prepare() + conn_attach() do exactly the same as the previous conn_assign().	2013-12-09 15:40:23 +01:00
Willy Tarreau	910c6aa5b7	MINOR: connection: reintroduce conn_prepare to set the protocol and transport Now that we can assign conn->xprt regardless of the initialization state, we can reintroduce conn_prepare() to set only the protocol, the transport layer and initialize the transport layer's state.	2013-12-09 15:40:23 +01:00
Willy Tarreau	3ed35ef05b	MINOR: stream-interface: introduce si_reset() and si_set_state() The first function is used to (re)initialize a stream interface and the second to force it into a known state. These are intended for cleaning up the stream interface initialization code in session.c and peers.c and avoiding future issues with missing initializations.	2013-12-09 15:40:23 +01:00
Willy Tarreau	f79c8171b2	MAJOR: connection: add two new flags to indicate readiness of control/transport Currently the control and transport layers of a connection are supposed to be initialized when their respective pointers are not NULL. This will not work anymore when we plan to reuse connections, because there is an asymmetry between the accept() side and the connect() side : - on accept() side, the fd is set first, then the ctrl layer then the transport layer ; upon error, they must be undone in the reverse order, then the FD must be closed. The FD must not be deleted if the control layer was not yet initialized ; - on the connect() side, the fd is set last and there is no reliable way to know if it has been initialized or not. In practice it's initialized to -1 first but this is hackish and supposes that local FDs only will be used forever. Also, there are even less solutions for keeping trace of the transport layer's state. Also it is possible to support delayed close() when something (eg: logs) tracks some information requiring the transport and/or control layers, making it even more difficult to clean them. So the proposed solution is to add two flags to the connection : - CO_FL_CTRL_READY is set when the control layer is initialized (fd_insert) and cleared after it's released (fd_delete). - CO_FL_XPRT_READY is set when the control layer is initialized (xprt->init) and cleared after it's released (xprt->close). The functions have been adapted to rely on this and not on the pointers anymore. conn_xprt_close() was unused and dangerous : it did not close the control layer (eg: the socket itself) but still marks the transport layer as closed, preventing any future call to conn_full_close() from finishing the job. The problem comes from conn_full_close() in fact. It needs to close the xprt and ctrl layers independantly. After that we're still having an issue : we don't know based on ->ctrl alone whether the fd was registered or not. For this we use the two new flags CO_FL_XPRT_READY and CO_FL_CTRL_READY. We now rely on this and not on conn->xprt nor conn->ctrl anymore to decide what remains to be done on the connection. In order not to miss some flag assignments, we introduce conn_ctrl_init() to initialize the control layer, register the fd using fd_insert() and set the flag, and conn_ctrl_close() which unregisters the fd and removes the flag, but only if the transport layer was closed. Similarly, at the transport layer, conn_xprt_init() calls ->init and sets the flag, while conn_xprt_close() checks the flag, calls ->close and clears the flag, regardless xprt_ctx or xprt_st. This also ensures that the ->init and the ->close functions are called only once each and in the correct order. Note that conn_xprt_close() does nothing if the transport layer is still tracked. conn_full_close() now simply calls conn_xprt_close() then conn_full_close() in turn, which do nothing if CO_FL_XPRT_TRACKED is set. In order to handle the error path, we also provide conn_force_close() which ignores CO_FL_XPRT_TRACKED and closes the transport and the control layers in turns. All relevant instances of fd_delete() have been replaced with conn_force_close(). Now we always know what state the connection is in and we can expect to split its initialization.	2013-12-09 15:40:23 +01:00
Willy Tarreau	b97f3b1abf	MINOR: connection: add conn_new() / conn_free() conn_new() will be a more convenient way of allocating and initializing a connection. It calls pool_alloc2() and conn_init() upon success. conn_free() is just a pool_free2() but is provided for symmetry with conn_new().	2013-12-09 15:40:23 +01:00
Willy Tarreau	c10aec299f	MINOR: get rid of si_takeover_conn() Since last commit, this function is an exact copy of si_prepare_conn().	2013-12-09 15:40:23 +01:00
Willy Tarreau	37213433a8	MEDIUM: connection: replace conn_prepare with conn_assign Everywhere conn_prepare() is used, the call to conn_init() has already been done. We can now safely replace all instances of conn_prepare() with conn_assign() which does not reset the transport layer, and remove conn_prepare().	2013-12-09 15:40:23 +01:00
Willy Tarreau	d015577428	MINOR: connection: add conn_init() to (re)initialize a connection This function will ease the initialization of new connections as well as their reuse. It initializes the obj_type and a few fields so that the connection is fresh again. It leaves the addresses and target untouched so it is suitable for use across connection retries.	2013-12-09 15:40:23 +01:00
Willy Tarreau	b363a1f469	MAJOR: stream-int: stop using si->conn and use si->end instead The connection will only remain there as a pre-allocated entity whose goal is to be placed in ->end when establishing an outgoing connection. All connection initialization can be made on this connection, but all information retrieved should be applied to the end point only. This change is huge because there were many users of si->conn. Now the only users are those who initialize the new connection. The difficulty appears in a few places such as backend.c, proto_http.c, peers.c where si->conn is used to hold the connection's target address before assigning the connection to the stream interface. This is why we have to keep si->conn for now. A future improvement might consist in dynamically allocating the connection when it is needed.	2013-12-09 15:40:22 +01:00
Willy Tarreau	691b1f429e	CLEANUP: stream-int: remove obsolete si_ctrl function This function makes no sense anymore and will cause trouble to convert the remains of connection/applet to end points. Let's replace it now with its contents.	2013-12-09 15:40:22 +01:00
Willy Tarreau	cf644ed37a	MEDIUM: stream-int: make ->end point to the connection or the appctx The long-term goal is to have a context for applets as an alternative to the connection and not as a complement. At the moment, the context is still stored into the stream interface, and we only put a pointer to the applet's context in si->end, initialize the context with object type OBJ_TYPE_APPCTX, and this allows us not to allocate an entry when deciding to switch to an applet. A special care is taken to never dereference si->conn anymore when dealing with an applet. That's why it's important that si->end is always set to the proper type : si->end == NULL => not connected to anything si->end == OBJ_TYPE_APPCTX => connected to an applet si->end == OBJ_TYPE_CONN => real connection (server, proxy, ...) The session management code used to check the applet from the connection's target. Now it uses the stream interface's end point and does not touch the connection at all. Similarly, we stop checking the connection's addresses and file descriptors when reporting the applet's status in the stats dump.	2013-12-09 15:40:22 +01:00
Willy Tarreau	4a59f2f954	MAJOR: stream interface: remove the ->release function pointer Since last commit, we now have a pointer to the applet in the applet context. So we don't need the si->release function pointer anymore, it can be extracted from applet->applet.release. At many places, the ->release function was still tested for real connections while it is only limited to applets, so most of them were simply removed. For the remaining valid uses, a new inline function si_applet_release() was added to simplify the check and the call.	2013-12-09 15:40:22 +01:00
Willy Tarreau	48099c7a07	MEDIUM: stream-interface: set the pointer to the applet into the applet context In preparation for a later move of all the applet context outside of the stream interface, we'll need to have access to the applet itself from the context. Let's have a pointer to it inside the context.	2013-12-09 15:40:22 +01:00
Willy Tarreau	7d67d7b9e5	MINOR: stream-int: add a new pointer to the end point The end point will correspond to either an applet context or a connection, depending on the object type. For now the pointer remains null.	2013-12-09 15:40:22 +01:00
Willy Tarreau	372d6708fb	MINOR: stream-int: split si_prepare_embedded into si_prepare_none and si_prepare_applet si_prepare_embedded() was used both to attach an applet and to detach anything from a stream interface. Split it into si_prepare_none() to detach and si_prepare_applet() to attach an applet. si->conn->target is now assigned from within these two functions instead of their respective callers.	2013-12-09 15:40:22 +01:00
Willy Tarreau	0788f47cc1	MINOR: obj: introduce a new type appctx The object type was added to "struct appctx". The purpose will be to identify an appctx when the applet context is detached from the stream interface. For now, it's still attached, so this patch only adds the new type and does not replace its use.	2013-12-09 15:40:22 +01:00
Willy Tarreau	51c2184755	MINOR: connection: add a field to store an object type This will soon be used to differenciate connections from applet contexts. Object type "connection" has also been added.	2013-12-09 15:40:22 +01:00
Willy Tarreau	66337a0784	MINOR: obj: provide a safe and an unsafe access to pointed objects Most of the times, the caller of objt_<type>(ptr) will know that <ptr> is valid and of the correct type (eg: in an "if" condition). Let's provide an unsafe variant that does not perform the check again for these usages. The new functions are called "__objt_<type>".	2013-12-09 15:40:22 +01:00
Willy Tarreau	6fe1541285	MINOR: stream-int: make the shutr/shutw functions void This is to be more consistent with the other functions. The only reason why these functions used to return a value was to let the caller adjust polling by itself, but now their only callers were the si_shutr()/si_shutw() inline functions. Now these functions do not depend anymore on the connection. These connection variant of these functions now call conn_data_stop_recv()/conn_data_stop_send() before returning order not to require a return code anymore. The applet version does not need this at all.	2013-12-09 15:40:22 +01:00
Willy Tarreau	8b3d7dfd7c	MEDIUM: stream-int: split the shutr/shutw functions between applet and conn These functions induce a lot of ifs everywhere because they consider two different cases, one which is where the connection exists and has a file descriptor, and the other one which is the default case where at most an applet has to be notified. Let's have them in si_ops and automatically decide which one to use. The connection shutdown sequence has been slightly simplified, and we now clear the flags at the end. Also we remove SHUTR_NOW after a shutw with nolinger, as it's cleaner not to keep it.	2013-12-09 15:40:22 +01:00
Willy Tarreau	347a35d19e	MAJOR: stats: move the HTTP stats handling to its applet There is a big trouble with the way POST is handled for the admin stats page. The POST parameters are extracted from some http-request rules, and if not round they return zero hoping for being called again when more data passes. This results in the HTTP analyser being called several times and all the rules prior to the stats being executed multiple times as well. That includes rewrite rules. So instead of doing this, we now move all the processing of the stats into the stats applet. That way we just set the stats applet in the HTTP analyser when a stats request is detected, and the applet takes the time it needs to read the arguments and respond. We could even imagine improving the applet to support requests larger than a single buffer. The code was almost only moved and minimally changed. Several new HTTP states were added to the stats applet to emit headers, redirects and to read POST. It was necessary to do this because the headers sent depend on the parsing of the POST request. In the end it's beneficial because we removed two stream_int_retnclose() calls.	2013-12-09 15:40:22 +01:00
Willy Tarreau	96d44918f7	MEDIUM: stats: prepare the HTTP stats I/O handler to support more states In preparation for moving the POST processing to the applet, we first add new states to the HTTP I/O handler. Till now st0 was only 0/1 for start/end. We now replace it with an enum.	2013-12-09 15:40:22 +01:00
Willy Tarreau	787add2932	MINOR: session: add a simple function to retrieve a session from a task This function only casts t->context to (struct session *). It will avoid some ugly and unsafe casts in upcoming changes.	2013-12-09 15:40:21 +01:00
Thierry FOURNIER	d18cd0f110	MEDIUM: http: The redirect strings follows the log format rules. We handle "http-request redirect" with a log-format string now, but we leave "redirect" unaffected. Note that the control of the special "/" case is move from the runtime execution to the configuration parsing. If the format rule list is empty, the build_logline() function does nothing.	2013-12-02 23:31:33 +01:00
Willy Tarreau	6f8fe310cf	MINOR: pattern: import acl_find_match_name() into pattern.h It's only dedicated to pattern match lookups, so it was renamed pat_find_match_name().	2013-12-02 23:31:33 +01:00
Willy Tarreau	0cba607400	MINOR: acl/pattern: use types different from int to clarify who does what. We now have the following enums and all related functions return them and consume them : enum pat_match_res { PAT_NOMATCH = 0, /* sample didn't match any pattern / PAT_MATCH = 3, / sample matched at least one pattern / }; enum acl_test_res { ACL_TEST_FAIL = 0, / test failed / ACL_TEST_MISS = 1, / test may pass with more info / ACL_TEST_PASS = 3, / test passed / }; enum acl_cond_pol { ACL_COND_NONE, / no polarity set yet / ACL_COND_IF, / positive condition (after 'if') / ACL_COND_UNLESS, / negative condition (after 'unless') */ }; It's just in order to avoid doubts when reading some code.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	a65b343eee	MEDIUM: pattern: rename "acl" prefix to "pat" This patch just renames functions, types and enums. No code was changed. A significant number of files were touched, especially the ACL arrays, so it is likely that some external patches will not apply anymore. One important thing is that we had to split ACL_PAT_* into two groups : - ACL_TEST_{PASS\|MISS\|FAIL} - PAT_{MATCH\|UNMATCH} A future patch will enforce enums on all these places to avoid confusion.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	d163e1ce30	MEDIUM: pattern: create pattern expression This new structure contains the data needed for pattern matching. It's the first step to the complete independance of the pattern matching.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	ed66c297c2	REORG: acl/pattern: extract pattern matching from the acl file and create pattern.c This patch just moves code without any change. The ACL are just the association between sample and pattern. The pattern contains the match method and the parse method. These two things are different. This patch cleans the code by splitting it.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	dd69a04666	MEDIUM: acl: associate "struct sample_storage" to each "struct acl_pattern" This will be used later with maps. Each map will associate an entry with a sample_storage value. This patch changes the "parse" prototype and all the parsing methods. The goal is to associate "struct sample_storage" to each entry of "struct acl_pattern". Only the "parse" function can add the sample value into the "struct acl_pattern".	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	29d47b87c4	MINOR: acl: Extract the pattern matching function The map feature will need to match acl patterns. This patch extracts the matching function from the global ACL function "acl_exec_cond". The code was only moved to its own function, no functional changes were made.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	3a103c5a6b	MINOR: acl: Extract the pattern parsing and indexation from the "acl_read_patterns_from_file()" function With this split, the pattern indexation can apply to any source. The map feature needs this functionality because the map cannot be loaded with the same file format as the ones supported by acl_read_patterns_from_file(). The code was only moved to its own function, no functional changes were made.	2013-12-02 23:31:33 +01:00
Thierry FOURNIER	348971ea28	MEDIUM: acl: use the fetch syntax 'fetch(args),conv(),conv()' into the ACL keyword If the acl keyword is a "fetch", the dedicated parsing function "sample_parse_expr()" is used. Otherwise, the acl parsing function "parse_acl_expr()" is extended to understand the syntax of a series of converters placed after the "fetch" keyword. Before this patch, each acl uses a "struct sample_fetch" and executes it with the "<fetch>->process()" function. Now, the dedicated function "sample_process()" is called. These syntax are now avalaible: acl bad req.hdr(host),lower -m str www http-request redirect prefix /go-away if bad acl bad hdr_beg(host),lower www http-request redirect prefix /go-away if bad	2013-12-02 23:31:32 +01:00
Thierry FOURNIER	20f4996738	MINOR: sample: export the generic sample conversion parser just export function "find_sample_conv()" to prepare the generic sample conversion parser.	2013-12-02 23:31:32 +01:00
Willy Tarreau	34c2fb6f89	BUG/MINOR: config: report the correct track-sc number in tcp-rules When parsing track-sc* actions in tcp-request rules, we now automatically compute the track-sc identifier number using %d when displaying an error message. But the ID has become wrong since we introduced sc0, we continue to report id+1 in error messages causing some confusion. No backport is needed.	2013-12-02 23:31:32 +01:00
Willy Tarreau	830bf61815	BUG/MINOR: connection: fix typo in error message report "unknownn" -> "unknown"	2013-12-01 20:29:58 +01:00
Simon Horman	8c3d0be987	MEDIUM: Add DRAIN state and report it on the stats page Add a DRAIN sub-state for a server which will be shown on the stats page instead of UP if its effective weight is zero. Also, log if a server enters or leaves the DRAIN state as the result of an agent check. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-25 07:31:16 +01:00
Willy Tarreau	d32c399747	MINOR: stats: report correct throttling percentage for servers in slowstart The column used to report the throttle percentage when a server is in slowstart is based on the time only. This is wrong, because server weights in slowstart are updated at most once a second, so the reported value is wrong at least fo rone second during each step, which means all the time when using short delays (< 20s). The second point is that it's disturbing to see a weight < 100% without any throttle at the end of the period (during the last second), because the effective weight has not yet been updated. Instead, we now compute the exact ratio between eweight and uweight and report it. It's always accurate and describes the value being used instead of using only the date. It can be backported to 1.4 though it's not particularly important.	2013-11-21 15:30:45 +01:00
Willy Tarreau	004e045f31	BUG/MAJOR: server: weight calculation fails for map-based algorithms A crash was reported by Igor at owind when changing a server's weight on the CLI. Lukas Tribus could reproduce a related bug where setting a server's weight would result in the new weight being multiplied by the initial one. The two bugs are the same. The incorrect weight calculation results in the total farm weight being larger than what was initially allocated, causing the map index to be out of bounds on some hashes. It's easy to reproduce using "balance url_param" with a variable param, or with "balance static-rr". It appears that the calculation is made at many places and is not always right and not always wrong the same way. Thus, this patch introduces a new function "server_recalc_eweight()" which is dedicated to this task of computing ->eweight from many other elements including uweight and current time (for slowstart), and all users now switch to use this function. The patch is a bit large but the code was not trivially fixable in a way that could guarantee this situation would not occur anymore. The fix is much more readable and has been verified to work with all algorithms, with both consistent and map-based hashes, and even with static-rr. Slowstart was tested as well, just like enable/disable server. The same bug is very likely present in 1.4 as well, so the patch will probably need to be backported eventhough it will not apply as-is. Thanks to Lukas and Igor for the information they provided to reproduce it.	2013-11-21 15:09:02 +01:00
Simon Horman	4a741432be	MEDIUM: Paramatise functions over the check of a server Paramatise the following functions over the check of a server * set_server_down * set_server_up * srv_getinter * server_status_printf * set_server_check_status * set_server_disabled * set_server_enabled Generally the server parameter of these functions has been removed. Where it is still needed it is obtained using check->server. This is in preparation for associating a agent check with a server which runs as well as the server's existing check. By paramatising these functions they may act on each of the checks without further significant modification. Explanation of the SSP_O_HCHK portion of this change: * Prior to this patch SSP_O_HCHK serves a single purpose which is to tell server_status_printf() weather it should print the details of the check of a server or not. With the paramatisation that this patch adds there are two cases. 1) Printing the details of the check in which case a valid check parameter is needed. 2) Not printing the details of the check in which case the contents check parameter are unused. In case 1) we could pass SSP_O_HCHK and a valid check and; In case 2) we could pass !SSP_O_HCHK and any value for check including NULL. If NULL is used for case 2) then SSP_O_HCHK becomes supurfulous and as NULL is used for case 2) SSP_O_HCHK has been removed. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-19 09:35:54 +01:00
Simon Horman	a360844735	CLEANUP: Make parameters of srv_downtime and srv_getinter const The parameters of srv_downtime and srv_getinter are not modified and thus may be const. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-11-19 08:04:58 +01:00
Thierry FOURNIER	de6617b486	MINOR: http: some exported functions were not in the header file Export the following functions: - find_hdr_value_end - http_header_match2 - http_remove_header2 - http_header_add_tail2	2013-10-23 12:21:38 +02:00
Willy Tarreau	3a925c155d	MEDIUM: stick-tables: flush old entries upon soft-stop When a process with large stick tables is replaced by a new one and remains present until the last connection finishes, it keeps these data in memory for nothing since they will never be used anymore by incoming connections, except during syncing with the new process. This is especially problematic when dealing with long session protocols such as WebSocket as it becomes possible to stack many processes and eat a lot of memory. So the idea here is to know if a table still needs to be synced or not, and to purge all unused entries once the sync is complete. This means that after a few hundred milliseconds when everything has been synchronized with the new process, only a few entries will remain allocated (only the ones held by sessions during the restart) and all the remaining memory will be freed. Note that we carefully do that only after the grace period is expired so as not to impact a possible proxy that needs to accept a few more connections before leaving. Doing this required to add a sync counter to the stick tables, to know how many peer sync sessions are still in progress in order not to flush the entries until all synchronizations are completed.	2013-09-04 17:54:01 +02:00
Willy Tarreau	b4c8493a9f	MINOR: session: make the number of stick counter entries more configurable In preparation of more flexibility in the stick counters, make their number configurable. It still defaults to 3 which is the minimum accepted value. Changing the value alone is not sufficient to get more counters, some bitfields still need to be updated and the TCP actions need to be updated as well, but this update tries to be easier, which is nice for experimentation purposes.	2013-08-01 21:17:14 +02:00
Willy Tarreau	cadd8c9ec3	MINOR: payload: split smp_fetch_rdp_cookie() This function is also called directly from backend.c, so let's stop building fake args to call it as a sample fetch, and have a lower layer more generic function instead.	2013-08-01 21:17:13 +02:00
Willy Tarreau	ef38c39287	MEDIUM: sample: systematically pass the keyword pointer to the keyword We're having a lot of duplicate code just because of minor variants between fetch functions that could be dealt with if the functions had the pointer to the original keyword, so let's pass it as the last argument. An earlier version used to pass a pointer to the sample_fetch element, but this is not the best solution for two reasons : - fetch functions will solely rely on the keyword string - some other smp_fetch_* users do not have the pointer to the original keyword and were forced to pass NULL. So finally we're passing a pointer to the keyword as a const char *, which perfectly fits the original purpose.	2013-08-01 21:17:13 +02:00
Godbach	430f291a99	CLEANUP: session: remove event_accept() which was not used anymore Remove event_accept() in include/proto/proto_http.h and use correct function name in other two files instead of event_accept(). Signed-off-by: Godbach <nylzhaowei@gmail.com>	2013-06-20 08:07:35 +02:00
Willy Tarreau	be4a3eff34	MEDIUM: counters: use sc0/sc1/sc2 instead of sc1/sc2/sc3 It was a bit inconsistent to have gpc start at 0 and sc start at 1, so make sc start at zero like gpc. No previous release was issued with sc3 anyway, so no existing setup should be affected.	2013-06-17 15:04:07 +02:00
Thierry FOURNIER	d3879e8b57	CLEANUP: fix missing include <string.h> in proto/listener.h The file proto/listener.h makes use of strdup() but doesn't include <string.h> so it's sensible to include file ordering.	2013-06-14 19:52:17 +02:00
Willy Tarreau	e365c0b92b	MEDIUM: http: add a new "http-response" ruleset Some actions were clearly missing to process response headers. This patch adds a new "http-response" ruleset which provides the following actions : - allow : stop evaluating http-response rules - deny : stop and reject the response with a 502 - add-header : add a header in log-format mode - set-header : set a header in log-format mode	2013-06-11 16:06:12 +02:00
Willy Tarreau	2b57cb8f30	MEDIUM: protocol: implement a "drain" function in protocol layers Since commit `cfd97c6f` was merged into 1.5-dev14 (BUG/MEDIUM: checks: prevent TIME_WAITs from appearing also on timeouts), some valid health checks sometimes used to show some TCP resets. For example, this HTTP health check sent to a local server : 19:55:15.742818 IP 127.0.0.1.16568 > 127.0.0.1.8000: S 3355859679:3355859679(0) win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:15.742841 IP 127.0.0.1.8000 > 127.0.0.1.16568: S 1060952566:1060952566(0) ack 3355859680 win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:15.742863 IP 127.0.0.1.16568 > 127.0.0.1.8000: . ack 1 win 257 19:55:15.745402 IP 127.0.0.1.16568 > 127.0.0.1.8000: P 1:23(22) ack 1 win 257 19:55:15.745488 IP 127.0.0.1.8000 > 127.0.0.1.16568: FP 1:146(145) ack 23 win 257 19:55:15.747109 IP 127.0.0.1.16568 > 127.0.0.1.8000: R 23:23(0) ack 147 win 257 After some discussion with Chris Huang-Leaver, it appeared clear that what we want is to only send the RST when we have no other choice, which means when the server has not closed. So we still keep SYN/SYN-ACK/RST for pure TCP checks, but don't want to see an RST emitted as above when the server has already sent the FIN. The solution against this consists in implementing a "drain" function at the protocol layer, which, when defined, causes as much as possible of the input socket buffer to be flushed to make recv() return zero so that we know that the server's FIN was received and ACKed. On Linux, we can make use of MSG_TRUNC on TCP sockets, which has the benefit of draining everything at once without even copying data. On other platforms, we read up to one buffer of data before the close. If recv() manages to get the final zero, we don't disable lingering. Same for hard errors. Otherwise we do. In practice, on HTTP health checks we generally find that the close was pending and is returned upon first recv() call. The network trace becomes cleaner : 19:55:23.650621 IP 127.0.0.1.16561 > 127.0.0.1.8000: S 3982804816:3982804816(0) win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:23.650644 IP 127.0.0.1.8000 > 127.0.0.1.16561: S 4082139313:4082139313(0) ack 3982804817 win 32792 <mss 16396,nop,nop,sackOK,nop,wscale 7> 19:55:23.650666 IP 127.0.0.1.16561 > 127.0.0.1.8000: . ack 1 win 257 19:55:23.651615 IP 127.0.0.1.16561 > 127.0.0.1.8000: P 1:23(22) ack 1 win 257 19:55:23.651696 IP 127.0.0.1.8000 > 127.0.0.1.16561: FP 1:146(145) ack 23 win 257 19:55:23.652628 IP 127.0.0.1.16561 > 127.0.0.1.8000: F 23:23(0) ack 147 win 257 19:55:23.652655 IP 127.0.0.1.8000 > 127.0.0.1.16561: . ack 24 win 257 This change should be backported to 1.4 which is where Chris encountered this issue. The code is different, so probably the tcp_drain() function will have to be put in the checks only.	2013-06-10 20:33:23 +02:00
Willy Tarreau	04ff9f105f	MINOR: http: add full-length header fetch methods The req.hdr and res.hdr fetch methods do not work well on headers which are allowed to contain commas, such as User-Agent, Date or Expires. More specifically, full-length matching is impossible if a comma is present. This patch introduces 4 new fetch functions which are designed to work with these full-length headers : - req.fhdr, req.fhdr_cnt - res.fhdr, res.fhdr_cnt These ones do not stop at commas and permit to return full-length header values.	2013-06-10 18:39:42 +02:00
Willy Tarreau	379357af58	BUG/MAJOR: http: always ensure response buffer has some room for a response Since 1.5-dev12 and commit `3bf1b2b8` (MAJOR: channel: stop relying on BF_FULL to take action), the HTTP parser switched to channel_full() instead of BF_FULL to decide whether a buffer had enough room to start parsing a request or response. The problem is that channel_full() intentionally ignores outgoing data, so a corner case exists where a large response might still be left in a response buffer with just a few bytes left (much less than the reserve), enough to accept a second response past the last data, but not enough to permit the HTTP processor to add some headers. Since all the processing relies on this space being available, we can get some random crashes when clients pipeline requests. The analysis of a core from haproxy configured with 20480 bytes buffers shows this : with enough "luck", when sending back the response for the first request, the client is slow, the TCP window is congested, the socket buffers are full, and haproxy's buffer fills up. We still have 20230 bytes of response data in a 20480 response buffer. The second request is sent to the server which returns 214 bytes which fit in the small 250 bytes left in this buffer. And the buffer arrangement makes it possible to escape all the controls in http_wait_for_response() : \|<------ response buffer = 20480 bytes ------>\| [ 2/2 \| 3 \| 4 \| 1/2 ] ^ start of circular buffer 1/2 = beginning of previous response (18240) 2/2 = end of previous response (1990) 3 = current response (214) 4 = free space (36) - channel_full() returns false (20230 bytes are going to leave) - the response headers does not wrap at the end of the buffer - the remaining linear room after the headers is larger than the reserve, because it's the previous response which wraps : => response is processed Header rewriting causes it to reach 260 bytes, 10 bytes larger than what the buffer could hold. So all computations during header addition are wrong and lead to the corruption we've observed. All the conditions are very hard to meet (which explains why it took almost one year for this bug to show up) and are almost impossible to reproduce on purpose on a test platform. But the bug is clearly there. This issue was reported by Dinko Korunic who kindly devoted a lot of time to provide countless traces and cores, and to experiment with troubleshooting patches to knock the bug down. Thanks Dinko! No backport is needed, but all 1.5-dev versions between dev12 and dev18 included must be upgraded. A workaround consists in setting option forceclose to prevent pipelined requests from being processed.	2013-06-08 13:14:17 +02:00
Willy Tarreau	d5ca9abb0d	MINOR: counters: make it easier to extend the amount of tracked counters By properly affecting the flags and values, it becomes easier to add more tracked counters, for example for experimentation. It also slightly reduces the code and the number of tests. No counters were added with this patch.	2013-05-28 17:43:40 +02:00
de Lafond Guillaume	88c278fadf	MEDIUM: stats: add proxy name filtering on the statistic page This patch adds a "scope" box in the statistics page in order to display only proxies with a name that contains the requested value. The scope filter is preserved across all clicks on the page.	2013-04-15 22:50:33 +02:00
Willy Tarreau	a4312fa28e	MAJOR: sample: maintain a per-proxy list of the fetch args to resolve While ACL args were resolved after all the config was parsed, it was not the case with sample fetch args because they're almost everywhere now. The issue is that ACLs now solely rely on sample fetches, so their args resolving doesn't work anymore. And many fetches involving a server, a proxy or a userlist don't work at all. The real issue is that at the bottom layers we have no information about proxies, line numbers, even ACLs in order to report understandable errors, and that at the top layers we have no visibility over the locations where fetches are referenced (think log node). After failing multiple unsatisfying solutions attempts, we now have a new concept of args list. The principle is that every proxy has a list head which contains a number of indications such as the config keyword, the context where it's used, the file and line number, etc... and a list of arguments. This list head is of the same type as the elements, so it serves as a template for adding new elements. This way, it is filled from top to bottom by the callers with the information they have (eg: line numbers, ACL name, ...) and the lower layers just have to duplicate it and add an element when they face an argument they cannot resolve yet. Then at the end of the configuration parsing, a loop passes over each proxy's list and resolves all the args in sequence. And this way there is all necessary information to report verbose errors. The first immediate benefit is that for the first time we got very precise location of issues (arg number in a keyword in its context, ...). Second, in order to do this we had to parse log-format and unique-id-format a bit earlier, so that was a great opportunity for doing so when the directives are encountered (unless it's a default section). This way, the recorded line numbers for these args are the ones of the place where the log format is declared, not the end of the file. Userlists report slightly more information now. They're the only remaining ones in the ACL resolving function.	2013-04-03 02:13:02 +02:00
Willy Tarreau	93fddf1dbc	MEDIUM: acl: have a pointer to the keyword name in acl_expr The acl_expr struct used to hold a pointer to the ACL keyword. But since we now have all the relevant pointers, we don't need that anymore, we just need the pointer to the keyword as a string in order to return warnings and error messages. So let's change this in order to remove the dependency on the acl_keyword struct from acl_expr. During this change, acl_cond_kw_conflicts() used to return a pointer to an ACL keyword but had to be changed to return a const char* for the same reason.	2013-04-03 02:13:01 +02:00
Willy Tarreau	a91d0a583c	MAJOR: acl: convert all ACL requires to SMP use+val instead of ->requires The ACLs now use the fetch's ->use and ->val to decide upon compatibility between the place where they are used and where the information are fetched. The code is capable of reporting warnings about very fine incompatibilities between certain fetches and an exact usage location, so it is expected that some new warnings will be emitted on some existing configurations. Two degrees of detection are provided : - detecting ACLs that never match - detecting keywords that are ignored All tests show that this seems to work well, though bugs are still possible.	2013-04-03 02:13:00 +02:00
Willy Tarreau	bf8e251077	MINOR: sample: provide a function to report the name of a sample check point We need to put names on places where samples are used in order to emit warnings and errors. Let's do that now.	2013-04-03 02:13:00 +02:00
Willy Tarreau	25320b2906	MEDIUM: proxy: remove acl_requires and just keep a flag "http_needed" Proxy's acl_requires was a copy of all bits taken from ACLs, but we'll get rid of ACL flags and only rely on sample fetches soon. The proxy's acl_requires was only used to allocate an HTTP context when needed, and was even forced in HTTP mode. So better have a flag which exactly says what it's supposed to be used for.	2013-04-03 02:13:00 +02:00
Willy Tarreau	8ed669b12a	MAJOR: acl: make all ACLs reference the fetch function via a sample. ACL fetch functions used to directly reference a fetch function. Now that all ACL fetches have their sample fetches equivalent, we can make ACLs reference a sample fetch keyword instead. In order to simplify the code, a sample keyword name may be NULL if it is the same as the ACL's, which is the most common case. A minor change appeared, http_auth always expects one argument though the ACL allowed it to be missing and reported as such afterwards, so fix the ACL to match this. This is not really a bug.	2013-04-03 02:12:58 +02:00
Willy Tarreau	d4c33c8889	MEDIUM: samples: move payload-based fetches and ACLs to their own file The file acl.c is a real mess, it both contains functions to parse and process ACLs, and some sample extraction functions which act on buffers. Some other payload analysers were arbitrarily dispatched to proto_tcp.c. So now we're moving all payload-based fetches and ACLs to payload.c which is capable of extracting data from buffers and rely on everything that is protocol-independant. That way we can safely inflate this file and only use the other ones when some fetches are really specific (eg: HTTP, SSL, ...). As a result of this cleanup, the following new sample fetches became available even if they're not really useful : always_false, always_true, rep_ssl_hello_type, rdp_cookie_cnt, req_len, req_ssl_hello_type, req_ssl_sni, req_ssl_ver, wait_end The function 'acl_fetch_nothing' was wrong and never used anywhere so it was removed. The "rdp_cookie" sample fetch used to have a mandatory argument while it was optional in ACLs, which are supposed to iterate over RDP cookies. So we're making it optional as a fetch too, and it will return the first one.	2013-04-03 02:12:57 +02:00
Willy Tarreau	434c57c95c	MINOR: log: indicate it when some unreliable sample fetches are logged If a log-format involves some sample fetches that may not be present at the logging instant, we can now report a warning. Note that this is done both for log-format and for add-header and carefully respects the original fetch keyword's capabilities.	2013-04-03 02:12:56 +02:00
Willy Tarreau	80aca90ad2	MEDIUM: samples: use new flags to describe compatibility between fetches and their usages Samples fetches were relying on two flags SMP_CAP_REQ/SMP_CAP_RES to describe whether they were compatible with requests rules or with response rules. This was never reliable because we need a finer granularity (eg: an HTTP request method needs to parse an HTTP request, and is available past this point). Some fetches are also dependant on the context (eg: "hdr" uses request or response depending where it's involved, causing some abiguity). In order to solve this, we need to precisely indicate in fetches what they use, and their users will have to compare with what they have. So now we have a bunch of bits indicating where the sample is fetched in the processing chain, with a few variants indicating for some of them if it is permanent or volatile (eg: an HTTP status is stored into the transaction so it is permanent, despite being caught in the response contents). The fetches also have a second mask indicating their validity domain. This one is computed from a conversion table at registration time, so there is no need for doing it by hand. This validity domain consists in a bitmask with one bit set for each usage point in the processing chain. Some provisions were made for upcoming controls such as connection-based TCP rules which apply on top of the connection layer but before instantiating the session. Then everywhere a fetch is used, the bit for the control point is checked in the fetch's validity domain, and it becomes possible to finely ensure that a fetch will work or not. Note that we need these two separate bitfields because some fetches are usable both in request and response (eg: "hdr", "payload"). So the keyword will have a "use" field made of a combination of several SMP_USE_* values, which will be converted into a wider list of SMP_VAL_* flags. The knowledge of permanent vs dynamic information has disappeared for now, as it was never used. Later we'll probably reintroduce it differently when dealing with variables. Its only use at the moment could have been to avoid caching a dynamic rate measurement, but nothing is cached as of now.	2013-04-03 02:12:56 +02:00
Simon Horman	7d09b9a4df	MEDIUM: server: Break out set weight processing code Break out set weight processing code. This is in preparation for reusing the code. Also, remove duplicate check in nested if clauses. {px->lbprm.algo & BE_LB_PROP_DYN) is checked by the immediate outer if clause, so there is no need to check it a second time. Signed-off-by: Simon Horman <horms@verge.net.au>	2013-02-13 10:53:40 +01:00
Willy Tarreau	6cbbdbf3f3	BUG/MEDIUM: log: emit '-' for empty fields again Commit `2b0108ad` accidently got rid of the ability to emit a "-" for empty log fields. This can happen for captured request and response cookies, as well as for fetches. Since we don't want to have this done for headers however, we set the default log method when parsing the format. It is still possible to force the desired mode using +M/-M.	2013-02-05 18:55:09 +01:00
Emeric Brun	22890a1225	MINOR: ssl: Setting global tune.ssl.cachesize value to 0 disables SSL session cache.	2012-12-28 14:48:13 +01:00
Willy Tarreau	4baae248fc	REORG: config: move the http redirect rule parser to proto_http.c We'll have to use this elsewhere soon, let's move it to the proper place.	2012-12-28 14:47:19 +01:00
Willy Tarreau	71241abfd3	MINOR: http: move redirect rule processing to its own function We now have http_apply_redirect_rule() which does all the redirect-specific job instead of having this inside http_process_req_common(). Also one of the benefit gained from uniformizing this code is that both keep-alive and close response do emit the PR-- flags. The fix for the flags could probably be backported to 1.4 though it's very minor. The previous function http_perform_redirect() was becoming confusing so it was renamed http_perform_server_redirect() since it only applies to server-based redirection.	2012-12-28 14:47:19 +01:00
Willy Tarreau	b83bc1e1c1	MINOR: log: make parse_logformat_string() take a const char * Sometimes we can't pass a char *, and there is no need for this since we strdup() it.	2012-12-24 12:36:33 +01:00
Willy Tarreau	354898bba9	MINOR: stats: replace STAT_FMT_CSV with STAT_FMT_HTML We need to switch the default mode if we want to add new output formats later. Let CSV be the default and HTML be an option.	2012-12-23 21:46:30 +01:00
Willy Tarreau	47ca54505c	MINOR: chunks: centralize the trash chunk allocation At the moment, we need trash chunks almost everywhere and the only correctly implemented one is in the sample code. Let's move this to the chunks so that all other places can use this allocator. Additionally, the get_trash_chunk() function now really returns two different chunks. Previously it used to always overwrite the same chunk and point it to a different buffer, which was a bit tricky because it's not obvious that two consecutive results do alias each other.	2012-12-23 21:46:07 +01:00
Willy Tarreau	d9bdcd5139	REORG: stats: massive code reorg and cleanup The dumpstats code looks like a spaghetti plate. Several functions are supposed to be able to do several things but rely on complex states to dispatch the work to independant functions. Most of the HTML output is performed within the switch/case statements of the whole state machine. Let's clean this up by adding new functions to emit the data and have a few more iterators to avoid relying on so complex states. The new stats dump sequence looks like this for CLI and for HTTP : cli_io_handler() -> stats_dump_sess_to_buffer() // "show sess" -> stats_dump_errors_to_buffer() // "show errors" -> stats_dump_raw_info_to_buffer() // "show info" -> stats_dump_raw_info() -> stats_dump_raw_stat_to_buffer() // "show stat" -> stats_dump_csv_header() -> stats_dump_proxy() -> stats_dump_px_hdr() -> stats_dump_fe_stats() -> stats_dump_li_stats() -> stats_dump_sv_stats() -> stats_dump_be_stats() -> stats_dump_px_end() http_stats_io_handler() -> stats_http_redir() -> stats_dump_http() // also emits the HTTP headers -> stats_dump_html_head() // emits the HTML headers -> stats_dump_csv_header() // emits the CSV headers (same as above) -> stats_dump_http_info() // note: ignores non-HTML output -> stats_dump_proxy() // same as above -> stats_dump_http_end() // emits HTML trailer	2012-12-22 20:45:02 +01:00
Willy Tarreau	2b0108adf6	MINOR: log: add lf_text_len This function allows to log a text of a specific length.	2012-12-21 19:24:48 +01:00
Willy Tarreau	e7ad4bb2f0	MINOR: samples: add a function to fetch and convert any sample to a string Any sample type can now easily be converted to a string that can be used anywhere. This will be used for logging and passing information in headers.	2012-12-21 17:57:24 +01:00
Willy Tarreau	8a3f52fc2e	MEDIUM: log-format: make the format parser more robust and more extensible The log-format parser reached a limit making it hard to add new features. It also suffers from a weak handling of certain incorrect corner cases, for example "%{foo}" is emitted as a litteral while syntactically it's an argument to no variable. Also the argument parser had to redo some of the job with some cases causing minor memory leaks (eg: ignored args). This work aims at improving the situation so that slightly better reporting is possible and that it becomes possible to extend the log format. The code has a few more states but looks significantly simpler. The parser is now capable of reporting ignored arguments and truncated lines.	2012-12-20 23:34:20 +01:00
Willy Tarreau	5fb3803f4b	CLEANUP: buffer: use buffer_empty() instead of buffer_len()==0 A few places still made use of buffer_len()==0 to detect an empty buffer. Use the cleaner and more efficient buffer_empty() instead.	2012-12-17 01:14:49 +01:00
Willy Tarreau	7d28149e92	BUG/MEDIUM: connection: always update connection flags prior to computing polling stream_int_chk_rcv_conn() did not clear connection flags before updating them. It is unsure whether this could have caused the stalled transfers that have been reported since dev15. In order to avoid such further issues, we now use a simple inline function to do all the job.	2012-12-17 01:14:25 +01:00
Willy Tarreau	4a29144591	OPTIM: poll: optimize fd management functions for low register count CPUs Looking at the assembly code that updt_fd() and alloc/release_spec_entry produce in the polling loops, it's clear that gcc has to recompute pointers several times in a row because of limited spare registers. By better grouping adjacent structure updates, we improve the code size by around 60 bytes in the fast path on x86.	2012-12-13 23:34:18 +01:00
Willy Tarreau	20d46a5a95	CLEANUP: session: use an array for the stick counters The stick counters were in two distinct sets of struct members, causing some code to be duplicated. Now we use an array, which enables some processing to be performed in loops. This allowed the code to be shrunk by 700 bytes.	2012-12-09 15:57:16 +01:00
Willy Tarreau	5d5b5d8eaf	MEDIUM: proto_tcp: add support for tracking L7 information Until now it was only possible to use track-sc1/sc2 with "src" which is the IPv4 source address. Now we can use track-sc1/sc2 with any fetch as well as any transformation type. It works just like the "stick" directive. Samples are automatically converted to the correct types for the table. Only "tcp-request content" rules may use L7 information, and such information must already be present when the tracking is set up. For example it becomes possible to track the IP address passed in the X-Forwarded-For header. HTTP request processing now also considers tracking from backend rules because we want to be able to update the counters even when the request was already parsed and tracked. Some more controls need to be performed (eg: samples do not distinguish between L4 and L6).	2012-12-09 14:08:47 +01:00
Willy Tarreau	55e4ecd928	MINOR: stats: add a few more information on session dump We also report fd.spec_p, fd.updt and a few names instead of the values.	2012-12-08 17:48:47 +01:00
Emeric Brun	af9619da3e	MEDIUM: ssl: manage shared cache by blocks for huge sessions. Sessions using client certs are huge (more than 1 kB) and do not fit in session cache, or require a huge cache. In this new implementation sshcachesize set a number of available blocks instead a number of available sessions. Each block is large enough (128 bytes) to store a simple session (without client certs). Huge sessions will take multiple blocks depending on client certificate size. Note: some unused code for session sync with remote peers was temporarily removed.	2012-12-04 10:56:56 +01:00
Willy Tarreau	20879a0233	MEDIUM: connection: add error reporting for the SSL Get a bit more info in the logs when client-side SSL handshakes fail.	2012-12-03 17:21:52 +01:00
Willy Tarreau	8e3bf699db	MEDIUM: connection: add error reporting for the PROXY protocol header When the PROXY protocol header is expected and fails, leading to an abort of the incoming connection, we now emit a log message. If option dontlognull is set and it was just a port probe, then nothing is logged.	2012-12-03 17:21:51 +01:00
Willy Tarreau	0af2912fd1	MEDIUM: connection: add minimal error reporting in logs for incomplete connections Since the introduction of SSL, it became quite annoying not to get any useful info in logs about handshake failures. Let's improve reporting for embryonic sessions by checking a per-connection error code and reporting it into the logs if an error happens before the session is completely instanciated. The "dontlognull" option is supported in that if a connection does not talk before being aborted, nothing will be emitted. At the moment, only timeouts are considered for SSL and the PROXY protocol, but next patches will handle more errors.	2012-12-03 15:38:23 +01:00
Emeric Brun	786991e8b7	BUG/MEDIUM: ssl: Fix handshake failure on session resumption with client cert. Openssl session_id_context was not set on cached sessions so handshake returns an error.	2012-11-26 18:43:21 +01:00
Willy Tarreau	36fb02c526	BUG/MEDIUM: connection: always disable polling upon error Commit `0ffde2cc` in 1.5-dev13 tried to always disable polling on file descriptors when errors were encountered. Unfortunately it did not always succeed in doing so because it relied on detecting polling changes to disable it. Let's use a dedicated conn_stop_polling() function that is inconditionally called upon error instead. This managed to stop a busy loop observed when a health check makes use of the send-proxy protocol and fails before the connection can be established.	2012-11-24 11:09:07 +01:00
Willy Tarreau	f0837b259b	MEDIUM: tcp: add explicit support for delayed ACK in connect() Commit `24db47e0` tried to improve support for delayed ACK upon connect but it was incomplete, because checks with the proxy protocol would always enable polling for data receive and there was no way of distinguishing data polling and delayed ack. So we add a distinct delack flag to the connect() function so that the caller decides whether or not to use a delayed ack regardless of pending data (eg: when send-proxy is in use). Doing so covers all combinations of { (check with data), (sendproxy), (smart-connect) }.	2012-11-24 10:24:27 +01:00
Willy Tarreau	2b199c9ac3	MEDIUM: connection: provide a common conn_full_close() function Several places got the connection close sequence wrong because it was not obvious. In practice we always need the same sequence when aborting, so let's have a common function for this.	2012-11-23 17:32:21 +01:00
Willy Tarreau	88c6d81386	MINOR: http: add some debugging functions to pretty-print msg state names The http_msg_state_str() function reports a string containing the name of the state passed in argument. This helps while debugging.	2012-11-21 21:50:04 +01:00
William Lallemand	072a2bf537	MINOR: compression: CPU usage limit New option 'maxcompcpuusage' in global section. Sets the maximum CPU usage HAProxy can reach before stopping the compression for new requests or decreasing the compression level of current requests. It works like 'maxcomprate' but with the Idle.	2012-11-21 02:15:16 +01:00
William Lallemand	e3a7d99062	MINOR: compression: report zlib memory usage Show the memory usage and the max memory available for zlib. The value stored is now the memory used instead of the remaining available memory.	2012-11-21 02:15:16 +01:00
William Lallemand	8b52bb3878	MEDIUM: compression: use pool for comp_ctx Use pool for comp_ctx, it is allocated during the comp_algo->init(). The allocation of comp_ctx is accounted for in the zlib_memory_available.	2012-11-21 01:56:47 +01:00
Willy Tarreau	bc174aa144	MINOR: cli: report connection status in "show sess xxx" Connection flags, targets and transport layers are now reported in "show sess $PTR", as it is an absolute requirement in debugging.	2012-11-19 16:22:22 +01:00
William Lallemand	bf3ae61789	MEDIUM: compression: don't compress when no data This patch makes changes in the http_response_forward_body state machine. It checks if the compress algorithm had consumed data before swapping the temporary and the input buffer. So it prevents null sized zlib chunks.	2012-11-19 14:57:29 +01:00
Willy Tarreau	3fdb366885	MAJOR: connection: replace struct target with a pointer to an enum Instead of storing a couple of (int, ptr) in the struct connection and the struct session, we use a different method : we only store a pointer to an integer which is stored inside the target object and which contains a unique type identifier. That way, the pointer allows us to retrieve the object type (by dereferencing it) and the object's address (by computing the displacement in the target structure). The NULL pointer always corresponds to OBJ_TYPE_NONE. This reduces the size of the connection and session structs. It also simplifies target assignment and compare. In order to improve the generated code, we try to put the obj_type element at the beginning of all the structs (listener, server, proxy, si_applet), so that the original and target pointers are always equal. A lot of code was touched by massive replaces, but the changes are not that important.	2012-11-12 00:42:33 +01:00
Willy Tarreau	128b03c9ab	CLEANUP: stream_interface: remove the external task type target Before connections were introduced, it was possible to connect an external task to a stream interface. However it was left as an exercise for the brave implementer to find how that ought to be done. The feature was broken since the introduction of connections and was never fixed since due to lack of users. Better remove this dead code now.	2012-11-11 23:14:16 +01:00
Willy Tarreau	b31c971bef	CLEANUP: channel: remove any reference of the hijackers Hijackers were functions designed to inject data into channels in the distant past. They became unused around 1.3.16, and since there has not been any user of this mechanism to date, it's uncertain whether the mechanism still works (and it's not really useful anymore). So better remove it as well as the pointer it uses in the channel struct.	2012-11-11 23:05:39 +01:00
Willy Tarreau	70c6fd82c3	MAJOR: polling: remove unused callbacks from the poller struct Since no poller uses poller->{set,clr,wai,is_set,rem} anymore, let's remove them and remove the associated pointer tests in proto/fd.h.	2012-11-11 21:02:34 +01:00
Willy Tarreau	7f7ad91056	BUILD: stream_interface: remove si_fd() and its references si_fd() is not used a lot, and breaks builds on OpenBSD 5.2 which defines this name for its own purpose. It's easy enough to remove this one-liner function, so let's do it.	2012-11-11 20:53:29 +01:00
Willy Tarreau	09f24569d4	REORG: fd: centralize the processing of speculative events Speculative events are independant on the poller, so they can be centralized in fd.c.	2012-11-11 17:45:39 +01:00
Willy Tarreau	6ea20b1acb	REORG: fd: move the fd state management from ev_sepoll ev_sepoll already provides everything needed to manage FD events by only manipulating the speculative I/O list. Nothing there is sepoll-specific so move all this to fd.	2012-11-11 17:45:39 +01:00
Willy Tarreau	7be79a41e1	REORG: fd: move the speculative I/O management from ev_sepoll The speculative I/O will need to be ported to all pollers, so move this to fd.c.	2012-11-11 17:45:39 +01:00
Willy Tarreau	037d2c1f8f	MAJOR: sepoll: make the poller totally event-driven At the moment sepoll is not 100% event-driven, because a call to fd_set() on an event which is already being polled will not change its state. This causes issues with OpenSSL because if some I/O processing is interrupted after clearing the I/O event (eg: read all data from a socket, can't put it all into the buffer), then there is no way to call the SSL_read() again once the buffer releases some space. The only real solution is to go 100% event-driven. The principle is to use the spec list as an event cache and that each time an I/O event is reported by epoll_wait(), this event is automatically scheduled for addition to the spec list for future calls until the consumer explicitly asks for polling or stopping. Doing this is a bit tricky because sepoll used to provide a substantial number of optimizations such as event merging. These optimizations have been maintained : a dedicated update list is affected when events change, but not the event list, so that updates may cancel themselves without any side effect such as displacing events. A specific case was considered for handling newly created FDs as soon as they are detected from within the poll loop. This ensures that their read or write operation will always be attempted as soon as possible, thus reducing the number of poll loops and process_session wakeups. This is especially true for newly accepted fds which immediately perform their first recv() call. Two new flags were added to the fdtab[] struct to tag the fact that a file descriptor already exists in the update list. One flag indicates that a file descriptor is new and has just been created (fdtab[].new) and the other one indicates that a file descriptor is already referenced by the update list (fdtab[].updated). Even if the FD state changes during operations or if the fd is closed and replaced, it's not an issue because the update flag remains and is easily spotted during list walks. The flag must absolutely reflect the presence of the fd in the update list in order to avoid overflowing the update list with more events than there are distinct fds. Note that this change also recovers the small performance loss introduced by its connection counter-part and goes even beyond.	2012-11-10 00:17:27 +01:00
Willy Tarreau	c8dd77fddf	MAJOR: connection: remove the CO_FL_CURR_*_POL flag This is the first step of a series of changes aiming at making the polling totally event-driven. This first change consists in only remembering at the connection level whether an FD was enabled or not, regardless of the fact it was being polled or cached. From now on, an EAGAIN will always be considered as a change so that the pollers are able to manage a cache and to flush it based on such events. One of the noticeable effect is that conn_fd_handler() is called once more per session (6 instead of 5 min) but other update functions are less called. Note that the performance loss caused by this change at the moment is quite significant, around 2.5%, but the change is needed to have SSL working correctly in all situations, even when data were read from the socket and stored in the invisible cache, waiting for some room in the channel's buffer.	2012-11-09 22:09:33 +01:00
William Lallemand	1c2d622d82	CLEANUP: use struct comp_ctx instead of union Replace union comp_ctx by struct comp_ctx. Use struct comp_ctx * in the init/add_data/flush/reset/end prototypes of compression.h functions.	2012-11-05 10:23:16 +01:00
Willy Tarreau	07115412d3	MEDIUM: stick-table: allocate the table key of size buffer size Keys are copied from samples to stick_table_key. If a key is larger than the stick_table_key, we have an overflow. In pratice it does not happen because it requires : 1) a configuration with tune.bufsize larger than BUFSIZE (common) 2) a stick-table configured with keys strictly larger than buffers 3) extraction of data larger than BUFSIZE (eg: using payload()) Points 2 and 3 don't make any sense for a real world configuration. That said the issue needs be fixed. The solution consists in allocating it the same size as the global buffer size, just like the samples. This fixes the issue.	2012-10-29 21:56:59 +01:00
Willy Tarreau	7e2c647ee7	MEDIUM: remove remains of BUFSIZE in HTTP auth and sample conversions Sample conversions rely on two alternative buffers which were previously allocated as static bufs of size BUFSIZE. Now they're initialized to the global buffer size. It was the same for HTTP authentication. Note that it seems that none of them was prone to any mistake when dealing with the buffer size, but better stay on the safe side by maintaining the old assumption that a trash buffer is always "large enough".	2012-10-29 20:44:36 +01:00
Willy Tarreau	f2943dccd0	MAJOR: session: detach the connections from the stream interfaces We will need to be able to switch server connections on a session and to keep idle connections. In order to achieve this, the preliminary requirement is that the connections can survive the session and be detached from them. Right now they're still allocated at exactly the same place, so when there is a session, there are always 2 connections. We could soon improve on this by allocating the outgoing connection only during a connect(). This current patch touches a lot of code and intentionally does not change any functionnality. Performance tests show no regression (even a very minor improvement). The doc has not yet been updated.	2012-10-26 20:15:20 +02:00
William Lallemand	82fe75c1a7	MEDIUM: HTTP compression (zlib library support) This commit introduces HTTP compression using the zlib library. http_response_forward_body has been modified to call the compression functions. This feature includes 3 algorithms: identity, gzip and deflate: * identity: this is mostly for debugging, and it was useful for developping the compression feature. With Content-Length in input, it is making each chunk with the data available in the current buffer. With chunks in input, it is rechunking, the output chunks will be bigger or smaller depending of the size of the input chunk and the size of the buffer. Identity does not apply any change on data. * gzip: same as identity, but applying a gzip compression. The data are deflated using the Z_NO_FLUSH flag in zlib. When there is no more data in the input buffer, it flushes the data in the output buffer (Z_SYNC_FLUSH). At the end of data, when it receives the last chunk in input, or when there is no more data to read, it writes the end of data with Z_FINISH and the ending chunk. * deflate: same as gzip, but with deflate algorithm and zlib format. Note that this algorithm has ambiguous support on many browsers and no support at all from recent ones. It is strongly recommended not to use it for anything else than experimentation. You can't choose the compression ratio at the moment, it will be set to Z_BEST_SPEED (1), as tests have shown very little benefit in terms of compression ration when going above for HTML contents, at the cost of a massive CPU impact. Compression will be activated depending of the Accept-Encoding request header. With identity, it does not take care of that header. To build HAProxy with zlib support, use USE_ZLIB=1 in the make parameters. This work was initially started by David Du Colombier at Exceliance.	2012-10-26 02:30:48 +02:00
Willy Tarreau	55a6906125	OPTIM: channel: inline channel_forward's fast path Most calls to channel_forward() are performed with short byte counts and are already optimized in channel_forward() taking just a few instructions. Thus it's a waste of CPU cycles to call a function for this, let's just inline the short byte count case and fall back to the common one for remaining situations. Doing so has increased the chunked encoding parser's performance by 12% !	2012-10-26 01:08:01 +02:00
Emeric Brun	a068a2951d	MINOR: sample: export 'sample_get_trash_chunk(void)' This will be used on external fetch modules.	2012-10-22 18:54:24 +02:00
Emeric Brun	07ca496ea9	MINOR: acl: add parse and match primitives to use binary type on ACLs Binary ACL match patterns can now be entered as hex digit strings.	2012-10-22 18:54:24 +02:00
Willy Tarreau	2e845be249	MEDIUM: sample: pass an empty list instead of a null for fetch args ACL and sample fetches use args list and it is really not convenient to check for null args everywhere. Now for empty args we pass a constant list of end of lists. It will allow us to remove many useless checks.	2012-10-19 19:49:09 +02:00
Willy Tarreau	c93f7959e5	CLEANUP: session: remove term_trace which is not used anymore This field was used to trace precisely where a session was terminated but it did not survive code rearchitecture and was not used at all anymore. Let's get rid of it.	2012-10-13 11:10:30 +02:00
Willy Tarreau	9b28e03b66	MAJOR: channel: replace the struct buffer with a pointer to a buffer With this commit, we now separate the channel from the buffer. This will allow us to replace buffers on the fly without touching the channel. Since nobody is supposed to keep a reference to a buffer anymore, doing so is not a problem and will also permit some copy-less data manipulation. Interestingly, these changes have shown a 2% performance increase on some workloads, probably due to a better cache placement of data.	2012-10-13 09:07:52 +02:00
Willy Tarreau	974ced6305	CLEANUP: channel: use 'chn' instead of 'buf' as local variable names It's too confusing to see buf->buf everywhere where the first buf is a channel. Let's fix this now.	2012-10-12 23:11:02 +02:00
Willy Tarreau	ffc3fcd6da	MEDIUM: log: report SSL ciphers and version in logs using logformat %sslc/%sslv These two new log-format tags report the SSL protocol version (%sslv) and the SSL ciphers (%sslc) used for the connection with the client. For instance, to append these information just after the client's IP/port address information on an HTTP log line, use the following configuration : log-format %Ci:%Cp\ %sslv:%sslc\ [%t]\ %ft\ %b/%s\ %Tq/%Tw/%Tc/%Tr/%Tt\ %st\ %B\ %cc\ \ %cs\ %tsc\ %ac/%fc/%bc/%sc/%rc\ %sq/%bq\ %hr\ %hs\ %{+Q}r It will report a line such as the following one : Oct 12 20:47:30 haproxy[9643]: 127.0.0.1:43602 TLSv1:AES-SHA [12/Oct/2012:20:47:30.303] stick2~ stick2/s1 7/0/12/0/19 200 145 - - ---- 0/0/0/0/0 0/0 "GET /?t=0 HTTP/1.0"	2012-10-12 20:48:51 +02:00
Willy Tarreau	4f65356a22	MINOR: log: make lf_text use a const char * lf_text() should use a const char * otherwise it makes it more complex to use data coming from const strings.	2012-10-12 20:30:51 +02:00
Willy Tarreau	1e954913de	MEDIUM: connection: add a flag to hold the transport layer When we start logging SSL information, we need the SSL struct to be present even past the conn_xprt_close() call. In order to achieve this, we should use refcounting on the connection and the transport layer. At the moment it's not worth using plain refcounting as only the logs require this, so instead of real refcounting we just use a flag which will be set by the log subsystem when SSL data need to be logged. What happens then is that the xprt->close() call is ignored and the transport layer is closed again during session_free(), after the log line is emitted.	2012-10-12 20:30:50 +02:00
Willy Tarreau	6c03a64978	MEDIUM: connection: always unset the transport layer upon close When calling conn_xprt_close(), we always clear the transport pointer so that all transport layers leave the connection in the same state after a close. This will also make it safer and cheaper to call conn_xprt_close() multiple times if needed.	2012-10-12 17:03:04 +02:00
Emeric Brun	94324a4c87	MINOR: ssl: move ssl context init for servers from cfgparse.c to ssl_sock.c	2012-10-12 11:37:36 +02:00
Willy Tarreau	21faa91be6	MINOR: server: add minimal infrastructure to parse keywords Just like with the "bind" lines, we'll switch the "server" line parsing to keyword registration. The code is essentially the same as for bind keywords, with minor changes such as support for the default-server keywords and support for variable argument count.	2012-10-10 17:42:39 +02:00
Willy Tarreau	5f1504f524	MEDIUM: connection: add a new local send-proxy transport callback This callback sends a PROXY protocol line on the outgoing connection, with the local and remote endpoint information. This is used for local connections (eg: health checks) where the other end needs to have a valid address and no connection is relayed.	2012-10-05 00:32:35 +02:00
Willy Tarreau	e1e4a61e7a	REORG: connection: move the PROXY protocol management to connection.c It was previously in frontend.c but there is no reason for this anymore considering that all the information involved is in the connection itself only. Theorically this should be in the socket layer but we don't have this yet.	2012-10-05 00:32:33 +02:00
Willy Tarreau	0ffde2cc3f	MEDIUM: connection: automatically disable polling on error We absolutely want to disable FD polling after an error is detected, otherwise the data layer has to do it and it's far from being obvious at these layers. The way we did it was a bit tricky in conn_update__polling and conn__polling_changes. However it has almost no impact on performance and code size both for the fast and slow path. We'll now be able to remove some flag updates in the stream interface.	2012-10-04 22:26:11 +02:00
Willy Tarreau	f3a6d7e115	MEDIUM: connection: reorganize connection flags The connection flags have progressively been added one after the other and were not very well organized. Some of them are often used together and a number of operations are performed on the DATA/SOCK ENA/POL flags. Thus, they have been reorganized so that flags that work together are close to each other (allows immediate operands on ARM) and that polling changes can be detected with fewer operations using a simple shift and xor. The handshakes are now the last ones so that it will be easier to add new ones after without risking a collision. All activity-related flags are also grouped together.	2012-10-04 22:26:10 +02:00
Willy Tarreau	071e137ec2	MEDIUM: connection: use a generic data-layer init() callback The generic data-layer init callback is now used after the transport layer is complete and before calling the data layer recv/send callbacks. This allows the session to switch from the embryonic session data layer to the complete stream interface data layer, by making conn_session_complete() the data layer's init callback. It sill looks awkwards that the init() callback must be used opon error, but except by adding yet another one, it does not seem to be mergeable into another function (eg: it should probably not be merged with ->wake to avoid unneeded calls during the handshake, though semantically that would make sense).	2012-10-04 22:26:10 +02:00
Willy Tarreau	bd99aab91f	MINOR: connection: split conn_prepare() in two functions We'll also need a function to takeover an existing connection without reinitializing it. The same will be needed at the stream interface level.	2012-10-04 22:26:10 +02:00
Willy Tarreau	4aa3683b2d	MINOR: connection: provide a generic data layer wakeup callback Instead of calling conn_notify_si() from the connection handler, we now call data->wake(), which will allow us to use a different callback with health checks. Note that we still rely on a flag in order to decide whether or not to call this function. The reason is that with embryonic sessions, the callback is already initialized to si_conn_cb without the flag, and we can't call the SI notify function in the leave path before the stream interface is initialized. This issue should be addressed by involving a different data_cb for embryonic sessions and for stream interfaces, that would be changed during session_complete() for the final data_cb.	2012-10-04 22:26:10 +02:00
Willy Tarreau	74beec32a5	REORG: connection: rename app_cb "data" Now conn->data will designate the data layer which is the client for the transport layer. In practice it's the stream interface and will soon also be the health checks.	2012-10-04 22:26:10 +02:00
Willy Tarreau	f7bc57ca6e	REORG: connection: rename the data layer the "transport layer" While working on the changes required to make the health checks use the new connections, it started to become obvious that some naming was not logical at all in the connections. Specifically, it is not logical to call the "data layer" the layer which is in charge for all the handshake and which does not yet provide a data layer once established until a session has allocated all the required buffers. In fact, it's more a transport layer, which makes much more sense. The transport layer offers a medium on which data can transit, and it offers the functions to move these data when the upper layer requests this. And it is the upper layer which iterates over the transport layer's functions to move data which should be called the data layer. The use case where it's obvious is with embryonic sessions : an incoming SSL connection is accepted. Only the connection is allocated, not the buffers nor stream interface, etc... The connection handles the SSL handshake by itself. Once this handshake is complete, we can't use the data functions because the buffers and stream interface are not there yet. Hence we have to first call a specific function to complete the session initialization, after which we'll be able to use the data functions. This clearly proves that SSL here is only a transport layer and that the stream interface constitutes the data layer. A similar change will be performed to rename app_cb => data, but the two could not be in the same commit for obvious reasons.	2012-10-04 22:26:09 +02:00
Emeric Brun	9faf071acb	MINOR: ssl: add build param USE_PRIVATE_CACHE to build cache without shared memory It removes dependencies with futex or mutex but ssl performances decrease using nbproc > 1 because switching process force session renegotiation. This can be useful on small systems which never intend to run in multi-process mode.	2012-10-02 08:34:38 +02:00
Emeric Brun	4b3091e54e	MINOR: ssl: disable shared memory and locks on session cache if nbproc == 1 We don't needa to lock the memory when there is a single process. This can make a difference on small systems where locking is much more expensive than just a test.	2012-10-02 08:34:38 +02:00
Willy Tarreau	cd379950a7	MINOR: connection: add a pointer to the connection owner This will be needed to find the stream interface from the connection once they're detached, but in the more immediate term, we'll need this for health checks since they don't use a stream interface.	2012-09-28 00:01:22 +02:00
Willy Tarreau	dda5e7c986	CLEANUP: connection: offer conn_prepare() to set up a connection This will be used by checks as well as stream interfaces.	2012-09-24 22:49:06 +02:00
Willy Tarreau	290e63aa87	REORG: listener: move unix perms from the listener to the bind_conf Unix permissions are per-bind configuration line and not per listener, so let's concretize this in the way the config is stored. This avoids some unneeded loops to set permissions on all listeners. The access level is not part of the unix perms so it has been moved away. Once we can use str2listener() to set all listener addresses, we'll have a bind keyword parser for this one.	2012-09-20 18:07:14 +02:00
Willy Tarreau	4348fad1c1	MAJOR: listeners: use dual-linked lists to chain listeners with frontends Navigating through listeners was very inconvenient and error-prone. Not to mention that listeners were linked in reverse order and reverted afterwards. In order to definitely get rid of these issues, we now do the following : - frontends have a dual-linked list of bind_conf - frontends have a dual-linked list of listeners - bind_conf have a dual-linked list of listeners - listeners have a pointer to their bind_conf This way we can now navigate from anywhere to anywhere and always find the proper bind_conf for a given listener, as well as find the list of listeners for a current bind_conf.	2012-09-20 16:48:07 +02:00
Willy Tarreau	8638f4850f	MEDIUM: config: enumerate full list of registered "bind" keywords upon error When an unknown "bind" keyword is detected, dump the list of all registered keywords. Unsupported default alternatives are also reported as "not supported".	2012-09-18 18:27:14 +02:00
Willy Tarreau	79eeafacb4	MEDIUM: move bind SSL parsing to ssl_sock Registering new SSL bind keywords was not particularly handy as it required many #ifdef in cfgparse.c. Now the code has moved to ssl_sock.c which calls a register function for all the keywords. Error reporting was also improved by this move, because the called functions build an error message using memprintf(), which can span multiple lines if needed, and each of these errors will be displayed indented in the context of the bind line being processed. This is important when dealing with certificate directories which can report multiple errors.	2012-09-18 16:20:01 +02:00
Willy Tarreau	269826659d	MEDIUM: listener: add a minimal framework to register "bind" keyword options With the arrival of SSL, the "bind" keyword has received even more options, all of which are processed in cfgparse in a cumbersome way. So it's time to let modules register their own bind options. This is done very similarly to the ACLs with a small difference in that we make the difference between an unknown option and a known, unimplemented option.	2012-09-15 22:33:08 +02:00
Willy Tarreau	2a65ff014e	MEDIUM: config: replace ssl_conf by bind_conf Some settings need to be merged per-bind config line and are not necessarily SSL-specific. It becomes quite inconvenient to have this ssl_conf SSL-specific, so let's replace it with something more generic.	2012-09-15 22:29:33 +02:00
Willy Tarreau	d1d5454180	REORG: split "protocols" files into protocol and listener It was becoming confusing to have protocols and listeners in the same files, split them.	2012-09-15 22:29:32 +02:00
Emeric Brun	fc0421fde9	MEDIUM: ssl: add support for SNI and wildcard certificates A side effect of this change is that the "ssl" keyword on "bind" lines is now just a boolean and that "crt" is needed to designate certificate files or directories. Note that much refcounting was needed to have the free() work correctly due to the number of cert aliases which can make a context be shared by multiple names.	2012-09-10 09:27:02 +02:00
Willy Tarreau	f5ae8f7637	MEDIUM: config: centralize handling of SSL config per bind line SSL config holds many parameters which are per bind line and not per listener. Let's use a per-bind line config instead of having it replicated for each listener. At the moment we only do this for the SSL part but this should probably evolved to handle more of the configuration and maybe even the state per bind line.	2012-09-08 08:31:50 +02:00
Willy Tarreau	783f25800c	BUILD: http: rename error_message http_error_message to fix conflicts on RHEL Duncan Hall reported a build issue on CentOS where error_message conflicts with another system declaration when SSL is enabled. Rename the function.	2012-09-04 12:19:04 +02:00
Emeric Brun	3e541d1c03	MEDIUM: ssl: add shared memory session cache implementation. This SSL session cache was developped at Exceliance and is the same that was proposed for stunnel and stud. It makes use of a shared memory area between the processes so that sessions can be handled by any process. It is only useful when haproxy runs with nbproc > 1, but it does not hurt performance at all with nbproc = 1. The aim is to totally replace OpenSSL's internal cache. The cache is optimized for Linux >= 2.6 and specifically for x86 platforms. On Linux/x86, it makes use of futexes for inter-process locking, with some x86 assembly for the locked instructions. On other architectures, GCC builtins are used instead, which are available starting from gcc 4.1. On other operating systems, the locks fall back to pthread mutexes so libpthread is automatically linked. It is not recommended since pthreads are much slower than futexes. The lib is only linked if SSL is enabled.	2012-09-03 22:36:33 +02:00
Emeric Brun	e1f38dbb44	MEDIUM: ssl: protect against client-initiated renegociation CVE-2009-3555 suggests that client-initiated renegociation should be prevented in the middle of data. The workaround here consists in having the SSL layer notify our callback about a handshake occurring, which in turn causes the connection to be marked in the error state if it was already considered established (which means if a previous handshake was completed). The result is that the connection with the client is immediately aborted and any pending data are dropped.	2012-09-03 22:03:17 +02:00
Emeric Brun	4659195e31	MEDIUM: ssl: add new files ssl_sock.[ch] to provide the SSL data layer This data layer supports socket-to-buffer and buffer-to-socket operations. No sock-to-pipe nor pipe-to-sock functions are provided, since splicing does not provide any benefit with data transformation. At best it could save a memcpy() and avoid keeping a buffer allocated but that does not seem very useful. An init function and a close function are provided because the SSL context needs to be allocated/freed. A data-layer shutw() function is also provided because upon successful shutdown, we want to store the SSL context in the cache in order to reuse it for future connections and avoid a new key generation. The handshake function is directly called from the connection handler. At this point it is not certain whether this will remain this way or if a new ->handshake callback will be added to the data layer so that the connection handler doesn't care about SSL. The sock-to-buf and buf-to-sock functions are all capable of enabling the SSL handshake at any time. This also implies polling in the opposite direction to what was expected. The upper layers must take that into account (it is OK right now with the stream interface).	2012-09-03 20:49:14 +02:00
Willy Tarreau	dd2f85eb3b	CLEANUP: includes: fix includes for a number of users of fd.h It appears that fd.h includes a number of unneeded files and was included from standard.h, and as such served as an intermediary to provide almost everything to everyone. By removing its useless includes, a long dependency chain broke but could easily be fixed.	2012-09-03 20:49:14 +02:00
Willy Tarreau	e9dfa79a75	MAJOR: connection: rearrange the polling flags. Polling flags were set for data and sock layer, but while this does make sense for the ENA flag, it does not for the POL flag which translates the detection of an EAGAIN condition. So now we remove the {DATA,SOCK}_POL* flags and instead introduce two new layer-independant flags (WANT_RD and WANT_WR). These flags are only set when an EAGAIN is encountered so that polling can be enabled. In order for these flags to have any meaning they are not persistent and have to be cleared by the connection handler before calling the I/O and data callbacks. For this reason, changes detection has been slightly improved. Instead of comparing the WANT_* flags with CURR_*_POL, we only check if the ENA status changes, or if the polling appears, since we don't want to detect the useless poll to ena transition. Tests show that this has eliminated one useless call to __fd_clr(). Finally the conn_set_polling() function which was becoming complex and required complex operations from the caller was split in two and replaced its two only callers (conn_update_data_polling and conn_update_sock_polling). The two functions are now much smaller due to the less complex conditions. Note that it would be possible to re-merge them and only pass a mask but this does not appear much interesting.	2012-09-03 20:47:35 +02:00
Willy Tarreau	74172ff9c3	CLEANUP: frontend: remove the old proxy protocol decoder This one used to rely on a stream analyser which was inappropriate. It's not used anymore.	2012-09-03 20:47:35 +02:00
Willy Tarreau	22cda21ad5	MAJOR: connection: make the PROXY decoder a handshake handler The PROXY protocol is now decoded in the connection before other handshakes. This means that it may be extracted from a TCP stream before SSL is decoded from this stream.	2012-09-03 20:47:35 +02:00
Willy Tarreau	2542b53b19	MAJOR: session: introduce embryonic sessions When an incoming connection request is accepted, a connection structure is needed to store its state. However we don't want to fully initialize a session until the data layer is about to be ready. As long as the connection is physically stored into the session, it's not easy to split both allocations. As such, we only initialize the minimum requirements of a session, which results in what we call an embryonic session. Then once the data layer is ready, we can complete the function's initialization. Doing so avoids buffers allocation and ensures that a session only sees ready connections. The frontend's client timeout is used as the handshake timeout. It is likely that another timeout will be used in the future.	2012-09-03 20:47:35 +02:00
Willy Tarreau	15678efc45	MEDIUM: connection: add an ->init function to data layer SSL need to initialize the data layer before proceeding with data. At the moment, this data layer is automatically initialized from itself, which will not be possible once we extract connection from sessions since we'll only create the data layer once the handshake is finished. So let's have the application layer initialize the data layer before using it.	2012-09-03 20:47:34 +02:00
Willy Tarreau	64ee491309	MINOR: tcp: replace tcp_src_to_stktable_key with addr_to_stktable_key Make it more obvious that this function does not depend on any knowledge of the session. This is important to plan for TCP rules that can run on connection without any initialized session yet.	2012-09-03 20:47:34 +02:00
Willy Tarreau	14f8e86da5	MEDIUM: proto_tcp: remove any dependence on stream_interface The last uses of the stream interfaces were in tcp_connect_server() and could easily and more appropriately be moved to its callers, si_connect() and connect_server(), making a lot more sense. Now the function should theorically be usable for health checks. It also appears more obvious that the file is split into two distinct parts : - the protocol layer used at the connection level - the tcp analysers executing tcp-* rules and their samples/acls.	2012-09-03 20:47:34 +02:00
Willy Tarreau	986a9d2d12	MAJOR: connection: move the addr field from the stream_interface We need to have the source and destination addresses in the connection. They were lying in the stream interface so let's move them. The flags SI_FL_FROM_SET and SI_FL_TO_SET have been moved as well. It's worth noting that tcp_connect_server() almost does not use the stream interface anymore except for a few flags. It has been identified that once we detach the connection from the SI, it will probably be needed to keep a copy of the server-side addresses in the SI just for logging purposes. This has not been implemented right now though.	2012-09-03 20:47:34 +02:00
Willy Tarreau	3cefd521fa	REORG: connection: move the target pointer from si to connection The target is per connection and is directly used by the connection, so we need it there. It's not needed anymore in the SI however.	2012-09-03 20:47:34 +02:00
Willy Tarreau	8263d2b259	CLEANUP: channel: use "channel" instead of "buffer" in function names This is a massive rename of most functions which should make use of the word "channel" instead of the word "buffer" in their names. In concerns the following ones (new names) : unsigned long long channel_forward(struct channel buf, unsigned long long bytes); static inline void channel_init(struct channel buf) static inline int channel_input_closed(struct channel buf) static inline int channel_output_closed(struct channel buf) static inline void channel_check_timeouts(struct channel b) static inline void channel_erase(struct channel buf) static inline void channel_shutr_now(struct channel buf) static inline void channel_shutw_now(struct channel buf) static inline void channel_abort(struct channel buf) static inline void channel_stop_hijacker(struct channel buf) static inline void channel_auto_connect(struct channel buf) static inline void channel_dont_connect(struct channel buf) static inline void channel_auto_close(struct channel buf) static inline void channel_dont_close(struct channel buf) static inline void channel_auto_read(struct channel buf) static inline void channel_dont_read(struct channel buf) unsigned long long channel_forward(struct channel *buf, unsigned long long bytes) Some functions provided by channel.[ch] have kept their "buffer" name because they are really designed to act on the buffer according to some information gathered from the channel. They have been moved together to the same place in the file for better readability but they were not changed at all. The "buffer" memory pool was also renamed "channel".	2012-09-03 20:47:33 +02:00
Willy Tarreau	03cdb7c678	CLEANUP: channel: usr CF_/CHN_ prefixes instead of BF_/BUF_ Get rid of these confusing BF_* flags. Now channel naming should clearly be used everywhere appropriate. No code was changed, only a renaming was performed. The comments about channel operations was updated.	2012-09-03 20:47:33 +02:00
Willy Tarreau	af81935b82	REORG: channel: move buffer_{replace,insert_line}* to buffer.{c,h} These functions do not depend on the channel flags anymore thus they're much better suited to be used on plain buffers. Move them from channel to buffer.	2012-09-03 20:47:33 +02:00
Willy Tarreau	f941cf2ef2	MAJOR: channel: remove the BF_FULL flag This is similar to the recent removal of BF_OUT_EMPTY. This flag was very problematic because it relies on permanently changing information such as the to_forward value, so it had to be updated upon every change to the buffers. Previous patch already got rid of its users. One part of the change is sensible : the flag was also part of BF_MASK_STATIC, which is used by process_session() to rescan all analysers in case the flag's status changes. At first glance, none of the analysers seems to change its mind base on this flag when it is subject to change, so it seems fine not to add variation checks here. Otherwise it's possible that checking the buffer's input and output is more reliable than checking the flag's replacement.	2012-09-03 20:47:33 +02:00
Willy Tarreau	ad1cc3df9c	MINOR: channel: rename bi_full to channel_full as it checks the whole channel Since the function takes care of the forward count and involves more than buffer knowledge, rename it.	2012-09-03 20:47:32 +02:00
Willy Tarreau	a75bcef867	REORG: buffer: move buffer_flush, b_adv and b_rew to buffer.h These one now operate over real buffers, not channels anymore.	2012-09-03 20:47:32 +02:00
Willy Tarreau	8e21bb9e52	MAJOR: channel: remove the BF_OUT_EMPTY flag This flag was very problematic because it was composite in that both changes to the pipe or to the buffer had to cause this flag to be updated, which is not always simple (eg: there may not even be a channel attached to a buffer at all). There were not that many users of this flags, mostly setters. So the flag got replaced with a macro which reports whether the channel is empty or not, by checking both the pipe and the buffer. One part of the change is sensible : the flag was also part of BF_MASK_STATIC, which is used by process_session() to rescan all analysers in case the flag's status changes. At first glance, none of the analysers seems to change its mind base on this flag when it is subject to change, so it seems fine not to add variation checks here. Otherwise it's possible that checking the buffer's output size is more useful than checking the flag's replacement.	2012-09-03 20:47:32 +02:00
Willy Tarreau	c7e4238df0	REORG: buffers: split buffers into chunk,buffer,channel Many parts of the channel definition still make use of the "buffer" word.	2012-09-03 20:47:32 +02:00
Willy Tarreau	c578891112	CLEANUP: connection: split sock_ops into data_ops, app_cp and si_ops Some parts of the sock_ops structure were only used by the stream interface and have been moved into si_ops. Some of them were callbacks to the stream interface from the connection and have been moved into app_cp as they're the application seen from the connection (later, health-checks will need to use them). The rest has moved to data_ops. Normally at this point the connection could live without knowing about stream interfaces at all.	2012-09-03 20:47:31 +02:00
Willy Tarreau	5368d80ede	MAJOR: connection: split the send call into connection and stream interface Similar to what was done on the receive path, the data layer now provides only an snd_buf() callback that is iterated over by the stream interface's si_conn_send_loop() function. The data layer now has no knowledge about channels nor stream interfaces. The splice() code still need to be ported as it currently is disabled.	2012-09-03 20:47:31 +02:00
Willy Tarreau	ce323dea14	REORG: stream-interface: move sock_raw_read() to si_conn_recv_cb() The recv function is now generic and is usable to iterate any connection-to-buf reading function from a stream interface. So let's move it to stream-interface.	2012-09-03 20:47:30 +02:00
Willy Tarreau	75bf2c925f	REORG: sock_raw: rename the files raw_sock* The "raw_sock" prefix will be more convenient for naming functions as it will be prefixed with the data layer and suffixed with the data direction. So let's rename the files now to avoid any further confusion. The #include directive was also removed from a number of files which do not need it anymore.	2012-09-02 21:54:56 +02:00
Willy Tarreau	3af56a9359	MINOR: connection: provide conn_{data\|sock}_{read0\|shutw} functions These functions are used to report unidirectional shutdown and to disable polling in the related direction.	2012-09-02 21:54:56 +02:00
Willy Tarreau	572bf9095d	REORG/MAJOR: extract "struct buffer" from "struct channel" At the moment, the struct is still embedded into the struct channel, but all the functions have been updated to use struct buffer only when possible, otherwise struct channel. Some functions would likely need to be splitted between a buffer-layer primitive and a channel-layer function. Later the buffer should become a pointer in the struct buffer, but doing so requires a few changes to the buffer allocation calls.	2012-09-02 21:54:56 +02:00
Willy Tarreau	7421efb85f	REORG/MAJOR: use "struct channel" instead of "struct buffer" This is a massive rename. We'll then split channel and buffer. This change needs a lot of cleanups. At many locations, the parameter or variable is still called "buf" which will become ambiguous. Also, the "struct channel" is still defined in buffers.h.	2012-09-02 21:54:55 +02:00
Willy Tarreau	9bf9c14c12	MEDIUM: stream-interface: provide a generic stream_sock_read0() function This function is used by the data layer when a zero has been read over a connection. At the moment it only handles sockets and nothing else. Once the complete split is done between buffers and stream interfaces, it should become possible to work regardless on the connection type.	2012-09-02 21:54:55 +02:00
Willy Tarreau	eecf6ca68a	MEDIUM: stream-interface: provide a generic si_conn_send_cb callback The connection send() callback is supposed to be generic for a stream-interface, and consists in calling the lower layer snd_buf function. Move this function to the stream interface and remove the sock-raw and sock-ssl clones.	2012-09-02 21:54:55 +02:00
Willy Tarreau	de5722c302	MEDIUM: stream-interface: provide a generic stream_int_chk_snd_conn() function This one can be used by both sock_raw and sock_ssl instead of each having their own.	2012-09-02 21:54:55 +02:00
Willy Tarreau	fae4499e36	MEDIUM: stream-interface: add a snd_buf() callback to sock_ops This callback is used to send data from the buffer to the socket. It is the old write_loop() call of the data layer which is used both by the ->write() callback and the ->chk_snd() function. The reason for having it as a pointer is that it's the only remaining part which causes the write and chk_snd() functions to be different between raw and ssl.	2012-09-02 21:54:18 +02:00
Willy Tarreau	46a8d925c2	MEDIUM: stream-interface: offer a generic chk_rcv function for connections sock_raw and sock_ssl use a pretty generic chk_rcv function, so let's move this function to the stream_interface and remove specific functions. Later we might have a single chk_rcv function.	2012-09-02 21:54:18 +02:00
Willy Tarreau	100c467120	MEDIUM: stream_interface: offer a generic function for connection updates We need to have a generic function to be called by upper layers when buffer flags have been updated (the si->update function). At the moment, both sock_raw and sock_ssl had their own which basically was a copy-paste. Since these functions are only used to update stream interface flags, it is logical to have them handled by the stream interface code. This allowed us to remove the stream_interface-specific update function from sock_raw and sock_ssl which now use the generic code. The stream_sock_update_conn callback has also been more appropriately renamed conn_notify_si() since it's meant to be called by lower layers to notify the SI and possibly upper layers about incoming changes.	2012-09-02 21:54:18 +02:00
Willy Tarreau	afad0e0f80	MAJOR: make use of conn_{data\|sock}_{poll\|stop\|want}* in connection handlers This is a second attempt at getting rid of FD_WAIT_. Now the situation is much better since native I/O handlers can directly manipulate the FD using fd_{poll\|want\|stop}_ and the connection handlers manipulate connection-level flags using the conn_{data\|sock}_* equivalent. Proceeding this way ensures that the connection flags always reflect the reality even after data<->handshake switches.	2012-09-02 21:53:12 +02:00
Willy Tarreau	f9dabecd03	MEDIUM: connection: make use of the new polling functions Now the connection handler, the handshake callbacks and the I/O callbacks make use of the connection-layer polling functions to enable or disable polling on a file descriptor. Some changes still need to be done to avoid using the FD_WAIT_* constants.	2012-09-02 21:53:11 +02:00
Willy Tarreau	b5e2cbdcc8	MEDIUM: connection: add definitions for dual polling mechanisms The conflicts we're facing with polling is that handshake handlers have precedence over data handlers and may change the polling requirements regardless of what is expected by the data layer. This causes issues such as missed events. The real need is to have three polling levels : - the "current" one, which is effective at any moment - the data one, which reflects what the data layer asks for - the sock one, which reflects what the socket layer asks for Depending on whether a handshake is in progress or not, either one of the last two will replace the current one, and the change will be propagated to the lower layers. At the moment, the shutdown status is not considered, and only handshakes are used to decide which layer to chose. This will probably change.	2012-09-02 21:53:11 +02:00
Willy Tarreau	babd05a6c6	MEDIUM: fd: add fd_poll_{recv,send} for use when explicit polling is required The old EV_FD_SET() macro was confusing, as it would enable receipt but there was no way to indicate that EAGAIN was received, hence the recently added FD_WAIT_* flags. They're not enough as we're still facing a conflict between EV_FD_* and FD_WAIT_*. So let's offer I/O functions what they need to explicitly request polling.	2012-09-02 21:53:11 +02:00
Willy Tarreau	49b046dddf	MAJOR: fd: replace all EV_FD_* macros with new fd__ inline calls These functions have a more explicity meaning and will offer provisions for explicit polling. EV_FD_ISSET() has been left for now as it is still in use in checks.	2012-09-02 21:53:11 +02:00
Willy Tarreau	4a36b56909	MAJOR: stream_int: use a common stream_int_shut() functions regardless of the data layer Up to now, we had to use a shutr/shutw interface per data layer, which basically means 3 distinct functions when we include SSL : - generic stream_interface - sock_raw - sock_ssl With this change, the code located in the stream_interface manages all the stream_interface and buffer updates, and calls the data layer hooks when needed. At the moment, the socket layer hook had been implicitly considered as being a regular socket, so the si_shut() functions call the normal shutdown() and EV_FD_CLR() functions on the fd if a socket layer is defined. This may change in the future. The stream_int_shut*() functions don't call EV_FD_CLR() so that they can later be embedded in lower layers. Thus, the si->data->shutr() is not called anymore and si->data->shutw() is called to close the data layer only (eg: only for SSL). Proceeding like this is very important because it's the only way to be able not to rely on these functions when called from the connection handlers, and call the data layers' instead.	2012-09-02 21:53:10 +02:00
Willy Tarreau	8b117082bc	REORG: connection: replace si_data_close() with conn_data_close() This close function only applies to connection-specific parts and the stream-interface entry may soon disappear. Move this to the connection instead.	2012-09-02 21:53:10 +02:00
Willy Tarreau	3788e4c874	MEDIUM: fd: remove the EV_FD_COND_* primitives These primitives were initially introduced so that callers were able to conditionally set/disable polling on a file descriptor and check in return what the state was. It's been long since we last had an "if" on this, and all pollers' functions were the same for cond_* and their systematic counter parts, except that this required a check and a specific return value that are not always necessary. So let's simplify the FD API by removing this now unused distinction and by making all specific functions return void.	2012-09-02 21:53:10 +02:00
Willy Tarreau	239d7189fc	MEDIUM: stream_interface: pass connection instead of fd in sock_ops The sock_ops I/O callbacks made use of an FD till now. This has become inappropriate and the struct connection is much more useful. It also fixes the race condition introduced by previous change.	2012-09-02 21:53:08 +02:00
Willy Tarreau	fd31e53139	MAJOR: remove the stream interface and task management code from sock_* The socket data layer code must only focus on moving data between a socket and a buffer. We need a special stream interface handler to update the stream interface and the file descriptor status. At the moment the code works but suffers from a race condition caused by its API : the read/write callbacks still make use of the fd instead of using the connection. And when a double shutdown is performed, a call to ->write() after ->read() processed an error results in dereferencing a NULL fdtab[]->owner. This is only a temporary issue which doesn't need to be fixed now since this will automatically go away when the functions change to use the connection instead.	2012-09-02 21:53:08 +02:00
Willy Tarreau	2da156fe5e	MAJOR: tcp: remove the specific I/O callbacks for TCP connection probes Use a single tcp_connect_probe() instead of tcp_connect_write() and tcp_connect_read(). We call this one only when no data layer function have been processed, so this is a fallback to test for completion of a connection attempt. With this done, we don't have the need for any direct I/O callback anymore. The function still relies on ->write() to wake the stream interface up, so it's not finished.	2012-09-02 21:51:29 +02:00
Willy Tarreau	2c6be84b3a	MEDIUM: connection: extract the send_proxy callback from proto_tcp This handshake handler must be independant, so move it away from proto_tcp. It has a dedicated connection flag. It is tested before I/O handlers and automatically removes the CO_FL_WAIT_L4_CONN flag upon success. It also sets the BF_WRITE_NULL flag on the stream interface and stops the SI timeout. However it does not perform the task_wakeup(), and relies on the data handler to do so for now. The SI wakeup will have to be moved elsewhere anyway.	2012-09-02 21:51:28 +02:00
Willy Tarreau	59f98393bb	MINOR: connection: add a handler for fd-based connections This connection handler will be used as an I/O handler for events detected on a file descriptor. It is not used yet.	2012-09-02 21:51:28 +02:00
Willy Tarreau	654694e189	MEDIUM: stats/cli: add support for "set table key" to enter values This is used to enter values for stick tables. The most likely usage is to set gpc0 for a specific IP address in order to block traffic for abusers without having to reload. Since all data types are supported, other usages are possible (eg: replace a users's assigned server).	2012-09-02 21:51:07 +02:00
Willy Tarreau	c3a08a136b	BUG: stktable: tcp_src_to_stktable_key() must return NULL on invalid families Source addresses of non-TCP families were not correctly handled by tcp_src_to_stktable_key() as it forgot to return NULL and instead left the previous value in the stick-table buffer. This bug is 1.5-specific and was introduced by commit `4f92d320` in 1.5-dev6 so it does not need any backport.	2012-08-31 11:03:30 +02:00
Willy Tarreau	ab152a7eda	BUG/MAJOR: b_rew() must pass a signed offset to b_ptr() Commit `13e66da` introduced b_rew() but passes -adv which is an unsigned quantity on 64-bit platforms, causing the buffer to advance in the wrong direction. No backport is needed.	2012-05-31 11:33:42 +02:00
Emeric Brun	21adb02d19	MINOR: stream_interface: add a pointer to the listener for TARG_TYPE_CLIENT When the target is a client, it will be convenient to have a pointer to the original listener so that we can retrieve some configuration information at the stream interface level.	2012-05-21 22:22:39 +02:00
Willy Tarreau	24208275d5	MINOR: stream_interface: add a data channel close function This function will be called later when splitting the shutdown in two steps. It will be needed by SSL and for remote socket operations to release unused contexts.	2012-05-21 17:59:53 +02:00
Willy Tarreau	949811319b	REORG/MEDIUM: stream_interface: move applet->state and private to connection The state and the private pointer are not specific to the applets, since SSL will require exactly both of them. Move them to the connection layer now and rename them. We also now ensure that both are NULL on first call.	2012-05-21 17:09:48 +02:00
Willy Tarreau	fb7508aefb	REORG/MINOR: stream_interface: move si->fd to struct connection The socket fd is used only when in socket mode and with a connection.	2012-05-21 16:47:54 +02:00
Willy Tarreau	73b013b070	MINOR: stream_interface: introduce a new "struct connection" type We start to move everything needed to manage a connection to a special entity "struct connection". We have the data layer operations and the control operations there. We'll also have more info in the future such as file descriptors and applet contexts, so that in the end it becomes detachable from the stream interface, which will allow connections to be reused between sessions. For now on, we start with minimal changes.	2012-05-21 16:31:45 +02:00
Willy Tarreau	13e66dad26	MINOR: buffers: add a rewind function b_rew() will be used to rewind a buffer for certain specific operations such as header inspection on data already in the output queue.	2012-05-18 22:11:27 +02:00
Willy Tarreau	ce887fd3b2	MEDIUM: session: add support for tunnel timeouts Tunnel timeouts are used when TCP connections are forwarded, or when forwarding upgraded HTTP connections (WebSocket) as well as CONNECT requests to proxies. This timeout allows long-lived sessions to be supported without having to set large timeouts to normal requests.	2012-05-12 12:50:00 +02:00
Willy Tarreau	f873d754f8	CLEANUP: stream_interface: stop exporting socket layer functions Similarly to the previous patch, we don't need the socket-layer functions outside of stream_interface. They could even move to a file dedicated to applets, though that does not seem particularly useful at the moment.	2012-05-11 17:47:17 +02:00
Willy Tarreau	b277d6e568	CLEANUP: sock_raw: remove last references to stream_sock We also stop exporting all functions since they're not needed anymore outside of sock_raw.c.	2012-05-11 17:03:42 +02:00
Willy Tarreau	59b9479667	BUG/MEDIUM: stream_interface: restore get_src/get_dst Commit e164e7a removed get_src/get_dst setting in the stream interfaces but forgot to set it in proto_tcp. Get the feature back because we need it for logging, transparent mode, ACLs etc... We now rely on the stream interface direction to know what syscall to use. One benefit of doing it this way is that we don't use getsockopt() anymore on outgoing stream interfaces nor on UNIX sockets.	2012-05-11 16:48:10 +02:00
Willy Tarreau	1539a01645	MINOR: stream_interface: add a client target : TARG_TYPE_CLIENT This one will be used to identify the direction the SI is being used. All incoming connections have a target of type TARG_TYPE_CLIENT.	2012-05-11 14:47:34 +02:00
Willy Tarreau	c63190d429	REORG: use the name sock_raw instead of stream_sock We'll soon have an SSL socket layer, and in order to ease the difference between the two, we use the name "sock_raw" to designate the one which directly talks to the sockets without any conversion.	2012-05-11 14:23:52 +02:00
Willy Tarreau	a7fe8e527c	MINOR: http: replace http_message_realign() with buffer_slow_realign() There is no more reason for the realign function being HTTP specific, it only operates on a buffer now. Let's move it to buffers.c instead. It's likely that buffer_bounce_realign is broken (not used), this will have to be inspected. The function is worth rewriting as it can be cheaper than buffer_slow_realign() to realign large wrapping buffers.	2012-05-08 21:28:17 +02:00
Willy Tarreau	0a3dd74c9c	MEDIUM: cfgparse: use the new error reporting framework for remaining cfg_keywords All keywords registered using a cfg_kw_list now make use of the new error reporting framework. This allows easier and more precise error reporting without having to deal with complex buffer allocation issues.	2012-05-08 21:28:17 +02:00
Willy Tarreau	a93c74be5c	MEDIUM: cfgparse: make backend_parse_balance() use memprintf to report errors Using the new error reporting framework makes it easier to report complex errors.	2012-05-08 21:28:17 +02:00
Willy Tarreau	bbebbbff83	REORG/MEDIUM: move the default accept function from sockstream to protocols.c The previous sockstream_accept() function uses nothing from sockstream, and is totally irrelevant to stream interfaces. Move this to the protocols.c file which handles listeners and protocols, and call it listener_accept(). It now makes much more sense that the code dealing with listen() also handles accept() and passes it to upper layers.	2012-05-08 21:28:15 +02:00
Willy Tarreau	26d8c59f0b	REORG/MEDIUM: replace stream interface protocol functions by a proto pointer The stream interface now makes use of the socket protocol pointer instead of the direct functions.	2012-05-08 21:28:15 +02:00
Willy Tarreau	5c979a9c71	REORG/MEDIUM: stream_interface: initialize socket ops from descriptors	2012-05-08 21:28:14 +02:00
Willy Tarreau	cd3b094618	REORG: rename "pattern" files They're now called "sample" everywhere to match their description.	2012-05-08 20:57:21 +02:00
Willy Tarreau	1278578487	REORG: use the name "sample" instead of "pattern" to designate extracted data This is mainly a massive renaming in the code to get it in line with the calling convention. Next patch will rename a few files to complete this operation.	2012-05-08 20:57:20 +02:00
Willy Tarreau	7dcb6480db	MEDIUM: acl: extend the pattern parsers to report meaningful errors By passing the error pointer to all ACL parsers, we can make them report useful errors and not simply fail.	2012-05-08 20:57:20 +02:00
Willy Tarreau	b7451bb660	MEDIUM: acl: report parsing errors to the caller All parsing errors were known but impossible to return. Now by making use of memprintf(), we're able to build meaningful error messages that the caller can display.	2012-05-08 20:57:20 +02:00
Willy Tarreau	185b5c4a7b	MEDIUM: http: merge acl and pattern header fetch functions HTTP header fetch is now done using smp_fetch_hdr() for both ACLs and patterns. This one also supports an occurrence number, making it possible to specify explicit occurrences for ACLs and patterns.	2012-05-08 20:57:19 +02:00
Willy Tarreau	32a6f2e572	MEDIUM: acl/pattern: use the same direction scheme Patterns were using a bitmask to indicate if request or response was desired in fetch functions and keywords. ACLs were using a bitmask in fetch keywords and a single bit in fetch functions. ACLs were also using an ACL_PARTIAL bit in fetch functions indicating that a non-final fetch was performed, which was an abuse of the existing direction flag. The change now consists in using : - a capabilities field for fetch keywords => SMP_CAP_REQ/RES to indicate if a keyword supports requests, responses, both, etc... - an option field for fetch functions to indicate what the caller expects (request/response, final/non-final) The ACL_PARTIAL bit was reversed to get SMP_OPT_FINAL as it's more explicit to know we're working on a final buffer than on a non-final one. ACL_DIR_* were removed, as well as PATTERN_FETCH_*. L4 fetches were improved to support being called on responses too since they're still available. The <dir> field of all fetch functions was changed to <opt> which is now unsigned. The patch is large but mostly made of cosmetic changes to accomodate this, as almost no logic change happened.	2012-05-08 20:57:17 +02:00
Willy Tarreau	24e32d8c6b	MEDIUM: acl: replace acl_expr with args in acl fetch_* functions Having the args everywhere will make it easier to share fetch functions between patterns and ACLs. The only place where we could have needed the expr was in the http_prefetch function which can do well without.	2012-05-08 20:57:16 +02:00
Willy Tarreau	32389b7d04	MEDIUM: acl/pattern: switch rdp_cookie functions stack up-down Previously, both pattern, backend and persist_rdp_cookie would build fake ACL expressions to fetch an RDP cookie by calling acl_fetch_rdp_cookie(). Now we switch roles. The RDP cookie fetch function is provided as a sample fetch function that all others rely on, including ACL. The code is exactly the same, only the args handling moved from expr->args to args. The code was moved to proto_tcp.c, but probably that a dedicated file would be more suited to content handling.	2012-05-08 20:57:16 +02:00
Willy Tarreau	b4a88f0672	MINOR: pattern: replace struct pattern with struct sample This change is pretty minor. Struct pattern is only used for pattern_process() now so changing it to use the common type is quite obvious. It's worth noting that the last argument of pattern_process() is never used so the function is self-sufficient. Note that pattern_process() does not initialize the pattern at all before calling fetch->process(), and that minimal initialization will be required when we later change the argument for the sample.	2012-05-08 20:57:15 +02:00
Willy Tarreau	21e5b0e3cb	MEDIUM: get rid of SMP_F_READ_ONLY and SMP_F_MUST_FREE These ones were either unused or improperly used. Some integers were marked read-only, which does not make much sense. Buffers are not read-only, they're "constant" in that they must be kept intact after any possible change.	2012-05-08 20:57:15 +02:00
Willy Tarreau	f853c46bc3	MEDIUM: pattern/acl: get rid of temp_pattern in ACLs This one is not needed anymore as we can return the data and its type in the sample provided by the caller. ACLs now always return the proper type. BOOL is already returned when the result is expected to be processed as a boolean. temp_pattern has been unexported now.	2012-05-08 20:57:14 +02:00
Willy Tarreau	3740635b88	MAJOR: acl: make use of the new sample struct and get rid of acl_test This change is invasive in lines of code but not much in terms of functionalities as it's mainly a replacement of struct acl_test with struct sample.	2012-05-08 20:57:14 +02:00
Willy Tarreau	8f7406e9b4	MEDIUM: acl: remove the ACL_TEST_F_NULL_MATCH flag This flag was used to force a boolean match even if there was no pattern to match. It was used only by http_auth() and designed only for this one. It's easier and cleaner to make the fetch function perform the test and report the boolean result as a few other functions already do. It simplifies the acl_exec_cond() logic and will help merging ACLs and patterns.	2012-05-08 20:57:13 +02:00
Willy Tarreau	9fcb984b17	MEDIUM: pattern: use the standard arg parser We don't need the pattern-specific args parsers anymore, make use of the common parser instead. We still need to improve this by adding a validation function to report abnormal argument values or combinations. We don't report precise parsing errors yet but this was not previously done either.	2012-05-08 20:57:13 +02:00
Willy Tarreau	f995410355	MEDIUM: pattern: get rid of arg_i in all functions making use of arguments arg_i was almost unused, and since we migrated to use struct arg everywhere, the rare cases where arg_i was needed could be replaced by switching to arg->type = ARGT_STOP.	2012-05-08 20:57:12 +02:00
Willy Tarreau	ecfb8e8ff9	MEDIUM: pattern: replace type pattern_arg with type arg arg is more complete than pattern_arg since it also covers ACL args, so let's use this one instead.	2012-05-08 20:57:12 +02:00
Willy Tarreau	2ac5718dbd	MEDIUM: add a new typed argument list parsing framework make_arg_list() builds an array of typed arguments with their values, that the caller describes how to parse. This will be used to support multiple arguments for ACLs and patterns, which is currently problematic and prevents ACLs and patterns from being merged. Up to 7 arguments types may be enumerated in a single 32-bit word, including their number of mandatory parts. At the moment, these files are not used yet, they're only built. Note that the 4-bit encoding for the type has left only one unused type!	2012-05-08 20:57:10 +02:00
Willy Tarreau	9dab5fc4d4	MEDIUM: buffers: rename a number of buffer management functions The following renaming took place : 1) buffer input functions buffer_put_block => bi_putblk buffer_put_char => bi_putchr buffer_put_string => bi_putstr buffer_put_chunk => bi_putchk buffer_feed => bi_putstr buffer_feed_chunk => bi_putchk buffer_cut_tail => bi_erase buffer_ignore => bi_fast_delete 2) buffer output functions buffer_get_char => bo_getchr buffer_get_line => bo_getline buffer_get_block => bo_getblk buffer_skip => bo_skip buffer_write => bo_inject 3) buffer input avail/full functions were introduced : bi_avail bi_full	2012-05-08 20:56:56 +02:00
Willy Tarreau	328582c3f9	MEDIUM: buffers: implement b_adv() to advance a buffer's pointer This is more convenient and efficient than buf->p = b_ptr(buf, n); It simply advances the buffer's pointer by <n> and trasfers that amount of bytes from <in> to <out>. The BF_OUT_EMPTY flag is updated accordingly. A few occurrences of such computations in buffers.c and stream_sock.c were updated to use b_adv(), which resulted in a small code shrink.	2012-05-08 12:28:14 +02:00
Willy Tarreau	cc5cfcbcce	MEDIUM: buffers: add new pointer wrappers and get rid of almost all buffer_wrap_add calls buffer_wrap_add was convenient for the migration but is not handy at all. Let's have new wrappers that report input begin/end and output begin/end instead. It looks like we'll also need a b_adv(ofs) to advance a buffer's pointer.	2012-05-08 12:28:14 +02:00
Willy Tarreau	ec1bc82a1d	MEDIUM: buffers: fix unsafe use of buffer_ignore at some places buffer_ignore may only be used when the output of a buffer is empty, but it's not granted it is always the case when sending HTTP error responses. Better use buffer_cut_tail() instead, and use buffer_ignore only on non-wrapping data.	2012-05-08 12:28:14 +02:00
Willy Tarreau	8a0cef2dad	MEDIUM: http: remove buffer arg in http_capture_bad_message The buffer pointer is now taken from the http_msg.	2012-05-08 12:28:13 +02:00
Willy Tarreau	45c0d98769	MEDIUM: http: http_send_name_header: remove references to msg and buffer They can be deduced from txn.	2012-05-08 12:28:12 +02:00
Willy Tarreau	fa4a03ca08	CLEANUP: http: remove unused http_msg->col The <col> element of the struct http_msg has not been used for a long time now, remove it.	2012-05-08 12:28:11 +02:00
Willy Tarreau	a458b67965	MAJOR: http: move buffer->lr to http_msg->next The buffer's pointer <lr> was only used by HTTP parsers which also use a struct http_msg to keep track of the parser's state. We've reached a point where it makes no sense to keep ->lr in the buffer, as the split between buffer and msg is only arbitrary for historical reasons. This change ensures that touching buffers will not impact HTTP messages anymore, making the buffers more content-agnostic. However, it becomes very important not to forget to update msg->next when some data get forwarded or moved (and in general each time buf->p is updated). The new pointer in http_msg becomes relative to buffer->p so that parsing multiple messages becomes easier. It is possible that at one point ->som and ->next will be merged. Note: http_parse_reqline() and http_parse_stsline() have been temporarily modified to know the message starting point in the buffer (->p).	2012-05-08 12:28:11 +02:00
Willy Tarreau	363a5bb152	MAJOR: buffers: replace buf->r with buf->p + buf->i This change gets rid of buf->r which is always equal to buf->p + buf->i. It removed some wrapping detection at a number of places, but required addition of new relative offset computations at other locations. A large number of places can be simplified now with extreme care, since most of the time, either the pointer has to be computed once or we need a difference between the old ->w and old ->r to compute free space. The cleanup will probably happen with the rewrite of the buffer_input_* and buffer_output_* functions anyway. buf->lr still has to move to the struct http_msg and be relative to buf->p for the rework to be complete.	2012-05-08 12:28:11 +02:00
Willy Tarreau	89fa706d39	MAJOR: buffers: replace buf->w with buf->p - buf->o This change introduces the buffer's base pointer, which is the limit between incoming and outgoing data. It's the point where the parsing should start from. A number of computations have already been greatly simplified, but more simplifications are expected to come from the removal of buf->r. The changes appear good and have revealed occasional improper use of some pointers. It is possible that this patch has introduced bugs or revealed some, although preliminary testings tend to indicate that everything still works as it should.	2012-05-08 12:28:10 +02:00
Willy Tarreau	3f7ff1406c	MINOR: buffers: remove unused function buffer_contig_data() This one was never used and is buggy. It will be easier to rewrite it when the buffer rework is complete.	2012-05-08 12:28:10 +02:00
Willy Tarreau	7fd758bbcf	MINOR: buffers: provide simple pointer normalization functions Add buffer_wrap_sub() and buffer_wrap_add() to normalize buffer pointers after an addition or subtract.	2012-05-08 12:28:10 +02:00
Willy Tarreau	02d6cfc1d7	MAJOR: buffer: replace buf->l with buf->{o+i} We don't have buf->l anymore. We have buf->i for pending data and the total length is retrieved by adding buf->o. Some computation already become simpler. Despite extreme care, bugs are not excluded. It's worth noting that msg->err_pos as set by HTTP request/response analysers becomes relative to pending data and not to the beginning of the buffer. This has not been completed yet so differences might occur when outgoing data are left in the buffer.	2012-05-08 12:28:10 +02:00
Willy Tarreau	2e046c6017	MAJOR: buffer rework: replace ->send_max with ->o This is the first minor step of the buffer rework. It's only renaming, it should have no impact.	2012-04-30 11:57:00 +02:00
Willy Tarreau	9b061e3320	MEDIUM: stream_sock: add a get_src and get_dst callback and remove SN_FRT_ADDR_SET These callbacks are used to retrieve the source and destination address of a socket. The address flags are not hold on the stream interface and not on the session anymore. The addresses are collected when needed. This still needs to be improved to store the IP and port separately so that it is not needed to perform a getsockname() when only the IP address is desired for outgoing traffic.	2012-04-07 18:03:52 +02:00
William Lallemand	a73203e3dc	MEDIUM: log: Unique ID The Unique ID, is an ID generated with several informations. You can use a log-format string to customize it, with the "unique-id-format" keyword, and insert it in the request header, with the "unique-id-header" keyword.	2012-04-07 16:25:26 +02:00
William Lallemand	5f2324019d	MEDIUM: log: New format-log flags: %Fi %Fp %Si %Sp %Ts %rt %H %pid %Fi: Frontend IP %Fp: Frontend Port %Si: Server IP %Sp: Server Port %Ts: Timestamp %rt: HTTP request counter %H: hostname %pid: PID +X: Hexadecimal represenation The +X mode in logformat displays hexadecimal for the following flags %Ci %Cp %Fi %Fp %Bi %Bp %Si %Sp %Ts %ct %pid rename logformat_write_string() to lf_text() Optimize size computation	2012-04-07 16:05:39 +02:00
William Lallemand	1d7055675e	MEDIUM: log: split of log_format generation * logformat functions now take a format linked list as argument * build_logline() build a logline using a format linked list * rename LOG_* by LOG_FMT_* in enum * improve error management in build_logline()	2012-04-07 16:05:02 +02:00
Cyril Bonté	19979e176e	MINOR: stats admin: reduce memcmp()/strcmp() calls on status codes memcmp()/strcmp() calls were needed in different parts of code to determine the status code. Each new status code introduces new calls, which can become inefficient and source of bugs. This patch reorganizes the code to rely on a numeric status code internally and to be hopefully more generic.	2012-04-05 09:58:27 +02:00
Cyril Bonté	cf8d9ae3cd	MINOR: stats admin: allow unordered parameters in POST requests Previously, the stats admin page required POST parameters to be provided exactly in the same order as the HTML form. This patch allows to handle those parameters in any orders. Also, note that haproxy won't alter server states anymore if backend or server names are ambiguous (duplicated names in the configuration) to prevent unexpected results (the same should probably be applied to the stats socket).	2012-04-05 09:58:25 +02:00
Simon Horman	63a4a822c1	CLEANUP: Make check_statuses, analyze_statuses and process_chk static These symbols are only used inside src/checks.c	2012-03-24 21:54:19 +01:00
Willy Tarreau	b1a2faf7c9	BUG/CRITICAL: log: fix risk of crash in development snapshot Commit a1cc38 introduced a regression which was easy to trigger till `ad4cd58` (snapshots 20120222 to 20120311 included). The bug was still present after that but harder to trigger. The bug is caused by the use of two distinct log buffers due to intermediary changes. The issue happens when an HTTP request is logged just after a TCP request during the same second and the HTTP request is too large for the buffer. In this case, it happens that the HTTP request is logged into the TCP buffer instead and that length controls can't detect anything. Starting with bddd4f, the issue is still possible when logging too large an HTTP request just after a send_log() call (typically a server status change). We owe a big thanks to Sander Klein for testing several snapshots and more specifically for taking significant risks in production by letting the buggy version crash several times in order to provide an exploitable core ! The bug could not have been found without this precious help. Thank you Sander ! This fix does not need to be backported, it did not affect any released version.	2012-03-19 17:09:30 +01:00
William Lallemand	81f5117a24	BUG/MINOR: log-format: fix %o flag The %o flag was not working at all.	2012-03-12 15:50:53 +01:00
William Lallemand	bddd4fd93b	MEDIUM: log: use log_format for mode tcplog Merge http_sess_log() and tcp_sess_log() to sess_log() and move it to log.c A new field in logformat_type define if you can use a logformat variable in TCP or HTTP mode. doc: log-format in tcp mode Note that due to the way log buffer allocation currently works, trying to log an HTTP request without "option httplog" is still not possible. This will change in the near future.	2012-03-12 15:47:13 +01:00
Willy Tarreau	18dd41dc46	MINOR: buffer: switch a number of buffer args to const A number of offset computation functions use struct buffer* arguments and return integers without modifying the input. Using consts helps simplifying some operations in callers.	2012-03-10 08:55:07 +01:00
Willy Tarreau	f09c6603d3	MEDIUM: backend: add the 'first' balancing algorithm The principle behind this load balancing algorithm was first imagined and modeled by Steen Larsen then iteratively refined through several work sessions until it would totally address its original goal. The purpose of this algorithm is to always use the smallest number of servers so that extra servers can be powered off during non-intensive hours. Additional tools may be used to do that work, possibly by locally monitoring the servers' activity. The first server with available connection slots receives the connection. The servers are choosen from the lowest numeric identifier to the highest (see server parameter "id"), which defaults to the server's position in the farm. Once a server reaches its maxconn value, the next server is used. It does not make sense to use this algorithm without setting maxconn. Note that it can however make sense to use minconn so that servers are not used at full load before starting new servers, and so that introduction of new servers requires a progressively increasing load (the number of servers would more or less follow the square root of the load until maxconn is reached). This algorithm ignores the server weight, and is more beneficial to long sessions such as RDP or IMAP than HTTP, though it can be useful there too.	2012-02-21 22:27:27 +01:00
William Lallemand	a1cc381151	MEDIUM: log: make http_sess_log use log_format http_sess_log now use the logformat linked list to make the log string, snprintf is not used for speed issue. CLF mode also uses logformat. NOTE: as of now, empty fields in CLF now are "" not "-" anymore.	2012-02-09 17:03:28 +01:00
William Lallemand	723b73ad75	MINOR: config: Parse the string of the log-format config keyword parse_logformat_string: parse the string, detect the type: text, separator or variable parse_logformat_var: dectect variable name parse_logformat_var_args: parse arguments and flags add_to_logformat_list: add to the logformat linked list	2012-02-09 17:03:24 +01:00
William Lallemand	2a4a44f0f9	REORG: log: split send_log function send_log function is now splited in 3 functions * hdr_log: generate the syslog header * send_log: send a syslog message with a printf format string * __send_log: send a syslog message	2012-02-09 15:54:43 +01:00
Willy Tarreau	f8e8b76ed3	BUG/MEDIUM: zero-weight servers must not dequeue requests from the backend It was reported that a server configured with a zero weight would sometimes still take connections from the backend queue. This issue is real, it happens this way : 1) the disabled server accepts a request with a cookie 2) many cookie-less requests accumulate in the backend queue 3) when the disabled server completes its request, it checks its own queue and the backend's queue 4) the server takes a pending request from the backend queue and processes it. In response, the server's cookie is assigned to the client, which ensures that some requests will continue to be served by this server, leading back to point 1 above. The fix consists in preventing a zero-weight server from dequeuing pending requests from the backend. Making use of srv_is_usable() in such tests makes the tests more robust against future changes. This fix must be backported to 1.4 and 1.3.	2012-01-20 16:18:53 +01:00
Willy Tarreau	62c3be28ed	BUG/MEDIUM: correctly disable servers tracking another disabled servers. In a config where server "s1" is marked disabled and "s2" tracks "s1", s2 appears disabled on the stats but is still inserted into the LB farm because the tracking is resolved too late in the configuration process. We now resolve tracked servers before building LB maps and we also mark the tracking server in maintenance mode, which previously was not done, causing half of the issue. Last point is that we also protect srv_is_usable() against electing a server marked for maintenance. This is not absolutely needed but is a safe choice and makes a lot of sense. This fix must be backported to 1.4.	2012-01-20 16:18:30 +01:00
Mark Lamourine	c2247f0b8d	MEDIUM: http: add support for sending the server's name in the outgoing request New option "http-send-name-header" specifies the name of a header which will hold the server name in outgoing requests. This is the name of the server the connection is really sent to, which means that upon redispatches, the header's value is updated so that it always matches the server's name.	2012-01-05 15:17:31 +01:00
Willy Tarreau	294c473756	MEDIUM: http: replace get_ip_from_hdr2() with http_get_hdr() The new function does not return IP addresses but header values instead, so that the caller is free to make what it want of them. The conversion is not quite clean yet, as the previous test which considered that address 0.0.0.0 meant "no address" is still used. A different IP parsing function should be used to take this into account.	2011-12-30 17:33:26 +01:00
Willy Tarreau	918458439e	MINOR: acl: include pattern.h to make pattern migration more transparent	2011-12-30 17:33:25 +01:00
Willy Tarreau	5e6cc4aad8	MINOR: pattern: export the global temporary pattern The global pattern is used for pattern conversions. Export it under the name "temp_pattern" so that it can later be used by ACLs.	2011-12-30 17:33:25 +01:00
Willy Tarreau	19ae56b2b6	CLEANUP: kill buffer_replace() and use an inline instead This function is never used, only its buffer_replace2() alternative is used. Replace the former with an inline which calls the later.	2011-11-28 21:01:28 +01:00
Willy Tarreau	71730256a3	MINOR: buffers: make buffer_pointer() support negative pointers too It's more handy if the buffer_pointer() function also handles negative pointers.	2011-11-28 21:00:46 +01:00
Willy Tarreau	fe4b1f9dc0	BUG: buffers: don't return a negative value on buffer_total_space_res() In commit `4b517ca93a` (MEDIUM: buffers: add some new primitives and rework existing ones), we forgot to check if buffer_max_len() < l. No backport is needed.	2011-11-28 21:00:46 +01:00
Willy Tarreau	4b517ca93a	MEDIUM: buffers: add some new primitives and rework existing ones A number of primitives were missing for buffer management, and some of them were particularly awkward to use. Specifically, the functions used to compute free space could not always be used depending what was wrapping in the buffers. Some documentation has been added about how the buffers work and their properties. Some functions are still missing such as a buffer replacement which would support wrapping buffers.	2011-11-25 21:57:29 +01:00
Willy Tarreau	34eb671f24	OPTIM/MINOR: move the hdr_idx pools out of the proxy struct It makes no sense to have one pointer to the hdr_idx pool in each proxy struct since these pools do not depend on the proxy. Let's have a common pool instead as it is already the case for other types.	2011-10-24 18:15:04 +02:00
Willy Tarreau	6471afb43d	MINOR: remove the client/server side distinction in SI addresses Stream interfaces used to distinguish between client and server addresses because they were previously of different types (sockaddr_storage for the client, sockaddr_in for the server). This is not the case anymore, and this distinction is confusing at best and has caused a number of regressions to be introduced in the process of converting everything to full-ipv6. We can now remove this and have a much cleaner code.	2011-09-23 10:54:59 +02:00
Willy Tarreau	0e69854ed4	MINOR: acl: add new matches for header/path/url length This patch introduces hdr_len, path_len and url_len for matching these respective parts lengths against integers. This can be used to detect abuse or empty headers.	2011-09-16 08:32:32 +02:00
Willy Tarreau	a2a64e9689	[MEDIUM] session: make session_shutdown() an independant function We already had the ability to kill a connection, but it was only for the checks. Now we can do this for any session, and for this we add a specific flag "K" to the logs.	2011-09-07 23:01:56 +02:00
Willy Tarreau	532a450ebc	[MEDIUM] stats: add the ability to enable/disable/shutdown a frontend at runtime The stats socket now allows the admin to disable, enable or shutdown a frontend. This can be used when a bug is discovered in a configuration and it's desirable to fix it but the rules in place don't allow to change a running config. Thus it becomes possible to kill the frontend to release the port and start a new one in a separate process. This can also be used to temporarily make haproxy return TCP resets to incoming requests to pretend the service is not bound. For instance, this may be useful to quickly flush a very deep SYN backlog. The frontend check and lookup code was factored with the "set maxconn" usage.	2011-09-07 22:50:52 +02:00
Willy Tarreau	ce8fe259b5	[CLEANUP] proxy: make pause_proxy() perform the required controls and emit the logs It avoids duplicated code in the caller.	2011-09-07 22:47:43 +02:00
Willy Tarreau	81c25d0ee6	[MEDIUM] add support for global.maxconnrate to limit the per-process conn rate. This one enforces a per-process connection rate limit, regardless of what may be set per frontend. It can be a way to limit the CPU usage of a process being severely attacked. The side effect is that the global process connection rate is now measured for each incoming connection, so it will be possible to report it.	2011-09-07 22:47:42 +02:00
Willy Tarreau	237250cc0d	[BUG] proxy: stats frontend and peers were missing many initializers This was revealed with one of the very latest patches which caused the listener_queue not to be initialized on the stats socket frontend. And in fact a number of other ones were missing too. This is getting so boring that now we'll always make use of the same function to initialize any proxy. Doing so has even saved about 500 bytes on the binary due to the avoided code redundancy. No backport is needed.	2011-07-29 02:00:19 +02:00
Willy Tarreau	918ff608f8	[MAJOR] proxy: finally get rid of maintain_proxies() This function is finally not needed anymore, as it has been replaced with a per-proxy task that is scheduled when some limits are encountered on incoming connections or when the process is stopping. The savings should be noticeable on configs with a large number of proxies. The most important point is that the rate limiting is now enforced in a clean and solid way.	2011-07-25 16:33:49 +02:00
Willy Tarreau	26e4881a2d	[MINOR] task: new function task_schedule() to schedule a wake up This function is used when a task should be woken up at most at a given date. This will be used with rate shapers.	2011-07-25 15:30:39 +02:00
Willy Tarreau	e6ca1fcd84	[MINOR] listeners: add support for queueing resource limited listeners When a listeners encounters a resource shortage, it currently stops until one re-enables it. This is far from being perfect as it does not yet handle the case where the single connection from the listener is rejected (eg: the stats page). Now we'll have a special status for resource limited listeners and we'll queue them into one or multiple lists. That way, each time we have to stop a listener because of a resource shortage, we can enqueue it and change its state, so that it is dequeued once more resources are available. This patch currently does not change any existing behaviour, it only adds the basic building blocks for doing that.	2011-07-24 22:03:52 +02:00
Willy Tarreau	627937158f	[MINOR] listeners: add listen_full() to mark a listener full This is just a cleanup which removes calls to EV_FD_CLR() and state setting everywhere in the code.	2011-07-24 19:25:28 +02:00
Willy Tarreau	be58c38264	[MEDIUM] proxy: add a PAUSED state to listeners and move socket tricks out of proxy.c Managing listeners state is difficult because they have their own state and can at the same time have theirs dictated by their proxy. The pause is not done properly, as the proxy code is fiddling with sockets. By introducing new functions such as pause_listener()/resume_listener(), we make it a bit more obvious how/when they're supposed to be used. The listen_proxies() function was also renamed to resume_proxies() since it's only used for pause/resume. This patch is the first in a series aiming at getting rid of the maintain_proxies mess. In the end, proxies should not call enable_listener()/disable_listener() anymore.	2011-07-24 19:09:37 +02:00
Willy Tarreau	9bd0d744ef	[BUG] session: risk of crash on out of memory (1.5-dev regression) Patch af5149 introduced an issue which can be detected only on out of memory conditions : a LIST_DEL() may be performed on an uninitialized struct member instead of a LIST_INIT() during the accept() phase, causing crashes and memory corruption to occur. This issue was detected and diagnosed by the Exceliance R&D team. This is 1.5-specific and very recent, so no existing deployment should be impacted.	2011-07-20 00:22:54 +02:00
Simon Horman	af51495397	[MINOR] Add active connection list to server The motivation for this is to allow iteration of all the connections of a server without the expense of iterating over the global list of connections. The first use of this will be to implement an option to close connections associated with a server when is is marked as being down or in maintenance mode.	2011-06-21 22:00:12 +02:00
Simon Horman	dec5be4ed4	[CLEANUP] session.c: Make functions static where possible	2011-06-18 20:27:19 +02:00
Simon Horman	96553775a0	[CLEANUP] peers.h: fix declarations * The declaration of peer_session_create() does not match its definition. As it is only used inside of peers.c make it static. * Make the declaration of peers_register_table() match its definition. * Also, make all functions in peers.c that are not also in peers.h static	2011-06-18 20:27:19 +02:00
Simon Horman	c88b887d8d	[MINOR] More flexible clearing of stick table * Allow clearing of all entries of a table * Allow clearing of all entries of a table that match a data filter	2011-06-17 11:39:29 +02:00
Simon Horman	9bd2c73916	[CLEANUP] dumpstats: make symbols static where possible	2011-06-17 11:39:28 +02:00
Willy Tarreau	436d9ed808	[REORG] http: move HTTP error codes back to proto_http.h This one was left isolated in its own file. It probably is a leftover from the 1.2->1.3 split.	2011-05-11 16:31:43 +02:00
Willy Tarreau	c9f6011760	[BUG] TCP source tracking was broken with IPv6 changes John Helliwell reported a bug when using TCP source address tracking on Solaris. The bug was introduced in haproxy 1.5-dev5.	2011-04-07 10:53:25 +02:00
David du Colombier	4f92d32004	[MEDIUM] IPv6 support for stick-tables Since IPv6 is a different type than IPv4, the pattern fetch functions src6 and dst6 were added. IPv6 stick-tables can also fetch IPv4 addresses with src and dst. In this case, the IPv4 addresses are mapped to their IPv6 counterpart, according to RFC 4291.	2011-03-29 01:09:14 +02:00
Willy Tarreau	0bc3493d2c	[OPTIM] buffers: uninline buffer_forward() Since the latest additions to buffer_forward(), it became too large for inlining, so let's uninline it. The code size drops by 3kB. Should be backported to 1.4 too.	2011-03-28 16:25:58 +02:00
Willy Tarreau	d8ee85a0a3	[BUG] http: fix content-length handling on 32-bit platforms Despite much care around handling the content-length as a 64-bit integer, forwarding was broken on 32-bit platforms due to the 32-bit nature of the ->to_forward member of the "buffer" struct. The issue is that this member is declared as a long, so while it works OK on 64-bit platforms, 32-bit truncate the content-length to the lower 32-bits. One solution could consist in turning to_forward to a long long, but it is used a lot in the critical path, so it's not acceptable to perform all buffer size computations on 64-bit there. The fix consists in changing the to_forward member to a strict 32-bit integer and ensure in buffer_forward() that only the amount of bytes that can fit into it is considered. Callers of buffer_forward() are responsible for checking that their data were taken into account. We arbitrarily ensure we never consider more than 2G at once. That's the way it was intended to work on 32-bit platforms except that it did not. This issue was tracked down hard at Exosec with Bertrand Jacquin, Thierry Fournier and Julien Thomas. It remained undetected for a long time because files larger than 4G are almost always transferred in chunked-encoded format, and most platforms dealing with huge contents these days run on 64-bit. The bug affects all 1.5 and 1.4 versions, and must be backported.	2011-03-28 16:25:16 +02:00
Willy Tarreau	a73fcaf424	[MINOR] frontend: add a make_proxy_line function This function will build a PROXY protocol line header from two addresses (IPv4 or IPv6). AF_UNIX family will be reported as UNKNOWN.	2011-03-20 10:15:22 +01:00
Willy Tarreau	ff011f26e9	[REORG] http: move the http-request rules to proto_http And also rename "req_acl_rule" "http_req_rule". At the beginning that was a bit confusing to me, especially the "req_acl" list which in fact holds what we call rules. After some digging, it appeared that some part of the code is 100% HTTP and not just related to authentication anymore, so let's move that part to HTTP and keep the auth-only code in auth.c.	2011-03-13 22:00:24 +01:00
Willy Tarreau	7d0aaf39d1	[MEDIUM] stats: split frontend and backend stats It's very annoying that frontend and backend stats are merged because we don't know what we're observing. For instance, if a "listen" instance makes use of a distinct backend, it's impossible to know what the bytes_out means. Some points take care of not updating counters twice if the backend points to the frontend, indicating a "listen" instance. The thing becomes more complex when we try to add support for server side keep-alive, because we have to maintain a pointer to the backend used for last request, and to update its stats. But we can't perform such comparisons anymore because the counters will not match anymore. So in order to get rid of this situation, let's have both frontend AND backend stats in the "struct proxy". We simply update the relevant ones during activity. Some of them are only accounted for in the backend, while others are just for frontend. Maybe we can improve a bit on that later, but the essential part is that those counters now reflect what they really mean.	2011-03-13 22:00:23 +01:00
David du Colombier	6f5ccb1589	[MEDIUM] add internal support for IPv6 server addresses This patch turns internal server addresses to sockaddr_storage to store IPv6 addresses, and makes the connect() function use it. This code already works but some caveats with getaddrinfo/gethostbyname still need to be sorted out while the changes had to be merged at this stage of internal architecture changes. So for now the config parser will not emit an IPv6 address yet so that user experience remains unchanged. This change should have absolutely zero user-visible effect, otherwise it's a bug introduced during the merge, that should be reported ASAP.	2011-03-13 22:00:12 +01:00
Willy Tarreau	827aee913f	[MAJOR] session: remove the ->srv pointer from struct session This one has been removed and is now totally superseded by ->target. To get the server, one must use target_srv(&s->target) instead of s->srv now. The function ensures that non-server targets still return NULL.	2011-03-10 23:32:17 +01:00
Willy Tarreau	9e000c6ec8	[CLEANUP] stream_interface: use inline functions to manipulate targets The connection target involves a type and a union of pointers, let's make the code cleaner using simple wrappers.	2011-03-10 23:32:17 +01:00
Willy Tarreau	295a837726	[REORG] session: move the data_ctx struct to the stream interface's applet This is in fact where those parts belong to. The old data_state was replaced by applet.state and is now initialized when the applet is registered. It's worth noting that the applet does not need to know the session nor the buffer anymore since everything is brought by the stream interface. It is possible that having a separate applet struct would simplify the code but that's not a big deal.	2011-03-10 23:32:16 +01:00
Willy Tarreau	5ec29ffa42	[CLEANUP] stats: make all dump functions only rely on the stream interface This will be needed to move the applet-specific data out of the session.	2011-03-10 23:32:16 +01:00
Willy Tarreau	ac82540c35	[MEDIUM] stream_interface: store the target pointer and type When doing a connect() on a stream interface, some information is needed from the server and from the backend. In some situations, we don't have a server and only a backend (eg: peers). In other cases, we know we have an applet and we don't want to connect to anything, but we'd still like to have the info about the applet being used. For this, we now store a pointer to the "target" into the stream interface. The target describes what's on the other side before trying to connect. It can be a server, a proxy or an applet for now. Later we'll probably have descriptors for multiple-stage chains so that the final information may still be found. This will help removing many specific cases in the code. It already made it possible to remove the "srv" and "be" parameters to tcpv4_connect_server().	2011-03-10 23:32:15 +01:00
Willy Tarreau	f153686a71	[REORG] tcp: make tcpv4_connect_server() take the target address from the SI The address is now available in the stream interface, no need to pass it by argument.	2011-03-10 23:32:15 +01:00
Willy Tarreau	957c0a5845	[REORG] session: move client and server address to the stream interface This will be needed very soon for the keep-alive.	2011-03-10 23:32:14 +01:00
Willy Tarreau	b24281b0ff	[MINOR] stream_interface: make use of an applet descriptor for IO handlers I/O handlers are still delicate to manipulate. They have no type, they're just raw functions which have no knowledge of themselves. Let's have them declared as applets once for all. That way we can have multiple applets share the same handler functions and we can store their names there. When we later need to add more parameters (eg: usage stats), we'll be able to do so in the applets themselves. The CLI functions has been prefixed with "cli" instead of "stats" as it's clearly what is going on there. The applet descriptor in the stream interface should get all the applet specific data (st0, ...) but this will be done in the next patch so that we don't pollute this one too much.	2011-03-10 23:32:14 +01:00
Willy Tarreau	078272e115	[MINOR] stats: report HTTP message state and buffer flags in error dumps Debugging parsing errors can be greatly improved if we know what the parser state was and what the buffer flags were (especially for closed inputs/outputs and full buffers). Let's add that to the error snapshots.	2010-12-12 12:46:33 +01:00
Emeric Brun	2b920a1af1	[MAJOR] Add new files src/peer.c, include/proto/peers.h and include/types/peers.h for sync stick table management Add cmdline option -L to configure local peer name	2010-11-11 09:29:08 +01:00
Emeric Brun	85e77c7f0d	[MEDIUM] Create updates tree on stick table to manage sync.	2010-11-11 09:29:08 +01:00
Emeric Brun	485479d8e9	[MEDIUM] Create new protected pattern types CONSTSTRING and CONSTDATA to force memcpy if data from protected areas need to be manipulated. Enhance pattern convs and fetch argument parsing, now fetchs and convs callbacks used typed args. Add more details on error messages on parsing pattern expression function. Update existing pattern convs and fetchs to new proto. Create stick table key type "binary". Manage Truncation and padding if pattern's fetch-converted result don't match table key size.	2010-11-11 09:29:07 +01:00
Emeric Brun	97679e7901	[MEDIUM] Implement tcp inspect response rules	2010-11-11 09:28:18 +01:00
Emeric Brun	c89a57284a	[BUG] stick table entries expire on counters updates/read or show table, even if there is no "expire" parameter	2010-11-11 09:28:18 +01:00
Emeric Brun	cf20bf1c1c	[MEDIUM] Enhance message errors management on binds	2010-11-05 10:34:07 +01:00
Willy Tarreau	8b0cbf9969	[MINOR] frontend: add a new analyser to parse a proxied connection The introduction of a new PROXY protocol for proxied connections requires an early analyser to decode the incoming connection and set the session flags accordingly. Some more work is needed, among which setting a flag on the session to indicate it's proxied, and copying the original parameters for later comparisons with new ACLs (eg: real_src, ...).	2010-10-30 19:04:38 +02:00
Cyril Bont�	474be415af	[MEDIUM] stats: add an admin level The stats web interface must be read-only by default to prevent security holes. As it is now allowed to enable/disable servers, a new keyword "stats admin" is introduced to activate this admin level, conditioned by ACLs. (cherry picked from commit 5334bab92ca7debe36df69983c19c21b6dc63f78)	2010-10-30 19:04:34 +02:00
Cyril Bont�	70be45dbdf	[MEDIUM] enable/disable servers from the stats web interface Based on a patch provided by Judd Montgomery, it is now possible to enable/disable servers from the stats web interface. This allows to select several servers in a backend and apply the action to them at the same time. Currently, there are 2 known limitations : - The POST data are limited to one packet (don't alter too many servers at a time). - Expect: 100-continue is not supported. (cherry picked from commit 7693948766cb5647ac03b48e782cfee2b1f14491)	2010-10-30 19:04:34 +02:00
Willy Tarreau	74b08c9ab7	[MEDIUM] buffers: rework the functions to exchange between SI and buffers There was no consistency between all the functions used to exchange data between a buffer and a stream interface. Also, the functions used to send data to a buffer did not consider the possibility that the buffer was shutdown for read. Now the functions are called buffer_{put,get}_{char,block,chunk,string}. The old buffer_feed* functions have been left available for existing code but marked deprecated.	2010-09-08 17:04:31 +02:00
Willy Tarreau	d0807c3c60	[MEDIUM] signals: support redistribution of signal zero when stopping Signal zero is never delivered by the system. However having a signal to which functions and tasks can subscribe to be notified of a stopping event is useful. So this patch does two things : 1) allow signal zero to be delivered from any function of signal handler 2) make soft_stop() deliver this signal so that tasks can be notified of a stopping condition.	2010-08-27 18:26:11 +02:00
Willy Tarreau	24f4efa670	[MEDIUM] signals: add support for registering functions and tasks The two new functions below make it possible to register any number of functions or tasks to a system signal. They will be called in the registration order when the signal is received. struct sig_handler signal_register_fct(int sig, void (fct)(struct sig_handler ), int arg); struct sig_handler signal_register_task(int sig, struct task *task, int reason);	2010-08-27 18:00:40 +02:00
Willy Tarreau	0a4838cd31	[MEDIUM] session-counters: correctly unbind the counters tracked by the backend In case of HTTP keepalive processing, we want to release the counters tracked by the backend. Till now only the second set of counters was released, while it could have been assigned by the frontend, or the backend could also have assigned the first set. Now we reuse to unused bits of the session flags to mark which stick counters were assigned by the backend and to release them as appropriate.	2010-08-10 18:04:16 +02:00
Willy Tarreau	56123282ef	[MINOR] session-counters: use "track-sc{1,2}" instead of "track-{fe,be}-counters" The assumption that there was a 1:1 relation between tracked counters and the frontend/backend role was wrong. It is perfectly possible to track the track-fe-counters from the backend and the track-be-counters from the frontend. Thus, in order to reduce confusion, let's remove this useless {fe,be} reference and simply use {1,2} instead. The keywords have also been renamed in order to limit confusion. The ACL rule action now becomes "track-sc{1,2}". The ACLs are now "sc{1,2}_" instead of "trk{fe,be}_". That means that we can reasonably document "sc1" and "sc2" (sticky counters 1 and 2) as sort of patterns that are available during the whole session's life and use them just like any other pattern.	2010-08-10 18:04:15 +02:00
Willy Tarreau	f6efda1189	[MEDIUM] session counters: automatically remove expired entries. When a ref_cnt goes down to zero and the entry is expired, remove it.	2010-08-10 18:04:15 +02:00
Willy Tarreau	f059a0f63a	[MAJOR] session-counters: split FE and BE track counters Having a single tracking pointer for both frontend and backend counters does not work. Instead let's have one for each. The keyword has changed to "track-be-counters" and "track-fe-counters", and the ACL "trk_" changed to "trkfe_" and "trkbe_*".	2010-08-10 18:04:15 +02:00
Willy Tarreau	3b9c6e053e	[MEDIUM] stick-table: make use of generic types for stored data It's a bit cumbersome to have to know all possible storable types from the stats interface. Instead, let's have generic types for all data, which will facilitate their manipulation.	2010-08-10 18:04:14 +02:00
Willy Tarreau	69f58c8058	[MEDIUM] stats: add "show table [<name>]" to dump a stick-table It is now possible to dump a table's contents with keys, expire, use count, and various data using the command above on the stats socket. "show table" only shows main table stats, while "show table <name>" dumps table contents, only if the socket level is admin.	2010-08-10 18:04:14 +02:00
Willy Tarreau	da7ff64aa9	[MEDIUM] session-counters: add HTTP req/err tracking This patch adds support for the following session counters : - http_req_cnt : HTTP request count - http_req_rate: HTTP request rate - http_err_cnt : HTTP request error count - http_err_rate: HTTP request error rate The equivalent ACLs have been added to check the tracked counters for the current session or the counters of the current source.	2010-08-10 18:04:14 +02:00
Willy Tarreau	1f7e925d6a	[MINOR] stktable: add a stktable_update_key() function This function looks up a key, updates its expiration date, or creates it if it was not found. acl_fetch_src_updt_conn_cnt() was updated to make use of it.	2010-08-10 18:04:14 +02:00
Willy Tarreau	91c43d7fe4	[MEDIUM] session counters: add conn_rate and sess_rate counters These counters maintain incoming connection rates and session rates in a stick-table, over a period which is defined in the configuration (2 ms to 24 days). They can be used to detect service abuse and enforce a certain accept rate per source address for instance, and block if the rate is passed over. Example : # block if more than 50 requests per 5 seconds from a source. stick-table type ip size 200k expire 1m store conn_rate(5s),sess_rate(5s) tcp-request track-counters src tcp-request reject if { trk_conn_rate gt 50 } # cause a 3 seconds pause to requests from sources in excess of 20 requests/5s tcp-request inspect-delay 3s tcp-request content accept if { trk_sess_rate gt 20 } WAIT_END	2010-08-10 18:04:13 +02:00
Willy Tarreau	ac78288eaf	[MEDIUM] stick-tables: add stored data argument type checking We're now able to return errors based on the validity of an argument passed to a stick-table store data type. We also support ARG_T_DELAY to pass delays to stored data types (eg: for rate counters).	2010-08-10 18:04:13 +02:00
Willy Tarreau	888617dc3b	[MEDIUM] stick-tables: add support for arguments to data_types Some data types will require arguments (eg: period for a rate counter). This patch adds support for such arguments between parenthesis in the "store" directive of the stick-table statement. Right now only integers are supported.	2010-08-10 18:04:13 +02:00
Willy Tarreau	e348793696	[MEDIUM] session-counters: automatically update tracked connection count When a session tracks a counter, automatically increase the cumulated connection count. This makes src_updt_conn_cnt() almost useless. In fact it might still be used to update different tables.	2010-08-10 18:04:12 +02:00
Willy Tarreau	38285c18f4	[MEDIUM] session: add concurrent connections counter The new "conn_cur" session counter has been added. It is automatically updated upon "track XXX" directives, and the entry is touched at the moment we increment the value so that we don't consider further counter updates as real updates, otherwise we would end up updating upon completion, which may not be desired. Probably that some other event counters (eg: HTTP requests) will have to be updated upon each event though. This counter can be matched against current session's source address using the "src_conn_cur" ACL.	2010-08-10 18:04:12 +02:00
Willy Tarreau	4a0347add0	[MINOR] stick-table: provide a table lookup function We'll often need to lookup a table by its name. This will change in the future once we can resolve these names on startup.	2010-08-10 18:04:12 +02:00
Willy Tarreau	9ba2dcc86c	[MAJOR] session: add track-counters to track counters related to the session This patch adds the ability to set a pointer in the session to an entry in a stick table which holds various counters related to a specific pattern. Right now the syntax matches the target syntax and only the "src" pattern can be specified, to track counters related to the session's IPv4 source address. There is a special function to extract it and convert it to a key. But the goal is to be able to later support as many patterns as for the stick rules, and get rid of the specific function. The "track-counters" directive may only be set in a "tcp-request" statement right now. Only the first one applies. Probably that later we'll support multi-criteria tracking for a single session and that we'll have to name tracking pointers. No counter is updated right now, only the refcount is. Some subsequent patches will have to bring that feature.	2010-08-10 18:04:12 +02:00
Willy Tarreau	591fedc2c3	[MEDIUM] buffer: make buffer_feed* support writing non-contiguous chunks The buffer_feed* functions that are used to send data to buffers did only support sending contiguous chunks while they're relying on memcpy(). This patch improves on this by making them able to write in two chunks if needed. Thus, the buffer_almost_full() function has been improved to really consider the remaining space and not just what can be written at once.	2010-08-10 17:48:57 +02:00
Willy Tarreau	2970b0bedf	[MINOR] freq_ctr: add new types and functions for periods different from 1s Some freq counters will have to work on periods different from 1 second. The original freq counters rely on the period to be exactly one second. The new ones (freq_ctr_period) let the user define the period in ticks, and all computations are operated over that period. When reading a value, it indicates the amount of events over that period too.	2010-08-10 14:01:09 +02:00
Willy Tarreau	cb18364ca7	[MEDIUM] stick_table: separate storage and update of session entries When an entry already exists, we just need to update its expiration timer. Let's have a dedicated function for that instead of spreading open code everywhere. This change also ensures that an update of an existing sticky session really leads to an update of its expiration timer, which was apparently not the case till now. This point needs to be checked in 1.4.	2010-06-14 15:10:26 +02:00
Willy Tarreau	41883e2041	[MINOR] stick_table: export the stick_table_key This one is huge and will be needed by other portions of code for various data lookups. Let's not have them allocate it in the stack.	2010-06-14 15:10:25 +02:00
Willy Tarreau	68129b90eb	[MINOR] stick_table: provide functions to return stksess data from a type This function does the indirection job in the table to find the pointer to the real data matching the requested type.	2010-06-14 15:10:25 +02:00
Willy Tarreau	f16d2b8c1b	[MEDIUM] stick_table: don't overwrite data when storing an entry Till now sticky sessions only held server IDs. Now there are other data types so it is not acceptable anymore to overwrite the server ID when writing something. The server ID must then only be written from the caller when appropriate. Doing this has also led to separate lookup and storage.	2010-06-14 15:10:24 +02:00
Willy Tarreau	08d5f98294	[MEDIUM] stick_table: add room for extra data types The stick_tables will now be able to store extra data for a same key. A limited set of extra data types will be defined and for each of them an offset in the sticky session will be assigned at startup time. All of this information will be stored in the stick table. The extra data types will have to be specified after the new "store" keyword of the "stick-table" directive, which will reserve some space for them.	2010-06-14 15:10:24 +02:00
Willy Tarreau	f0b38bfc33	[CLEANUP] stick_table: move pattern to key functions to stick_table.c pattern.c depended on stick_table while in fact it should be the opposite. So we move from pattern.c everything related to stick_tables and invert the dependency. That way the code becomes more logical and intuitive.	2010-06-14 15:10:24 +02:00
Willy Tarreau	393379c3e0	[MINOR] stick_table: add support for variable-sized data Right now we're only able to store a server ID in a sticky session. The goal is to be able to store anything whose size is known at startup time. For this, we store the extra data before the stksess pointer, using a negative offset. It will then be easy to cumulate multiple data provided they each have their own offset.	2010-06-14 15:10:23 +02:00
Willy Tarreau	b36b4244a2	[MINOR] session: differenciate between accepted connections and received connections Now we're able to reject connections very early, so we need to use a different counter for the connections that are received and the ones that are accepted and converted into sessions, so that the rate limits can still apply to the accepted ones. The session rate must still be used to compute the rate limit, so that we can reject undesired traffic without affecting the rate.	2010-06-14 10:53:19 +02:00
Willy Tarreau	decd14d298	[MEDIUM] stats: rely on the standard session_accept() function The stats' accept() function is now ridiculously small. It could even be reduced by moving some parts to the common accept code.	2010-06-14 10:53:18 +02:00
Willy Tarreau	81f9aa3bf2	[MAJOR] frontend: split accept() into frontend_accept() and session_accept() A new function session_accept() is now called from the lower layer to instanciate a new session. Once the session is instanciated, the upper layer's frontent_accept() is called. This one can be service-dependant. That way, we have a 3-phase accept() sequence : 1) protocol-specific, session-less accept(), which is pointed to by the listener. It defaults to the generic stream_sock_accept(). 2) session_accept() which relies on a frontend but not necessarily for use in a proxy (eg: stats or any future service). 3) frontend_accept() which performs the accept for the service offerred by the frontend. It defaults to frontend_accept() which is really what is used by a proxy. The TCP/HTTP proxies have been moved to this mode so that we can now rely on frontend_accept() for any type of session initialization relying on a frontend. The next step will be to convert the stats to use the same system for the stats.	2010-06-14 10:53:17 +02:00
Willy Tarreau	a8f55d5473	[MEDIUM] backend: initialize the server stream_interface upon connect() It's not normal to initialize the server-side stream interface from the accept() function, because it may change later. Thus, we introduce a new stream_sock_prepare_interface() function which is called just before the connect() and which sets all of the stream_interface's callbacks to the default ones used for real sockets. The ->connect function is also set at the same instant so that we can easily add new server-side protocols soon.	2010-06-14 10:53:15 +02:00
Willy Tarreau	a5c0ab200b	[MEDIUM] frontend: check for LI_O_TCP_RULES in the listener The new LI_O_TCP_RULES listener option indicates that some TCP rules must be checked upon accept on this listener. It is now checked by the frontend and the L4 rules are evaluated only in this case. The flag is only set when at least one tcp-req rule is present in the frontend. The L4 rules check function has now been moved to proto_tcp.c where it ought to be.	2010-06-14 10:53:13 +02:00
Willy Tarreau	eb472685cb	[MEDIUM] separate protocol-level accept() from the frontend's For a long time we had two large accept() functions, one for TCP sockets instanciating proxies, and another one for UNIX sockets instanciating the stats interface. A lot of code was duplicated and both did not work exactly the same way. Now we have a stream_sock layer accept() called for either TCP or UNIX sockets, and this function calls the frontend-specific accept() function which does the rest of the frontend-specific initialisation. Some code is still duplicated (session & task allocation, stream interface initialization), and might benefit from having an intermediate session-level accept() callback to perform such initializations. Still there are some minor differences that need to be addressed first. For instance, the monitor nets should only be checked for proxies and not for other connection templates. Last, we renamed l->private as l->frontend. The "private" pointer in the listener is only used to store a frontend, so let's rename it to eliminate this ambiguity. When we later support detached listeners (eg: FTP), we'll add another field to avoid the confusion.	2010-06-14 10:53:11 +02:00
Willy Tarreau	03fa5df64a	[CLEANUP] rename client -> frontend The 'client.c' file now only contained frontend-specific functions, so it has naturally be renamed 'frontend.c'. Same for client.h. This has also been an opportunity to remove some cross references from files that should not have depended on it. In the end, this file should contain a protocol-agnostic accept() code, which would initialize a session, task, etc... based on an accept() from a lower layer. Right now there are still references to TCP.	2010-06-14 10:53:10 +02:00
Willy Tarreau	44b90cc4d8	[CLEANUP] tcp: move some non tcp-specific layer6 processing out of proto_tcp Some functions which act on generic buffer contents without being tcp-specific were historically in proto_tcp.c. This concerns ACLs and RDP cookies. Those have been moved away to more appropriate locations. Ideally we should create some new files for each layer6 protocol parser. Let's do that later.	2010-06-14 10:53:09 +02:00
Willy Tarreau	bce7088275	[MEDIUM] add ability to connect to a server from an IP found in a header Using get_ip_from_hdr2() we can look for occurrence #X or #-X and extract the IP it contains. This is typically designed for use with the X-Forwarded-For header. Using "usesrc hdr_ip(name,occ)", it becomes possible to use the IP address found in <name>, and possibly specify occurrence number <occ>, as the source to connect to a server. This is possible both in a server and in a backend's source statement. This is typically used to use the source IP previously set by a upstream proxy.	2010-03-30 10:39:43 +02:00
Willy Tarreau	b1d67749db	[MEDIUM] backend: move the transparent proxy address selection to backend The transparent proxy address selection was set in the TCP connect function which is not the most appropriate place since this function has limited access to the amount of parameters which could produce a source address. Instead, now we determine the source address in backend.c:connect_server(), right after calling assign_server_address() and we assign this address in the session and pass it to the TCP connect function. This cannot be performed in assign_server_address() itself because in some cases (transparent mode, dispatch mode or http_proxy mode), we assign the address somewhere else. This change will open the ability to bind to addresses extracted from many other criteria (eg: from a header).	2010-03-30 09:59:43 +02:00
Willy Tarreau	e45997661b	[MEDIUM] session: better fix for connection to servers with closed input The following patch fixed an issue but brought another one : 296897 [MEDIUM] connect to servers even when the input has already been closed The new issue is that when a connection is inspected and aborted using TCP inspect rules, now it is sent to the server before being closed. So that test is not satisfying. A probably better way is not to prevent a connection from establishing if only BF_SHUTW_NOW is set but BF_SHUTW is not. That way, the BF_SHUTW flag is not set if the request has any data pending, which still fixes the stats issue, but does not let any empty connection pass through. Also, as a safety measure, we extend buffer_abort() to automatically disable the BF_AUTO_CONNECT flag. While it appears to always be OK, it is by pure luck, so better safe than sorry.	2010-03-21 23:31:42 +01:00
Willy Tarreau	d9b587f260	[STATS] report HTTP requests (total and rate) in frontends Now that we support keep-alive, it's important to report a separate counter for requests. Right now it just appears in the CSV output.	2010-02-26 10:05:55 +01:00
Willy Tarreau	b97f199d4b	[MEDIUM] http: don't use trash to realign large buffers The trash buffer may now be smaller than a buffer because we can tune it at run time. This causes a risk when we're trying to use it as a temporary buffer to realign unaligned requests, because we may have to put up to a full buffer into it. Instead of doing a double copy, we're now relying on an open-coded bouncing copy algorithm. The principle is that we move one byte at a time to its final place, and if that place also holds a byte, then we move it too, and so on. We finish when we've moved all the buffer. It limits the number of memory accesses, but since it proceeds one byte at a time and with random walk, it's not cache friendly and should be slower than a double copy. However, it's only used in extreme situations and the difference will not be noticeable. It has been extensively tested and works reliably.	2010-02-25 23:54:31 +01:00
Willy Tarreau	9cc670f7d9	[CLEANUP] config: use build_acl_cond() to simplify http-request ACL parsing Now that we have this new function to make your life better, use it.	2010-02-01 10:43:44 +01:00
Cyril Bont�	cd19e51b05	[MEDIUM] add a maintenance mode to servers This is a first attempt to add a maintenance mode on servers, using the stat socket (in admin level). It can be done with the following command : - disable server <backend>/<server> - enable server <backend>/<server> In this mode, no more checks will be performed on the server and it will be marked as a special DOWN state (MAINT). If some servers were tracking it, they'll go DOWN until the server leaves the maintenance mode. The stats page and the CSV export also display this special state. This can be used to disable the server in haproxy before doing some operations on this server itself. This is a good complement to the "http-check disable-on-404" keyword and works in TCP mode.	2010-01-31 23:33:18 +01:00
Krzysztof Piotr Oledzki	8c8bd4593c	[MAJOR] use the new auth framework for http stats Support the new syntax (http-request allow/deny/auth) in http stats. Now it is possible to use the same syntax is the same like in the frontend/backend http-request access control: acl src_nagios src 192.168.66.66 acl stats_auth_ok http_auth(L1) stats http-request allow if src_nagios stats http-request allow if stats_auth_ok stats http-request auth realm LB The old syntax is still supported, but now it is emulated via private acls and an aditional userlist.	2010-01-31 19:14:09 +01:00
Krzysztof Piotr Oledzki	f9423ae43a	[MINOR] acl: add http_auth and http_auth_group Add two acls to match http auth data: acl <name> http_auth(userlist) acl <name> http_auth_hroup(userlist) group1 group2 (...)	2010-01-31 19:14:09 +01:00
Krzysztof Piotr Oledzki	961050465e	[MINOR] generic auth support with groups and encrypted passwords Add generic authentication & authorization support. Groups are implemented as bitmaps so the count is limited to sizeof(int)*8 == 32. Encrypted passwords are supported with libcrypt and crypt(3), so it is possible to use any method supported by your system. For example modern Linux/glibc instalations support MD5/SHA-256/SHA-512 and of course classic, DES-based encryption.	2010-01-31 19:14:07 +01:00
Willy Tarreau	fdb563c06f	[MEDIUM] http: add support for conditional response header rewriting Just as for the req* rules, we can now condition rsp* rules with ACLs. ACLs match on response, so volatile request information cannot be used. A warning is emitted if a configuration contains such an anomaly.	2010-01-31 15:43:27 +01:00
Willy Tarreau	6c123b15cb	[MEDIUM] http: make the request filter loop check for optional conditions From now on, if request filters have ACLs defined, these ACLs will be evaluated to condition the filter. This will be used to conditionally remove/rewrite headers based on ACLs.	2010-01-28 20:22:06 +01:00
Willy Tarreau	f1e98b8628	[CLEANUP] config: use warnif_cond_requires_resp() to check for bad ACLs Factor out some repetitive copy-pasted code to check for request ACLs validity.	2010-01-28 17:59:39 +01:00
Willy Tarreau	2bbba415d7	[MINOR] acl: add build_acl_cond() to make it easier to add ACLs in config This function automatically builds a rule, considering the if/unless statements, and automatically updates the proxy's acl_requires, the condition's file and line.	2010-01-28 16:48:33 +01:00
Willy Tarreau	68085d8cfb	[MINOR] http: add http_remove_header2() to remove a header value. Calling this function after http_find_header2() automatically deletes the current value of the header, and removes the header itself if the value is the only one. The context is automatically adjusted for a next call to http_find_header2() to return the next header. No other change nor test should be made on the transient context though.	2010-01-18 19:51:33 +01:00
Emeric Brun	107ca30d54	[MEDIUM] Add pattern fetch management types and functions	2010-01-12 16:01:19 +01:00
Emeric Brun	3bd697e071	[MEDIUM] Add stick table (persistence) management functions and types	2010-01-12 11:23:15 +01:00
Willy Tarreau	148d099406	[BUG] stream_interface: fix retnclose and remove cond_close The stream_int_cond_close() function was added to preserve the contents of the response buffer because stream_int_retnclose() was buggy. It flushed the response instead of flushing the request. This caused issues with pipelined redirects followed by error messages which ate the previous response. This might even have caused object truncation on pipelined requests followed by an error or by a server redirection. Now that this is fixed, simply get rid of the now useless function.	2010-01-10 10:21:21 +01:00
Willy Tarreau	90deb18916	[MEDIUM] http: make safer use of the DONT_READ and AUTO_CLOSE flags Several HTTP analysers used to set those flags to values that were useful but without considering the possibility that they were not called again to clean what they did. First, replace direct flag manipulation with more explicit macros. Second, enforce a rule stating that any buffer which changes one of these flags from the default must restore it after completion, so that other analysers see correct flags. With both this fix and the previous one about analyser bits, we should not see any more stuck sessions.	2010-01-07 00:20:41 +01:00
Krzysztof Piotr Oledzki	15514c21a2	[MINOR]: stats: add show-legends to report additional informations Supported informations, available via "tr/td title": - cap: capabilities (proxy) - mode: one of tcp, http or health (proxy) - id: SNMP ID (proxy, socket, server) - IP (socket, server) - cookie (backend, server)	2010-01-06 00:28:06 +01:00
Willy Tarreau	d98cf93395	[MAJOR] http: implement body parser The body parser will be used in close and keep-alive modes. It follows the stream to keep in sync with both the request and the response message. Both chunked transfer-coding and content-length are supported according to RFC2616. The multipart/byterange encoding has not yet been implemented and if not seconded by any of the two other ones, will be forwarded till the close, as requested by the specification. Both the request and the response analysers converge into an HTTP_MSG_DONE state where it will be possible to force a close (option forceclose) or to restart with a fresh new transaction and maintain keep-alive. This change is important. All tests are OK but any possible behaviour change with "option httpclose" might find its root here.	2009-12-27 22:54:55 +01:00
Willy Tarreau	5d881d0f3a	[MINOR] new function stream_int_cond_close() This one will be used to conditionally send a message upon a close on a stream interface. It will not overwrite any existing data.	2009-12-27 22:51:06 +01:00
Willy Tarreau	d21e01c63f	[MINOR] buffers: add buffer_ignore() to skip some bytes This simple function will be used to skip blanks at the beginning of a request or response.	2009-12-27 15:45:38 +01:00
Willy Tarreau	0937bc43cf	[MINOR] http: move the http transaction init/cleanup code to proto_http This code really belongs to the http part since it's transaction-specific. This will also make it easier to later reinitialize a transaction in order to support keepalive.	2009-12-22 15:03:09 +01:00
Willy Tarreau	7c3c54177a	[MAJOR] buffers: automatically compute the maximum buffer length We used to apply a limit to each buffer's size in order to leave some room to rewrite headers, then we used to remove this limit once the session switched to a data state. Proceeding that way becomes a problem with keepalive because we have to know when to stop reading too much data into the buffer so that we can leave some room again to process next requests. The principle we adopt here consists in only relying on to_forward+send_max. Indeed, both of those data define how many bytes will leave the buffer. So as long as their sum is larger than maxrewrite, we can safely fill the buffers. If they are smaller, then we refrain from filling the buffer. This means that we won't risk to fill buffers when reading last data chunk followed by a POST request and its contents. The only impact identified so far is that we must ensure that the BF_FULL flag is correctly dropped when starting to forward. Right now this is OK because nobody inflates to_forward without using buffer_forward().	2009-12-22 10:06:34 +01:00
Krzysztof Piotr Oledzki	97f07b832f	[MEDIUM] Decrease server health based on http responses / events, version 3 Implement decreasing health based on observing communication between HAProxy and servers. Changes in this version 2: - documentation - close race between a started check and health analysis event - don't force fastinter if it is not set - better names for options - layer4 support Changes in this version 3: - add stats - port to the current 1.4 tree	2009-12-16 00:29:27 +01:00
Cyril Bonté	b21570ae0f	[MEDIUM] appsession: add "len", "prefix" and "mode" options To sum up : - len : it's now the max number of characters for the value, preventing garbaged results. - a new option "prefix" is added, this allows to use dynamic cookie names (e.g. ASPSESSIONIDXXX). Previously in the thread, I wanted to use the value found with "capture cookie" but when i started to update the documentation, I found this solution quite weird. I've made a small rework to not depend on "capture cookie". - There's the posssiblity to define the URL parser mode (path parameters or query string).	2009-11-30 11:31:53 +01:00
Willy Tarreau	fa355d4a51	[MINOR] http: keep pointer to beginning of data We now set msg->col and msg->sov to the first byte of non-header. They will be used later when parsing chunks. A new macro was added to perform size additions on an http_msg in order to limit the risks of copy-paste in the long term. During this operation, it appeared that the http_msg struct was not optimal on 64-bit, so it was re-ordered to fill the holes.	2009-11-29 18:12:29 +01:00
Alex Williams	96532db923	[MINOR] server tracking: don't care about the tracked server's mode Right now, an HTTP server cannot track a TCP server and vice-versa. This patch enables proxy tracking without relying on the proxy's mode (tcp/http/health). It only requires a matching proxy name to exist. The original function was renamed to findproxy_mode().	2009-11-02 11:08:00 +01:00
Willy Tarreau	45cb4fb640	[MEDIUM] build: switch ebtree users to use new ebtree version All files referencing the previous ebtree code were changed to point to the new one in the ebtree directory. A makefile variable (EBTREE_DIR) is also available to use files from another directory. The ability to build the libebtree library temporarily remains disabled because it can have an impact on some existing toolchains and does not appear worth it in the medium term if we add support for multi-criteria stickiness for instance.	2009-10-26 21:10:04 +01:00
Willy Tarreau	8e89b84848	[MINOR] http: remove the last call to stream_int_return And remove the now unused function itself too.	2009-10-18 23:56:35 +02:00
Willy Tarreau	b37c27e28f	[MAJOR] http: create the analyser which waits for a response The code part which waits for an HTTP response has been extracted from the old function. We now have two analysers and the second one may re-enable the first one when an 1xx response is encountered. This has been tested and works. The calls to stream_int_return() that were remaining in the wait analyser have been converted to stream_int_retnclose().	2009-10-18 23:15:41 +02:00
Cyril Bont�	bf47aeb946	[MEDIUM] appsession: add the "request-learn" option This patch has 2 goals : 1. I wanted to test the appsession feature with a small PHP code, using PHPSESSID. The problem is that when PHP gets an unknown session id, it creates a new one with this ID. So, when sending an unknown session to PHP, persistance is broken : haproxy won't see any new cookie in the response and will never attach this session to a specific server. This also happens when you restart haproxy : the internal hash becomes empty and all sessions loose their persistance (load balancing the requests on all backend servers, creating a new session on each one). For a user, it's like the service is unusable. The patch modifies the code to make haproxy also learn the persistance from the client : if no session is sent from the server, then the session id found in the client part (using the URI or the client cookie) is used to associated the server that gave the response. As it's probably not a feature usable in all cases, I added an option to enable it (by default it's disabled). The syntax of appsession becomes : appsession <cookie> len <length> timeout <holdtime> [request-learn] This helps haproxy repair the persistance (with the risk of losing its session at the next request, as the user will probably not be load balanced to the same server the first time). 2. This patch also tries to reduce the memory usage. Here is a little example to explain the current behaviour : - Take a Tomcat server where /session.jsp is valid. - Send a request using a cookie with an unknown value AND a path parameter with another unknown value : curl -b "JSESSIONID=12345678901234567890123456789012" http://<haproxy>/session.jsp;jsessionid=00000000000000000000000000000001 (I know, it's unexpected to have a request like that on a live service) Here, haproxy finds the URI session ID and stores it in its internal hash (with no server associated). But it also finds the cookie session ID and stores it again. - As a result, session.jsp sends a new session ID also stored in the internal hash, with a server associated. => For 1 request, haproxy has stored 3 entries, with only 1 which will be usable The patch modifies the behaviour to store only 1 entry (maximum).	2009-10-18 11:56:26 +02:00
Willy Tarreau	4e33d8677a	[OPTIM] stats: check free space before trying to print This alone makes a typical HTML stats dump consume 10% CPU less, because we avoid doing complex printf calls to drop them later. Only a few common cases have been checked, those which are very likely to run for nothing.	2009-10-11 23:35:10 +02:00
Willy Tarreau	ea1f5fe28a	[MINOR] stats: use a dedicated state to output static data It is a bit expensive and complex to use to call buffer_feed() directly from the request parser, and there are risks that some output messages are lost in case of buffer full. Since most of these messages are static, let's have a state dedicated to print these messages and store them in a specific area shared with the stats in the session. This both reduces code size and risks of losing output data.	2009-10-11 23:12:51 +02:00
Krzysztof Piotr Oledzki	ba8d7d3916	[MINOR] Add chunk_htmlencode and chunk_asciiencode Add two functions to encode input chunk replacing non-printable, non ascii or special characters with: "&#%u;" - chunk_htmlencode "<%02X>" - chunk_asciiencode Above functions should be used when adding strings, received from possible unsafe sources, to html stats or logs.	2009-10-10 21:51:16 +02:00
Willy Tarreau	f395017227	[MINOR] proxy: provide function to retrieve backend/server pointers int get_backend_server(const char bk_name, const char sv_name, struct proxy bk, struct server sv); This function scans the list of backends and servers to retrieve the first backend and the first server with the given names, and sets them in both parameters. It returns zero if either is not found, or non-zero and sets the ones it did not found to NULL. If a NULL pointer is passed for the backend, only the pointer to the server will be updated.	2009-10-10 18:36:25 +02:00
Willy Tarreau	9bcc91e80e	[MINOR] buffers: add buffer_feed2() and make buffer_feed() measure string length It's inconvenient to always have to compute string lengths when calling buffer_feed(), so change that.	2009-10-10 18:01:44 +02:00
Willy Tarreau	6b2e11be1e	[MEDIUM] backend: implement consistent hashing variation Consistent hashing provides some interesting advantages over common hashing. It avoids full redistribution in case of a server failure, or when expanding the farm. This has a cost however, the hashing is far from being perfect, as we associate a server to a request by searching the server with the closest key in a tree. Since servers appear multiple times based on their weights, it is recommended to use weights larger than approximately 10-20 in order to smoothen the distribution a bit. In some cases, playing with weights will be the only solution to make a server appear more often and increase chances of being picked, so stats are very important with consistent hashing. In order to indicate the type of hashing, use : hash-type map-based (default, old one) hash-type consistent (new one) Consistent hashing can make sense in a cache farm, in order not to redistribute everyone when a cache changes state. It could also probably be used for long sessions such as terminal sessions, though that has not be attempted yet. More details on this method of hashing here : http://www.spiteful.com/2008/03/17/programmers-toolbox-part-3-consistent-hashing/	2009-10-09 07:17:58 +02:00
Krzysztof Piotr Oledzki	6f61b21524	[BUG] Fix NULL pointer dereference in stats_check_uri_auth(), v2 Recent "struct chunk rework" introduced a NULL pointer dereference and now haproxy segfaults if auth is required for stats but not found. The reason is that size_t cannot store negative values, but current code assumes that "len < 0" == uninitialized. This patch fixes it.	2009-10-04 23:44:45 +02:00
Willy Tarreau	ac68c5d92c	[OPTIM] counters: move some max numbers to the counters struct There are a few remaining max values that need to move to counters. Also, the counters are more often used than some config information, so get them closer to the other useful struct members for better cache efficiency.	2009-10-04 23:26:19 +02:00
Krzysztof Piotr Oledzki	aeebf9ba65	[MEDIUM] Collect & provide separate statistics for sockets, v2 This patch allows to collect & provide separate statistics for each socket. It can be very useful if you would like to distinguish between traffic generate by local and remote users or between different types of remote clients (peerings, domestic, foreign). Currently no "Session rate" is supported, but adding it should be possible if we found it useful.	2009-10-04 18:56:02 +02:00
Krzysztof Piotr Oledzki	052d4fd07d	[CLEANUP] Move counters to dedicated structures Move counters from "struct proxy" and "struct server" to "struct pxcounters" and "struct svcounters". This patch should make no functional change.	2009-10-04 18:32:39 +02:00
Willy Tarreau	b0c9bc4f95	[MEDIUM] stats: make HTTP stats use an I/O handler Doing this, we can remove the last BF_HIJACK user and remove produce_content(). s->data_source could also be removed but it is currently used to detect if the stats or a server was used.	2009-10-04 15:56:38 +02:00
Willy Tarreau	f5a885fd28	[MEDIUM] stats: don't use s->ana_state anymore The stats handler used to store internal states in s->ana_state. Now we only rely on si->st0 in which we can store as many states as we have possible outputs. This cleans up the stats code a lot and makes it more maintainable. It has also reduced code size by a few hundred bytes.	2009-10-04 14:22:18 +02:00
Willy Tarreau	24955a1000	[MINOR] stats: make stats_dump_raw_to_buffer() use buffer_feed_chunk Same as previous change. A remaining call to stats_dump_proxy() still prevents us from completing the update.	2009-10-04 12:17:54 +02:00
Willy Tarreau	7e72a8faf2	[MINOR] stats_dump_sess_to_buffer: use buffer_feed_chunk() same as previous patch for this function.	2009-10-04 11:00:11 +02:00
Willy Tarreau	61b347342c	[MINOR] stats_dump_errors_to_buffer: use buffer_feed_chunk() We can simplify the code in the stats functions using buffer_feed_chunk() instead of buffer_write_chunk(). Let's start with this function. This patch also fixed an issue where we could dump past the end of the capture buffer if it is shorter than the captured request.	2009-10-04 11:00:11 +02:00
Willy Tarreau	33b230b34a	[BUG] stats: don't call buffer_shutw(), but ->shutw() instead Calling buffer_shutw() marks the buffer as closed but if it was already closed in the other direction, the stream interface is not marked as closed, causing infinite loops. We took this opportunity to completely remove buffer_shutw() and buffer_shutr() which have no reason to be used at all and which will always cause trouble when directly called. The stats occurrence was the last one.	2009-10-04 09:19:36 +02:00
Willy Tarreau	39c9ba72a7	[MINOR] lb_map: reorder code in order to ease integration of new hash functions We need to remove hash map accesses out of backend.c if we want to later support new hash methods. This patch separates the hash computation method from the server lookup. It leaves the lookup function to lb_map.c and calls it with the result of the hash.	2009-10-01 21:11:15 +02:00
Willy Tarreau	f89c1873f8	[CLEANUP] backend: move LB algos to individual files It was becoming painful to have all the LB algos in backend.c. Let's move them to their own files. A few hashing functions still need be broken in two parts, one for the contents and one for the map position.	2009-10-01 11:19:37 +02:00
Willy Tarreau	78ff5d0a9e	[MINOR] include time.h from freq_ctr.h as is uses "now".	2009-10-01 11:05:26 +02:00
Krzysztof Piotr Oledzki	78abe618a8	[MAJOR] struct chunk rework Add size to struct chunk and simplify the code as there is no longer required to pass sizeof in chunk_printf().	2009-10-01 10:17:37 +02:00
Willy Tarreau	ca7d4b98d4	[MINOR] backend: uninline some LB functions There is no reason to inline functions which are used to grab a server depending on an LB algo. They are large and used at several places. Uninlining them saves 400 bytes of code.	2009-10-01 09:21:55 +02:00
Willy Tarreau	c5d9c80182	[MINOR] backend: export some functions to recount servers Those functions will be used by new LB algorithms.	2009-10-01 09:17:05 +02:00
Willy Tarreau	9a42c0d771	[MEDIUM] stats: replace the stats socket analyser with an SI applet We can get rid of the stats analyser by moving all the stats code to a stream interface applet. Above being cleaner, it provides new advantages such as the ability to process requests and responses from the same function and work only with simple state machines. There's no need for any hijack hack anymore. The direct advantage for the user are the interactive mode and the ability to chain several commands delimited by a semi-colon. Now if the user types "prompt", he gets a prompt from which he can send as many requests as he wants. All outputs are terminated by a blank line followed by a new prompt, so this can be used from external tools too. The code is not very clean, it needs some rework, but some part of the dirty parts are due to the remnants of the hijack mode used in the old functions we call. The old AN_REQ_STATS_SOCK analyser flag is now unused and has been removed.	2009-09-23 23:52:17 +02:00
Willy Tarreau	fb90d94d7a	[MINOR] stream_interface: add functions to support running as internal/external tasks It will soon be necessary to have stream interfaces running as part of the current task, or as independant tasks. For instance when we want to implement compression or SSL. It will also be used for applets running as stream interfaces. These new functions are used to perform exactly that. Note that it's still not easy to write a simple echo applet and more functions will likely be needed.	2009-09-23 23:52:15 +02:00
Willy Tarreau	2e1dd3d213	[BUG] fix buffer_skip() and buffer_si_getline() to correctly handle wrap-arounds Those two functions did not correctly deal with full buffers and/or buffers that wrapped around. Buffer_skip() was even able to incorrectly set buf->w further than the end of buffer if its len argument was wrong, and buffer_si_getline() was able to incorrectly return a length larger than the effective buffer data available.	2009-09-23 23:52:14 +02:00
Willy Tarreau	fb0e9209a9	[MINOR] ensure that buffer_feed() and buffer_skip() set BF_*_PARTIAL It's important that these functions set these flags themselves, otherwise the callers will always have to do this, and there is no valid reason for not doing it.	2009-09-23 23:50:57 +02:00
Krzysztof Piotr Oledzki	0960541e49	[MEDIUM] Collect & show information about last health check, v3 Collect information about last health check result, including L7 code if possible (for example http or smtp return code) and time took to finish last check. Health check info is provided on both stats pages (html & csv) and logged when a server is marked UP or DOWN. Currently active check are marked with an asterisk, but only in html mode. Currently there are 14 status codes: UNK -> unknown INI -> initializing SOCKERR -> socket error L4OK -> check passed on layer 4, no upper layers testing enabled L4TOUT -> layer 1-4 timeout L4CON -> layer 1-4 connection problem, for example "Connection refused" (tcp rst) or "No route to host" (icmp) L6OK -> check passed on layer 6 L6TOUT -> layer 6 (SSL) timeout L6RSP -> layer 6 invalid response - protocol error L7OK -> check passed on layer 7 L7OKC -> check conditionally passed on layer 7, for example 404 with disable-on-404 L7TOUT -> layer 7 (HTTP/SMTP) timeout L7RSP -> layer 7 invalid response - protocol error L7STS -> layer 7 response error, for example HTTP 5xx	2009-09-23 23:15:36 +02:00
Willy Tarreau	2d028db75a	[BUG] buffers: buffer_forward() must not always clear BF_OUT_EMPTY This flag was unconditionally cleared, which is wrong because we can enable forwarding on an empty buffer.	2009-09-21 06:25:15 +02:00
Willy Tarreau	269358db93	[BUILD] stream_interface: fix conflicting declaration stream_int_check_timeouts was declared void while it's an int.	2009-09-21 06:24:42 +02:00
Willy Tarreau	31971e536a	[MEDIUM] add support for infinite forwarding In TCP, we don't want to forward chunks of data, we want to forward indefinitely. This patch introduces a special value for the amount of data to be forwarded. When buffer_forward() is called with BUF_INFINITE_FORWARD, it configures the buffer to never stop forwarding until the end.	2009-09-20 12:07:52 +02:00
Willy Tarreau	ba0b63d2c7	[MAJOR] buffers: fix the BF_EMPTY flag's meaning The BF_EMPTY flag was once used to indicate an empty buffer. However, it was used half the time as meaning the buffer is empty for the reader, and half the time as meaning there is nothing left to send. "nothing to send" is only indicated by "->send_max=0 && !pipe". Once we fix this, we discover that the flag is not used anymore. So the flags has been renamed BF_OUT_EMPTY and means exactly the condition above, ie, there is nothing to send. Doing so has allowed us to remove some unused tests for emptiness, but also to uncover a certain amount of situations where the flag was not correctly set or tested.	2009-09-20 08:17:45 +02:00
Willy Tarreau	520d95e42b	[MAJOR] buffers: split BF_WRITE_ENA into BF_AUTO_CONNECT and BF_AUTO_CLOSE The BF_WRITE_ENA buffer flag became very complex to deal with, because it was used to : - enable automatic connection - enable close forwarding - enable data forwarding The last point was not very true anymore since we introduced ->send_max, but still the test remained everywhere. This was causing issues such as impossibility to connect without forwarding data, impossibility to prevent closing when data was forwarded, etc... This patch clarifies the situation by getting rid of this multi-purpose flag and replacing it with : - data forwarding based only on ->send_max \|\| ->pipe ; - a new BF_AUTO_CONNECT flag to allow automatic connection and only that ; - ability to perform an automatic connection when ->send_max or ->pipe indicate that data is waiting to leave the buffer ; - a new BF_AUTO_CLOSE flag to let the producer automatically set the BF_SHUTW_NOW flag when it gets a BF_SHUTR. During this cleanup, it was discovered that some tests were performed twice, or that the BF_HIJACK flag was still tested, which is not needed anymore since ->send_max replcaed it. These places have been fixed too. These cleanups have also revealed a few areas where the other flags such as BF_EMPTY are not cleanly used. This will be an opportunity for a second patch.	2009-09-19 21:14:54 +02:00
Willy Tarreau	c77e761968	[MINOR] buffers: inline buffer_si_putchar() By inlining this function and slightly reordering it, we can double the getchar/putchar test throughput, and reduce its footprint by about 40 bytes. Also, it was the only non-inlined char-based function, which now makes it more consistent this time.	2009-09-19 16:34:18 +02:00
Willy Tarreau	9cb8daa203	[MINOR] buffers: add buffer_cut_tail() to cut only unsent data This function is used to cut the "tail" of a buffer, which means strip it to the length of unsent data only, and kill any remaining unsent data. Any scheduled forwarding is stopped. This is mainly to be used to send error messages after existing data. It does the same as buffer_erase() for buffers without pending outgoing data.	2009-09-19 14:53:47 +02:00

... 8 9 10 11 12 ...

1141 Commits