haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-08 08:07:10 +02:00

Author	SHA1	Message	Date
Willy Tarreau	213e99073b	REORG: include: move listener.h to haproxy/listener{,-t}.h stdlib and list were missing from listener.h, otherwise it was OK.	2020-06-11 10:18:58 +02:00
Willy Tarreau	d7d2c28104	CLEANUP: include: remove unused mux_pt.h It used to be needed to export mux_pt_ops when it was the only way to detect a mux but that's no longer the case.	2020-06-11 10:18:57 +02:00
Willy Tarreau	8efbdfb77b	REORG: include: move obj_type.h to haproxy/obj_type{,-t}.h No change was necessary. It still includes lots of types/* files.	2020-06-11 10:18:57 +02:00
Willy Tarreau	762d7a5117	REORG: include: move frontend.h to haproxy/frontend.h There was no type file for this one, it only contains frontend_accept().	2020-06-11 10:18:57 +02:00
Willy Tarreau	0f6ffd652e	REORG: include: move fd.h to haproxy/fd{,-t}.h A few includes were missing in each file. A definition of struct polled_mask was moved to fd-t.h. The MAX_POLLERS macro was moved to defaults.h Stdio used to be silently inherited from whatever path but it's needed for list_pollers() which takes a FILE* and which can thus not be forward-declared.	2020-06-11 10:18:57 +02:00
Willy Tarreau	3afc4c4bb0	REORG: include: move dict.h to hparoxy/dict{,-t}.h This was entirely free-standing. haproxy/api-t.h was added for size_t.	2020-06-11 10:18:57 +02:00
Willy Tarreau	48fbcae07c	REORG: tools: split common/standard.h into haproxy/tools{,-t}.h And also rename standard.c to tools.c. The original split between tools.h and standard.h dates from version 1.3-dev and was mostly an accident. This patch moves the files back to what they were expected to be, and takes care of not changing anything else. However this time tools.h was split between functions and types, because it contains a small number of commonly used macros and structures (e.g. name_desc) which in turn cause the massive list of includes of tools.h to conflict with the callers. They remain the ugliest files of the whole project and definitely need to be cleaned and split apart. A few types are defined there only for functions provided there, and some parts are even OS-specific and should move somewhere else, such as the symbol resolution code.	2020-06-11 10:18:57 +02:00
Willy Tarreau	6131d6a731	REORG: include: move common/net_helper.h to haproxy/net_helper.h No change was necessary.	2020-06-11 10:18:57 +02:00
Willy Tarreau	92b4f1372e	REORG: include: move time.h from common/ to haproxy/ This one is included almost everywhere and used to rely on a few other .h that are not needed (unistd, stdlib, standard.h). It could possibly make sense to split it into multiple parts to distinguish operations performed on timers and the internal time accounting, but at this point it does not appear much important.	2020-06-11 10:18:56 +02:00
Willy Tarreau	3f567e4949	REORG: include: split hathreads into haproxy/thread.h and haproxy/thread-t.h This splits the hathreads.h file into types+macros and functions. Given that most users of this file used to include it only to get the definition of THREAD_LOCAL and MAXTHREADS, the bare minimum was placed into thread-t.h (i.e. types and macros). All the thread management was left to haproxy/thread.h. It's worth noting the drop of the trailing "s" in the name, to remove the permanent confusion that arises between this one and the system implementation (no "s") and the makefile's option (no "s"). For consistency, src/hathreads.c was also renamed thread.c. A number of files were updated to only include thread-t which is the one they really needed. Some future improvements are possible like replacing empty inlined functions with macros for the thread-less case, as building at -O0 disables inlining and causes these ones to be emitted. But this really is cosmetic.	2020-06-11 10:18:56 +02:00
Willy Tarreau	4c7e4b7738	REORG: include: update all files to use haproxy/api.h or api-t.h if needed All files that were including one of the following include files have been updated to only include haproxy/api.h or haproxy/api-t.h once instead: - common/config.h - common/compat.h - common/compiler.h - common/defaults.h - common/initcall.h - common/tools.h The choice is simple: if the file only requires type definitions, it includes api-t.h, otherwise it includes the full api.h. In addition, in these files, explicit includes for inttypes.h and limits.h were dropped since these are now covered by api.h and api-t.h. No other change was performed, given that this patch is large and affects 201 files. At least one (tools.h) was already freestanding and didn't get the new one added.	2020-06-11 10:18:42 +02:00
Emeric Brun	530ba38a70	BUG/MINOR: peers: fix internal/network key type mapping. Network types were directly and mistakenly mapped on sample types: This patch fix the doc with values effectively used to keep backward compatiblitiy on existing implementations. In addition it adds an internal/network mapping for key types to avoid further mistakes adding or modifying internals types. This patch should be backported on all maintained branches, particularly until v1.8 included for documentation part.	2020-06-02 16:25:19 +02:00
Ilya Shipitsin	c02a23f981	CLEANUP: assorted typo fixes in the code and comments This is 9th iteration of typo fixes	2020-05-11 10:11:29 +02:00
Emeric Brun	70de43b77b	BUG/MEDIUM: peers: resync ended with RESYNC_PARTIAL in wrong cases. This bug was introduced with peers.c code re-work (`7d0ceeec80`): "struct peer" flags are mistakenly checked instead of "struct peers" flags to check the resync status of the local peer. The issue was reported here: https://github.com/haproxy/haproxy/issues/545 This bug affects all branches >= 2.0 and should be backported.	2020-03-16 11:32:47 +01:00
Willy Tarreau	52bf839394	BUG/MEDIUM: random: implement a thread-safe and process-safe PRNG This is the replacement of failed attempt to add thread safety and per-process sequences of random numbers initally tried with commit `1c306aa84d` ("BUG/MEDIUM: random: implement per-thread and per-process random sequences"). This new version takes a completely different approach and doesn't try to work around the horrible OS-specific and non-portable random API anymore. Instead it implements "xoroshiro128*", a reputedly high quality random number generator, which is one of the many variants of xorshift, which passes all quality tests and which is described here: http://prng.di.unimi.it/ While not cryptographically secure, it is fast and features a 2^128-1 period. It supports fast jumps allowing to cut the period into smaller non-overlapping sequences, which we use here to support up to 2^32 processes each having their own, non-overlapping sequence of 2^96 numbers (~710^28). This is enough to provide 1 billion randoms per second and per process for 2200 billion years. The implementation was made thread-safe either by using a double 64-bit CAS on platforms supporting it (x86_64, aarch64) or by using a local lock for the time needed to perform the shift operations. This ensures that all threads pick numbers from the same pool so that it is not needed to assign per-thread ranges. For processes we use the fast jump method to advance the sequence by 2^96 for each process. Before this patch, the following config: global nbproc 8 frontend f bind :4445 mode http log stdout format raw daemon log-format "%[uuid] %pid" redirect location / Would produce this output: a4d0ad64-2645-4b74-b894-48acce0669af 12987 a4d0ad64-2645-4b74-b894-48acce0669af 12992 a4d0ad64-2645-4b74-b894-48acce0669af 12986 a4d0ad64-2645-4b74-b894-48acce0669af 12988 a4d0ad64-2645-4b74-b894-48acce0669af 12991 a4d0ad64-2645-4b74-b894-48acce0669af 12989 a4d0ad64-2645-4b74-b894-48acce0669af 12990 82d5f6cd-f6c1-4f85-a89c-36ae85d26fb9 12987 82d5f6cd-f6c1-4f85-a89c-36ae85d26fb9 12992 82d5f6cd-f6c1-4f85-a89c-36ae85d26fb9 12986 (...) And now produces: f94b29b3-da74-4e03-a0c5-a532c635bad9 13011 47470c02-4862-4c33-80e7-a952899570e5 13014 86332123-539a-47bf-853f-8c8ea8b2a2b5 13013 8f9efa99-3143-47b2-83cf-d618c8dea711 13012 3cc0f5c7-d790-496b-8d39-bec77647af5b 13015 3ec64915-8f95-4374-9e66-e777dc8791e0 13009 0f9bf894-dcde-408c-b094-6e0bb3255452 13011 49c7bfde-3ffb-40e9-9a8d-8084d650ed8f 13014 e23f6f2e-35c5-4433-a294-b790ab902653 13012 There are multiple benefits to using this method. First, it doesn't depend anymore on a non-portable API. Second it's thread safe. Third it is fast and more proven than any hack we could attempt to try to work around the deficiencies of the various implementations around. This commit depends on previous patches "MINOR: tools: add 64-bit rotate operators" and "BUG/MEDIUM: random: initialize the random pool a bit better", all of which will need to be backported at least as far as version 2.0. It doesn't require to backport the build fixes for circular include files dependecy anymore.	2020-03-08 10:09:02 +01:00
Willy Tarreau	0fbf28a05b	Revert "BUG/MEDIUM: random: implement per-thread and per-process random sequences" This reverts commit `1c306aa84d`. It breaks the build on all non-glibc platforms. I got confused by the man page (which possibly is the most confusing man page I've ever read about a standard libc function) and mistakenly understood that random_r was portable, especially since it appears in latest freebsd source as well but not in released versions, and with a slightly different API :-/ We need to find a different solution with a fallback. Among the possibilities, we may reintroduce this one with a fallback relying on locking around the standard functions, keeping fingers crossed for no other library function to call them in parallel, or we may also provide our own PRNG, which is not necessarily more difficult than working around the totally broken up design of the portable API.	2020-03-07 11:24:39 +01:00
Willy Tarreau	1c306aa84d	BUG/MEDIUM: random: implement per-thread and per-process random sequences As mentioned in previous patch, the random number generator was never made thread-safe, which used not to be a problem for health checks spreading, until the uuid sample fetch function appeared. Currently it is possible for two threads or processes to produce exactly the same UUID. In fact it's extremely likely that this will happen for processes, as can be seen with this config: global nbproc 8 frontend f bind :4445 mode http log stdout daemon format raw log-format "%[uuid] %pid" redirect location / It typically produces this log: 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30645 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30641 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30644 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30639 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30646 07764439-c24d-4e6f-a5a6-0138be59e7a8 30645 07764439-c24d-4e6f-a5a6-0138be59e7a8 30639 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30643 07764439-c24d-4e6f-a5a6-0138be59e7a8 30646 b6773fdd-678f-4d04-96f2-4fb11ad15d6b 30646 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30642 07764439-c24d-4e6f-a5a6-0138be59e7a8 30642 What this patch does is to use a distinct per-thread and per-process seed to make sure the same sequences will not appear, and will then extend these seeds by "burning" a number of randoms that depends on the global random seed, the thread ID and the process ID. This adds roughly 20 extra bits of randomness, resulting in 52 bits total per thread and per process. It only takes a few milliseconds to burn these randoms and given that threads start with a different seed, we know they will not catch each other. So these random extra bits are essentially added to ensure randomness between boots and cluster instances. This replaces all uses of random() with ha_random() which uses the thread-local state. This must be backported as far as 2.0 or any version having the UUID sample-fetch function since it's the main victim here. It's important to note that this patch, in addition to depending on the previous one "BUG/MEDIUM: init: initialize the random pool a bit better", also depends on the preceeding build fixes to address a circular dependency issue in the include files that prevented it from building. Part or all of these patches may need to be backported or adapted as well.	2020-03-07 06:11:15 +01:00
Willy Tarreau	6cde5d883c	CLEANUP: stick-tables: use read_u32() to display a node's key This fixes another aliasing issue that pops up in stick_table.c and peers.c's debug code.	2020-02-25 09:41:22 +01:00
Tim Duesterhus	d02ffe9b6d	CLEANUP: peers: Remove unused static function `free_dcache_tx` The function was added in commit `6c39198b57`, but was also used within a single function `free_dcache` which was unused itself. see issue #301 see commit `10ce0c2f31` which removed `free_dcache`	2020-02-05 23:40:17 +01:00
Tim Duesterhus	10ce0c2f31	CLEANUP: peers: Remove unused static function `free_dcache` The function was changed to be static in commit `6c39198b57`, but even that commit no longer uses it. The purpose of the change vs. outright removal is unclear. see issue #301	2020-02-05 18:49:29 +01:00
Fr�d�ric L�caille	3585cab221	BUG/MINOR: peers: "peer alive" flag not reset when deconnecting. The peer flags (->flags member of peer struct) are reset by __peer_session_deinit() function. PEER_F_ALIVE flag which is used by the heartbeat part of the peer protocol to mark a peer as being alive was not reset by this function. This simple patch adds add the statement to this. Note that, at this time, there was no identified issue due to this missing reset. Must be backported to 2.0.	2019-11-20 13:38:13 +01:00
Fr�d�ric L�caille	af9990f035	BUG/MINOR: peers: Wrong null "server_name" data field handling. As the peers protocol expects to parse at least one encoded integer value for each stick-table data field even when not configured on the local side, about the "server_name" data field we must emit something even if it has not been set (no server was configured for instance). As this data field is made of first one encoded integer which is the length of the remaining data (the dictionary cache entry), we encode the length 0 when emitting such an absent dictionary cache entry. On the remote side, when we decode such an integer with 0 as value, we stop parsing the data field and that's it. Must be backported to 2.0.	2019-11-19 14:48:33 +01:00
Fr�d�ric L�caille	ec1c10b839	MINOR: peers: Add debugging information to "show peers". This patch adds three counters to help in debugging peers protocol issues to "peer" struct: ->no_hbt counts the number of reconnection period without receiving heartbeat ->new_conn counts the number of reconnections after ->reconnect timeout expirations. ->proto_err counts the number of protocol errors.	2019-11-19 14:48:28 +01:00
Fr�d�ric L�caille	33cab3c0eb	MINOR: peers: Add TX/RX heartbeat counters. Add RX/TX heartbeat counters to "peer" struct to have an idead about which peer is alive or not. Dump these counters values on the CLI via "show peers" command.	2019-11-19 14:48:25 +01:00
Fr�d�ric L�caille	470502b420	MINOR: peers: Alway show the table info for disconnected peers. This patch enable us to dump the stick-table information of remote or local peers without already opened peer session. This may be the case also for the local peer during synchronizations with an old processus (reload).	2019-11-19 14:48:21 +01:00
Willy Tarreau	9d00869323	CLEANUP: cli: replace all occurrences of manual handling of return messages There were 221 places where a status message or an error message were built to be returned on the CLI. All of them were replaced to use cli_err(), cli_msg(), cli_dynerr() or cli_dynmsg() depending on what was expected. This removed a lot of duplicated code because most of the times, 4 lines are replaced by a single, safer one.	2019-08-09 11:26:10 +02:00
Willy Tarreau	02efedac0c	MINOR: peers: now remove the remote connection setup code The connection is not needed anymore, the backend does the job.	2019-07-19 13:50:09 +02:00
Willy Tarreau	1c8d32bb62	MAJOR: stream: store the target address into s->target_addr When forcing the outgoing address of a connection, till now we used to allocate this outgoing connection and set the address into it, then set SF_ADDR_SET. With connection reuse this causes a whole lot of issues and difficulties in the code. Thanks to the previous changes, it is now possible to store the target address into the stream instead, and copy the address from the stream to the connection when initializing the connection. assign_server_address() does this and as a result SF_ADDR_SET now reflects the presence of the target address in the stream, not in the connection. The http_proxy mode, the peers and the master's CLI now use the same mechanism. For now the existing connection code was not removed to limit the amount of tricky changes, but the allocated connection is not used anymore. This change also revealed a latent issue that we've been having around option http_proxy : the address was set in the connection but neither the SF_ADDR_SET nor the SF_ASSIGNED flags were set. It looks like the connection could establish only due to the fact that it existed with a non-null destination address.	2019-07-19 13:50:09 +02:00
Willy Tarreau	ca79f59365	MEDIUM: connection: make sure all address producers allocate their address This commit places calls to sockaddr_alloc() at the places where an address is needed, and makes sure that the allocation is properly tested. This does not add too many error paths since connection allocations are already in the vicinity and share the same error paths. For the two cases where a clear_addr() was called, instead the address was not allocated.	2019-07-19 13:50:09 +02:00
Willy Tarreau	3ca149018d	MINOR: peers: use conn->dst for the peer's target address The target address is duplicated from the peer's configured one. For now we keep the target address as-is but we'll have to dynamically allocate it and place it into the stream instead. Maybe a sockaddr_dup() will help by the way. The "show peers" part is safe as it's already called after checking the addresses' validity.	2019-07-19 13:50:09 +02:00
Willy Tarreau	8dfffdb060	MINOR: stream/cli: use conn_get_{src,dst} in "show sess" and "show peers" output The stream outputs requires to retrieve connections sources and destinations. The previous call involving conn_get_{to,from}_addr() was missing a status check which has now been integrated with the new call since these places already handle connection errors there. The same code parts were reused for "show peers" and were modified similarly.	2019-07-19 13:50:09 +02:00
Christopher Faulet	711ed6ae4a	MAJOR: http: Remove the HTTP legacy code First of all, all legacy HTTP analyzers and all functions exclusively used by them were removed. So the most of the functions in proto_http.{c,h} were removed. Only functions to deal with the HTTP transaction have been kept. Then, http_msg and hdr_idx modules were entirely removed. And finally the structure http_msg was lightened of all its useless information about the legacy HTTP. The structure hdr_ctx was also removed because unused now, just like unused states in the enum h1_state. Note that the memory pool "hdr_idx" was removed and "http_txn" is now smaller.	2019-07-19 09:24:12 +02:00
Fr�d�ric L�caille	b65717fa55	MINOR: peers: Optimization for dictionary cache lookup. When we look up an dictionary entry in the cache used upon transmission we store the last result in ->prev_lookup of struct dcache_tx so that to compare it with the subsequent entries to look up and save performances.	2019-06-07 15:47:54 +02:00
Fr�d�ric L�caille	fd827937ed	MINOR: peers: A bit of optimization when encoding cached server names. When a server name is cached we only send its cache entry ID which has an encoded length of 1 (because smaller than PEER_ENC_2BYTES_MIN). So, in this case we only have to encode 1, the already known encoded length of this ID before encoding it. Furthermore we do not have to call strlen() to compute the lengths of server name strings thanks to this commit: "MINOR: dict: Store the length of the dictionary entries".	2019-06-07 15:47:54 +02:00
Fr�d�ric L�caille	6c39198b57	MINOR peers: data structure simplifications for server names dictionary cache. We store pointers to server names dictionary entries in a pre-allocated array of ebpt_node's (->entries member of struct dcache_tx) to cache those sent to remote peers. Consequently the ID used to identify the server name dictionary entry is also used as index for this array. There is no need to implement a lookup by key for this dictionary cache.	2019-06-07 15:47:54 +02:00
Willy Tarreau	237f8aef41	BUILD: peers: fix a build warning about an incorrect intiialization Just got this one : src/peers.c:528:13: warning: missing braces around initializer [-Wmissing-braces] src/peers.c:528:13: warning: (near initialization for 'cde.key') [-Wmissing-braces] Indeed, this struct contains two structs so scalar zero is not a valid value for the first field. Let's just leave it as an empty struct since it was the purpose.	2019-06-06 16:42:14 +02:00
Fr�d�ric L�caille	56aec0ddc6	BUG/MINOR: peers: Wrong server name parsing. This commit was not complete: BUG/MINOR: peers: Wrong "server_name" decoding. We forgot forgotten to move forward <msg_cur> pointer variable after having parse the server name string. Again this bug may happen only if we add stick-table new data type after the server name which is the current last one. Furthermore this bug is visible only the first time a peer sends a server name for a stick-table entry. Nothing to backport.	2019-06-06 16:06:00 +02:00
Fr�d�ric L�caille	344e94816c	BUG/MINOR: peers: Wrong "server_name" decoding. This patch fixes a bug which does not occur at this time because the "server_name" stick-table data type is the last one (see STKTABLE_DT_SERVER_NAME). It was introduced by this commit: "MINOR: peers: Make peers protocol support new "server_name" data type". Indeed when receiving STD_T_DICT stick-table data type we first decode the length of these data, then we decode the ID of this dictionary entry. To know if there is remaining data to parse, we check if we have reached the end of the current data, relying on <msg_end> variable. But <msg_end> is at the end of the entire message! So this patch computes the correct end of the current STD_T_DICT before doing anything else with it. Nothing to backport.	2019-06-05 13:36:34 +02:00
Fr�d�ric L�caille	36fb77e295	MINOR: peers: Replace hard-coded values for peer protocol messaging by macros. Simple patch to replace hard-coded values in relation with bytes identifiers used for stick-table messages by macros.	2019-06-05 08:42:36 +02:00
Fr�d�ric L�caille	32b5573b13	MINOR: peers: Replace hard-coded for peer protocol 64-bits value encoding by macros. With this patch we define macros for the minimum values which are encoded for 2 up to 10 bytes. This latter is big enough to encode UINT64_MAX. We replaced at several places 240 value by PEER_ENC_2BYTES_MIN which is the minimum value which is encoded with 2 bytes. The peer protocol encoding consisting in encoding with only one byte a value which is less than PEER_ENC_2BYTES_MIN and with at least 2 bytes a 64-bits value greater than PEER_ENC_2BYTES_MIN.	2019-06-05 08:42:36 +02:00
Fr�d�ric L�caille	62b0b0bc02	MINOR: peers: Add dictionary cache information to "show peers" CLI command. This patch adds dictionary entries cached and used for the server by name stickiness feature (exchanged thanks to peers protocol).	2019-06-05 08:42:36 +02:00
Fr�d�ric L�caille	8d78fa7def	MINOR: peers: Make peers protocol support new "server_name" data type. Make usage of the APIs implemented for dictionaries (dict.c) and their LRU caches (struct dcache) so that to send/receive server names used for the server by name stickiness. These names are sent over the network as follows: - in every case we send the encode length of the data (STD_T_DICT), then - if the server names is not present in the cache used upon transmission (struct dcache_tx) we cache it and we the ID of this TX cache entry followed the encode length of the server name, and finally the sever name itseft (non NULL terminated string). - if the server name is present, we repead these operations but we only send the TX cache entry ID. Upon receipt, the couple of (cache IDs, server name) are stored the LRU cache used only upon receipt (struct dcache_rx). As the peers protocol is symetrical, the fact that the server name is present in the received data (resp. or not) denotes if the entry is absent (resp. or not).	2019-06-05 08:42:33 +02:00
Fr�d�ric L�caille	74167b25f7	MINOR: peers: Add a LRU cache implementation for dictionaries. We want to send some stick-table data fields stored as strings in dictionaries without consuming too much memory and CPU. To do so we implement with this patch a cache for send/received dictionaries entries. These dictionary of strings entries are stored in others real dictionary entries with an identifier as key (unsigned int) and a pointer to the dictionary of strings entries as values.	2019-06-05 08:33:35 +02:00
Fr�d�ric L�caille	0e8db97df4	BUG/MINOR: peers: Wrong stick-table update message building. When creating this patch "CLEANUP: peers: Replace hard-coded values by macros", we realized there was a remaining place in peer_prepare_updatemsg() where the maximum of an encoded length harcoded value could be replaced by PEER_MSG_ENCODED_LENGTH_MAXLEN macro. But in this case, the 1 harcoded value for the header length is wrong. Should be 2 or PEER_MSG_HEADER_LEN. So, there is a missing byte to encode the length of remaining data after the header. Note that the bug was never encountered because even with a missing byte, we could encode a maximum length which would be (1<<25) (32MB) according to the following extract of the peers protocol documentation which were from far a never reached limit I guess: I) Encoded Integer and Bitfield. 0 <= X < 240 : 1 byte (7.875 bits) [ XXXX XXXX ] 240 <= X < 2288 : 2 bytes (11 bits) [ 1111 XXXX ] [ 0XXX XXXX ] 2288 <= X < 264432 : 3 bytes (18 bits) [ 1111 XXXX ] [ 1XXX XXXX ] [ 0XXX XXXX ] 264432 <= X < 33818864 : 4 bytes (25 bits) [ 1111 XXXX ] [ 1XXX XXXX ]2 [ 0XXX XXXX ] 33818864 <= X < 4328786160 : 5 bytes (32 bits) [ 1111 XXXX ] [ 1XXX XXXX ]3 [ 0XXX XXXX ]	2019-06-05 08:33:34 +02:00
Fr�d�ric L�caille	39143340ec	CLEANUP: peers: Replace hard-coded values by macros. All the peer stick-table messages are made of a 2-byte header (PEER_MSG_HEADER_LEN) followed by the encoded length of the remaining data wich is harcoded as 5 (in bytes) for the maximum (PEER_MSG_ENCODED_LENGTH_MAXLEN). With such a length we can encode a maximum length which equals to (1 << 32) - 1, which is from far enough. This patches replaces both these values by macros where applicable.	2019-06-05 08:33:34 +02:00
Fr�d�ric L�caille	7fcc24d4ef	MINOR: peers: Do not emit global stick-table names. This commit "MINOR: stick-table: Add prefixes to stick-table names" prepended the "peers" section name to stick-table names declared in such "peers" sections followed by a '/' character. This is not this name which must be sent over the network to avoid collisions with stick-table name declared as backends. As the '/' character is forbidden as first character of a backend name, we prefix the stick-table names declared in peers sections only with a '/' character. With such declarations: peers mypeers table t1 backend t1 stick-table ... peers mypeers at peer protocol level, "t1" declared as stick-table in "mypeers" section is different of "t1" stick-table declared as backend. In src/peers.c, only two modifications were required: use ->nid stktable struct member in place of ->id in peer_prepare_switchmsg() to prepare the stick-table definition messages. Same thing in peer_treat_definemsg() to treat a stick-table definition messages.	2019-05-07 06:54:07 +02:00
Emeric Brun	0bbec0fa34	MINOR: peers: adds counters on show peers about tasks calls. This patch adds a counter of calls on the orchestator peers task and a counter on the tasks linked to applet i/o handler for each peer. Those two counters are useful to detect if a peer sync is active or frozen. This patch is related to the commit: "MINOR: peers: Add a new command to the CLI for peers." and should be backported with it.	2019-04-18 18:24:25 +02:00
Olivier Houchard	3f795f76e8	MEDIUM: tasks: Merge task_delete() and task_free() into task_destroy(). task_delete() was never used without calling task_free() just after, and task_free() was only used on error pathes to destroy a just-created task, so merge them into task_destroy(), that will remove the task from the wait queue, and make sure the task is either destroyed immediately if it's not in the run queue, or destroyed when it's supposed to run.	2019-04-18 10:10:04 +02:00
Fr�d�ric L�caille	95679dc096	MINOR: peers: Add a new command to the CLI for peers. Implements "show peers [peers section]" new CLI command to dump information about the peers and their stick-tables to be synchronized and others internal. May be backported as far as 1.5.	2019-04-16 09:58:40 +02:00
Emeric Brun	9ef2ad7844	BUG/MEDIUM: peers: fix a case where peer session is not cleanly reset on release. The deinit took place in only peer_session_release, but in the a case of a previous call to peer_session_forceshutdown, the session cursors won't be reset, resulting in a bad state for new session of the same peer. For instance, a table definition message could be dropped and so all update messages will be dropped by the remote peer. This patch move the deinit processing directly in the force shutdown funtion. Killed session remains in "ST_END" state but ref on peer was reset to NULL and deinit will be skipped on session release function. The session release continue to assure the deinit for "active" sessions. This patch should be backported on all stable version since proto peers v2.	2019-04-03 14:42:10 +02:00
Fr�d�ric L�caille	b7405c1c50	BUG/MINOR: peers: Missing initializations after peer session shutdown. This patch fixes a bug introduced by `045e0d4` commit where it was really a bad idea to reset the peer applet context before shutting down the underlying session. This had as side effect to cancel the re-initializations done by peer_session_release(), especially prevented this function from re-initializing the current table pointer which is there to force annoucement of stick-table definitions on when reconnecting. Consequently the peers could send stick-table update messages without a first stick-table definition message. As this is forbidden, this leaded the remote peers to close the sessions.	2019-03-27 15:16:25 +01:00
Fr�d�ric L�caille	54bff83f43	CLEANUP: peers: replace timeout constants by macros. This adds two macros PEER_RESYNC_TIMEOUT and PEER_RECONNECT_TIMEOUT both set to 5 seconds in order to remove magic timeouts which appear in the code.	2019-03-26 10:54:06 +01:00
Fr�d�ric L�caille	aba44a2abc	CLEANUP: peers: remove useless annoying tabulations. There were tabs in between macro names and their values in their definition, forcing everyone to do the same, and causing some mangling in patches. Let's fix all this.	2019-03-26 10:53:09 +01:00
Fr�d�ric L�caille	045e0d4b3b	BUG/MINOR: peers: Really close the sessions with no heartbeat. `645635d` commit was not sufficient to implement the heartbeat feature. When no heartbeat was received before its timeout has expired the session was not closed due to the fact that process_peer_sync() which is the task responsible of handling the heartbeat and session expirations only checked the heartbeat timeout, and sent a heartbeat message if it has expired. This has as side effect to leave the session opened. On the remote side, a peer which receives a heartbeat message, even if not supported, does not close the session. Furthermore it not sufficient to update ->reconnect peer member field to schedule a peer session release. With this patch, a peer is flagged as alive as soon as it received peer protocol messages (and not only heartbeat messages). When no updates must be sent, we first check the reconnection timeout (->reconnect peer member field). If expired, we really shutdown the session if the peer is not alive, but if the peer seen as alive, we reset this flag and update the ->reconnect for the next period. If the reconnection timeout has not expired, then we check the heartbeat timeout which is there only to emit heartbeat messages upon expirations. If expired, as before this patch we increment the heartbeat timeout by 3s to schedule the next heartbeat message then we emit a heartbeat message waking up the peer I/O handler. In every cases we update the task expiration to the earlier time between the reconnection time and the heartbeat timeout time so that to be sure to check again these two ->reconnect and ->heartbeat timers.	2019-03-26 10:51:12 +01:00
Olivier Houchard	ed87989ab5	MEDIUM: peers: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Fr�d�ric L�caille	645635da84	MINOR: peers: Add a message for heartbeat. This patch implements peer heartbeat feature to prevent any haproxy peer from reconnecting too often, consuming sockets for nothing. To do so, we add PEER_MSG_CTRL_HEARTBEAT new message to PEER_MSG_CLASS_CONTROL peers control class of messages. A ->heartbeat field is added to peer structs to store the heatbeat timeout value which is handled by the same function as for ->reconnect to control the session timeouts. A 2-bytes heartbeat message is sent every 3s when no updates have to be sent. This way, the peer which receives such a message is sure the remote peer is still alive. So, it resets the ->reconnect peer session timeout to its initial value (5s). This prevents any reconnection to an already connected alive peer.	2019-03-01 09:33:26 +01:00
Willy Tarreau	a8cf66bcab	MINOR: listener: do not needlessly set l->maxconn It's pointless to always set and maintain l->maxconn because the accept loop already enforces the frontend's limit anyway. Thus let's stop setting this value by default and keep it to zero meaning "no limit". This way the frontend's maxconn will be used by default. Of course if a value is set, it will be enforced.	2019-02-28 17:05:32 +01:00
Willy Tarreau	9bdd7bc63d	BUILD/MINOR: peers: remove an impossible null test in intencode() intencode() tests for the nullity of the target pointer passed in argument, but the code calling intencode() never does so and happily dereferences it. gcc at -O3 detects this as a potential null deref. Let's remove this incorrect and misleading test. If this pointer was null, the code would already crash in the calling functions. This must be backported to stable versions.	2019-02-12 11:59:35 +01:00
Olivier Houchard	ef60ff38fb	BUG/MEDIUM: peers: Handle mux creation failure. If the mux fails to properly be created by conn_install_mux, fail, instead of silently ignoring it. This should be backported to 1.9.	2019-01-29 19:47:20 +01:00
Willy Tarreau	6254a9257e	BUILD/MINOR: peers: shut up a build warning introduced during last cleanup A new warning appears when building at -O0 since commit `3f0fb9df6` ("MINOR: peers: move "hello" message treatment code to reduce the size of the I/O handler."), it is related to the fact that proto_len is initialized from strlen() which is not a constant. Let's replace it with sizeof-1 instead and also mark the variable as static since it's useless outside of the file.	2019-01-29 17:45:23 +01:00
Willy Tarreau	6f731f33ac	CLEANUP: peers: factor error handling in peer_treat_definedmsg() This is a trivial code refactoring of similar parsing error code under a single label.	2019-01-29 11:11:23 +01:00
Willy Tarreau	1e82a14c34	CLEANUP: peers: factor the error handling code in peer_treet_updatemsg() The error handling code was extremely repetitive and error-prone due to the numerous copy-pastes, some involving unlocks or free. Let's factor this out. The code could be further simplified, but 12 locations were already cleaned without taking risks.	2019-01-29 11:08:06 +01:00
Fr�d�ric L�caille	4b2fd9bf71	MINOR: peers: move peer initializations code to reduce the size of the I/O handler. Implements two new functions to init peer flags and other stuff after having accepted or connected them with the peer I/O handler so that to reduce its size. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	3f0fb9df6c	MINOR: peers: move "hello" message treatment code to reduce the size of the I/O handler. This patch implements three functions to read and parse the three line of a "hello" peer protocol message so that to call them from the peer I/O handler and reduce its size. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	be825e5c05	CLEANUP: peers: Remove useless statements. When implementing peer_recv_msg() we added the statements reached with a "goto imcomplete" at the end of this function. This statements are executed only when co_getblk() returns something <0. So they are useless for now on, and may be safely removed. The following section wich was responsible of sending any peer protocol messages were reached only when co_getblk() returned 0 (no more message to read). In this case we replace the "goto impcomplete" statement by a "goto send_msgs" to reach this only when peer_recv_msg() returns 0. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	25e1d5e435	MINOR: peers: move send code to reduce the size of the I/O handler. This patch extracts the code responsible of sending peer protocol messages from the peer I/O handler to create a new function and to reduce the size of this handler. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	444243c62c	MINOR: peers: move messages treatment code to reduce the size of the I/O handler. Extract the code of the peer I/O handler responsible of treating any peer protocol message to create peer_treat_awaited_msg() function. Also rename peer_recv_updatemsg() to peer_treat_updatemsg() as this function only parse a stick-table update message already received by peer_recv_msg(). May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	7d0ceeec80	MINOR: peers: move error handling to reduce the size of the I/O handler. Implement new functions to send error and control class stick-table messages. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	d5fe14bb96	CLEANUP: peers: Be more generic. Make usage of a C union to pass parameters to all the peer_prepare_*() functions (more readable). May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	95203f2185	MINOR: peers: Move high level receive code to reduce the size of I/O handler. Implement a new function to read incoming stick-table messages. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	d27b09400c	MINOR: peers: Move ack, switch and definition receive code to reduce the size of the I/O handler. Implement three new functions to treat peer acks, switch and definition messages extracting the code from the big swich-case of the peer I/O handler to give more chances to this latter to be readable. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	168a34b45f	MINOR: peers: Move update receive code to reduce the size of the I/O handler. This patch implements a new function to treat the stick-table update messages so that to reduce the size of the peer I/O handler by ~200 lines. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	6a8303d49e	MEDIUM: peers: synchronizaiton code factorization to reduce the size of the I/O handler. Factorize the code responsible of synchronizing the peers upon startup. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	87f554c9fb	MINOR: peers: Add new functions to send code and reduce the I/O handler. This patch reduces the size of the peer I/O handler implementing a new function named peer_send_updatemsg() which uses the already implement peer_prepare_updatemsg(), then ci_putblk(). Reuse the code used to implement peer_send_(ack\|swith)msg() function especially the more generic function peer_send_msg(). May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	ec44ea8692	MINOR: peers: send code factorization. Implements peer_send_msg() functions for switch and ack messages which call the already defined peer_prepare_msg() before calling ci_putblk(). These two new functions are used at three places in the peer_io_handler(). May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	a8725ec372	CLEANUP: peers: Indentation fixes. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	ce02557aad	MINOR: peers: Extract some code to be reused. May be backported as far as 1.5.	2019-01-29 10:29:54 +01:00
Fr�d�ric L�caille	1055e687a2	MINOR: peers: Make outgoing connection to SSL/TLS peers work. This patch adds pointer to a struct server to peer structure which is initialized after having parsed a remote "peer" line. After having parsed all peers section we run ->prepare_srv to initialize all SSL/TLS stuff of remote perr (or server). Remaining thing to do to completely support peer protocol over SSL/TLS: make "bind" keyword be supported in "peers" sections to make SSL/TLS incoming connections to local peers work. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Fr�d�ric L�caille	c06b5d4f74	MINOR: cfgparse: Make "peer" lines be parsed as "server" lines. With this patch "default-server" lines are supported in "peers" sections to setup the default settings of peers which are from now setup when parsing both "peer" and "server" lines. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Olivier Houchard	f502aca5c2	MEDIUM: mux: provide the session to the init() and attach() method. Instead of trying to get the session from the connection, which is not always there, and of course there could be multiple sessions per connection, provide it with the init() and attach() methods, so that we know the session for each outgoing stream.	2018-12-15 23:50:09 +01:00
Joseph Herlant	82b2f54d4c	CLEANUP: Fix typos in the peers subsystem Fix some typos in the code comments of the peers subsystem.	2018-11-18 22:26:42 +01:00
Willy Tarreau	db398435aa	MINOR: stream-int: replace si_cant_put() with si_rx_room_{blk,rdy}() Remaining calls to si_cant_put() were all for lack of room and were turned to si_rx_room_blk(). A few places where SI_FL_RXBLK_ROOM was cleared by hand were converted to si_rx_room_rdy(). The now unused si_cant_put() function was removed.	2018-11-18 21:41:50 +01:00
Willy Tarreau	0cd3bd628a	MINOR: stream-int: rename si_applet_{want\|stop\|cant}_{get\|put} It doesn't make sense to limit this code to applets, as any stream interface can use it. Let's rename it by simply dropping the "applet_" part of the name. No other change was made except updating the comments.	2018-11-11 10:18:37 +01:00
Willy Tarreau	2d372c2aa1	MINOR: stats: report the number of currently connected peers The active peers output indicates both the number of established peers connections and the number of peers connection attempts. The new counter "ConnectedPeers" also indicates the number of currently connected peers. This helps detect that some peers cannot be reached for example. It's worth mentioning that this value changes over time because unused peers are often disconnected and reconnected. Most of the time it should be equal to ActivePeers.	2018-11-05 17:15:21 +01:00
Willy Tarreau	199ad24661	MINOR: stats: report the number of active peers in "show info" Peers are the last type of activity which can maintain a job present, so it's important to report that such an entity is still active to explain why the job count may be higher than zero. Here by "ActivePeers" we report peers sessions, which include both established connections and outgoing connection attempts.	2018-11-05 17:15:21 +01:00
Willy Tarreau	086735a688	BUG/MINOR: tasks: make sure wakeup events are properly reported to subscribers The tasks API was changed in 1.9-dev1 with commit `9f6af3322` ("MINOR: tasks: Change the task API so that the callback takes 3 arguments."), causing the task's state not to be usable anymore and to have been replaced with an explicit argument in the callee. The task's state doesn't contain any trace of the wakeup cause anymore. But there were two places where the old task's state remained in use : - sessions, used to more accurately report timeouts in logs when seeing TASK_WOKEN_TIMEOUT ; - peers, used to finish resynchronization when seeing TASK_WOKEN_SIGNAL This commit fixes both occurrences by making sure we don't access task->state directly (should we rename it by the way ?). No backport is needed.	2018-11-05 17:15:21 +01:00
Olivier Houchard	33992267aa	MINOR: peers: use defines instead of enums to appease clang. Clang (rightfully) warns that we're trying to set chars to values >= 128. Use defines with hex values instead of an enum to address this.	2018-10-16 19:31:15 +02:00
Willy Tarreau	d944344f01	BUILD: peers: check allocation error during peers_init_sync() peers_init_sync() doesn't check task_new()'s return value and doesn't return any result to indicate success or failure. Let's make it return an int and check it from the caller. This can be backported as far as 1.6.	2018-10-15 13:24:43 +02:00
Willy Tarreau	175a2bb507	MINOR: connection: pass the proxy when creating a connection Till now it was very difficult for a mux to know what proxy it was working for. Let's pass the proxy when the mux is instanciated at init() time. It's not yet used but the H1 mux will definitely need it, just like the H2 mux when dealing with backend connections.	2018-09-12 17:39:22 +02:00
Willy Tarreau	35b51c6e5b	REORG: http: move the HTTP semantics definitions to http.h/http.c It's a bit painful to have to deal with HTTP semantics for each protocol version (H1 and H2), and working on the version-agnostic code further emphasizes the problem. This patch creates http.h and http.c which are agnostic to the version in use, and which borrow a few parts from proto_http and from h1. For example the once thought h1-specific h1_char_classes array is in fact dictated by RFC7231 and is used to parse HTTP headers. A few changes were made to a few files which were including proto_http.h while they only needed http.h. Certain string definitions pre-dated the introduction of indirect strings (ist) so some were used to simplify the definition of the known HTTP methods. The current lookup code saves 2 kB of a heavily used table and is faster than the previous table based lookup (typ. 14 ns vs 16 before).	2018-09-11 10:30:25 +02:00
Willy Tarreau	be373150c7	MINOR: connection: make the initialization more consistent Sometimes a connection is prepared before the target is set, sometimes after. There's no real rule since the few functions involved operate on different and independent fields. Soon we'll benefit from knowing the target at the connection layer, in order to figure the associated proxy and retrieve the various parameters (timeouts etc). This patch slightly reorders a few calls to conn_prepare() so that we can make sure that the target is always known to the mux.	2018-09-06 11:45:30 +02:00
Willy Tarreau	83061a820e	MAJOR: chunks: replace struct chunk with struct buffer Now all the code used to manipulate chunks uses a struct buffer instead. The functions are still called "chunk*", and some of them will progressively move to the generic buffer handling code as they are cleaned up.	2018-07-19 16:23:43 +02:00
Willy Tarreau	843b7cbe9d	MEDIUM: chunks: make the chunk struct's fields match the buffer struct Chunks are only a subset of a buffer (a non-wrapping version with no head offset). Despite this we still carry a lot of duplicated code between buffers and chunks. Replacing chunks with buffers would significantly reduce the maintenance efforts. This first patch renames the chunk's fields to match the name and types used by struct buffers, with the goal of isolating the code changes from the declaration changes. Most of the changes were made with spatch using this coccinelle script : @rule_d1@ typedef chunk; struct chunk chunk; @@ - chunk.str + chunk.area @rule_d2@ typedef chunk; struct chunk chunk; @@ - chunk.len + chunk.data @rule_i1@ typedef chunk; struct chunk chunk; @@ - chunk->str + chunk->area @rule_i2@ typedef chunk; struct chunk chunk; @@ - chunk->len + chunk->data Some minor updates to 3 http functions had to be performed to take size_t ints instead of ints in order to match the unsigned length here.	2018-07-19 16:23:43 +02:00
Willy Tarreau	c9fa0480af	MAJOR: buffer: finalize buffer detachment Now the buffers only contain the header and a pointer to the storage area which can be anywhere. This will significantly simplify buffer swapping and will make it possible to map chunks on buffers as well. The buf_empty variable was removed, as now it's enough to have size==0 and area==NULL to designate the empty buffer (thus a non-allocated head is the empty buffer by default). buf_wanted for now is indicated by size==0 and area==(void *)1. The channels and the checks now embed the buffer's head, and the only pointer is to the storage area. This slightly increases the unallocated buffer size (3 extra ints for the empty buffer) but considerably simplifies dynamic buffer management. It will also later permit to detach unused checks. The way the struct buffer is arranged has proven quite efficient on a number of tests, which makes sense given that size is always accessed and often first, followed by the othe ones.	2018-07-19 16:23:43 +02:00
Olivier Houchard	9f6af33222	MINOR: tasks: Change the task API so that the callback takes 3 arguments. In preparation for thread-specific runqueues, change the task API so that the callback takes 3 arguments, the task itself, the context, and the state, those were retrieved from the task before. This will allow these elements to change atomically in the scheduler while the application uses the copied value, and even to have NULL tasks later.	2018-05-26 19:23:57 +02:00
Emeric Brun	5548291395	BUG/MEDIUM: peers: fix expire date wasn't updated if entry is modified remotely. The stktable_touch_remote considers the expire field stored in the stksess struct. The expire field was updated on the a newly created stksess to store. But if the stksess with a same key is still present the expire was not updated. This patch postpones the update of the expire field of the stksess just before processing the "touch". These bug was introduced in commit: MEDIUM: threads/stick-tables: handle multithreads on stick tables. And the fix should be backported on 1.8.	2018-01-22 16:03:25 +01:00
Willy Tarreau	dbd026792a	BUG/MEDIUM: peers: set NOLINGER on the outgoing stream interface Since peers were ported to an applet in 1.5, an issue appeared which is that certain attempts to close an outgoing connection are a bit "too nice". Specifically, protocol errors and stream timeouts result in a clean shutdown to be sent, waiting for the other side to confirm. This is particularly problematic in the case of timeouts since by definition the other side will not confirm as it has disappeared. As found by Fred, this issue was further emphasized in 1.8 by commit `f9ce57e` ("MEDIUM: connection: make conn_sock_shutw() aware of lingering") which causes clean shutdowns not to be sent if the fd is marked as linger_risk, because now even a clean timeout will not be sent on an idle peers session, and the other one will have nothing to respond to. The solution here is to set NOLINGER on the outgoing stream interface to ensure we always close whenever we attempt a simple shutdown. However it is important to keep in mind that this also underlines some weaknesses of the shutr/shutw processing inside process_stream() and that all this part needs to be reworked to clearly consider the abort case, and to stop the confusion between linger_risk and NOLINGER. This fix needs to be backported as far as 1.5 (all versions are affected). However, during testing of the backport it was found that 1.5 never tries to close the peers connection on timeout, so it suffers for another issue.	2017-12-06 17:48:36 +01:00
Emeric Brun	088c9b73ca	BUG/MAJOR: thread/peers: fix deadlock on peers sync. Table lock was not released on an error path (if there is no enough room to write table switch message). [wt: needs to be backported to 1.8]	2017-12-01 15:06:43 +01:00
Willy Tarreau	cea8537efd	BUG/MEDIUM: threads/peers: decrement, not increment jobs on quitting Commit `8d8aa0d` ("MEDIUM: threads/listeners: Make listeners thread-safe") mistakenly placed HA_ATOMIC_ADD(job, 1) to replace a job--, so it maintains the job count too high preventing the process from cleanly exiting on reload. This needs to be backported to 1.8.	2017-11-29 14:51:20 +01:00
Willy Tarreau	bafbe01028	CLEANUP: pools: rename all pool functions and pointers to remove this "2" During the migration to the second version of the pools, the new functions and pool pointers were all called "pool_something2()" and "pool2_something". Now there's no more pool v1 code and it's a real pain to still have to deal with this. Let's clean this up now by removing the "2" everywhere, and by renaming the pool heads "pool_head_something".	2017-11-24 17:49:53 +01:00
Christopher Faulet	767a84bcc0	CLEANUP: log: Rename Alert/Warning in ha_alert/ha_warning	2017-11-24 17:19:12 +01:00
Christopher Faulet	2a944ee16b	BUILD: threads: Rename SPIN/RWLOCK macros using HA_ prefix This remove any name conflicts, especially on Solaris.	2017-11-07 11:10:24 +01:00
Emeric Brun	f2fc1fda80	BUG/MINOR: freq: fix infinite loop on freq_ctr_period. Using peers or stick table we could update an freq_ctr using a tick value with the first bit set but this bit is reserved for lock since multithreading support.	2017-11-02 18:09:58 +01:00
Olivier Houchard	9aaf778129	MAJOR: connection : Split struct connection into struct connection and struct conn_stream. All the references to connections in the data path from streams and stream_interfaces were changed to use conn_streams. Most functions named "something_conn" were renamed to "something_cs" for this. Sometimes the connection still is what matters (eg during a connection establishment) and were not always renamed. The change is significant and minimal at the same time, and was quite thoroughly tested now. As of this patch, all accesses to the connection from upper layers go through the pass-through mux.	2017-10-31 18:03:23 +01:00
Willy Tarreau	53a4766e40	MEDIUM: connection: start to introduce a mux layer between xprt and data For HTTP/2 and QUIC, we'll need to deal with multiplexed streams inside a connection. After quite a long brainstorming, it appears that the connection interface to the existing streams is appropriate just like the connection interface to the lower layers. In fact we need to have the mux layer in the middle of the connection, between the transport and the data layer. A mux can exist on two directions/sides. On the inbound direction, it instanciates new streams from incoming connections, while on the outbound direction it muxes streams into outgoing connections. The difference is visible on the mux->init() call : in one case, an upper context is already known (outgoing connection), and in the other case, the upper context is not yet known (incoming connection) and will have to be allocated by the mux. The session doesn't have to create the new streams anymore, as this is performed by the mux itself. This patch introduces this and creates a pass-through mux called "mux_pt" which is used for all new connections and which only calls the data layer's recv,send,wake() calls. One incoming stream is immediately created when init() is called on the inbound direction. There should not be any visible impact. Note that the connection's mux is purposely not set until the session is completed so that we don't accidently run with the wrong mux. This must not cause any issue as the xprt_done_cb function is always called prior to using mux's recv/send functions.	2017-10-31 18:03:23 +01:00
Emeric Brun	80527f5bb6	MAJOR: threads/peers: Make peers thread safe A lock is used to protect accesses to a peer structure. A the lock is taken in the applet handler when the peer is identified and released living the applet handler. In the scheduling task for peers section, the lock is taken for every listed peer and released at the end of the process task function. The peer 'force shutdown' function was also re-worked.	2017-10-31 13:58:31 +01:00
Emeric Brun	1138fd0c57	MAJOR: threads/applet: Handle multithreading for applets A global lock has been added to protect accesses to the list of active applets. A process mask has also been added on each applet. Like for FDs and tasks, it is used to know which threads are allowed to process an applet. Because applets are, most of time, linked to a session, it should be sticky on the same thread. But in all cases, it is the responsibility of the applet handler to lock what have to be protected in the applet context.	2017-10-31 13:58:31 +01:00
Emeric Brun	819fc6f563	MEDIUM: threads/stick-tables: handle multithreads on stick tables The stick table API was slightly reworked: A global spin lock on stick table was added to perform lookup and insert in a thread safe way. The handling of refcount on entries is now handled directly by stick tables functions under protection of this lock and was removed from the code of callers. The "stktable_store" function is no more externalized and users should now use "stktable_set_entry" in any case of insertion. This last one performs a lookup followed by a store if not found. So the code using "stktable_store" was re-worked. Lookup, and set_entry functions automatically increase the refcount of the returned/stored entry. The function "sticktable_touch" was renamed "sticktable_touch_local" and is now able to decrease the refcount if last arg is set to true. It is allowing to release the entry without taking the lock twice. A new function "sticktable_touch_remote" is now used to insert entries coming from remote peers at the right place in the update tree. The code of peer update was re-worked to use this new function. This function is also able to decrease the refcount if wanted. The function "stksess_kill" also handle a parameter to decrease the refcount on the entry. A read/write lock is added on each entry to protect the data content updates of the entry.	2017-10-31 13:58:31 +01:00
Christopher Faulet	8d8aa0d681	MEDIUM: threads/listeners: Make listeners thread-safe First, we use atomic operations to update jobs/totalconn/actconn variables, listener's nbconn variable and listener's counters. Then we add a lock on listeners to protect access to their information. And finally, listener queues (global and per proxy) are also protected by a lock. Here, because access to these queues are unusal, we use the same lock for all queues instead of a global one for the global queue and a lock per proxy for others.	2017-10-31 13:58:30 +01:00
Emeric Brun	c60def8368	MAJOR: threads/task: handle multithread on task scheduler 2 global locks have been added to protect, respectively, the run queue and the wait queue. And a process mask has been added on each task. Like for FDs, this mask is used to know which threads are allowed to process a task. For many tasks, all threads are granted. And this must be your first intension when you create a new task, else you have a good reason to make a task sticky on some threads. This is then the responsibility to the process callback to lock what have to be locked in the task context. Nevertheless, all tasks linked to a session must be sticky on the thread creating the session. It is important that I/O handlers processing session FDs and these tasks run on the same thread to avoid conflicts.	2017-10-31 13:58:30 +01:00
Willy Tarreau	06d80a9a9c	REORG: channel: finally rename the last bi_* / bo_* functions For HTTP/2 we'll need some buffer-only equivalent functions to some of the ones applying to channels and still squatting the bi_* / bo_* namespace. Since these names have kept being misleading for quite some time now and are really getting annoying, it's time to rename them. This commit will use "ci/co" as the prefix (for "channel in", "channel out") instead of "bi/bo". The following ones were renamed : bi_getblk_nc, bi_getline_nc, bi_putblk, bi_putchr, bo_getblk, bo_getblk_nc, bo_getline, bo_getline_nc, bo_inject, bi_putchk, bi_putstr, bo_getchr, bo_skip, bi_swpbuf	2017-10-19 15:01:08 +02:00
Willy Tarreau	bf08beb2a3	MINOR: session: remove the list of streams from struct session Commit `bcb86ab` ("MINOR: session: add a streams field to the session struct") added this list of streams that is not needed anymore. Let's get rid of it now.	2017-10-08 22:32:05 +02:00
Willy Tarreau	0bf6fa5e40	MEDIUM: session: count the frontend's connections at a single place There are several places where we see feconn++, feconn--, totalconn++ and an increment on the frontend's number of connections and connection rate. This is done exactly once per session in each direction, so better take care of this counter in the session and simplify the callers. At least it ensures a better symmetry. It also ensures consistency as till now the lua/spoe/peers frontend didn't have these counters properly set, which can be useful at least for troubleshooting.	2017-09-15 11:49:52 +02:00
Willy Tarreau	6f5e4b98df	MEDIUM: session: take care of incrementing/decrementing jobs Each user of a session increments/decrements the jobs variable at its own place, resulting in a real mess and inconsistencies between them. Let's have session_new() increment jobs and session_free() decrement it.	2017-09-15 11:49:52 +02:00
Willy Tarreau	04b9286933	MINOR: peers: don't reference the incoming listener on outgoing connections Since v1.7 it's pointless to reference a listener when greating a session for an outgoing connection, it only complicates the code. SPOE and Lua were cleaned up in 1.8-dev1 but the peers code was forgotten. This patch fixes this by not assigning such a listener for outgoing connections. It also has the extra benefit of not discounting the outgoing connections from the number of allowed incoming connections (the code currently adds a safety marging of 3 extra connections to take care of this).	2017-09-15 11:01:04 +02:00
Willy Tarreau	87787acf72	MEDIUM: stream: make stream_new() allocate its own task Currently a task is allocated in session_new() and serves two purposes : - either the handshake is complete and it is offered to the stream via the second arg of stream_new() - or the handshake is not complete and it's diverted to be used as a timeout handler for the embryonic session and repurposed once we land into conn_complete_session() Furthermore, the task's process() function was taken from the listener's handler in conn_complete_session() prior to being replaced by a call to stream_new(). This will become a serious mess with the mux. Since it's impossible to have a stream without a task, this patch removes the second arg from stream_new() and make this function allocate its own task. In session_accept_fd(), we now only allocate the task if needed for the embryonic session and delete it later.	2017-08-30 07:05:04 +02:00
Willy Tarreau	2bfd35885e	MINOR: stream: link the stream to its session Now each stream is added to the session's list of streams, so that it will be possible to know all the streams belonging to a session, and to know if any stream is still attached to a sessoin.	2017-08-18 13:26:35 +02:00
Willy Tarreau	f1d33db10a	CLEANUP: task: remove all initializations to TICK_ETERNITY after task_new() This is now guaranteed by design, simply remove these unneeded parts to avoid confusion.	2017-07-24 17:55:20 +02:00
Fr�d�ric L�caille	ed2b4a6b79	BUG/MINOR: peers: peer synchronization issue (with several peers sections). When several stick-tables were configured with several peers sections, only a part of them could be synchronized: the ones attached to the last parsed 'peers' section. This was due to the fact that, at least, the peer I/O handler refered to the wrong peer section list, in fact always the same: the last one parsed. The fact that the global peer section list was named "struct peers *peers" lead to this issue. This variable name is dangerous ;). So this patch renames global 'peers' variable to 'cfg_peers' to ensure that no such wrong references are still in use, then all the functions wich used old 'peers' variable have been modified to refer to the correct peer list. Must be backported to 1.6 and 1.7.	2017-07-13 09:39:29 +02:00
Fr�d�ric L�caille	37a72546f6	MINOR: peers: Add additional information to stick-table definition messages. With this patch additional information are added to stick-table definition messages so that to make external application capable of learning peer stick-table configurations. First stick-table entries duration is added followed by the frequency counters type IDs and values. May be backported to 1.7 and 1.6.	2017-07-07 10:24:44 +02:00
Emeric Brun	5f77fef34e	MINOR: task/stream: tasks related to a stream must be init by the caller. The task_wakeup was called on stream_new, but the task/stream wasn't fully initialized yet. The task_wakeup must be called explicitly by the caller once the task/stream is initialized.	2017-06-27 14:38:02 +02:00
Fr�d�ric L�caille	5d6e5f86c5	BUG/MINOR: Wrong peer task expiration handling during synchronization processing. When a peer task has sent a synchronization request to remote peers its next expiration date was updated based on a resynchronization timeout value which itself may have already expired leading the underlying poller to wait for 0ms during a fraction of second (consuming high CPU resources). With this patch we update such peer task expiration dates only if the resynchronization timeout is not already expired. Thanks to Patrick Hemmer who reported an issue with nice traces which helped in finding this one. This patch may be backported to 1.7 and 1.6.	2017-06-21 11:19:50 +02:00
Fr�d�ric L�caille	5df119008a	BUG/MEDIUM: peers: Peers CLOSE_WAIT issue. A peer session which has just been created upon reconnect timeout expirations, could be right after shutdown (at peer session level) because the remote side peer could also righ after have connected. In such a case the underlying TCP session was still running (connect()/accept()) and finally left in CLOSE_WAIT state after the remote side stopped writting (shutdown(SHUT_WR)). Now on, with this patch we never shutdown such peer sessions wich have just been created. We leave them connect to the remote peer which is already connected and must shutdown its own peer session. Thanks to Patric Hemmer and Yves Lafon at w3.org for reporting this issue, and for having tested this patch on the field. Thanks also to Willy and Yelp blogs which helped me a lot in fixing it (see https://www.haproxy.com/blog/truly-seamless-reloads-with-haproxy-no-more-hacks/ and https://engineeringblog.yelp.com/2015/04/true-zero-downtime-haproxy-reloads.htmll).	2017-06-15 10:47:40 +02:00
Emeric Brun	18928af3a3	BUG/MEDIUM: peers: fix buffer overflow control in intdecode. A buffer overflow could happen if an integer is badly encoded in the data part of a msg received from a peer. It should not happen with authenticated peers (the handshake do not use this function). This patch makes the code of the 'intdecode' function more robust. It also adds some comments about the intencode function. This bug affects versions >= 1.6.	2017-03-30 12:12:46 +02:00
Willy Tarreau	c95bad5013	MEDIUM: move listener->frontend to bind_conf->frontend Historically, all listeners have a pointer to the frontend. But since the introduction of SSL, we now have an intermediary layer called bind_conf corresponding to a "bind" line. It makes no sense to have the frontend on each listener given that it's the same for all listeners belonging to a same bind_conf. Also certain parts like SSL can only operate on bind_conf and need the frontend. This patch fixes this by moving the frontend pointer from the listener to the bind_conf. The extra indirection is quite cheap given and the places were this is used are very scarce.	2016-12-22 23:26:38 +01:00
Christopher Faulet	a73e59b690	BUG/MAJOR: Fix how the list of entities waiting for a buffer is handled When an entity tries to get a buffer, if it cannot be allocted, for example because the number of buffers which may be allocated per process is limited, this entity is added in a list (called <buffer_wq>) and wait for an available buffer. Historically, the <buffer_wq> list was logically attached to streams because it were the only entities likely to be added in it. Now, applets can also be waiting for a free buffer. And with filters, we could imagine to have more other entities waiting for a buffer. So it make sense to have a generic list. Anyway, with the current design there is a bug. When an applet failed to get a buffer, it will wait. But we add the stream attached to the applet in <buffer_wq>, instead of the applet itself. So when a buffer is available, we wake up the stream and not the waiting applet. So, it is possible to have waiting applets and never awakened. So, now, <buffer_wq> is independant from streams. And we really add the waiting entity in <buffer_wq>. To be generic, the entity is responsible to define the callback used to awaken it. In addition, applets will still request an input buffer when they become active. But they will not be sleeped anymore if no buffer are available. So this is the responsibility to the applet I/O handler to check if this buffer is allocated or not. This way, an applet can decide if this buffer is required or not and can do additional processing if not. [wt: backport to 1.7 and 1.6]	2016-12-12 19:11:04 +01:00
Willy Tarreau	9df94c2b25	MINOR: peers: remove the pointer to the stream There's no reason to use the stream anymore, only the appctx should be used by a peer. This was a leftover from the migration to appctx and it caused some confusion, so let's totally drop it now. Note that half of the patch are just comment updates.	2016-10-31 20:07:01 +01:00
Willy Tarreau	81bc3b062b	MINOR: peers: make peer_session_forceshutdown() use the appctx and not the stream It was inherited from initial code but we must only manipulate the appctx and never the stream, otherwise we always risk shooting ourselves in the foot.	2016-10-31 20:07:01 +01:00
Willy Tarreau	b21d08e249	BUG/MEDIUM: peers: fix use after free in peer_session_create() In case of resource allocation error, peer_session_create() frees everything allocated and returns a pointer to the stream/session that was put back into the free pool. This stream/session is then assigned to ps->{stream,session} with no error control. This means that it is perfectly possible to have a new stream or session being both used for a regular communication and for a peer at the same time. In fact it is the only way (for now) to explain a CLOSE_WAIT on peers connections that was caught in this dump with the stream interface in SI_ST_CON state while the error field proves the state ought to have been SI_ST_DIS, very likely indicating two concurrent accesses on the same area : 0x7dbd50: [31/Oct/2016:17:53:41.267510] id=0 proto=tcpv4 flags=0x23006, conn_retries=0, srv_conn=(nil), pend_pos=(nil) frontend=myhost2 (id=4294967295 mode=tcp), listener=? (id=0) backend=<NONE> (id=-1 mode=-) addr=127.0.0.1:41432 server=<NONE> (id=-1) addr=127.0.0.1:8521 task=0x7dbcd8 (state=0x08 nice=0 calls=2 exp=<NEVER> age=1m5s) si[0]=0x7dbf48 (state=CLO flags=0x4040 endp0=APPCTX:0x7d99c8 exp=<NEVER>, et=0x000) si[1]=0x7dbf68 (state=CON flags=0x50 endp1=CONN:0x7dc0b8 exp=<NEVER>, et=0x020) app0=0x7d99c8 st0=11 st1=0 st2=0 applet=<PEER> co1=0x7dc0b8 ctrl=tcpv4 xprt=RAW data=STRM target=PROXY:0x7fe62028a010 flags=0x0020b310 fd=7 fd.state=22 fd.cache=0 updt=0 req=0x7dbd60 (f=0x80a020 an=0x0 pipe=0 tofwd=0 total=0) an_exp=<NEVER> rex=<NEVER> wex=<NEVER> buf=0x78a3c0 data=0x78a3d4 o=0 p=0 req.next=0 i=0 size=0 res=0x7dbda0 (f=0x80402020 an=0x0 pipe=0 tofwd=0 total=0) an_exp=<NEVER> rex=<NEVER> wex=<NEVER> buf=0x78a3c0 data=0x78a3d4 o=0 p=0 rsp.next=0 i=0 size=0 Special thanks to Arnaud Gavara who provided lots of valuable input and ran some validation testing on this patch. This fix must be backported to 1.6 and 1.5. Note that in 1.5 the session is not assigned from within the function so some extra checks may be needed in the callers.	2016-10-31 20:02:05 +01:00
Willy Tarreau	78c0c50705	BUG/MEDIUM: peers: on shutdown, wake up the appctx, not the stream This part was missed when peers were ported to the new applet infrastructure in 1.6, the main stream is woken up instead of the appctx. This creates a race condition by which it is possible to wake the stream at the wrong moment and miss an event. This bug might be at least partially responsible for some of the CLOSE_WAIT that were reported on peers session upon reload in version 1.6. This fix must be backported to 1.6.	2016-10-31 20:01:42 +01:00
Fr�d�ric L�caille	523cc9e858	MEDIUM: peers: Fix a peer stick-tables synchronization issue. During the stick-table teaching process which occurs at reloading/restart time, expiration dates of stick-tables entries were not synchronized between peers. This patch adds two new stick-table messages to provide such a synchronization feature. As these new messages are not supported by older haproxy peers protocol versions, this patch increments peers protol version, from 2.0 to 2.1, to help in detecting/supporting such older peers protocol implementations so that new versions might still be able to transparently communicate with a newer one. [wt: technically speaking it would be nice to have this backported into 1.6 as some people who reload often are affected by this design limitation, but it's not a totally transparent change that may make certain users feel reluctant to upgrade older versions. Let's let it cook in 1.7 first and decide later]	2016-10-17 19:44:35 +02:00
Emeric Brun	597b26e432	BUG/MINOR: peers: empty chunks after a resync. After pushing the last update related to a resync process, the teacher resets the re-connection's origin to the saved one (pointer position when he receive the resync request). But the last acknowledgement overwrites this pointer to an inconsistent value. In peersv2, it results with empty table chunks regularly pushed. The fix consist to move the confirm code to assure that the confirm message is always sent after the last acknowledgement related to the resync. And to reset the re-connection's origin to the saved one when a confirm message is received. This bug affects versions 1.6 and superior. For older versions (peersv1), this inconsistent state should not generate side effects because chunks are not used and a next acknowlegement message resets the pointer in a valid state.	2016-08-14 11:47:05 +02:00
Emeric Brun	cc52274496	BUG/MINOR: peers: some updates are pushed twice after a resync. This bug is due to a copy/paste error and was introduced with peers-protocol v2: The last_pushed pointer was not correctly reset to the teaching_origin at the end of the teaching state but to the first update present in the tree. The result: some updates were re-pushed after leaving the teaching state. This fix needs to be backported to 1.6.	2016-08-10 17:42:13 +02:00
Fr�d�ric L�caille	22fc3203db	BUG/MINOR: peers: Fix peers data decoding issue This error led to truncated data after decoding upon receipt. It's specific to peers v2 and needs to be backported to 1.6.	2016-07-26 14:37:38 +02:00
Vincent Bernat	02779b6263	CLEANUP: uniformize last argument of malloc/calloc Instead of repeating the type of the LHS argument (sizeof(struct ...)) in calls to malloc/calloc, we directly use the pointer name (sizeof(...)). The following Coccinelle patch was used: @@ type T; T x; @@ x = malloc( - sizeof(T) + sizeof(x) ) @@ type T; T x; @@ x = calloc(1, - sizeof(T) + sizeof(*x) ) When the LHS is not just a variable name, no change is made. Moreover, the following patch was used to ensure that "1" is consistently used as a first argument of calloc, not the last one: @@ @@ calloc( + 1, ... - ,1 )	2016-04-03 14:17:42 +02:00
Vincent Bernat	3c2f2f207f	CLEANUP: remove unneeded casts In C89, "void " is automatically promoted to any pointer type. Casting the result of malloc/calloc to the type of the LHS variable is therefore unneeded. Most of this patch was built using this Coccinelle patch: @@ type T; @@ - (T ) (\(lua_touserdata\\|malloc\\|calloc\\|SSL_get_app_data\\|hlua_checkudata\\|lua_newuserdata\)(...)) @@ type T; T x; void data; @@ x = - (T ) data @@ type T; T x; T data; @@ x = - (T ) data Unfortunately, either Coccinelle or I is too limited to detect situation where a complex RHS expression is of type "void *" and therefore casting is not needed. Those cases were manually examined and corrected.	2016-04-03 14:17:42 +02:00
Willy Tarreau	3bb46177ac	BUG/MEDIUM: peers: fix incorrect age in frequency counters The frequency counters's window start is sent as "now - freq.date", which is a positive age compared to the current date. But on receipt, this age was added to the current date instead of subtracted. So since the date was always in the future, they were always expired if the activity changed side in less than the counter's measuring period (eg: 10s). This bug was reported by Christian Ruppert who also provided an easy reproducer. It needs to be backported to 1.6.	2016-03-25 18:17:47 +01:00
Emeric Brun	234fc3c31e	BUG/MEDIUM: peers: table entries learned from a remote are pushed to others after a random delay. New sticktable entries learned from a remote peer can be pushed to others after a random delay because they are not inserted at the right position in the updates tree.	2015-12-16 15:50:22 +01:00
Emeric Brun	b058f1c548	BUG/MINOR: fct peer_prepare_ackmsg should not use trash. function 'peer_prepare_ackmsg' is designed to use the argument 'msg' instead of 'trash.str'. There is currently no bug because the caller passes 'trash.str' in the 'msg' argument.	2015-09-22 16:07:34 +02:00
Emeric Brun	a6a0998529	BUG/MEDIUM: peers: same table updates re-pushed after a re-connect Some updates are pushed using an incremental update message after a re-connection whereas the origin is forgotten by the peer. These updates are never correctly acknowledged. So they are regularly re-pushed after an idle timeout and a re-connect. The fix consists to use an absolute update message in some cases.	2015-09-22 16:07:32 +02:00
Willy Tarreau	37bb7be09c	BUG/MAJOR: peers: fix a crash when stopping peers on unbound processes Pradeep Jindal reported and troubleshooted a bug causing haproxy to die during startup on all processes not making use of a peers section. It only happens with nbproc > 1 when peers are declared. Technically it's when the peers task is stopped on processes that don't use it that the crash occurred (a task_free() called on a NULL task pointer). This only affects peers v2 in the dev branch, no backport is needed.	2015-09-21 15:24:58 +02:00
Emeric Brun	b157d73beb	BUG/MAJOR: peers: fix current table pointer not re-initialized on session release. This bug causes malfunctions after re-connect. For instance the re-sync fails.	2015-08-21 14:24:32 +02:00
Emeric Brun	e1ab808ff8	BUG/MEDIUM: peers: fix wrong message id on stick table updates acknowledgement. The table definition message id was used instead of the update acknowledgement id. This bug causes a malformated message and a protocol error and breaks the connection. After that, the updates remain unacknowledged.	2015-08-21 14:24:17 +02:00
Thierry FOURNIER	5d24ebc3d7	MEDIUM: stick-tables: use the sample type names This patch removes the special stick tables types names and use the standard sample type names. This avoid the maintainance of two types and remove the switch/case for matching a sample type for each stick table type.	2015-08-20 17:13:47 +02:00
Emeric Brun	9490095abb	MEDIUM: peers: support of any stick-table data-types for sync It is possible to propagate entries of any data-types in stick-tables between several haproxy instances over TCP connections in a multi-master fashion. Each instance pushes its local updates and insertions to remote peers. The pushed values overwrite remote ones without aggregation. Interrupted exchanges are automatically detected and recovered from the last known point.	2015-06-16 16:11:59 +02:00
Emeric Brun	aaf5860fd6	MINOR: peers: avoid re-scheduling of pending stick-table's updates still not pushed.	2015-06-16 16:11:12 +02:00
Emeric Brun	b3971ab062	MAJOR: peers: peers protocol version 2.0 This patch does'nt add any new feature: the functional behavior is the same than version 1.0. Technical differences: In this version all updates on different stick tables are multiplexed on the same tcp session. There is only one established tcp session per peer whereas in first version there was one established tcp session per peer and per stick table. Messages format was reviewed to be more evolutive and to support further types of data exchange such as SSL sessions or other sticktable's data types (currently only the sticktable's server id is supported).	2015-05-29 15:50:33 +02:00
Willy Tarreau	b4e34da692	BUG/MEDIUM: peers: apply a random reconnection timeout Commit `9ff95bb` ("BUG/MEDIUM: peers: correctly configure the client timeout") uncovered an old bug in the peers : upon disconnect, we reconnect immediately. This sometimes results in both ends to do the same thing in parallel causing a loop of connect/accept/close/close that can last several seconds. The risk of occurrence of the trouble increases with latency, and is emphasized by the fact that idle connections are now frequently recycled (after 5s of idle). In order to avoid this we must apply a random delay before reconnecting. Fortunately the mechanism already supports a reconnect delay, so here we compute the random timeout when killing a session. The delay is 50ms plus a random between 0 and 2 seconds. Ideally an exponential back-off would be preferred but it's preferable to keep the fix simple. This bug was reported by Marco Corte. This fix must be backported to 1.5 since the fix above was backported into 1.5.12.	2015-05-20 10:49:07 +02:00
Willy Tarreau	0fca4835b2	MEDIUM: config: propagate the table's process list to the peers sections Now a peers section has its bind_proc set to the union of all those of its users.	2015-05-01 20:16:31 +02:00
Willy Tarreau	46dc1ca761	MEDIUM: peers: unregister peers that were never started The peers initialization sequence is a bit complex, they're attached to stick-tables and initialized very early in the boot process. When we fork, if some must not start, it's too late to find them. Instead, simply add a guard in their respective tasks to stop them once they want to start.	2015-05-01 20:16:31 +02:00

1 2 3 4 5 ...

392 Commits