haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-17 12:36:59 +02:00

Author	SHA1	Message	Date
Willy Tarreau	251c2aae06	CLEANUP: include: move sample_data out of sample-t.h The struct sample_data is used by pattern, map and vars, and currently requires to include sample-t which comes with many other dependencies. Let's move sample_data into its own file to shorten the dependency tree. This revealed a number of issues in adjacent files which were hidden by the fact that sample-t.h brought everything that was missing.	2020-06-11 10:18:59 +02:00
Willy Tarreau	4f663ec022	CLEANUP: include: don't include proxy-t.h in global-t.h We only need a forward declaration here to avoid embarking lots of files, and by just doing this we reduce the build size by 3.5%.	2020-06-11 10:18:59 +02:00
Willy Tarreau	d62af6abe4	CLEANUP: include: don't include stddef.h directly Directly including stddef.h in many files results in it being processed multiple times while it can be centralized in api-t.h and be guarded against multiple inclusions. Doing so reduces the number of preprocessed lines by 1200!	2020-06-11 10:18:59 +02:00
Willy Tarreau	bcc6733fab	REORG: check: extract the external checks from check.{c,h} The health check code is ugly enough, let's take the external checks out of it to simplify the code and shrink the file a little bit.	2020-06-11 10:18:58 +02:00
Willy Tarreau	d604ace940	REORG: check: move email_alert* from proxy-t.h to mailers-t.h These ones are specific to mailers and have nothing to do in proxy-t.h.	2020-06-11 10:18:58 +02:00
Willy Tarreau	51cd5956ee	REORG: check: move tcpchecks away from check.c Checks.c remains one of the largest file of the project and it contains too many things. The tcpchecks code represents half of this file, and both parts are relatively isolated, so let's move it away into its own file. We now have tcpcheck.c, tcpcheck{,-t}.h. Doing so required to export quite a number of functions because check.c has almost everything made static, which really doesn't help to split!	2020-06-11 10:18:58 +02:00
Willy Tarreau	cee013e4e0	REORG: check: move the e-mail alerting code to mailers.c check.c is one of the largest file and contains too many things. The e-mail alerting code is stored there while nothing is in mailers.c. Let's move this code out. That's only 4% of the code but a good start. In order to do so, a few tcp-check functions had to be exported.	2020-06-11 10:18:58 +02:00
Willy Tarreau	4f6535d734	CLEANUP: hpack: export debug functions and move inlines to .h When building contrib/hpack there is a warning about an unused static function. Actually it makes no sense to make it static, instead it must be regularly exported. Similarly there is hpack_dht_get_tail() which is inlined in the C file and which would make more sense with all other ones in the H file.	2020-06-11 10:18:58 +02:00
Willy Tarreau	6be7849f39	REORG: include: move cfgparse.h to haproxy/cfgparse.h There's no point splitting the file in two since only cfgparse uses the types defined there. A few call places were updated and cleaned up. All of them were in C files which register keywords. There is nothing left in common/ now so this directory must not be used anymore.	2020-06-11 10:18:58 +02:00
Willy Tarreau	dfd3de8826	REORG: include: move stream.h to haproxy/stream{,-t}.h This one was not easy because it was embarking many includes with it, which other files would automatically find. At least global.h, arg.h and tools.h were identified. 93 total locations were identified, 8 additional includes had to be added. In the rare files where it was possible to finalize the sorting of includes by adjusting only one or two extra lines, it was done. But all files would need to be rechecked and cleaned up now. It was the last set of files in types/ and proto/ and these directories must not be reused anymore.	2020-06-11 10:18:58 +02:00
Willy Tarreau	1e56f92693	REORG: include: move server.h to haproxy/server{,-t}.h extern struct dict server_name_dict was moved from the type file to the main file. A handful of inlined functions were moved at the bottom of the file. Call places were updated to use server-t.h when relevant, or to simply drop the entry when not needed.	2020-06-11 10:18:58 +02:00
Willy Tarreau	a55c45470f	REORG: include: move queue.h to haproxy/queue{,-t}.h Nothing outstanding here. A number of call places were not justified and removed.	2020-06-11 10:18:58 +02:00
Willy Tarreau	4980160ecc	REORG: include: move backend.h to haproxy/backend{,-t}.h The files remained mostly unchanged since they were OK. However, half of the users didn't need to include them, and about as many actually needed to have it and used to find functions like srv_currently_usable() through a long chain that broke when moving the file.	2020-06-11 10:18:58 +02:00
Willy Tarreau	6c58ab0304	REORG: include: move spoe.h to haproxy/spoe{,-t}.h Only minor change was to make sure all defines were before the structs in spoe-t.h, everything else went smoothly.	2020-06-11 10:18:58 +02:00
Willy Tarreau	a264d960f6	REORG: include: move proxy.h to haproxy/proxy{,-t}.h This one is particularly difficult to split because it provides all the functions used to manipulate a proxy state and to retrieve names or IDs for error reporting, and as such, it was included in 73 files (down to 68 after cleanup). It would deserve a small cleanup though the cut points are not obvious at the moment given the number of structs involved in the struct proxy itself.	2020-06-11 10:18:58 +02:00
Willy Tarreau	aeed4a85d6	REORG: include: move log.h to haproxy/log{,-t}.h The current state of the logging is a real mess. The main problem is that almost all files include log.h just in order to have access to the alert/warning functions like ha_alert() etc, and don't care about logs. But log.h also deals with real logging as well as log-format and depends on stream.h and various other things. As such it forces a few heavy files like stream.h to be loaded early and to hide missing dependencies depending where it's loaded. Among the missing ones is syslog.h which was often automatically included resulting in no less than 3 users missing it. Among 76 users, only 5 could be removed, and probably 70 don't need the full set of dependencies. A good approach would consist in splitting that file in 3 parts: - one for error output ("errors" ?). - one for log_format processing - and one for actual logging.	2020-06-11 10:18:58 +02:00
Willy Tarreau	c6599682d5	REORG: include: move fcgi-app.h to haproxy/fcgi-app{,-t}.h Only arg-t.h was missing from the types to get arg_list.	2020-06-11 10:18:58 +02:00
Willy Tarreau	c7babd8570	REORG: include: move filters.h to haproxy/filters{,-t}.h Just a minor change, moved the macro definitions upwards. A few caller files were updated since they didn't need to include it.	2020-06-11 10:18:58 +02:00
Willy Tarreau	c2b1ff04e5	REORG: include: move http_ana.h to haproxy/http_ana{,-t}.h It was moved without any change, however many callers didn't need it at all. This was a consequence of the split of proto_http.c into several parts that resulted in many locations to still reference it.	2020-06-11 10:18:58 +02:00
Willy Tarreau	f1d32c475c	REORG: include: move channel.h to haproxy/channel{,-t}.h The files were moved with no change. The callers were cleaned up a bit and a few of them had channel.h removed since not needed.	2020-06-11 10:18:58 +02:00
Willy Tarreau	5e539c9b8d	REORG: include: move stream_interface.h to haproxy/stream_interface{,-t}.h Almost no changes, removed stdlib and added buf-t and connection-t to the types to avoid a warning.	2020-06-11 10:18:58 +02:00
Willy Tarreau	209108dbbd	REORG: include: move ssl_sock.h to haproxy/ssl_sock{,-t}.h Almost nothing changed, just moved a static inline at the end and moved an export from the types to the main file.	2020-06-11 10:18:58 +02:00
Willy Tarreau	2867159d63	REORG: include: move lb_map.h to haproxy/lb_map{,-t}.h Nothing was changed.	2020-06-11 10:18:58 +02:00
Willy Tarreau	83487a833c	REORG: include: move cli.h to haproxy/cli{,-t}.h Almost no change except moving the cli_kw struct definition after the defines. Almost all users had both types&proto included, which is not surprizing since this code is old and it used to be the norm a decade ago. These places were cleaned.	2020-06-11 10:18:58 +02:00
Willy Tarreau	2eec9b5f95	REORG: include: move stats.h to haproxy/stats{,-t}.h Just some minor reordering, and the usual cleanup of call places for those which didn't need it. We don't include the whole tools.h into stats-t anymore but just tools-t.h.	2020-06-11 10:18:58 +02:00
Willy Tarreau	3f0f82e7a9	REORG: move applet.h to haproxy/applet{,-t}.h The type file was slightly tidied. The cli-specific APPCTX_CLI_ST1_* flag definitions were moved to cli.h. The type file was adjusted to include buf-t.h and not the huge buf.h. A few call places were fixed because they did not need this include.	2020-06-11 10:18:58 +02:00
Willy Tarreau	8c42b8a147	REORG: include: split common/uri_auth.h into haproxy/uri_auth{,-t}.h Initially it looked like this could have been placed into auth.h or stats.h but it's not the case as it's what makes the link between them and the HTTP layer. However the file needed to be split in two. Quite a number of call places were dropped because these were mostly leftovers from the early days where the stats and cli were packed together.	2020-06-11 10:18:58 +02:00
Willy Tarreau	dcc048a14a	REORG: include: move acl.h to haproxy/acl.h{,-t}.h The files were moved almost as-is, just dropping arg-t and auth-t from acl-t but keeping arg-t in acl.h. It was useful to revisit the call places since a handful of files used to continue to include acl.h while they did not need it at all. Struct stream was only made a forward declaration since not otherwise needed.	2020-06-11 10:18:58 +02:00
Willy Tarreau	c6d61d762f	REORG: include: move trace.h to haproxy/trace{,-t}.h Only thread-t was added to satisfy THREAD_LOCAL but the rest was OK.	2020-06-11 10:18:58 +02:00
Willy Tarreau	48d25b3bc9	REORG: include: move session.h to haproxy/session{,-t}.h Almost no change was needed beyond a little bit of reordering of the types file and adjustments to use session-t instead of session at a few places.	2020-06-11 10:18:58 +02:00
Willy Tarreau	872f2ea209	REORG: include: move stick_table.h to haproxy/stick_table{,-t}.h The stktable_types[] array declaration was moved to the main file as it had nothing to do in the types. A few declarations were reordered in the types file so that defines were before the structs. Thread-t was added since there are a few __decl_thread(). The loss of peers.h revealed that cfgparse-listen needed it.	2020-06-11 10:18:58 +02:00
Willy Tarreau	3c2a7c2788	REORG: include: move peers.h to haproxy/peers{,-t}.h The cfg_peers external declaration was moved to the main file instead of the type one. A few types were still missing from the proto, causing warnings in the functions prototypes (proxy, stick_table).	2020-06-11 10:18:58 +02:00
Willy Tarreau	126ba3a1e1	REORG: include: move http_fetch.h to haproxy/http_fetch.h There's no type file for this trivial one. The unneeded dependency on htx.h was dropped.	2020-06-11 10:18:58 +02:00
Willy Tarreau	4aa573da6f	REORG: include: move checks.h to haproxy/check{,-t}.h All includes that were not absolutely necessary were removed because checks.h happens to very often be part of dependency loops. A warning was added about this in check-t.h. The fields, enums and structs were a bit tidied because it's particularly tedious to find anything there. It would make sense to split this in two or more files (at least extract tcp-checks). The file was renamed to the singular because it was one of the rare exceptions to have an "s" appended to its name compared to the struct name.	2020-06-11 10:18:58 +02:00
Willy Tarreau	7ea393d95e	REORG: include: move connection.h to haproxy/connection{,-t}.h The type file is becoming a mess, half of it is for the proxy protocol, another good part describes conn_streams and mux ops, it would deserve being split again. At least it was reordered so that elements are easier to find, with the PP-stuff left at the end. The MAX_SEND_FD macro was moved to compat.h as it's said to be the value for Linux.	2020-06-11 10:18:58 +02:00
Willy Tarreau	8b550afe1e	REORG: include: move tcp_rules.h to haproxy/tcp_rules.h There's no type file on this one which is pretty simple.	2020-06-11 10:18:58 +02:00
Willy Tarreau	3727a8a083	REORG: include: move signal.h to haproxy/signal{,-t}.h No change was necessary. Include from wdt.c was dropped since unneeded.	2020-06-11 10:18:58 +02:00
Willy Tarreau	fc77454aff	REORG: include: move proto_tcp.h to haproxy/proto_tcp.h There was no type file. This one really is trivial. A few missing includes were added to satisfy the exported functions prototypes.	2020-06-11 10:18:58 +02:00
Willy Tarreau	cea0e1bb19	REORG: include: move task.h to haproxy/task{,-t}.h The TASK_IS_TASKLET() macro was moved to the proto file instead of the type one. The proto part was a bit reordered to remove a number of ugly forward declaration of static inline functions. About a tens of C and H files had their dependency dropped since they were not using anything from task.h.	2020-06-11 10:18:58 +02:00
Willy Tarreau	f268ee8795	REORG: include: split global.h into haproxy/global{,-t}.h global.h was one of the messiest files, it has accumulated tons of implicit dependencies and declares many globals that make almost all other file include it. It managed to silence a dependency loop between server.h and proxy.h by being well placed to pre-define the required structs, forcing struct proxy and struct server to be forward-declared in a significant number of files. It was split in to, one which is the global struct definition and the few macros and flags, and the rest containing the functions prototypes. The UNIX_MAX_PATH definition was moved to compat.h.	2020-06-11 10:18:58 +02:00
Willy Tarreau	a171892501	REORG: include: move vars.h to haproxy/vars{,-t}.h A few includes (sessions.h, stream.h, api-t.h) were added for arguments that were first declared in function prototypes.	2020-06-11 10:18:58 +02:00
Willy Tarreau	b23e5958ed	REORG: include: move protocol_buffers.h to haproxy/protobuf{,-t}.h There is no C file for this one, the code was placed into sample.c which thus has a dependency on this file which itself includes sample.h. Probably that it would be wise to split that later.	2020-06-11 10:18:58 +02:00
Willy Tarreau	e6ce10be85	REORG: include: move sample.h to haproxy/sample{,-t}.h This one is particularly tricky to move because everyone uses it and it depends on a lot of other types. For example it cannot include arg-t.h and must absolutely only rely on forward declarations to avoid dependency loops between vars -> sample_data -> arg. In order to address this one, it would be nice to split the sample_data part out of sample.h.	2020-06-11 10:18:58 +02:00
Willy Tarreau	469509b39e	REORG: include: move payload.h to haproxy/payload.h There's no type file, it only contains fetch_rdp_cookie_name() and val_payload_lv() which probably ought to move somewhere else instead of staying there.	2020-06-11 10:18:58 +02:00
Willy Tarreau	2cd5809f94	REORG: include: move map to haproxy/map{,-t}.h Only small cleanups, and removal of a few includes from files that didn't need them.	2020-06-11 10:18:58 +02:00
Willy Tarreau	225a90aaec	REORG: include: move pattern.h to haproxy/pattern{,-t}.h It was moved as-is, except for extern declaration of pattern_reference. A few C files used to include it but didn't need it anymore after having been split apart so this was cleaned.	2020-06-11 10:18:58 +02:00
Willy Tarreau	213e99073b	REORG: include: move listener.h to haproxy/listener{,-t}.h stdlib and list were missing from listener.h, otherwise it was OK.	2020-06-11 10:18:58 +02:00
Willy Tarreau	546ba42c73	REORG: include: move lb_fwrr.h to haproxy/lb_fwrr{,-t}.h Nothing fancy, includes were already OK. The proto didn't reference the type, this was fixed. Still references proxy.h and server.h from types/.	2020-06-11 10:18:58 +02:00
Willy Tarreau	0254941666	REORG: include: move lb_fwlc.h to haproxy/lb_fwlc{,-t}.h Nothing fancy, includes were already OK. The proto didn't reference the type, this was fixed. Still references proxy.h and server.h from types/.	2020-06-11 10:18:58 +02:00
Willy Tarreau	b5fc3bf6dc	REORG: include: move lb_fas.h to haproxy/lb_fas{,-t}.h Nothing fancy, includes were already OK. The proto didn't reference the type, this was fixed. Still references proxy.h and server.h from types/.	2020-06-11 10:18:58 +02:00
Willy Tarreau	fbe8da3320	REORG: include: move lb_chash.h to haproxy/lb_chash{,-t}.h Nothing fancy, includes were already OK. The proto didn't reference the type, this was fixed. Still references proxy.h and server.h from types/.	2020-06-11 10:18:58 +02:00
Willy Tarreau	52d88725ab	REORG: move ssl_crtlist.h to haproxy/ssl_crtlist{,-t}.h These files were already clean as well. Just added ebptnode which is needed in crtlist_entry.	2020-06-11 10:18:58 +02:00
Willy Tarreau	47d7f9064d	REORG: include: move ssl_ckch.h to haproxy/ssl_ckch{,-t}.h buf-t and ebmbtree were included.	2020-06-11 10:18:58 +02:00
Willy Tarreau	b2bd865804	REORG: include: move ssl_utils.h to haproxy/ssl_utils.h Just added buf-t and openssl-compat for the missing types that appear in the prototypes.	2020-06-11 10:18:57 +02:00
Willy Tarreau	b5abe5bd5d	REORG: include: move mworker.h to haproxy/mworker{,-t}.h One function prototype makes reference to struct mworker_proc which was not defined there but in global.h instead. This definition, along with the PROC_O_* fields were moved to mworker-t.h instead.	2020-06-11 10:18:57 +02:00
Willy Tarreau	d7d2c28104	CLEANUP: include: remove unused mux_pt.h It used to be needed to export mux_pt_ops when it was the only way to detect a mux but that's no longer the case.	2020-06-11 10:18:57 +02:00
Willy Tarreau	c761f843da	REORG: include: move http_rules.h to haproxy/http_rules.h There was no include file. This one still includes types/proxy.h.	2020-06-11 10:18:57 +02:00
Willy Tarreau	8efbdfb77b	REORG: include: move obj_type.h to haproxy/obj_type{,-t}.h No change was necessary. It still includes lots of types/* files.	2020-06-11 10:18:57 +02:00
Willy Tarreau	762d7a5117	REORG: include: move frontend.h to haproxy/frontend.h There was no type file for this one, it only contains frontend_accept().	2020-06-11 10:18:57 +02:00
Willy Tarreau	278161c1b8	REORG: include: move capture.h to haproxy/capture{,-t}.h The file was split into two since it contains a variable declaration.	2020-06-11 10:18:57 +02:00
Willy Tarreau	cc9bbfb7b5	REORG: include: split mailers.h into haproxy/mailers{,-t}.h The file mostly contained struct definitions but there was also a variable export. Most of the stuff currently lies in checks.h and should definitely move here!	2020-06-11 10:18:57 +02:00
Willy Tarreau	167e1eb7c7	REORG: include: move counters.h to haproxy/counters-t.h Since these are only type definitions, let's move them to counters-t.h and reserve counters.h for when functions will be needed.	2020-06-11 10:18:57 +02:00
Willy Tarreau	7d865a5e3e	REORG: include: move flt_http_comp.h to haproxy/ There was no type definition for this file which was moved as-is.	2020-06-11 10:18:57 +02:00
Willy Tarreau	eb92deb500	REORG: include: move dns.h to haproxy/dns{,-t}.h The files were moved as-is.	2020-06-11 10:18:57 +02:00
Willy Tarreau	ac13aeaa89	REORG: include: move auth.h to haproxy/auth{,-t}.h The STATS_DEFAULT_REALM and STATS_DEFAULT_URI were moved to defaults.h. It was required to include types/pattern.h and types/sample.h since they are mentioned in function prototypes. It would be wise to merge this with uri_auth.h later.	2020-06-11 10:18:57 +02:00
Willy Tarreau	aa74c4e1b3	REORG: include: move arg.h to haproxy/arg{,-t}.h Almost no change was needed; chunk.h was replaced with buf-t.h. It dpeends on types/vars.h and types/protocol_buffers.h.	2020-06-11 10:18:57 +02:00
Willy Tarreau	122eba92b7	REORG: include: move action.h to haproxy/action{,-t}.h List.h was missing for LIST_ADDQ(). A few unneeded includes of action.h were removed from certain files. This one still relies on applet.h and stick-table.h.	2020-06-11 10:18:57 +02:00
Willy Tarreau	8c794000c4	REORG: include: move hlua_fcn.h to haproxy/hlua_fcn.h Added lua.h which was missing from the includes.	2020-06-11 10:18:57 +02:00
Willy Tarreau	8641605ff6	REORG: include: move hlua.h to haproxy/hlua{,-t}.h This one required a few more includes as it uses list and ebpt_node. It still references lots of types/ files for now.	2020-06-11 10:18:57 +02:00
Willy Tarreau	87735330d1	REORG: include: move http_htx.h to haproxy/http_htx{,-t}.h A few includes had to be added, namely list-t.h in the type file and types/proxy.h in the proto file. actions.h was including http-htx.h but didn't need it so it was dropped.	2020-06-11 10:18:57 +02:00
Willy Tarreau	c6fe884c74	REORG: include: move h1_htx.h to haproxy/h1_htx.h This one didn't have a type file. A few missing includes were added (htx, types).	2020-06-11 10:18:57 +02:00
Willy Tarreau	0a3bd3919e	REORG: include: move compression.h to haproxy/compression{,-t}.h No change was needed.	2020-06-11 10:18:57 +02:00
Willy Tarreau	f07f30c15f	REORG: include: move proto/proto_sockpair.h to haproxy/proto_sockpair.h This one didn't have any types file and was moved as-is.	2020-06-11 10:18:57 +02:00
Willy Tarreau	832ce65914	REORG: include: move proto_udp.h to haproxy/proto_udp{,-t}.h No change was needed.	2020-06-11 10:18:57 +02:00
Willy Tarreau	14e8af5932	CLEANUP: include: remove empty raw_sock.h This one only contained an include for types/stream_interface.h, which was already present in its 3 users.	2020-06-11 10:18:57 +02:00
Willy Tarreau	551271d99c	REORG: include: move pipe.h to haproxy/pipe{,-t}.h No change was needed beyond a minor cleanup.	2020-06-11 10:18:57 +02:00
Willy Tarreau	ba2f73d40e	REORG: include: move sink.h to haproxy/sink{,-t}.h The sink files could be moved with almost no change at since they didn't rely on anything fancy. ssize_t required sys/types.h and thread.h was needed for the locks.	2020-06-11 10:18:57 +02:00
Willy Tarreau	d2ad57c352	REORG: include: move ring to haproxy/ring{,-t}.h Some includes were wrong in the type definition but beyond this no change was needed.	2020-06-11 10:18:57 +02:00
Willy Tarreau	0f6ffd652e	REORG: include: move fd.h to haproxy/fd{,-t}.h A few includes were missing in each file. A definition of struct polled_mask was moved to fd-t.h. The MAX_POLLERS macro was moved to defaults.h Stdio used to be silently inherited from whatever path but it's needed for list_pollers() which takes a FILE* and which can thus not be forward-declared.	2020-06-11 10:18:57 +02:00
Willy Tarreau	fc8f6a8517	REORG: include: move port_range.h to haproxy/port_range{,-t}.h The port ranges didn't depend on anything. However they were missing some includes such as stdlib and api-t.h which were added.	2020-06-11 10:18:57 +02:00
Willy Tarreau	334099c324	REORG: include: move shctx to haproxy/shctx{,-t}.h Minor cleanups were applied, some includes were missing from the types file and some were incorrect in a few C files (duplicated or not using path).	2020-06-11 10:18:57 +02:00
Willy Tarreau	3afc4c4bb0	REORG: include: move dict.h to hparoxy/dict{,-t}.h This was entirely free-standing. haproxy/api-t.h was added for size_t.	2020-06-11 10:18:57 +02:00
Willy Tarreau	48fbcae07c	REORG: tools: split common/standard.h into haproxy/tools{,-t}.h And also rename standard.c to tools.c. The original split between tools.h and standard.h dates from version 1.3-dev and was mostly an accident. This patch moves the files back to what they were expected to be, and takes care of not changing anything else. However this time tools.h was split between functions and types, because it contains a small number of commonly used macros and structures (e.g. name_desc) which in turn cause the massive list of includes of tools.h to conflict with the callers. They remain the ugliest files of the whole project and definitely need to be cleaned and split apart. A few types are defined there only for functions provided there, and some parts are even OS-specific and should move somewhere else, such as the symbol resolution code.	2020-06-11 10:18:57 +02:00
Willy Tarreau	2dd7c35052	REORG: include: move protocol.h to haproxy/protocol{,-t}.h The protocol.h files are pretty low in the dependency and (sadly) used by some files from common/. Almost nothing was changed except lifting a few comments.	2020-06-11 10:18:57 +02:00
Willy Tarreau	fa2ef5b5eb	REORG: include: move common/fcgi.h to haproxy/ The file was moved almost verbatim (only stdio.h was dropped as useless). It was not split between types and functions because it's only included from direct C code (fcgi.c and mux_fcgi.c) as well as fcgi_app.h, included from the same ones, which should also be remerged as a single one.	2020-06-11 10:18:57 +02:00
Willy Tarreau	bf0731491b	REORG: include: move common/h2.h to haproxy/h2.h No change was performed, the file is only included from C files and currently doesn't need to be split into types+functions.	2020-06-11 10:18:57 +02:00
Willy Tarreau	be327fa332	REORG: include: move hpack*.h to haproxy/ and split hpack-tbl The various hpack files are self-contained, but hpack-tbl was one of those showing difficulties when pools were added because that began to add quite some dependencies. Now when built in standalone mode, it still uses the bare minimum pool definitions and doesn't require to know the prototypes anymore when only the structures are needed. Thus the files were moved verbatim except for hpack-tbl which was split between types and prototypes.	2020-06-11 10:18:57 +02:00
Willy Tarreau	16f958c0e9	REORG: include: split common/htx.h into haproxy/htx{,-t}.h Most of the file was a large set of HTX elements manipulation functions and few types, so splitting them allowed to further reduce dependencies and shrink the build time. Doing so revealed that a few files (h2.c, mux_pt.c) needed haproxy/buf.h and were previously getting it through htx.h. They were fixed.	2020-06-11 10:18:57 +02:00
Willy Tarreau	5413a87ad3	REORG: include: move common/h1.h to haproxy/h1.h The file was moved as-is. There was a wrong dependency on dynbuf.h instead of buf.h which was addressed. There was no benefit to splitting this between types and functions.	2020-06-11 10:18:57 +02:00
Willy Tarreau	0017be0143	REORG: include: split common/http-hdr.h into haproxy/http-hdr{,-t}.h There's only one struct and 2 inline functions. It could have been merged into http.h but that would have added a massive dependency on the hpack parts for nothing, so better keep it this way since hpack is already freestanding and portable.	2020-06-11 10:18:57 +02:00
Willy Tarreau	cd72d8c981	REORG: include: split common/http.h into haproxy/http{,-t}.h So the enums and structs were placed into http-t.h and the functions into http.h. This revealed that several files were dependeng on http.h but not including it, as it was silently inherited via other files.	2020-06-11 10:18:57 +02:00
Willy Tarreau	c2f7c5895c	REORG: include: move common/ticks.h to haproxy/ticks.h Nothing needed to be changed, there are no exported types.	2020-06-11 10:18:57 +02:00
Willy Tarreau	374b442cbc	REORG: include: split common/xref.h into haproxy/xref{,-t}.h The type is the only element needed by applet.h and hlua.h, while hlua.c needs the various functions. XREF_BUSY was placed into the types as well since it's better to have the special values there.	2020-06-11 10:18:57 +02:00
Willy Tarreau	7cd8b6e3a4	REORG: include: split common/regex.h into haproxy/regex{,-t}.h Regex are essentially included for myregex_t but it turns out that several of the C files didn't include it directly, relying on the one included by their own .h. This has been cleanly addressed so that only the type is included by H files which need it, and adding the missing includes for the other ones.	2020-06-11 10:18:57 +02:00
Willy Tarreau	7a00efbe43	REORG: include: move common/namespace.h to haproxy/namespace{,-t}.h The type was moved out as it's used by standard.h for netns_entry. Instead of just being a forward declaration when not used, it's an empty struct, which makes gdb happier (the resulting stripped executable is the same).	2020-06-11 10:18:57 +02:00
Willy Tarreau	6131d6a731	REORG: include: move common/net_helper.h to haproxy/net_helper.h No change was necessary.	2020-06-11 10:18:57 +02:00
Willy Tarreau	2741c8c4aa	REORG: include: move common/buffer.h to haproxy/dynbuf{,-t}.h The pretty confusing "buffer.h" was in fact not the place to look for the definition of "struct buffer" but the one responsible for dynamic buffer allocation. As such it defines the struct buffer_wait and the few functions to allocate a buffer or wait for one. This patch moves it renaming it to dynbuf.h. The type definition was moved to its own file since it's included in a number of other structs. Doing this cleanup revealed that a significant number of files used to rely on this one to inherit struct buffer through it but didn't need anything from this file at all.	2020-06-11 10:18:57 +02:00
Willy Tarreau	a04ded58dc	REORG: include: move activity to haproxy/ This moves types/activity.h to haproxy/activity-t.h and proto/activity.h to haproxy/activity.h. The macros defining the bit field values for the profiling variable were moved to the type file to be more future-proof.	2020-06-11 10:18:57 +02:00
Willy Tarreau	c13ed53b12	REORG: include: move common/chunk.h to haproxy/chunk.h No change was necessary, it was already properly split.	2020-06-11 10:18:57 +02:00
Willy Tarreau	d0ef439699	REORG: include: move common/memory.h to haproxy/pool.h Now the file is ready to be stored into its final destination. A few minor reorderings were performed to keep the file properly organized, making the various sections more visible (cache & lockless). In addition and to stay consistent, memory.c was renamed to pool.c.	2020-06-11 10:18:57 +02:00
Willy Tarreau	ed891fda52	MEDIUM: memory: make local pools independent on lockless pools Till now the local pool caches were implemented only when lockless pools were in use. This was mainly due to the difficulties to disentangle the code parts. However the locked pools would further benefit from the local cache, and having this would reduce the variants in the code. This patch does just this. It adds a new debug macro DEBUG_NO_LOCAL_POOLS to forcefully disable local pool caches, and makes sure that the high level functions are now strictly the same between locked and lockless (pool_alloc(), pool_alloc_dirty(), pool_free(), pool_get_first()). The pool index calculation was moved inside the CONFIG_HAP_LOCAL_POOLS guards. This allowed to move them out of the giant #ifdef and to significantly reduce the code duplication. A quick perf test shows that with locked pools the performance increases by roughly 10% on 8 threads and gets closer to the lockless one.	2020-06-11 10:18:57 +02:00
Willy Tarreau	f8c1b648c0	MINOR: memory: move pool-specific path of the locked pool_free() to __pool_free() pool_free() was not identical between locked and lockless pools. The different was the call to __pool_free() in one case versus open-coded accesses in the other, and the poisoning brought by commit `da52035a45` ("MINOR: memory: also poison the area on freeing") which unfortunately did if only for the lockless path. Let's now have __pool_free() to work on the global pool also in the locked case so that the code is architected similarly.	2020-06-11 10:18:56 +02:00
Willy Tarreau	fb117e6a8e	MEDIUM: memory: don't let pool_put_to_cache() free the objects itself Just as for the allocation path, the release path was not symmetrical. It was not logical to have pool_put_to_cache() free the objects itself, it was pool_free's job. In addition, just because of a variable export issue, it the insertion of the object to free back into the local cache couldn't be inlined while it was very cheap. This patch just slightly reorganizes this code path by making pool_free() decide whether or not to put the object back into the cache via pool_put_to_cache() otherwise place it back to the global pool using __pool_free(). Then pool_put_to_cache() adds the item to the local cache and only calls pool_evict_from_cache() if the cache is too big.	2020-06-11 10:18:56 +02:00
Willy Tarreau	a6982e5868	MINOR: memory: don't let __pool_get_first() pick from the cache When building with the local cache support, we have an asymmetry in the allocation path which is that __pool_get_first() picks from the cache while when no cache support is used, this one directly accesses the shared area. It looks like it was done this way only to centralize the call to __pool_get_from_cache() but this was not a good idea as it complicates the splitting the code. Let's move the cache access to the upper layer so thatt __pool_get_first() remains agnostic to the cache support. The call tree now looks like this with the cache enabled : pool_get_first() __pool_get_from_cache() // if cache enabled __pool_get_first() pool_alloc() pool_alloc_dirty() __pool_get_from_cache() // if cache enabled __pool_get_first() __pool_refill_alloc() __pool_free() pool_free_area() pool_put_to_cache() __pool_free() __pool_put_to_cache() pool_free() pool_put_to_cache() With cache disabled, the pool_free() path still differs: pool_free() __pool_free_area() __pool_put_to_cache()	2020-06-11 10:18:56 +02:00
Willy Tarreau	24aa1eebaa	REORG: memory: move the OS-level allocator to haproxy/pool-os.h The memory.h file is particularly complex due to the combination of debugging options. This patch extracts the OS-level interface and places it into a new file: pool-os.h. Doing this also moves pool_alloc_area() and pool_free_area() out of the #ifndef CONFIG_HAP_LOCKLESS_POOLS, making them usable from __pool_refill_alloc(), pool_free(), pool_flush() and pool_gc() instead of having direct calls to malloc/free there that are hard to wrap for debugging purposes.	2020-06-11 10:18:56 +02:00
Willy Tarreau	3646777a77	REORG: memory: move the pool type definitions to haproxy/pool-t.h This is the beginning of the move and cleanup of memory.h. This first step only extracts type definitions and basic macros that are needed by the files which reference a pool. They're moved to pool-t.h (since "pool" is more obvious than "memory" when looking for pool-related stuff). 3 files which didn't need to include the whole memory.h were updated.	2020-06-11 10:18:56 +02:00
Willy Tarreau	606135ac88	CLEANUP: pool: include freq_ctr.h and remove locally duplicated functions In memory.h we had to reimplement the swrate* functions just because of a broken circular dependency around freq_ctr.h. Now that this one is solved, let's get rid of this copy and use the original ones instead.	2020-06-11 10:18:56 +02:00
Willy Tarreau	6634794992	REORG: include: move freq_ctr to haproxy/ types/freq_ctr.h was moved to haproxy/freq_ctr-t.h and proto/freq_ctr.h was moved to haproxy/freq_ctr.h. Files were updated accordingly, no other change was applied.	2020-06-11 10:18:56 +02:00
Willy Tarreau	889faf467b	CLEANUP: include: remove excessive includes of common/standard.h Some of them were simply removed as unused (possibly some leftovers from an older cleanup session), some were turned to haproxy/bitops.h and a few had to be added (hlua.c and stick-table.h need standard.h for parse_time_err; htx.h requires chunk.h but used to get it through standard.h).	2020-06-11 10:18:56 +02:00
Willy Tarreau	aea4635c38	REORG: include: move integer manipulation functions from standard.h to intops.h There are quite a number of integer manipulation functions defined in standard.h, which is one of the reasons why standard.h is included from many places and participates to the dependencies loop. Let's just have a new file, intops.h to place all these operations. These are a few bitops, 32/64 bit mul/div/rotate, integer parsing and encoding (including varints), the full avalanche hash function, and the my_htonll/my_ntohll functions. For now no new C file was created for these yet.	2020-06-11 10:18:56 +02:00
Willy Tarreau	92b4f1372e	REORG: include: move time.h from common/ to haproxy/ This one is included almost everywhere and used to rely on a few other .h that are not needed (unistd, stdlib, standard.h). It could possibly make sense to split it into multiple parts to distinguish operations performed on timers and the internal time accounting, but at this point it does not appear much important.	2020-06-11 10:18:56 +02:00
Willy Tarreau	af613e8359	CLEANUP: thread: rename __decl_hathreads() to __decl_thread() I can never figure whether it takes an "s" or not, and in the end it's better if it matches the file's naming, so let's call it "__decl_thread".	2020-06-11 10:18:56 +02:00
Willy Tarreau	3f567e4949	REORG: include: split hathreads into haproxy/thread.h and haproxy/thread-t.h This splits the hathreads.h file into types+macros and functions. Given that most users of this file used to include it only to get the definition of THREAD_LOCAL and MAXTHREADS, the bare minimum was placed into thread-t.h (i.e. types and macros). All the thread management was left to haproxy/thread.h. It's worth noting the drop of the trailing "s" in the name, to remove the permanent confusion that arises between this one and the system implementation (no "s") and the makefile's option (no "s"). For consistency, src/hathreads.c was also renamed thread.c. A number of files were updated to only include thread-t which is the one they really needed. Some future improvements are possible like replacing empty inlined functions with macros for the thread-less case, as building at -O0 disables inlining and causes these ones to be emitted. But this really is cosmetic.	2020-06-11 10:18:56 +02:00
Willy Tarreau	5775d0964a	CLEANUP: threads: remove a few needless includes of hathreads.h A few files were including it while not needing it (anymore). Some only required access to the atomic ops and got haproxy/atomic.h in exchange. Others didn't need it at all. A significant number of files still include it only for THREAD_LOCAL definition.	2020-06-11 10:18:56 +02:00
Willy Tarreau	9453ecd670	REORG: threads: extract atomic ops from hathreads.h The hathreads.h file has quickly become a total mess because it contains thread definitions, atomic operations and locking operations, all this for multiple combinations of threads, debugging and architectures, and all this done with random ordering! This first patch extracts all the atomic ops code from hathreads.h to move it to haproxy/atomic.h. The code there still contains several sections based on non-thread vs thread, and GCC versions in the latter case. Each section was arranged in the exact same order to ease finding. The redundant HA_BARRIER() which was the same as __ha_compiler_barrier() was dropped in favor of the latter which follows the naming convention of all other barriers. It was only used in freq_ctr.c which was updated. Additionally, __ha_compiler_barrier() was defined inconditionally but used only for thread-related operations, so it was made thread-only like HA_BARRIER() used to be. We'd still need to have two types of compiler barriers, one for the general case (e.g. signals) and another one for concurrency, but this was not addressed here. Some comments were added at the beginning of each section to inform about the use case and warn about the traps to avoid. Some files which continue to include hathreads.h solely for atomic ops should now be updated.	2020-06-11 10:18:56 +02:00
Willy Tarreau	853b297c9b	REORG: include: split mini-clist into haproxy/list and list-t.h Half of the users of this include only need the type definitions and not the manipulation macros nor the inline functions. Moves the various types into mini-clist-t.h makes the files cleaner. The other one had all its includes grouped at the top. A few files continued to reference it without using it and were cleaned. In addition it was about time that we'd rename that file, it's not "mini" anymore and contains a bit more than just circular lists.	2020-06-11 10:18:56 +02:00
Willy Tarreau	f0f1c80daf	REORG: include: move istbuf.h to haproxy/ This one now relies on two files that were already cleaned up and is only used by buffer.h.	2020-06-11 10:18:56 +02:00
Willy Tarreau	8dabda7497	REORG: include: split buf.h into haproxy/buf-t.h and haproxy/buf.h File buf.h is one common cause of pain in the dependencies. Many files in the code need it to get the struct buffer definition, and a few also need the inlined functions to manipulate a buffer, but the file used to depend on a long chain only for BUG_ON() (addressed by last commit). Now buf.h is split into buf-t.h which only contains the type definitions, and buf.h for all inlined functions. Callers who don't care can continue to use buf.h but files in types/ must only use buf-t.h. sys/types.h had to be added to buf.h to get ssize_t as used by b_move(). It's worth noting that ssize_t is only supposed to be a size_t supporting -1, so b_move() ought to be rethought regarding this. The files were moved to haproxy/ and all their users were updated accordingly. A dependency issue was addressed on fcgi whose C file didn't include buf.h.	2020-06-11 10:18:56 +02:00
Willy Tarreau	025beea507	CLEANUP: debug: drop unused function p_malloc() This one was introduced 5 years ago for debugging and never really used. It is the one which used to cause circular dependencies issues. Let's drop it instead of starting to split the debug include in two.	2020-06-11 10:18:56 +02:00
Willy Tarreau	2a83d60662	REORG: include: move debug.h from common/ to haproxy/ The debug file is cleaner now and does not depend on much anymore.	2020-06-11 10:18:56 +02:00
Willy Tarreau	58017eef3f	REORG: include: move the BUG_ON() code to haproxy/bug.h This one used to be stored into debug.h but the debug tools got larger and require a lot of other includes, which can't use BUG_ON() anymore because of this. It does not make sense and instead this macro should be placed into the lower includes and given its omnipresence, the best solution is to create a new bug.h with the few surrounding macros needed to trigger bugs and place assertions anywhere. Another benefit is that it won't be required to add include <debug.h> anymore to use BUG_ON, it will automatically be covered by api.h. No less than 32 occurrences were dropped. The FSM_PRINTF macro was dropped since not used at all anymore (probably since 1.6 or so).	2020-06-11 10:18:56 +02:00
Willy Tarreau	eb6f701b99	REORG: include: move ist.h from common/ to import/ Fortunately that file wasn't made dependent upon haproxy since it was integrated, better isolate it before it's too late. Its dependency on api.h was the result of the change from config.h, which in turn wasn't correct. It was changed back to stddef.h for size_t and sys/types.h for ssize_t. The recently added reference to MAX() was changed as it was placed only to avoid a zero length in the non-free-standing version and was causing a build warning in the hpack encoder.	2020-06-11 10:18:56 +02:00
Willy Tarreau	6019faba50	REORG: include: move openssl-compat.h from common/ to haproxy/ This file is to openssl what compat.h is to the libc, so it makes sense to move it to haproxy/. It could almost be part of api.h but given the amount of openssl stuff that gets loaded I fear it could increase the build time. Note that this file contains lots of inlined functions. But since it does not depend on anything else in haproxy, it remains safe to keep all that together.	2020-06-11 10:18:56 +02:00
Willy Tarreau	8d36697dee	REORG: include: move base64.h, errors.h and hash.h from common to to haproxy/ These ones do not depend on any other file. One used to include haproxy/api.h but that was solely for stddef.h.	2020-06-11 10:18:56 +02:00
Willy Tarreau	d678805783	REORG: include: move version.h to haproxy/ Few files were affected. The release scripts was updated.	2020-06-11 10:18:56 +02:00
Willy Tarreau	fd4bffe7c0	REORG: include: move the base files from common/ to haproxy/ The files currently covered by api-t.h and api.h (compat, compiler, defaults, initcall) are now located inside haproxy/.	2020-06-11 10:18:56 +02:00
Willy Tarreau	b9082a93e5	CLEANUP: include: remove unused common/tools.h Let's definitely get rid of this old file.	2020-06-11 10:18:56 +02:00
Willy Tarreau	4d653a6285	REORG: include: move SWAP/MID_RANGE/MAX_RANGE from tools.h to standard.h Tools.h doesn't make sense for these 3 macros alone anymore, let's move them to standard.h which will ultimately become again tools.h once moved.	2020-06-11 10:18:56 +02:00
Willy Tarreau	5ae5006dde	REORG: include: move MIN/MAX from tools.h to compat.h Given that these macros are usually provided by sys/param.h, better move them to compat.h.	2020-06-11 10:18:56 +02:00
Willy Tarreau	57bb71e83a	CLEANUP: include: remove unused template.h There is one "template.h" per include subdirectory to show how to create a new file but in practice nobody knows they're here so they're useless. Let's simply remove them.	2020-06-11 10:18:56 +02:00
Willy Tarreau	86556a5377	CLEANUP: include: remove common/config.h It was already an indirection to load other files, it's not used anymore.	2020-06-11 10:18:56 +02:00
Willy Tarreau	4c7e4b7738	REORG: include: update all files to use haproxy/api.h or api-t.h if needed All files that were including one of the following include files have been updated to only include haproxy/api.h or haproxy/api-t.h once instead: - common/config.h - common/compat.h - common/compiler.h - common/defaults.h - common/initcall.h - common/tools.h The choice is simple: if the file only requires type definitions, it includes api-t.h, otherwise it includes the full api.h. In addition, in these files, explicit includes for inttypes.h and limits.h were dropped since these are now covered by api.h and api-t.h. No other change was performed, given that this patch is large and affects 201 files. At least one (tools.h) was already freestanding and didn't get the new one added.	2020-06-11 10:18:42 +02:00
Willy Tarreau	7ab7031e34	REORG: include: create new file haproxy/api.h This file includes everything that must be guaranteed to be available to any buildable file in the project (including the contrib/ subdirs). For now it includes <haproxy/api-t.h> so that standard integer types and compiler macros are known, <common/initcall.h> to ease dynamic registration of init functions, and <common/tools.h> for a few MIN/MAX macros. version.h should probably also be added, though at the moment it doesn't bring a great value. All files which currently include the ones above should now switch to haproxy/api.h or haproxy/api-t.h instead. This should also reduce build time by having a single guard for several files at once.	2020-06-11 09:31:11 +02:00
Willy Tarreau	ca1765713b	REORG: include: create new file haproxy/api-t.h This file is at the lowest level of the include tree. Its purpose is to make sure that common types are known pretty much everywhere, particularly in structure declarations. It will essentially cover integer types such as uintXX_t via inttypes.h, "size_t" and "ptrdiff_t" via stddef.h, and various type modifiers such as __maybe_unused or ALIGN() via compiler.h, compat.h and defaults.h. It could be enhanced later if required, for example if some macros used to compute array sizes are needed.	2020-06-11 09:31:11 +02:00
Willy Tarreau	8d2b777fe3	REORG: ebtree: move the include files from ebtree to include/import/ This is where other imported components are located. All files which used to directly include ebtree were touched to update their include path so that "import/" is now prefixed before the ebtree-related files. The ebtree.h file was slightly adjusted to read compiler.h from the common/ subdirectory (this is the only change). A build issue was encountered when eb32sctree.h is loaded before eb32tree.h because only the former checks for the latter before defining type u32. This was addressed by adding the reverse ifdef in eb32tree.h. No further cleanup was done yet in order to keep changes minimal.	2020-06-11 09:31:11 +02:00
Christopher Faulet	89aed32bff	MINOR: mux-h1/proxy: Add a proxy option to disable clear h2 upgrade By default, HAProxy is able to implicitly upgrade an H1 client connection to an H2 connection if the first request it receives from a given HTTP connection matches the HTTP/2 connection preface. This way, it is possible to support H1 and H2 clients on a non-SSL connections. It could be a problem if for any reason, the H2 upgrade is not acceptable. "option disable-h2-upgrade" may now be used to disable it, per proxy. The main puprose of this option is to let an admin to totally disable the H2 support for security reasons. Recently, a critical issue in the HPACK decoder was fixed, forcing everyone to upgrade their HAProxy version to fix the bug. It is possible to disable H2 for SSL connections, but not on clear ones. This option would have been a viable workaround.	2020-06-03 10:23:39 +02:00
Willy Tarreau	39bd740d00	CLEANUP: regex: remove outdated support for regex actions The support for reqrep and friends was removed in 2.1 but the chain_regex() function and the "action" field in the regex struct was still there. This patch removes them. One point worth mentioning though. There is a check_replace_string() function whose purpose was to validate the replacement strings passed to reqrep. It should also be used for other replacement regex, but is never called. Callers of exp_replace() should be checked and a call to this function should be added to detect the error early.	2020-06-02 17:17:13 +02:00
Emeric Brun	975564784f	MEDIUM: ring: add new srv statement to support octet counting forward log-proto <logproto> The "log-proto" specifies the protocol used to forward event messages to a server configured in a ring section. Possible values are "legacy" and "octet-count" corresponding respectively to "Non-transparent-framing" and "Octet counting" in rfc6587. "legacy" is the default. Notes: a separated io_handler was created to avoid per messages test and to prepare code to set different log protocols such as request- response based ones.	2020-05-31 10:49:43 +02:00
Emeric Brun	494c505703	MEDIUM: ring: add server statement to forward messages from a ring This patch adds new statement "server" into ring section, and the related "timeout connect" and "timeout server". server <name> <address> [param*] Used to configure a syslog tcp server to forward messages from ring buffer. This supports for all "server" parameters found in 5.2 paragraph. Some of these parameters are irrelevant for "ring" sections. timeout connect <timeout> Set the maximum time to wait for a connection attempt to a server to succeed. Arguments : <timeout> is the timeout value specified in milliseconds by default, but can be in any other unit if the number is suffixed by the unit, as explained at the top of this document. timeout server <timeout> Set the maximum time for pending data staying into output buffer. Arguments : <timeout> is the timeout value specified in milliseconds by default, but can be in any other unit if the number is suffixed by the unit, as explained at the top of this document. Example: global log ring@myring local7 ring myring description "My local buffer" format rfc3164 maxlen 1200 size 32764 timeout connect 5s timeout server 10s server mysyslogsrv 127.0.0.1:6514	2020-05-31 10:46:13 +02:00
Emeric Brun	dcd58afaf1	MINOR: ring: re-work ring attach generic API. Attach is now independent on appctx, which was unused anyway.	2020-05-31 10:37:31 +02:00
Willy Tarreau	21072b9480	CLEANUP: pools: use the regular lock for the flush operation on lockless pools Commit `04f5fe87d3` introduced an rwlock in the pools to deal with the risk that pool_flush() dereferences an area being freed, and commit `899fb8abdc` turned it into a spinlock. The pools already contain a spinlock in case of locked pools, so let's use the same and simplify the code by removing ifdefs. At this point I'm really suspecting that if pool_flush() would instead rely on __pool_get_first() to pick entries from the pool, the concurrency problem could never happen since only one user would get a given entry at once, thus it could not be freed by another user. It's not certain this would be faster however because of the number of atomic ops to retrieve one entry compared to a locked batch.	2020-05-29 17:28:04 +02:00
Christopher Faulet	0bac4cdf1a	CLEANUP: http: Remove unused HTTP message templates HTTP_1XX, HTTP_3XX and HTTP_4XX message templates are no longer used. Only HTTP_302 and HTTP_303 are used during configuration parsing by "errorloc" family directives. So these templates are removed from the generic http code. And HTTP_302 and HTTP_303 templates are moved as static strings in the function parsing "errorloc" directives.	2020-05-28 15:07:20 +02:00
Christopher Faulet	b304883754	MINOR: http-rules: Use an action function to eval http-request auth rules Now http-request auth rules are evaluated in a dedicated function and no longer handled "in place" during the HTTP rules evaluation. Thus the action name ACT_HTTP_REQ_AUTH is removed. In additionn, http_reply_40x_unauthorized() is also removed. This part is now handled in the new action_ptr callback function.	2020-05-28 15:07:20 +02:00
Christopher Faulet	612f2eafe9	MINOR: http-ana: Use proxy's error replies to emit 401/407 responses There is no reason to not use proxy's error replies to emit 401/407 responses. The function http_reply_40x_unauthorized(), responsible to emit those responses, is not really complex. It only adds a WWW-Authenticate/Proxy-Authenticate header to a generic message. So now, error replies can be defined for 401 and 407 status codes, using errorfile or http-error directives. When an http-request auth rule is evaluated, the corresponding error reply is used. For 401 responses, all occurrences of the WWW-Authenticate header are removed and replaced by a new one with a basic authentication challenge for the configured realm. For 407 responses, the same is done on the Proxy-Authenticate header. If the error reply must not be altered, "http-request return" rule must be used instead.	2020-05-28 15:07:20 +02:00
Christopher Faulet	ae43b6c446	MINOR: http-ana: Make the function http_reply_to_htx() public This function may be used from anywhere to convert an HTTP reply to an HTX message.	2020-05-28 15:07:20 +02:00
Willy Tarreau	63a8738724	MEDIUM: pools: directly free objects when pools are too much crowded During pool_free(), when the ->allocated value is 125% of needed_avg or more, instead of putting the object back into the pool, it's immediately freed using free(). By doing this we manage to significantly reduce the amount of memory pinned in pools after transient traffic spikes. During a test involving a constant load of 100 concurrent connections each delivering 100 requests per second, the memory usage was a steady 21 MB RSS. Adding a 1 minute parallel load of 40k connections all looping on 100kB objects made the memory usage climb to 938 MB before this patch. With the patch it was only 660 MB. But when this parasit load stopped, before the patch the RSS would remain at 938 MB while with the patch, it went down to 480 then 180 MB after a few seconds, to stabilize around 69 MB after about 20 seconds. This can be particularly important to improve reloads where the memory has to be shared between the old and new process. Another improvement would be welcome, we ought to have a periodic task to check pools usage and continue to free up unused objects regardless of any call to pool_free(), because the needed_avg value depends on the past and will not cover recently refilled objects.	2020-05-27 08:32:42 +02:00
Willy Tarreau	a1e4f8c27c	MINOR: pools: compute an estimate of each pool's average needed objects This adds a sliding estimate of the pools' usage. The goal is to be able to use this to start to more aggressively free memory instead of keeping lots of unused objects in pools. The average is calculated as a sliding average over the last 1024 consecutive measures of ->used during calls to pool_free(), and is bumped up for 1/4 of its history from ->allocated when allocation from the pool fails and results in a call to malloc(). The result is a floating value between ->used and ->allocated, that tries to react fast to under-estimates that result in expensive malloc() but still maintains itself well in case of stable usage, and progressively goes down if usage shrinks over time. This new metric is reported as "needed_avg" in "show pools". Sadly due to yet another include dependency hell, we couldn't reuse the functions from freq_ctr.h so they were temporarily duplicated into memory.h.	2020-05-27 08:32:42 +02:00
Emeric Brun	99c453df9d	MEDIUM: ring: new section ring to declare custom ring buffers. It is possible to globally declare ring-buffers, to be used as target for log servers or traces. ring <ringname> Creates a new ring-buffer with name <ringname>. description <text> The descritpition is an optional description string of the ring. It will appear on CLI. By default, <name> is reused to fill this field. format <format> Format used to store events into the ring buffer. Arguments: <format> is the log format used when generating syslog messages. It may be one of the following : iso A message containing only the ISO date, followed by the text. The PID, process name and system name are omitted. This is designed to be used with a local log server. raw A message containing only the text. The level, PID, date, time, process name and system name are omitted. This is designed to be used in containers or during development, where the severity only depends on the file descriptor used (stdout/stderr). This is the default. rfc3164 The RFC3164 syslog message format. This is the default. (https://tools.ietf.org/html/rfc3164) rfc5424 The RFC5424 syslog message format. (https://tools.ietf.org/html/rfc5424) short A message containing only a level between angle brackets such as '<3>', followed by the text. The PID, date, time, process name and system name are omitted. This is designed to be used with a local log server. This format is compatible with what the systemd logger consumes. timed A message containing only a level between angle brackets such as '<3>', followed by ISO date and by the text. The PID, process name and system name are omitted. This is designed to be used with a local log server. maxlen <length> The maximum length of an event message stored into the ring, including formatted header. If an event message is longer than <length>, it will be truncated to this length. size <size> This is the optional size in bytes for the ring-buffer. Default value is set to BUFSIZE. Example: global log ring@myring local7 ring myring description "My local buffer" format rfc3164 maxlen 1200 Note: ring names are resolved during post configuration processing.	2020-05-26 08:03:15 +02:00
Tim Duesterhus	b4fac1eb3c	MINOR: vars: Make vars_(un\|)set_by_name(_ifexist\|) return a success value Change the return type from `void` to `int` and return whether setting the variable was successful.	2020-05-25 08:12:27 +02:00
Tim Duesterhus	7329327333	CLEANUP: vars: Remove void vars_unset_by_name(const char, size_t, struct sample) With "MINOR: lua: Use vars_unset_by_name_ifexist()" the last user was removed and as outlined in that commit there is no good reason for this function to exist. May be backported together with the commit mentioned above.	2020-05-25 08:12:23 +02:00
Willy Tarreau	0ff9b3d64f	BUILD: hpack: make sure the hpack table can still be built standalone Recent commit `2bdcc70fa7` ("MEDIUM: hpack: use a pool for the hpack table") made the hpack code finally use a pool with very unintrusive code that was assumed to be trivial enough to adjust if the code needed to be reused outside of haproxy. Unfortunately the code in contrib/hpack already uses it and broke the oss-fuzz tests as it doesn't build anymore. This patch adds an HPACK_STANDALONE macro to decide if we should use the pools or malloc+free. The resulting macros are called hpack_alloc() and hpack_free() respectively, and the size must be passed into the pool itself.	2020-05-22 12:13:43 +02:00
Christopher Faulet	3b967c1210	MINOR: http-htx/proxy: Add http-error directive using http return syntax The http-error directive can now be used instead of errorfile to define an error message in a proxy section (including default sections). This directive uses the same syntax that http return rules. The only real difference is the limitation on status code that may be specified. Only status codes supported by errorfile directives are supported for this new directive. Parsing of errorfile directive remains independent from http-error parsing. But functionally, it may be expressed in terms of http-errors : errorfile <status> <file> ==> http-errror status <status> errorfile <file>	2020-05-20 18:27:14 +02:00
Christopher Faulet	963ce5bc06	CLEANUP: channel: Remove channel_htx_copy_msg() function This function is now unused. So it is removed.	2020-05-20 18:27:14 +02:00
Christopher Faulet	2056736453	MINOR: htx: Add a function to copy a buffer in an HTX message The htx_copy_msg() function can now be used to copy the HTX message stored in a buffer in an existing HTX message. It takes care to not overwrite existing data. If the destination message is empty, a raw copy is performed. All the message is copied or nothing. This function is used instead of channel_htx_copy_msg().	2020-05-20 18:27:14 +02:00
Christopher Faulet	f1fedc3cce	CLEANUP: http-htx: Remove unused storage of error messages in buffers Now, error messages are all stored in http replies. So the storage as a buffer can safely be removed.	2020-05-20 18:27:14 +02:00
Christopher Faulet	8dfeccf6d3	MEDIUM: http-ana: Use http replies for HTTP error messages When HAProxy returns an http error message, the corresponding http reply is now used instead of the buffer containing the corresponding HTX message. So, http_error_message() function now returns the http reply to use for a given stream. And the http_reply_and_close() function now relies on http_reply_message() to send the response to the client.	2020-05-20 18:27:14 +02:00
Christopher Faulet	507479b096	MINOR: http-ana: Use a TXN flag to prevent after-response ruleset evaluation The txn flag TX_CONST_REPLY may now be used to prevent after-response ruleset evaluation. It is used if this ruleset evaluation failed on an internal error response. Before, it was done incrementing the parameter <final>. But it is not really convenient if an intermediary function is used to produce the response. Using a txn flag could also be a good way to prevent after-response ruleset evaluation in a different context.	2020-05-20 18:27:13 +02:00
Christopher Faulet	e29a97e51a	MINOR: http-htx: Use http reply from the http-errors section When an http reply is configured to use an error message from an http-errors section, instead of referencing the error message, the http reply is used. To do so the new http reply type HTTP_REPLY_INDIRECT has been added.	2020-05-20 18:27:13 +02:00
Christopher Faulet	40e8569676	MINOR: proxy: Add references on http replies for proxy error messages Error messages defined in proxy section or inherited from a default section are now also referenced using an array of http replies. This is done during the configuration validity check.	2020-05-20 18:27:13 +02:00
Christopher Faulet	5809e10b48	MINOR: http-htx: Store errorloc/errorfile messages in http replies During configuration parsing, error messages resulting of parsing of errorloc and errorfile directives are now also stored as an http reply. So, for now, these messages are stored as a buffer and as an http reply. To be able to release all these http replies when haproxy is stopped, a global list is used. We must do that because the same http reply may be referenced several times by different proxies if it is defined in a default section.	2020-05-20 18:27:13 +02:00
Christopher Faulet	de30bb7245	MINOR: http-htx: Store messages of an http-errors section in a http reply array Error messages specified in an http-errors section is now also stored in an array of http replies. So, for now, these messages are stored as a buffer and as a http reply.	2020-05-20 18:27:13 +02:00
Christopher Faulet	1b13ecaca2	MINOR: http-htx: Store default error messages in a global http reply array Default error messages are stored as a buffer, in http_err_chunks global array. Now, they are also stored as a http reply, in http_err_replies global array.	2020-05-20 18:27:13 +02:00
Christopher Faulet	5cb513abeb	MEDIUM: http-rules: Rely on http reply for http deny/tarpit rules "http-request deny", "http-request tarpit" and "http-response deny" rules now use the same syntax than http return rules and internally rely on the http replies. The behaviour is not the same when no argument is specified (or only the status code). For http replies, a dummy response is produced, with no payload. For old deny/tarpit rules, the proxy's error messages are used. Thus, to be compatible with existing configuration, the "default-errorfiles" parameter is implied. For instance : http-request deny deny_status 404 is now an alias of http-request deny status 404 default-errorfiles	2020-05-20 18:27:13 +02:00
Christopher Faulet	0e2ad61315	MINOR: http-ana: Use a dedicated function to send a response from an http reply The http_reply_message() function may be used to send an http reply to a client. This function is responsile to convert the reply in HTX, to push it in the response buffer and to forward it to the client. It is also responsible to terminate the transaction. This function is used during evaluation of http return rules.	2020-05-20 18:27:13 +02:00
Christopher Faulet	7eea241c39	MINOR: http-htx: Use a dedicated function to check http reply validity A dedicated function is added to check the validity of an http reply object, after parsing. It is used to check the validity of http return rules. For now, this function is only used to find the right error message in an http-errors section for http replies of type HTTP_REPLY_ERRFILES (using "errorfiles" argument). On success, such replies are updated to point on the corresponding error message and their type is set to HTTP_REPLY_ERRMSG. If an unknown http-errors section is referenced, anx error is returned. If a unknown error message is referenced inside an existing http-errors section, a warning is emitted and the proxy's error messages are used instead.	2020-05-20 18:27:13 +02:00
Christopher Faulet	47e791e220	MINOR: http-htx: Use a dedicated function to parse http reply arguments A dedicated function to parse arguments and create an http_reply object is added. It is used to parse http return rule. Thus, following arguments are parsed by this function : ... [status <code>] [content-type <type>] [ { default-errorfiles \| errorfile <file> \| errorfiles <name> \| file <file> \| lf-file <file> \| string <str> \| lf-string <fmt> } ] [ hdr <name> <fmt> ]* Because the status code argument is optional, a default status code must be defined when this function is called.	2020-05-20 18:27:13 +02:00
Christopher Faulet	18630643a9	MINOR: http-htx: Use a dedicated function to release http_reply objects A function to release an http_reply object has been added. It is now called when an http return rule is released.	2020-05-20 18:27:13 +02:00
Christopher Faulet	5ff0c64921	MINOR: http-rules: Use http_reply structure for http return rules No real change here. Instead of using an internal structure to the action rule, the http return rules are now stored as an http reply. The main change is about the action type. It is now always set to ACT_CUSTOM. The http reply type is used to know how to evaluate the rule.	2020-05-20 18:27:13 +02:00
Christopher Faulet	b6ea17c6fc	CLEANUP: http-htx: Rename http_error structure into http_error_msg The structure owns an error message, most of time loaded from a file, and converted to HTX. It is created when an errorfile or errorloc directive is parsed. It is renamed to avoid ambiguities with http_reply structure.	2020-05-20 18:27:13 +02:00
Christopher Faulet	7bd3de06e7	MINOR: http-htx: Add http_reply type based on what is used for http return rules The http_reply structure is added. It represents a generic HTTP message used as internal response by HAProxy. It is based on the structure used to store http return rules. The aim is to store all error messages using this structure, as well as http return and http deny rules.	2020-05-20 18:27:13 +02:00
Christopher Faulet	a53abad42d	CLEANUP: http_ana: Remove unused TXN flags TX_CLDENY, TX_CLALLOW, TX_SVDENY and TX_SVALLOW flags are unused. Only TX_CLTARPIT is used to make the difference between an http deny rule and an http tarpit rule. So these unused flags are removed.	2020-05-20 18:27:13 +02:00
William Lallemand	8177ad9895	MINOR: ssl: split config and runtime variable for ssl-{min,max}-ver In the CLI command 'show ssl crt-list', the ssl-min-ver and the ssl-min-max arguments were always displayed because the dumped versions were the actual version computed and used by haproxy, instead of the version found in the configuration. To fix the problem, this patch separates the variables to have one with the configured version, and one with the actual version used. The dump only shows the configured version.	2020-05-20 16:49:02 +02:00
Willy Tarreau	d68a6927f7	Revert "MEDIUM: sink: add global statement to create a new ring (sink buffer)" This reverts commit `957ec59571`. As discussed with Emeric, the current syntax is not extensible enough, this will be turned to a section instead in a forthcoming patch.	2020-05-20 12:06:16 +02:00
Willy Tarreau	928068a74b	MINOR: ring: make the applet code not depend on the CLI The ring to applet communication was only made to deal with CLI functions but it's generic. Let's have generic appctx functions and have the CLI rely on these instead. This patch introduces ring_attach_appctx() and ring_detach_appctx().	2020-05-19 19:37:12 +02:00
Willy Tarreau	9597cbd17a	MINOR: applet: adopt the wait list entry from the CLI A few fields, including a generic list entry, were added to the CLI context by commit `300decc8d9` ("MINOR: cli: extend the CLI context with a list and two offsets"). It turns out that the list entry (l0) is solely used to consult rings and that the generic ring_write() code is restricted to a consumer on the CLI due to this, which was not the initial intent. Let's make it a general purpose wait_entry field that is properly initialized during appctx_init(). This will allow any applet to wait on a ring, not just the CLI.	2020-05-19 19:37:12 +02:00
Willy Tarreau	2bdcc70fa7	MEDIUM: hpack: use a pool for the hpack table Instead of using malloc/free to allocate an HPACK table, let's declare a pool. However the HPACK size is configured by the H2 mux, so it's also this one which allocates it after post_check.	2020-05-19 11:40:39 +02:00
Emeric Brun	957ec59571	MEDIUM: sink: add global statement to create a new ring (sink buffer) This patch adds the new global statement: ring <name> [desc <desc>] [format <format>] [size <size>] [maxlen <length>] Creates a named ring buffer which could be used on log line for instance. <desc> is an optionnal description string of the ring. It will appear on CLI. By default, <name> is reused to fill this field. <format> is the log format used when generating syslog messages. It may be one of the following : iso A message containing only the ISO date, followed by the text. The PID, process name and system name are omitted. This is designed to be used with a local log server. raw A message containing only the text. The level, PID, date, time, process name and system name are omitted. This is designed to be used in containers or during development, where the severity only depends on the file descriptor used (stdout/stderr). This is the default. rfc3164 The RFC3164 syslog message format. This is the default. (https://tools.ietf.org/html/rfc3164) rfc5424 The RFC5424 syslog message format. (https://tools.ietf.org/html/rfc5424) short A message containing only a level between angle brackets such as '<3>', followed by the text. The PID, date, time, process name and system name are omitted. This is designed to be used with a local log server. This format is compatible with what the systemd logger consumes. timed A message containing only a level between angle brackets such as '<3>', followed by ISO date and by the text. The PID, process name and system name are omitted. This is designed to be used with a local log server. <length> is the maximum length of event message stored into the ring, including formatted header. If the event message is longer than <length>, it would be truncated to this length. <name> is the ring identifier, which follows the same naming convention as proxies and servers. <size> is the optionnal size in bytes. Default value is set to BUFSIZE. Note: Historically sink's name and desc were refs on const strings. But with new configurable rings a dynamic allocation is needed.	2020-05-19 11:04:11 +02:00
Emeric Brun	e709e1e777	MEDIUM: logs: buffer targets now rely on new sink_write Before this path, they rely directly on ring_write bypassing a part of the sink API. Now the maxlen parameter of the log will apply only on the text message part (and not the header, for this you woud prefer to use the maxlen parameter on the sink/ring). sink_write prototype was also reviewed to return the number of Bytes written to be compliant with the other write functions.	2020-05-19 11:04:11 +02:00
Emeric Brun	bd163817ed	MEDIUM: sink: build header in sink_write for log formats This patch extends the sink_write prototype and code to handle the rfc5424 and rfc3164 header. It uses header building tools from log.c. Doing this some functions/vars have been externalized. facility and minlevel have been removed from the struct sink and passed to args at sink_write because they depends of the log and not of the sink (they remained unused by rest of the code until now).	2020-05-19 11:04:11 +02:00
William Dauchy	1665c43fd8	BUILD: ssl: include buffer common headers for ssl_sock_ctx since commit `c0cdaffaa3` ("REORG: ssl: move ssl_sock_ctx and fix cross-dependencies issues"), `struct ssl_sock_ctx` was moved in ssl_sock.h. As it contains a `struct buffer`, including `common/buffer.h` is now mandatory. I encountered an issue while including ssl_sock.h on another patch: include/types/ssl_sock.h:240:16: error: field ‘early_buf’ has incomplete type 240 \| struct buffer early_buf; /* buffer to store the early data received */ no backport needed. Fixes: `c0cdaffaa3` ("REORG: ssl: move ssl_sock_ctx and fix cross-dependencies issues") Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-05-18 08:29:32 +02:00
Marcin Deranek	4dc2b57d51	MINOR: stats: Prepare for more accurate moving averages Add swrate_add_dynamic function which is similar to swrate_add, but more accurate when calculating moving averages when not enough samples have been processed yet.	2020-05-16 22:40:00 +02:00
William Lallemand	6a66a5ec9b	REORG: ssl: move utility functions to src/ssl_utils.c These functions are mainly used to extract information from certificates.	2020-05-15 14:11:54 +02:00
William Lallemand	15e169447d	REORG: ssl: move sample fetches to src/ssl_sample.c Move all SSL sample fetches to src/ssl_sample.c.	2020-05-15 14:11:54 +02:00
William Lallemand	c0cdaffaa3	REORG: ssl: move ssl_sock_ctx and fix cross-dependencies issues In order to move all SSL sample fetches in another file, moving the ssl_sock_ctx definition in a .h file is required. Unfortunately it became a cross dependencies hell to solve, because of the struct wait_event field, so <types/connection.h> is needed which created other problems.	2020-05-15 14:11:54 +02:00
William Lallemand	ef76107a4b	MINOR: ssl: remove static keyword in some SSL utility functions In order to move the the sample fetches to another file, remove the static keyword of some utility functions in the SSL fetches.	2020-05-15 14:11:54 +02:00
William Lallemand	dad3105157	REORG: ssl: move ssl configuration to cfgparse-ssl.c Move all the configuration parsing of the ssl keywords in cfgparse-ssl.c	2020-05-15 14:11:54 +02:00
William Lallemand	da8584c1ea	REORG: ssl: move the CLI 'cert' functions to src/ssl_ckch.c Move the 'ssl cert' CLI functions to src/ssl_ckch.c.	2020-05-15 14:11:54 +02:00
William Lallemand	c756bbd3df	REORG: ssl: move the crt-list CLI functions in src/ssl_crtlist.c Move the crtlist functions for the CLI to src/ssl_crtlist.c	2020-05-15 14:11:54 +02:00
William Lallemand	03c331c80a	REORG: ssl: move the ckch_store related functions to src/ssl_ckch.c Move the cert_key_and_chain functions: int ssl_sock_load_files_into_ckch(const char path, struct cert_key_and_chain ckch, char *err); int ssl_sock_load_pem_into_ckch(const char path, char buf, struct cert_key_and_chain ckch , char *err); void ssl_sock_free_cert_key_and_chain_contents(struct cert_key_and_chain ckch); int ssl_sock_load_key_into_ckch(const char path, char buf, struct cert_key_and_chain ckch , char err); int ssl_sock_load_ocsp_response_from_file(const char ocsp_path, char buf, struct cert_key_and_chain ckch, char *err); int ssl_sock_load_sctl_from_file(const char sctl_path, char buf, struct cert_key_and_chain ckch, char *err); int ssl_sock_load_issuer_file_into_ckch(const char path, char buf, struct cert_key_and_chain ckch, char *err); And the utility ckch_store functions: void ckch_store_free(struct ckch_store store) struct ckch_store ckch_store_new(const char filename, int nmemb) struct ckch_store ckchs_dup(const struct ckch_store src) ckch_store ckchs_lookup(char path) ckch_store ckchs_load_cert_file(char path, int multi, char **err)	2020-05-15 14:11:54 +02:00
William Lallemand	c1c50b46e9	CLEANUP: ssl: avoid circular dependencies in ssl_crtlist.h Add forward declarations in types/ssl_crtlist.h in order to avoid circular dependencies. Also remove the listener.h include which is not needed anymore.	2020-05-15 14:11:54 +02:00
William Lallemand	6e9556b635	REORG: ssl: move crtlist functions to src/ssl_crtlist.c Move the crtlist functions to src/ssl_crtlist.c and their definitions to proto/ssl_crtlist.h. The following functions were moved: /* crt-list entry functions / void ssl_sock_free_ssl_conf(struct ssl_bind_conf conf); char crtlist_dup_filters(char args, int fcount); void crtlist_free_filters(char *args); void crtlist_entry_free(struct crtlist_entry entry); struct crtlist_entry crtlist_entry_new(); / crt-list functions / void crtlist_free(struct crtlist crtlist); struct crtlist crtlist_new(const char filename, int unique); /* file loading / int crtlist_parse_line(char line, char *crt_path, struct crtlist_entry entry, const char file, int linenum, char err); int crtlist_parse_file(char file, struct bind_conf bind_conf, struct proxy curproxy, struct crtlist crtlist, char err); int crtlist_load_cert_dir(char path, struct bind_conf bind_conf, struct crtlist crtlist, char err);	2020-05-15 14:11:54 +02:00
William Lallemand	c69973f7eb	CLEANUP: ssl: add ckch prototypes in proto/ssl_ckch.h Remove the static definitions of the ckch functions and add them to ssl_ckch.h in order to use them outside ssl_sock.c.	2020-05-15 14:11:54 +02:00
William Lallemand	d4632b2b6d	REORG: ssl: move the ckch structures to types/ssl_ckch.h Move all the structures used for loading the SSL certificates in ssl_ckch.h	2020-05-15 14:11:54 +02:00
William Lallemand	be21b663cd	REORG: move the crt-list structures in their own .h Move the structure definitions specifics to the crt-list in types/ssl_crtlist.h.	2020-05-15 14:11:54 +02:00
William Lallemand	7fd8b4567e	REORG: ssl: move macros and structure definitions to ssl_sock.h The ssl_sock.c file contains a lot of macros and structure definitions that should be in a .h. Move them to the more appropriate types/ssl_sock.h file.	2020-05-15 14:11:54 +02:00
Dragan Dosen	eb607fe6a1	MINOR: ssl: add a new function ssl_sock_get_ssl_object() This one can be used later to get a SSL object from connection. It will return NULL if connection is not established over SSL.	2020-05-14 13:13:14 +02:00
Dragan Dosen	1e7ed04665	MEDIUM: ssl: allow to register callbacks for SSL/TLS protocol messages This patch adds the ability to register callbacks for SSL/TLS protocol messages by using the function ssl_sock_register_msg_callback(). All registered callback functions will be called when observing received or sent SSL/TLS protocol messages.	2020-05-14 13:13:14 +02:00
Christopher Faulet	325504cf89	BUG/MINOR: sample/ssl: Fix digest converter for openssl < 1.1.0 The EVP_MD_CTX_create() and EVP_MD_CTX_destroy() functions were renamed to EVP_MD_CTX_new() and EVP_MD_CTX_free() in OpenSSL 1.1.0, respectively. These functions are used by the digest converter, introduced by the commit `8e36651ed` ("MINOR: sample: Add digest and hmac converters"). So for prior versions of openssl, macros are used to fallback on old functions. This patch must only be backported if the commit `8e36651ed` is backported too.	2020-05-12 16:30:41 +02:00
Willy Tarreau	5778fea4da	CLEANUP: remove THREAD_LOCAL from config.h This one really ought to be defined in hathreads.h like all other thread definitions, which is what this patch does. As expected, all files but one (regex.h) were already including hathreads.h when using THREAD_LOCAL; regex.h was fixed for this. This was the last entry in config.h which is now useless.	2020-05-09 09:08:09 +02:00
Willy Tarreau	3bc4e8bfe6	CLENAUP: config: move CONFIG_HAP_LOCKLESS_POOLS out of config.h The setting of CONFIG_HAP_LOCKLESS_POOLS depending on threads and compat was done in config.h for use only in memory.h and memory.c where other settings are dealt with. Further, the default pool cache size was set there from a fixed value instead of being set from defaults.h Let's move the decision to enable lockless pools via CONFIG_HAP_LOCKLESS_POOLS to memory.h, and set the default pool cache size in defaults.h like other default settings. This was the next-to-last setting in config.h.	2020-05-09 09:02:35 +02:00
Willy Tarreau	755afc08d5	CLEANUP: config: drop unused setting CONFIG_HAP_INLINE_FD_SET CONFIG_HAP_INLINE_FD_SET was introduced in 1.3.3 and dropped in 1.3.9 when the pollers were reworked, let's remove it.	2020-05-09 08:57:48 +02:00
Willy Tarreau	571eb3d659	CLEANUP: config: drop unused setting CONFIG_HAP_MEM_OPTIM CONFIG_HAP_MEM_OPTIM was introduced with memory pools in 1.3 and dropped in 1.6 when pools became the only way to allocate memory. Still the option remained present in config.h. Let's kill it.	2020-05-09 08:53:31 +02:00
Christopher Faulet	67a234583e	CLEANUP: checks: sort and rename tcpcheck_expect_type types The same naming format is used for all expect rules. And names are sorted to be grouped by type.	2020-05-06 12:38:44 +02:00
Christopher Faulet	aaab0836d9	MEDIUM: checks: Add matching on log-format string for expect rules It is now possible to use log-format string (or hexadecimal string for the binary version) to match a content in tcp-check based expect rules. For hexadecimal log-format string, the conversion in binary is performed after the string evaluation, during health check execution. The pattern keywords to use are "string-lf" for the log-format string and "binary-lf" for the hexadecimal log-format string.	2020-05-06 08:31:29 +02:00
Willy Tarreau	a4d9ee3d1c	BUG/MINOR: threads: fix multiple use of argument inside HA_ATOMIC_UPDATE_{MIN,MAX}() Just like in previous patch, it happens that HA_ATOMIC_UPDATE_MIN() and HA_ATOMIC_UPDATE_MAX() would evaluate the (val) argument up to 3 times. However this time it affects both thread and non-thread versions. It's strange because the copy was properly performed for the (new) argument in order to avoid this. Anyway it was done for the "val" one as well. A quick code inspection showed that this currently has no effect as these macros are fairly limited in usage. It would be best to backport this for long-term stability (till 1.8) but it will not fix an existing bug.	2020-05-05 16:18:52 +02:00
Willy Tarreau	d66345d6b0	BUG/MINOR: threads: fix multiple use of argument inside HA_ATOMIC_CAS() When threads are disabled, HA_ATOMIC_CAS() becomes a simple compound expression. However this expression presents a problem, which is that its arguments are evaluated multiple times, once for the comparison and once again for the assignement. This presents a risk of performing some side-effect operations twice in the non-threaded case (e.g. in case of auto-increment or function return). The macro was rewritten using local copies for arguments like the other macros do. Fortunately a complete inspection of the code indicates that this case currently never happens. It was however responsible for the strict-aliasing warning emitted when building fd.c without threads but with 64-bit CAS. This may be backported as far as 1.8 though it will not fix any existing bug and is more of a long-term safety measure in case a future fix would depend on this behavior.	2020-05-05 16:05:45 +02:00
Baptiste Assmann	0e9d87bf06	MINOR: istbuf: add ist2buf() function Purpose of this function is to build a <struct buffer> from a <struct ist>.	2020-05-05 15:28:59 +02:00
Baptiste Assmann	de80201460	MINOR: ist: add istissame() function The istissame() function takes 2 ist and compare their <.ptr> and <.len> values respectively. It returns non-zero if they are the same.	2020-05-05 15:28:59 +02:00
Baptiste Assmann	9ef1967af7	MINOR: ist: add istadv() function The purpose of istadv() function is to move forward <.ptr> by <nb> characters. It is very useful when parsing a payload.	2020-05-05 15:28:59 +02:00
Christopher Faulet	3970819a55	MEDIUM: checks: Support matching on headers for http-check expect rules It is now possible to add http-check expect rules matching HTTP header names and values. Here is the format of these rules: http-check expect header name [ -m <meth> ] <name> [log-format] \ [ value [ -m <meth> ] <value> [log-format] [full] ] the name pattern (name ...) is mandatory but the value pattern (value ...) is optionnal. If not specified, only the header presence is verified. <meth> is the matching method, applied on the header name or the header value. Supported matching methods are: * "str" (exact match) * "beg" (prefix match) * "end" (suffix match) * "sub" (substring match) * "reg" (regex match) If not specified, exact matching method is used. If the "log-format" option is used, the pattern (<name> or <value>) is evaluated as a log-format string. This option cannot be used with the regex matching method. Finally, by default, the header value is considered as comma-separated list. Each part may be tested. The "full" option may be used to test the full header line. Note that matchings are case insensitive on the header names.	2020-05-05 11:19:27 +02:00
Christopher Faulet	8dd33e13a5	MINOR: http-htx: Support different methods to look for header names It is now possible to use different matching methods to look for header names in an HTTP message: * The exact match. It is the default method. http_find_header() uses this method. http_find_str_header() is an alias. * The prefix match. It evals the header names starting by a prefix. http_find_pfx_header() must be called to use this method. * The suffix match. It evals the header names ending by a suffix. http_find_sfx_header() must be called to use this method. * The substring match. It evals the header names containing a string. http_find_sub_header() must be called to use this method. * The regex match. It evals the header names matching a regular expression. http_match_header() must be called to use this method.	2020-05-05 11:07:00 +02:00
Christopher Faulet	778f5ed478	MEDIUM: checks/http-fetch: Support htx prefetch from a check for HTTP samples Some HTTP sample fetches will be accessible from the context of a http-check health check. Thus, the prefetch function responsible to return the HTX message has been update to handle a check, in addition to a channel. Both cannot be used at the same time. So there is no ambiguity.	2020-05-05 11:06:43 +02:00
Willy Tarreau	86c6a9221a	BUG/MEDIUM: shctx: bound the number of loops that can happen around the lock Given that a "count" value of 32M was seen in _shctx_wait4lock(), it is very important to prevent this from happening again. It's absolutely essential to prevent the value from growing unbounded because with an increase of the number of threads, the number of successive failed attempts will necessarily grow. Instead now we're scanning all 2^p-1 values from 3 to 255 and are bounding to count to 255 so that in the worst case each thread tries an xchg every 255 failed read attempts. That's one every 4 on average per thread when there are 64 threads, which corresponds to the initial count of 4 for the first attempt so it seems like a reasonable value to keep a low latency. The bug was introduced with the shctx entries in 1.5 so the fix must be backported to all versions. Before 1.8 the function was called _shared_context_wait4lock() and was in shctx.c.	2020-05-01 13:32:20 +02:00
Willy Tarreau	3801bdc3fc	BUG/MEDIUM: shctx: really check the lock's value while waiting J�r�me reported an amazing crash in the spinlock version of _shctx_wait4lock() with an extremely high <count> value of 32M! The root cause is that the function cannot deal with contention on the lock at all because it forgets to check if the lock's value has changed! As such, every time it's called due to a contention, it waits twice as long before trying again and lets the caller check for the contention by itself. The correct thing to do is to compare the value again at each loop. This way it makes sure to mostly perform read accesses on the shared cache line without writing too often, and to be ready fast enough to try to grab the lock. And we must not increase the count on success either! Unfortunately I'd have expected to see a performance boost on the cache with this but there was absolutely no change, so it's very likely that these issues only happen once in a while and are sufficient to derail the process when they strike, but not to have a permanent performance impact. The bug was introduced with the shctx entries in 1.5 so the fix must be backported to all versions. Before 1.8 the function was called _shared_context_wait4lock() and was in shctx.c.	2020-05-01 13:29:14 +02:00
Willy Tarreau	f0e5da20e1	BUG/MINOR: debug: properly use long long instead of long for the thread ID I changed my mind twice on this one and pushed after the last test with threads disabled, without re-enabling long long, causing this rightful build warning. This needs to be backported if the previous commit `ff64d3b027` ("MINOR: threads: export the POSIX thread ID in panic dumps") is backported as well.	2020-05-01 12:26:03 +02:00
Willy Tarreau	ff64d3b027	MINOR: threads: export the POSIX thread ID in panic dumps It is very difficult to map a panic dump against a gdb thread dump because the thread numbers do not match. However gdb provides the pthread ID but this one is supposed to be opaque and not to be cast to a scalar. This patch provides a fnuction, ha_get_pthread_id() which retrieves the pthread ID of the indicated thread and casts it to an unsigned long long so as to lose the least possible amount of information from it. This is done cleanly using a union to maintain alignment so as long as these IDs are stored on 1..8 bytes they will be properly reported. This ID is now presented in the panic dumps so it now becomes possible to map these threads. When threads are disabled, zero is returned. For example, this is a panic dump: Thread 1 is about to kill the process. >Thread 1 : id=0x7fe92b825180 act=0 glob=0 wq=1 rq=0 tl=0 tlsz=0 rqsz=0 stuck=1 prof=0 harmless=0 wantrdv=0 cpu_ns: poll=5119122 now=2009446995 diff=2004327873 curr_task=0xc99bf0 (task) calls=4 last=0 fct=0x592440(task_run_applet) ctx=0xca9c50(<CLI>) strm=0xc996a0 src=unix fe=GLOBAL be=GLOBAL dst=<CLI> rqf=848202 rqa=0 rpf=80048202 rpa=0 sif=EST,200008 sib=EST,204018 af=(nil),0 csf=0xc9ba40,8200 ab=0xca9c50,4 csb=(nil),0 cof=0xbf0e50,1300:PASS(0xc9cee0)/RAW((nil))/unix_stream(20) cob=(nil),0:NONE((nil))/NONE((nil))/NONE(0) call trace(20): \| 0x59e4cf [48 83 c4 10 5b 5d 41 5c]: wdt_handler+0xff/0x10c \| 0x7fe92c170690 [48 c7 c0 0f 00 00 00 0f]: libpthread:+0x13690 \| 0x7ffce29519d9 [48 c1 e2 20 48 09 d0 48]: linux-vdso:+0x9d9 \| 0x7ffce2951d54 [eb d9 f3 90 e9 1c ff ff]: linux-vdso:__vdso_gettimeofday+0x104/0x133 \| 0x57b484 [48 89 e6 48 8d 7c 24 10]: main+0x157114 \| 0x50ee6a [85 c0 75 76 48 8b 55 38]: main+0xeaafa \| 0x50f69c [48 63 54 24 20 85 c0 0f]: main+0xeb32c \| 0x59252c [48 c7 c6 d8 ff ff ff 44]: task_run_applet+0xec/0x88c Thread 2 : id=0x7fe92b6e6700 act=0 glob=0 wq=0 rq=0 tl=0 tlsz=0 rqsz=0 stuck=0 prof=0 harmless=1 wantrdv=0 cpu_ns: poll=786738 now=1086955 diff=300217 curr_task=0 Thread 3 : id=0x7fe92aee5700 act=0 glob=0 wq=0 rq=0 tl=0 tlsz=0 rqsz=0 stuck=0 prof=0 harmless=1 wantrdv=0 cpu_ns: poll=828056 now=1129738 diff=301682 curr_task=0 Thread 4 : id=0x7fe92a6e4700 act=0 glob=0 wq=0 rq=0 tl=0 tlsz=0 rqsz=0 stuck=0 prof=0 harmless=1 wantrdv=0 cpu_ns: poll=818900 now=1153551 diff=334651 curr_task=0 And this is the gdb output: (gdb) info thr Id Target Id Frame 1 Thread 0x7fe92b825180 (LWP 15234) 0x00007fe92ba81d6b in raise () from /lib64/libc.so.6 2 Thread 0x7fe92b6e6700 (LWP 15235) 0x00007fe92bb56a56 in epoll_wait () from /lib64/libc.so.6 3 Thread 0x7fe92a6e4700 (LWP 15237) 0x00007fe92bb56a56 in epoll_wait () from /lib64/libc.so.6 4 Thread 0x7fe92aee5700 (LWP 15236) 0x00007fe92bb56a56 in epoll_wait () from /lib64/libc.so.6 We can clearly see that while threads 1 and 2 are the same, gdb's threads 3 and 4 respectively are haproxy's threads 4 and 3. This may be backported to 2.0 as it removes some confusion in github issues.	2020-05-01 11:45:56 +02:00
Christopher Faulet	dc75d577b9	CLEANUP: checks: Fix checks includes	2020-04-29 13:32:29 +02:00
Christopher Faulet	1543d44607	MINOR: http-htx: Export functions to update message authority and host These functions will be used by HTTP health checks when a request is formatted before sending it.	2020-04-29 13:32:29 +02:00
Damien Claisse	57c8eb939d	MINOR: log: Add "Tu" timer It can be sometimes useful to measure total time of a request as seen from an end user, including TCP/TLS negotiation, server response time and transfer time. "Tt" currently provides something close to that, but it also takes client idle time into account, which is problematic for keep-alive requests as idle time can be very long. "Ta" is also not sufficient as it hides TCP/TLS negotiationtime. To improve that, introduce a "Tu" timer, without idle time and everything else. It roughly estimates time spent time spent from user point of view (without DNS resolution time), assuming network latency is the same in both directions.	2020-04-28 16:30:13 +02:00
Christopher Faulet	bfb0f72d52	BUG/MEDIUM: sessions: Always pass the mux context as argument to destroy a mux This bug was introduced by the commit `2444aa5b` ("MEDIUM: sessions: Don't be responsible for connections anymore."). In session_check_idle_conn(), when the mux is destroyed, its context must be passed as argument instead of the connection. It is de 2.2-dev bug. No need to backport.	2020-04-27 15:53:43 +02:00
Christopher Faulet	4a8c026117	BUG/MINOR: checks/server: use_ssl member must be signed	2020-04-27 12:13:06 +02:00
Christopher Faulet	8021a5f4a5	MINOR: checks: Support list of status codes on http-check expect rules It is now possible to match on a comma-separated list of status codes or range of codes. In addtion, instead of a string comparison to match the response's status code, a integer comparison is performed. Here is an example: http-check expect status 200,201,300-310	2020-04-27 10:46:28 +02:00
Christopher Faulet	88d939c831	Revert "MEDIUM: checks: capture groups in expect regexes" This reverts commit 1979943c30ef285ed04f07ecf829514de971d9b2. Captures in comment was only used when a tcp-check expect based on a negative regex matching failed to eventually report what was captured while it was not expected. It is a bit far-fetched to be useable IMHO. on-error and on-success log-format strings are far more usable. For now there is few check sample fetches (in fact only one...). But it could be really powerful to report info in logs.	2020-04-27 10:46:28 +02:00
Christopher Faulet	d7cee71e77	MINOR: checks: Use a tree instead of a list to store tcp-check rulesets Since all tcp-check rulesets are globally stored, it is a problem to use list. For configuration with many backends, the lookups in list may be costly and slow downs HAProxy startup. To solve this problem, tcp-check rulesets are now stored in a tree.	2020-04-27 10:46:28 +02:00
Christopher Faulet	0417975bdc	MINOR: ist: Add a function to retrieve the ist pointer There is already the istlen() function to get the ist length. Now, it is possible to call istptr() to get the ist pointer.	2020-04-27 10:46:28 +02:00
Christopher Faulet	61cc852230	CLEANUP: checks: Reorg checks.c file to be more readable The patch is not obvious at the first glance. But it is just a reorg. Functions have been grouped and ordered in a more logical way. Some structures and flags are now private to the checks module (so moved from the .h to the .c file).	2020-04-27 10:46:28 +02:00
Christopher Faulet	d7e639661a	MEDIUM: checks: Implement default TCP check using tcp-check rules Defaut health-checks, without any option, doing only a connection check, are now based on tcp-checks. An implicit default tcp-check connect rule is used. A shared tcp-check ruleset, name "*tcp-check" is created to support these checks.	2020-04-27 10:46:28 +02:00
Christopher Faulet	a9e1c4c7c2	MINOR: connection: Add a function to install a mux for a health-check This function is unused for now. But it will have be used to install a mux for an outgoing connection openned in a health-check context. In this case, the session's origin is the check itself, and it is used to know the mode, HTTP or TCP, depending on the tcp-check type and not the proxy mode. The check is also used to get the mux protocol if configured.	2020-04-27 09:39:38 +02:00
Christopher Faulet	b356714769	MINOR: checks: Add a mux proto to health-check and tcp-check connect rule It is not set and not used for now, but it will be possible to force the mux protocol thanks to this patch. A mux proto field is added to the checks and to tcp-check connect rules.	2020-04-27 09:39:38 +02:00
Christopher Faulet	a142c1deb4	BUG/MINOR: obj_type: Handle stream object in obj_base_ptr() function The stream object (OBJ_TYPE_STREAM) was missing in the switch statement of the obj_base_ptr() function. This patch must be backported as far as 2.0.	2020-04-27 09:39:38 +02:00
Christopher Faulet	3829046893	MINOR: checks/obj_type: Add a new object type for checks An object type is now affected to the check structure.	2020-04-27 09:39:38 +02:00
Christopher Faulet	e60abd1a06	MINOR: connection: Add macros to know if a conn or a cs uses an HTX mux IS_HTX_CONN() and IS_HTX_CS may now be used to know if a connection or a conn-stream use an HTX based multiplexer.	2020-04-27 09:39:38 +02:00
Christopher Faulet	e5870d872b	MAJOR: checks: Implement HTTP check using tcp-check rules HTTP health-checks are now internally based on tcp-checks. Of course all the configuration parsing of the "http-check" keyword and the httpchk option has been rewritten. But the main changes is that now, as for tcp-check ruleset, it is possible to perform several send/expect sequences into the same health-checks. Thus the connect rule is now also available from HTTP checks, jst like set-var, unset-var and comment rules. Because the request defined by the "option httpchk" line is used for the first request only, it is now possible to set the method, the uri and the version on a "http-check send" line.	2020-04-27 09:39:38 +02:00
Christopher Faulet	5eb96cbcbc	MINOR: standard: Add my_memspn and my_memcspn Do the same than strsnp() and strcspn() but on a raw bytes buffer.	2020-04-27 09:39:38 +02:00
Christopher Faulet	12d5740a38	MINOR: checks: Introduce flags to configure in tcp-check expect rules Instead of having 2 independent integers, used as boolean values, to know if the expect rule is invered and to know if the matching regexp has captures, we know use a 32-bits bitfield.	2020-04-27 09:39:38 +02:00
Christopher Faulet	f930e4c4df	MINOR: checks: Use an indirect string to represent the expect matching string Instead of having a string in the expect union with its length outside of the union, directly in the expect structure, an indirect string is now used.	2020-04-27 09:39:38 +02:00
Christopher Faulet	404f919995	MEDIUM: checks: Use a shared ruleset to store tcp-check rules All tcp-check rules are now stored in the globla shared list. The ones created to parse a specific protocol, for instance redis, are already stored in this list. Now pure tcp-check rules are also stored in it. The ruleset name is created using the proxy name and its config file and line. tcp-check rules declared in a defaults section are also stored this way using "defaults" as proxy name. For now, all tcp-check ruleset are stored in a list. But it could be a bit slow to looks for a specific ruleset with a huge number of backends. So, it could be a good idea to use a tree instead.	2020-04-27 09:39:38 +02:00
Christopher Faulet	6f5579160a	MINOR: proxy/checks: Move parsing of external-check option in checks.c Parsing of the proxy directive "option external-check" have been moved in checks.c.	2020-04-27 09:39:38 +02:00
Christopher Faulet	430e480510	MINOR: proxy/checks: Move parsing of tcp-check option in checks.c Parsing of the proxy directive "option tcp-check" have been moved in checks.c.	2020-04-27 09:39:38 +02:00
Christopher Faulet	6c2a743538	MINOR: proxy/checks: Move parsing of httpchk option in checks.c Parsing of the proxy directive "option httpchk" have been moved in checks.c.	2020-04-27 09:39:38 +02:00
Christopher Faulet	ec07e386a7	MINOR: checks: Add an option to set success status of tcp-check expect rules It is now possible to specified the healthcheck status to use on success of a tcp-check rule, if it is the last evaluated rule. The option "ok-status" supports "L4OK", "L6OK", "L7OK" and "L7OKC" status.	2020-04-27 09:39:38 +02:00
Christopher Faulet	799f3a4621	MINOR: Produce tcp-check info message for pure tcp-check rules only This way, messages reported by protocol checks are closer that the old one.	2020-04-27 09:39:38 +02:00
Christopher Faulet	0ae3d1dbdf	MEDIUM: checks: Implement agent check using tcp-check rules A shared tcp-check ruleset is now created to support agent checks. The following sequence is used : tcp-check send "%[var(check.agent_string)] log-format tcp-check expect custom The custom function to evaluate the expect rule does the same that it was done to handle agent response when a custom check was used.	2020-04-27 09:39:38 +02:00
Christopher Faulet	267b01b761	MEDIUM: checks: Implement SPOP check using tcp-check rules A share tcp-check ruleset is now created to support SPOP checks. This way no extra memory is used if several backends use a SPOP check. The following sequence is used : tcp-check send-binary SPOP_REQ tcp-check expect custom min-recv 4 The spop request is the result of the function spoe_prepare_healthcheck_request() and the expect rule relies on a custom function calling spoe_handle_healthcheck_response().	2020-04-27 09:39:38 +02:00
Christopher Faulet	1997ecaa0c	MEDIUM: checks: Implement LDAP check using tcp-check rules A shared tcp-check ruleset is now created to support LDAP check. This way no extra memory is used if several backends use a LDAP check. The following sequance is used : tcp-check send-binary "300C020101600702010304008000" tcp-check expect rbinary "^30" min-recv 14 \ on-error "Not LDAPv3 protocol" tcp-check expect custom The last expect rule relies on a custom function to check the LDAP server reply.	2020-04-27 09:39:38 +02:00
Christopher Faulet	f2b3be5c27	MEDIUM: checks: Implement MySQL check using tcp-check rules A share tcp-check ruleset is now created to support MySQL checks. This way no extra memory is used if several backends use a MySQL check. One for the following sequence is used : ## If no extra params are set tcp-check connect default linger tcp-check expect custom ## will test the initial handshake ## If the username is defined tcp-check connect default linger tcp-check send-binary MYSQL_REQ log-format tcp-check expect custom ## will test the initial handshake tcp-check expect custom ## will test the reply to the client message The log-format hexa string MYSQL_REQ depends on 2 preset variables, the packet header containing the packet length and the sequence ID (check.header) and the username (check.username). If is also different if the "post-41" option is set or not. Expect rules relies on custom functions to check MySQL server packets.	2020-04-27 09:39:38 +02:00
Christopher Faulet	ce355074f1	MEDIUM: checks: Implement postgres check using tcp-check rules A shared tcp-check ruleset is now created to support postgres check. This way no extra memory is used if several backends use a pgsql check. The following sequence is used : tcp-check connect default linger tcp-check send-binary PGSQL_REQ log-format tcp-check expect !rstring "^E" min-recv 5 \ error-status "L7RSP" on-error "%[check.payload(6,0)]" tcp-check expect rbinary "^520000000800000000 min-recv "9" \ error-status "L7STS" \ on-success "PostgreSQL server is ok" \ on-error "PostgreSQL unknown error" The log-format hexa string PGSQL_REQ depends on 2 preset variables, the packet length (check.plen) and the username (check.username).	2020-04-27 09:39:38 +02:00
Christopher Faulet	fbcc77c6ba	MEDIUM: checks: Implement smtp check using tcp-check rules A share tcp-check ruleset is now created to support smtp checks. This way no extra memory is used if several backends use a smtp check. The following sequence is used : tcp-check connect default linger tcp-check expect rstring "^[0-9]{3}[ \r]" min-recv 4 \ error-status "L7RSP" on-error "%[check.payload(),cut_crlf]" tcp-check expect rstring "^2[0-9]{2}[ \r]" min-recv 4 \ error-status "L7STS" \ on-error %[check.payload(4,0),ltrim(' '),cut_crlf] \ status-code "check.payload(0,3)" tcp-echeck send "%[var(check.smtp_cmd)]\r\n" log-format tcp-check expect rstring "^2[0-9]{2}[- \r]" min-recv 4 \ error-status "L7STS" \ on-error %[check.payload(4,0),ltrim(' '),cut_crlf] \ on-success "%[check.payload(4,0),ltrim(' '),cut_crlf]" \ status-code "check.payload(0,3)" The variable check.smtp_cmd is by default the string "HELO localhost" by may be customized setting <helo> and <domain> parameters on the option smtpchk line. Note there is a difference with the old smtp check. The server gretting message is checked before send the HELO/EHLO comand.	2020-04-27 09:39:38 +02:00
Christopher Faulet	811f78ced1	MEDIUM: checks: Implement ssl-hello check using tcp-check rules A shared tcp-check ruleset is now created to support ssl-hello check. This way no extra memory is used if several backends use a ssl-hello check. The following sequence is used : tcp-check send-binary SSLV3_CLIENT_HELLO log-format tcp-check expect rbinary "^1[56]" min-recv 5 \ error-status "L6RSP" tout-status "L6TOUT" SSLV3_CLIENT_HELLO is a log-format hexa string representing a SSLv3 CLIENT HELLO packet. It is the same than the one used by the old ssl-hello except the sample expression "%[date(),htonl,hex]" is used to set the date field.	2020-04-27 09:39:38 +02:00
Christopher Faulet	33f05df650	MEDIUM: checks: Implement redis check using tcp-check rules A share tcp-check ruleset is now created to support redis checks. This way no extra memory is used if several backends use a redis check. The following sequence is used : tcp-check send "*1\r\n$4\r\nPING\r\n" tcp-check expect string "+PONG\r\n" error-status "L7STS" \ on-error "%[check.payload(),cut_crlf]" on-success "Redis server is ok"	2020-04-27 09:39:38 +02:00
Christopher Faulet	9e6ed1598e	MINOR: checks: Support custom functions to eval a tcp-check expect rules It is now possible to set a custom function to evaluate a tcp-check expect rule. It is an internal and not documentd option because the right pointer of function must be set and it is not possible to express it in the configuration. It will be used to convert some protocol healthchecks to tcp-checks. Custom functions must have the following signature: enum tcpcheck_eval_ret (custom)(struct check , struct tcpcheck_rule *, int);	2020-04-27 09:39:38 +02:00
Christopher Faulet	6f87adcf20	MINOR: checks: Export the tcpcheck_eval_ret enum This enum will be used to define custom function for tcp-check expect rules.	2020-04-27 09:39:38 +02:00
Christopher Faulet	7a1e2e1823	MEDIUM: checks: Add a list of vars to set before executing a tpc-check ruleset A list of variables is now associated to each tcp-check ruleset. It is more a less a list of set-var expressions. This list may be filled during the configuration parsing. The listed variables will then be set during each execution of the tcp-check healthcheck, at the begining, before execution of the the first tcp-check rule. This patch is mandatory to convert all protocol checks to tcp-checks. It is a way to customize shared tcp-check rulesets.	2020-04-27 09:39:37 +02:00
Christopher Faulet	bb591a1a11	MINOR: checks: Relax the default option for tcp-check connect rules Now this option may be mixed with other options. This way, options on the server line are used but may be overridden by tcp-check connect options.	2020-04-27 09:39:37 +02:00
Christopher Faulet	98cc57cf5c	MEDIUM: checks: Add status-code sample expression on tcp-check expect rules This option defines a sample expression, evaluated as an integer, to set the status code (check->code) if a tcp-check healthcheck ends on the corresponding expect rule.	2020-04-27 09:39:37 +02:00
Christopher Faulet	be52b4de66	MEDIUM: checks: Add on-error/on-success option on tcp-check expect rules These options define log-format strings used to produce the info message if a tcp-check expect rule fails (on-error option) or succeeds (on-success option). For this last option, it must be the ending rule, otherwise the parameter is ignored.	2020-04-27 09:39:37 +02:00
Christopher Faulet	cf80f2f263	MINOR: checks: Add option to tcp-check expect rules to customize error status It is now possible to specified the healthcheck status to use on error or on timeout for tcp-check expect rules. First, to define the error status, the option "error-status" must be used followed by "L4CON", "L6RSP", "L7RSP" or "L7STS". Then, to define the timeout status, the option "tout-status" must be used followed by "L4TOUT", "L6TOUT" or "L7TOUT". These options will be used to convert specific protocol healthchecks (redis, pgsql...) to tcp-check ones. x	2020-04-27 09:39:37 +02:00
Christopher Faulet	1032059bd0	MINOR: checks: Use a name for the healthcheck status enum The enum defining all healthcheck status (HCHK_STATUS_*) is now named.	2020-04-27 09:39:37 +02:00
Christopher Faulet	5d503fcf5b	MEDIUM: checks: Add a shared list of tcp-check rules A global list to tcp-check ruleset can now be used to share common rulesets with all backends without any duplication. It is mandatory to convert all specific protocol checks (redis, pgsql...) to tcp-check healthchecks. To do so, a flag is now attached to each tcp-check ruleset to know if it is a shared ruleset or not. tcp-check rules defined in a backend are still directly attached to the proxy and not shared. In addition a second flag is used to know if the ruleset is inherited from the defaults section.	2020-04-27 09:39:37 +02:00
Christopher Faulet	f50f4e956f	MEDIUM: checks: Support log-format strings for tcp-check send rules An extra parameter for tcp-check send rules can be specified to handle the string or the hexa string as a log-format one. Using "log-format" option, instead of considering the data to send as raw data, it is parsed as a log-format string. Thus it is possible to call sample fetches to customize data sent to a server. Of course, because we have no stream attached to healthchecks, not all sample fetches are available. So be careful. tcp-check set-var(check.port) int(8000) tcp-check set-var(check.uri) str(/status) tcp-check connect port var(check.port) tcp-check send "GET %[check.uri] HTTP/1.0\r\n" log-format tcp-check send "Host: %[srv_name]\r\n" log-format tcp-check send "\r\n"	2020-04-27 09:39:37 +02:00
Christopher Faulet	b7d30098f3	MEDIUM: checks: Support expression to set the port Since we have a session attached to tcp-check healthchecks, It is possible use sample expression and variables. In addition, it is possible to add tcp-check set-var rules to define custom variables. So, now, a sample expression can be used to define the port to use to establish a connection for a tcp-check connect rule. For instance: tcp-check set-var(check.port) int(8888) tcp-check connect port var(check.port)	2020-04-27 09:39:37 +02:00
Christopher Faulet	5c28874a69	MINOR: checks: Add the addr option for tcp-check connect rule With this option, it is now possible to use a specific address to open the connection for a tcp-check connect rule. If the port option is also specified, it is used in priority.	2020-04-27 09:39:37 +02:00
Christopher Faulet	d75f57e94c	MINOR: ssl: Export a generic function to parse an alpn string Parsing of an alpn string has been moved in a dedicated function and exposed to be used from outside the ssl_sock module.	2020-04-27 09:39:37 +02:00
Christopher Faulet	085426aea9	MINOR: checks: Add the via-socks4 option for tcp-check connect rules With this option, it is possible to establish the connection opened by a tcp-check connect rule using upstream socks4 proxy. Info from the socks4 parameter on the server are used.	2020-04-27 09:39:37 +02:00
Christopher Faulet	79b31d4ee5	MINOR: checks: Add the sni option for tcp-check connect rules With this option, it is possible to specify the SNI to be used for SSL conncection opened by a tcp-check connect rule.	2020-04-27 09:39:37 +02:00
Gaetan Rivet	707b52f17e	MEDIUM: checks: Parse custom action rules in tcp-checks Register the custom action rules "set-var" and "unset-var", that will call the parse_store() command upon parsing. These rules are thus built and integrated to the tcp-check ruleset, but have no further effect for the moment.	2020-04-27 09:39:37 +02:00
Gaetan Rivet	13a5043a9e	MINOR: checks/vars: Add a check scope for variables Add a dedicated vars scope for checks. This scope is considered as part of the session scope for accounting purposes. The scope can be addressed by a valid session, even embryonic. The stream is not necessary. The scope is initialized after the check session is created. All variables are then pruned before the session is destroyed.	2020-04-27 09:39:37 +02:00
Gaetan Rivet	05d692dc09	MEDIUM: checks: Associate a session to each tcp-check healthcheck Create a session for each healthcheck relying on a tcp-check ruleset. When such check is started, a session is allocated, which will be freed when the check finishes. A dummy static frontend is used to create these sessions. This will be useful to support variables and sample expression. This will also be used, later, by HTTP healthchecks to rely on HTTP muxes.	2020-04-27 09:39:37 +02:00
Christopher Faulet	b2c2e0fcca	MAJOR: checks: Refactor and simplify the tcp-check loop The loop in tcpcheck_main() function is quite hard to understand. Depending where we are in the loop, The current_step is the currentely executed rule or the one to execute on the next call to tcpcheck_main(). When the check result is reported, we rely on the rule pointed by last_started_step or the one pointed by current_step. In addition, the loop does not use the common list_for_each_entry macro and it is thus quite confusing. So the loop has been totally rewritten and splitted to several functions to simplify its reading and its understanding. Tcp-check rules are evaluated in dedicated functions. And a common for_each loop is used and only one rule is referenced, the current one.	2020-04-27 09:39:37 +02:00
Christopher Faulet	a202d1d4c1	MEDIUM: checks: Add implicit tcp-check connect rule After the configuration parsing, when its validity check, an implicit tcp-check connect rule is added in front of the tcp-check ruleset if the first non-comment rule is not a connect one. This implicit rule is flagged to use the default check parameter. This means now, all tcp-check rulesets begin with a connect and are never empty. When tcp-check healthchecks are used, all connections are thus handled by tcpcheck_main() function.	2020-04-27 09:39:37 +02:00
Gaetan Rivet	06d963aeca	MINOR: checks: define a tcp-check connect type The check rule itself is not changed, only its representation.	2020-04-27 09:39:37 +02:00
Gaetan Rivet	48219dc50e	MINOR: checks: define tcp-check send type The check rule itself is not changed, only its representation.	2020-04-27 09:39:37 +02:00
Gaetan Rivet	5301b01f99	MINOR: checks: Set the tcp-check rule index during parsing Now the position of a tcp-check rule in a chain is set during the parsing. This simplify significantly the function retrieving the current step id.	2020-04-27 09:39:37 +02:00
Gaetan Rivet	04578dbf37	MINOR: checks: Don't use a static tcp rule list head To allow reusing these blocks without consuming more memory, their list should be static and share-able accross uses. The head of the list will be shared as well. It is thus necessary to extract the head of the rule list from the proxy itself. Transform it into a pointer instead, that can be easily set to an external dynamically allocated head.	2020-04-27 09:39:37 +02:00
Gaetan Rivet	9dcb09fc98	MEDIUM: checks: capture groups in expect regexes Parse back-references in comments of tcp-check expect rules. If references are made, capture groups in the match and replace references to it within the comment when logging the error. Both text and binary regex can caputre groups and reference them in the expect rule comment. [Cf: I slightly updated the patch. exp_replace() function is used instead of a custom one. And if the trash buffer is too small to contain the comment during the substitution, the comment is ignored.]	2020-04-27 09:39:37 +02:00
Gaetan Rivet	efab6c61d9	MINOR: checks: add rbinary expect match type The rbinary match works similarly to the rstring match type, however the received data is rewritten as hex-string before the match operation is done. This allows using regexes on binary content even with the POSIX regex engine. [Cf: I slightly updated the patch. mem2hex function was removed and dump_binary is used instead.]	2020-04-27 09:39:37 +02:00
Gaetan Rivet	b616add793	MINOR: checks: define a tcp expect type Extract the expect definition from its tcpcheck ; create a standalone type.	2020-04-27 09:39:37 +02:00
Gaetan Rivet	f8ba6773e5	MINOR: checks: add linger option to tcp connect Allow declaring tcpcheck connect commands with a new parameter, "linger". This option will configure the connection to avoid using an RST segment to close, instead following the four-way termination handshake. Some servers would otherwise log each healthcheck as an error.	2020-04-27 09:39:37 +02:00
Gaetan Rivet	1afd826ae4	MINOR: checks: add min-recv tcp-check expect option Some expect rules cannot be satisfied due to inherent ambiguity towards the received data: in the absence of match, the current behavior is to be forced to wait either the end of the connection or a buffer full, whichever comes first. Only then does the matching diagnostic is considered conclusive. For instance : tcp-check connect tcp-check expect !rstring "^error" tcp-check expect string "valid" This check will only succeed if the connection is closed by the server before the check timeout. Otherwise the first expect rule will wait for more data until "^error" regex matches or the check expires. Allow the user to explicitly define an amount of data that will be considered enough to determine the value of the check. This allows succeeding on negative rstring rules, as previously in valid condition no match happened, and the matching was repeated until the end of the connection. This could timeout the check while no error was happening. [Cf: I slighly updated the patch. The parameter was renamed and the value is a signed integer to support -1 as default value to ignore the parameter.]	2020-04-27 09:39:37 +02:00
Gaetan Rivet	4038b94706	MEDIUM: checks: rewind to the first inverse expect rule of a chain on new data When receiving additional data while chaining multiple tcp-check expects, previous inverse expects might have a different result with the new data. They need to be evaluated again against the new data. Add a pointer to the first inverse expect rule of the current expect chain (possibly of length one) to each expect rule. When receiving new data, the currently evaluated tcp-check rule is set back to this pointed rule. Fonctionnaly speaking, it is a bug and it exists since the introduction of the feature. But there is no way for now to hit it because when an expect rule does not match, we wait for more data, independently on the inverse flag. The only way to move to the following rule is to be sure no more data will be received. This patch depends on the commit "MINOR: mini-clist: Add functions to iterate backward on a list". [Cf: I slightly updated the patch. First, it only concerns inverse expect rule. Normal expect rules are not concerned. Then, I removed the BUG tag because, for now, it is not possible to move to the following rule when the current one does not match while more data can be received.]	2020-04-27 09:39:37 +02:00
Gaetan Rivet	dd66732ffe	MINOR: checks: Use an enum to describe the tcp-check rule type Replace the generic integer with an enumerated list. This allows light type check and helps debugging (seeing action = 2 in the struct is not helpful).	2020-04-27 09:39:37 +02:00
Christopher Faulet	31c30fdf1e	CLEANUP: checks: Don't export anymore init_check and srv_check_healthcheck_port These functions are no longer called outside the checks.	2020-04-27 09:39:37 +02:00
Christopher Faulet	f61f33a1b2	BUG/MINOR: checks: Respect the no-check-ssl option This options is used to force a non-SSL connection to check a SSL server or to invert a check-ssl option inherited from the default section. The use_ssl field in the check structure is used to know if a SSL connection must be used (use_ssl=1) or not (use_ssl=0). The server configuration is used by default. The problem is that we cannot distinguish the default case (no specific SSL check option) and the case of an explicit non-SSL check. In both, use_ssl is set to 0. So the server configuration is always used. For a SSL server, when no-check-ssl option is set, the check is still performed using a SSL configuration. To fix the bug, instead of a boolean value (0=TCP, 1=SSL), we use a ternary value : * 0 = use server config * 1 = force SSL * -1 = force non-SSL The same is done for the server parameter. It is not really necessary for now. But it is a good way to know is the server no-ssl option is set. In addition, the PR_O_TCPCHK_SSL proxy option is no longer used to set use_ssl to 1 for a check. Instead the flag is directly tested to prepare or destroy the server SSL context. This patch should be backported as far as 1.8.	2020-04-27 09:39:37 +02:00
Christopher Faulet	8acb1284bc	MINOR: checks: Add a way to send custom headers and payload during http chekcs The 'http-check send' directive have been added to add headers and optionnaly a payload to the request sent during HTTP healthchecks. The request line may be customized by the "option httpchk" directive but there was not official way to add extra headers. An old trick consisted to hide these headers at the end of the version string, on the "option httpchk" line. And it was impossible to add an extra payload with an "http-check expect" directive because of the "Connection: close" header appended to the request (See issue #16 for details). So to make things official and fully support payload additions, the "http-check send" directive have been added : option httpchk POST /status HTTP/1.1 http-check send hdr Content-Type "application/json;charset=UTF-8" \ hdr X-test-1 value1 hdr X-test-2 value2 \ body "{id: 1, field: \"value\"}" When a payload is defined, the Content-Length header is automatically added. So chunk-encoded requests are not supported yet. For now, there is no special validity checks on the extra headers. This patch is inspired by Kiran Gavali's work. It should fix the issue #16 and as far as possible, it may be backported, at least as far as 1.8.	2020-04-27 09:39:37 +02:00
Christopher Faulet	bc1f54b0fc	MINOR: mini-clist: Add functions to iterate backward on a list list_for_each_entry_rev() and list_for_each_entry_from_rev() and corresponding safe versions have been added to iterate on a list in the reverse order. All these functions work the same way than the forward versions, except they use the .p field to move for an element to another.	2020-04-27 09:39:37 +02:00
Christopher Faulet	aaae9a0e99	BUG/MINOR: check: Update server address and port to execute an external check Server address and port may change at runtime. So the address and port passed as arguments and as environment variables when an external check is executed must be updated. The current number of connections on the server was already updated before executing the command. So the same mechanism is used for the server address and port. But in addition, command arguments are also updated. This patch must be backported to all stable versions. It should fix the issue #577.	2020-04-27 09:39:13 +02:00
Willy Tarreau	62ba9ba6ca	BUG/MINOR: http: make url_decode() optionally convert '+' to SP The url_decode() function used by the url_dec converter and a few other call points is ambiguous on its processing of the '+' character which itself isn't stable in the spec. This one belongs to the reserved characters for the query string but not for the path nor the scheme, in which it must be left as-is. It's only in argument strings that follow the application/x-www-form-urlencoded encoding that it must be turned into a space, that is, in query strings and POST arguments. The problem is that the function is used to process full URLs and paths in various configs, and to process query strings from the stats page for example. This patch updates the function to differentiate the situation where it's parsing a path and a query string. A new argument indicates if a query string should be assumed, otherwise it's only assumed after seeing a question mark. The various locations in the code making use of this function were updated to take care of this (most call places were using it to decode POST arguments). The url_dec converter is usually called on path or url samples, so it needs to remain compatible with this and will default to parsing a path and turning the '+' to a space only after a question mark. However in situations where it would explicitly be extracted from a POST or a query string, it now becomes possible to enforce the decoding by passing a non-null value in argument. It seems to be what was reported in issue #585. This fix may be backported to older stable releases.	2020-04-23 20:03:27 +02:00
Willy Tarreau	09568fd54d	BUG/MINOR: tools: fix the i386 version of the div64_32 function As reported in issue #596, the edx register isn't marked as clobbered in div64_32(), which could technically allow gcc to try to reuse it if it needed a copy of the 32 highest bits of the o1 register after the operation. Two attempts were tried, one using a dummy 32-bit local variable to store the intermediary edx and another one switching to "=A" and making result a long long. It turns out the former makes the resulting object code significantly dirtier while the latter makes it better and was kept. This is due to gcc's difficulties at working with register pairs mixing 32- and 64- bit values on i386. It was verified that no code change happened at all on x86_64, armv7, aarch64 nor mips32. In practice it's only used by the frequency counters so this bug cannot even be triggered but better fix it. This may be backported to stable branches though it will not fix any issue.	2020-04-23 17:21:37 +02:00
Ilya Shipitsin	856aabcda5	CLEANUP: assorted typo fixes in the code and comments This is 8th iteration of typo fixes	2020-04-17 09:37:36 +02:00
Willy Tarreau	bb86986253	MINOR: init: report the haproxy version and executable path once on errors If haproxy fails to start and emits an alert, then it can be useful to have it also emit the version and the path used to load it. Some users may be mistakenly launching the wrong binary due to a misconfigured PATH variable and this will save them some troubleshooting time when it reports that some keywords are not understood. What we do here is that we try to extract the binary name from the AUX vector on glibc, and we report this as a NOTICE tag before the very first alert is emitted.	2020-04-16 10:52:41 +02:00
Ilya Shipitsin	d425950c68	CLEANUP: assorted typo fixes in the code and comments This is 7th iteration of typo fixes	2020-04-16 10:04:36 +02:00
Willy Tarreau	3eb10b8e98	MINOR: init: add -dW and "zero-warning" to reject configs with warnings Since some systems switched to service managers which hide all warnings by default, some users are not aware of some possibly important warnings and get caught too late with errors that could have been detected earlier. This patch adds a new global keyword, "zero-warning" and an equivalent command-line option "-dW" to refuse to start in case any warning is detected. It is recommended to use these with configurations that are managed by humans in order to catch mistakes very early.	2020-04-15 16:42:39 +02:00
Willy Tarreau	bebd212064	MINOR: init: report in "haproxy -c" whether there were warnings or not This helps quickly checking if the config produces any warning. For this we reuse the "warned" bit field to add a new WARN_ANY bit that is set by ha_warning(). The rest of the bit field was also cleaned from unused bits.	2020-04-15 16:42:00 +02:00
Fr�d�ric L�caille	8ba10fea69	BUG/MINOR: peers: Incomplete peers sections should be validated. Before supporting "server" line in "peers" section, such sections without any local peer were removed from the configuration to get it validated. This patch fixes the issue where a "server" line without address and port which is a remote peer without address and port makes the configuration parsing fail. When encoutering such cases we now ignore such lines remove them from the configuration. Thank you to J�r�me Magnin for having reported this bug. Must be backported to 2.1 and 2.0.	2020-04-15 10:47:39 +02:00
William Lallemand	b7296c42bd	CLEANUP: ssl: remove a commentary in struct ckch_inst The struct ckch_inst now handles the ssl_bind_conf so this commentary is obsolete	2020-04-09 16:13:42 +02:00
William Lallemand	caa161982f	CLEANUP: ssl/cli: use the list of filters in the crtlist_entry In 'commit ssl cert', instead of trying to regenerate a list of filters from the SNIs, use the list provided by the crtlist_entry used to generate the ckch_inst. This list of filters doesn't need to be free'd anymore since they are always reused from the crtlist_entry.	2020-04-08 16:52:51 +02:00
William Lallemand	02e19a5c7b	CLEANUP: ssl: use the refcount for the SSL_CTX' Use the refcount of the SSL_CTX' to free them instead of freeing them on certains conditions. That way we can free the SSL_CTX everywhere its pointer is used.	2020-04-08 16:52:51 +02:00
William Lallemand	c69f02d0f0	MINOR: ssl/cli: replace dump/show ssl crt-list by '-n' option The dump and show ssl crt-list commands does the same thing, they dump the content of a crt-list, but the 'show' displays an ID in the first column. Delete the 'dump' command so it is replaced by the 'show' one. The old 'show' command is replaced by an '-n' option to dump the ID. And the ID which was a pointer is replaced by a line number and placed after colons in the filename. Example: $ echo "show ssl crt-list -n kikyo.crt-list" \| socat /tmp/sock1 - # kikyo.crt-list kikyo.pem.rsa:1 secure.domain.tld kikyo.pem.ecdsa:2 secure.domain.tld	2020-04-06 19:33:33 +02:00
Fr�d�ric L�caille	876ed55d9b	BUG/MINOR: protocol_buffer: Wrong maximum shifting. This patch fixes a bad stop condition when decoding a protocol buffer variable integer whose maximum lenghts are 10, shifting a uint64_t value by more than 63. Thank you to Ilya for having reported this issue. Must be backported to 2.1 and 2.0.	2020-04-02 15:09:46 +02:00
Olivier Houchard	4a0e7fe4f7	MINOR: connections: Don't mark conn flags 0x00000001 and 0x00000002 as unused. Remove the comments saying 0x00000001 and 0x00000002 are unused, they are now used by CO_FL_SAFE_LIST and CO_FL_IDLE_LIST.	2020-03-31 23:04:20 +02:00
William Lallemand	fa8cf0c476	MINOR: ssl: store a ptr to crtlist in crtlist_entry Store a pointer to crtlist in crtlist_entry so we can re-insert a crtlist_entry in its crtlist ebpt after updating its key.	2020-03-31 12:32:17 +02:00
William Lallemand	23d61c00b9	MINOR: ssl: add a list of crtlist_entry in ckch_store When updating a ckch_store we may want to update its pointer in the crtlist_entry which use it. To do this, we need the list of the entries using the store.	2020-03-31 12:32:17 +02:00
William Lallemand	493983128b	BUG/MINOR: ssl: ckch_inst wrongly inserted in crtlist_entry The instances were wrongly inserted in the crtlist entries, all instances of a crt-list were inserted in the last crt-list entry. Which was kind of handy to free all instances upon error. Now that it's done correctly, the error path was changed, it must iterate on the entries and find the ckch_insts which were generated for this bind_conf. To avoid wasting time, it stops the iteration once it found the first unsuccessful generation.	2020-03-31 12:32:17 +02:00
William Lallemand	ad3c37b760	REORG: ssl: move SETCERT enum to ssl_sock.h Move the SETCERT enum at the right place to cleanup ssl_sock.c.	2020-03-31 12:32:17 +02:00
William Lallemand	79d31ec0d4	MINOR: ssl: add a list of bind_conf in struct crtlist In order to be able to add new certificate in a crt-list, we need the list of bind_conf that uses this crt-list so we can create a ckch_inst for each of them.	2020-03-31 12:32:17 +02:00
William Lallemand	638f6ad033	MINOR: cli: add a general purpose pointer in the CLI struct This patch adds a p2 generic pointer which is inialized to zero before calling the parser.	2020-03-31 12:32:17 +02:00
Olivier Houchard	cf612a0457	MINOR: servers: Add a counter for the number of currently used connections. Add a counter to know the current number of used connections, as well as the max, this will be used later to refine the algorithm used to kill idle connections, based on current usage.	2020-03-30 00:30:01 +02:00
Jerome Magnin	824186bb08	MEDIUM: stream: support use-server rules with dynamic names With server-template was introduced the possibility to scale the number of servers in a backend without needing a configuration change and associated reload. On the other hand it became impractical to write use-server rules for these servers as they would only accept existing server labels as argument. This patch allows the use of log-format notation to describe targets of a use-server rules, such as in the example below: listen test bind *:1234 use-server %[hdr(srv)] if { hdr(srv) -m found } use-server s1 if { path / } server s1 127.0.0.1:18080 server s2 127.0.0.1:18081 If a use-server rule is applied because it was conditionned by an ACL returning true, but the target of the use-server rule cannot be resolved, no other use-server rule is evaluated and we fall back to load balancing. This feature was requested on the ML, and bumped with issue #563.	2020-03-29 09:55:10 +02:00
Olivier Houchard	dbda31939d	BUG/MINOR: connections: Set idle_time before adding to idle list. In srv_add_to_idle_list(), make sure we set the idle_time before we add the connection to an idle list, not after, otherwise another thread may grab it, set the idle_time to 0, only to have the original thread set it back to now_ms. This may have an impact, as in conn_free() we check idle_time to decide if we should decrement the idle connection counters for the server.	2020-03-22 20:05:59 +01:00
Olivier Houchard	ad91124bcf	BUILD/MEDIUM: fd: Declare fd_mig_lock as extern. Declare fd_mig_lock as extern so that it isn't defined multiple times. This should fix build for architectures without double-width CAS.	2020-03-20 11:42:11 +01:00
Olivier Houchard	566df309c6	MEDIUM: connections: Attempt to get idle connections from other threads. In connect_server(), if we no longer have any idle connections for the current thread, attempt to use the new "takeover" mux method to steal a connection from another thread. This should have no impact right now, given no mux implements it.	2020-03-19 22:07:33 +01:00
Olivier Houchard	d2489e00b0	MINOR: connections: Add a flag to know if we're in the safe or idle list. Add flags to connections, CO_FL_SAFE_LIST and CO_FL_IDLE_LIST, to let one know we are in the safe list, or the idle list.	2020-03-19 22:07:33 +01:00
Olivier Houchard	f0d4dff25c	MINOR: connections: Make the "list" element a struct mt_list instead of list. Make the "list" element a struct mt_list, and explicitely use list_from_mt_list to get a struct list * where it is used as such, so that mt_list_for_each_entry will be usable with it.	2020-03-19 22:07:33 +01:00
Olivier Houchard	00bdce24d5	MINOR: connections: Add a new mux method, "takeover". Add a new mux method, "takeover", that will attempt to make the current thread responsible for the connection. It should return 0 on success, and non-zero on failure.	2020-03-19 22:07:33 +01:00
Olivier Houchard	8851664293	MINOR: fd: Implement fd_takeover(). Implement a new function, fd_takeover(), that lets you become the thread responsible for the fd. On architectures that do not have a double-width CAS, use a global rwlock. fd_set_running() was also changed to be able to compete with fd_takeover(), either using a dooble-width CAS on both running_mask and thread_mask, or by claiming a reader on the global rwlock. This extra operation should not have any measurable impact on modern architectures where threading is relevant.	2020-03-19 22:07:33 +01:00
Olivier Houchard	dc2f2753e9	MEDIUM: servers: Split the connections into idle, safe, and available. Revamp the server connection lists. We know have 3 lists : - idle_conns, which contains idling connections - safe_conns, which contains idling connections that are safe to use even for the first request - available_conns, which contains connections that are not idling, but can still accept new streams (those are HTTP/2 or fastcgi, and are always considered safe).	2020-03-19 22:07:33 +01:00
Olivier Houchard	2444aa5b66	MEDIUM: sessions: Don't be responsible for connections anymore. Make it so sessions are not responsible for connection anymore, except for connections that are private, and thus can't be shared, otherwise, as soon as a request is done, the session will just add the connection to the orphan connections pool. This will break http-reuse safe, but it is expected to be fixed later.	2020-03-19 22:07:33 +01:00
Olivier Houchard	899fb8abdc	MINOR: memory: Change the flush_lock to a spinlock, and don't get it in alloc. The flush_lock was introduced, mostly to be sure that pool_gc() will never dereference a pointer that has been free'd. __pool_get_first() was acquiring the lock to, the fear was that otherwise that pointer could get free'd later, and then pool_gc() would attempt to dereference it. However, that can not happen, because the only functions that can free a pointer, when using lockless pools, are pool_gc() and pool_flush(), and as long as those two are mutually exclusive, nobody will be able to free the pointer while pool_gc() attempts to access it. So change the flush_lock to a spinlock, and don't bother acquire/release it in __pool_get_first(), that way callers of __pool_get_first() won't have to wait while the pool is flushed. The worst that can happen is we call __pool_refill_alloc() while the pool is getting flushed, and memory can get allocated just to be free'd. This may help with github issue #552 This may be backported to 2.1, 2.0 and 1.9.	2020-03-18 15:55:35 +01:00
Olivier Houchard	de01ea9878	MINOR: wdt: Move the definitions of WDTSIG and DEBUGSIG into types/signal.h. Move the definition of WDTSIG and DEBUGSIG from wdt.c and debug.c into types/signal.h, so that we can access them in another file. We need those definition to avoid blocking those signals when running __signal_process_queue(). This should be backported to 2.1, 2.0 and 1.9.	2020-03-18 13:07:19 +01:00
Olivier Houchard	a7bf573520	MEDIUM: fd: Introduce a running mask, and use it instead of the spinlock. In the struct fdtab, introduce a new mask, running_mask. Each thread should add its bit before using the fd. Use the running_mask instead of a lock, in fd_insert/fd_delete, we'll just spin as long as the mask is non-zero, to be sure we access the data exclusively. fd_set_running_excl() spins until the mask is 0, fd_set_running() just adds the thread bit, and fd_clr_running() removes it.	2020-03-17 15:30:07 +01:00
William Lallemand	2954c478eb	MEDIUM: ssl: allow crt-list caching The crtlist structure defines a crt-list in the HAProxy configuration. It contains crtlist_entry structures which are the lines in a crt-list file. crt-list are now loaded in memory using crtlist and crtlist_entry structures. The file is read only once. The generation algorithm changed a little bit, new ckch instances are generated from the crtlist structures, instead of being generated during the file loading. The loading function was split in two, one that loads and caches the crt-list and certificates, and one that looks for a crt-list and creates the ckch instances. Filters are also stored in crtlist_entry->filters as a char ** so we can generate the sni_ctx again if needed. I won't be needed anymore to parse the sni_ctx to do that. A crtlist_entry stores the list of all ckch_inst that were generated from this entry.	2020-03-16 16:18:49 +01:00
Willy Tarreau	e4d42551bd	BUILD: pools: silence build warnings with DEBUG_MEMORY_POOLS and DEBUG_UAF With these debug options we still get these warnings: include/common/memory.h:501:23: warning: null pointer dereference [-Wnull-dereference] (volatile int )0 = 0; ~~~~~~~~~~~~~~~~~~~^~~ include/common/memory.h:460:22: warning: null pointer dereference [-Wnull-dereference] (volatile int )0 = 0; ~~~~~~~~~~~~~~~~~~~^~~ These are purposely there to crash the process at specific locations. But the annoying warnings do not help with debugging and they are not even reliable as the compiler may decide to optimize them away. Let's pass the pointer through DISGUISE() to avoid this.	2020-03-14 11:10:21 +01:00
Willy Tarreau	2e8ab6b560	MINOR: use DISGUISE() everywhere we deliberately want to ignore a result It's more generic and versatile than the previous shut_your_big_mouth_gcc() that was used to silence annoying warnings as it's not limited to ignoring syscalls returns only. This allows us to get rid of the aforementioned function and the shut_your_big_mouth_gcc_int variable, that started to look ugly in multi-threaded environments.	2020-03-14 11:04:49 +01:00
Willy Tarreau	15ed69fd3f	MINOR: debug: consume the write() result in BUG_ON() to silence a warning Tim reported that BUG_ON() issues warnings on his distro, as the libc marks some syscalls with __attribute__((warn_unused_result)). Let's pass the write() result through DISGUISE() to hide it.	2020-03-14 10:58:35 +01:00
Willy Tarreau	f401668306	MINOR: debug: add a new DISGUISE() macro to pass a value as identity This does exactly the same as ALREADY_CHECKED() but does it inline, returning an identical copy of the scalar variable without letting the compiler know how it might have been transformed. This can forcefully disable certain null-pointer checks or result checks when known undesirable. Typically forcing a crash with *(DISGUISE(NULL))=0 will not cause a null-deref warning.	2020-03-14 10:52:46 +01:00
Ilya Shipitsin	77e3b4a2c4	CLEANUP: assorted typo fixes in the code and comments These are mostly comments in the code. A few error messages were fixed and are of low enough importance not to deserve a backport. Some regtests were also fixed.	2020-03-14 09:42:07 +01:00
Tim Duesterhus	cf6e0c8a83	MEDIUM: proxy_protocol: Support sending unique IDs using PPv2 This patch adds the `unique-id` option to `proxy-v2-options`. If this option is set a unique ID will be generated based on the `unique-id-format` while sending the proxy protocol v2 header and stored as the unique id for the first stream of the connection. This feature is meant to be used in `tcp` mode. It works on HTTP mode, but might result in inconsistent unique IDs for the first request on a keep-alive connection, because the unique ID for the first stream is generated earlier than the others. Now that we can send unique IDs in `tcp` mode the `%ID` log variable is made available in TCP mode.	2020-03-13 17:26:43 +01:00
Tim Duesterhus	d1b15b6e9b	MINOR: proxy_protocol: Ingest PP2_TYPE_UNIQUE_ID on incoming connections This patch reads a proxy protocol v2 provided unique ID and makes it available using the `fc_pp_unique_id` fetch.	2020-03-13 17:25:23 +01:00
Tim Duesterhus	b435f77620	DOC: proxy_protocol: Reserve TLV type 0x05 as PP2_TYPE_UNIQUE_ID This reserves and defines TLV type 0x05.	2020-03-13 17:25:23 +01:00
Olivier Houchard	84fd8a77b7	MINOR: lists: fix indentation. Fix indentation in the recently added list_to_mt_list().	2020-03-11 21:41:13 +01:00
Olivier Houchard	8676514d4e	MINOR: servers: Kill priv_conns. Remove the list of private connections from server, it has been largely unused, we only inserted connections in it, but we would never actually use it.	2020-03-11 19:20:01 +01:00
Olivier Houchard	751e5e21a9	MINOR: lists: Implement function to convert list => mt_list and mt_list => list Implement mt_list_to_list() and list_to_mt_list(), to be able to convert from a struct list to a struct mt_list, and vice versa. This is normally of no use, except for struct connection's list field, that can go in either a struct list or a struct mt_list.	2020-03-11 17:10:40 +01:00
Olivier Houchard	49983a9fe1	MINOR: mt_lists: Appease gcc. gcc is confused, and think p may end up being NULL in _MT_LIST_RELINK_DELETED. It should never happen, so let gcc know that.	2020-03-11 17:10:08 +01:00
Willy Tarreau	638698da37	BUILD: stream-int: fix a few includes dependencies The stream-int code doesn't need to load server.h as it doesn't use servers at all. However removing this one reveals that proxy.h was lacking types/checks.h that used to be silently inherited from types/server.h loaded before in stream_interface.h.	2020-03-11 14:15:33 +01:00
Willy Tarreau	855796bdc8	BUG/MAJOR: list: fix invalid element address calculation Ryan O'Hara reported that haproxy breaks on fedora-32 using gcc-10 (pre-release). It turns out that constructs such as: while (item != head) { item = LIST_ELEM(item.n); } loop forever, never matching <item> to <head> despite a printf there showing them equal. In practice the problem is that the LIST_ELEM() macro is wrong, it assigns the subtract of two pointers (an integer) to another pointer through a cast to its pointer type. And GCC 10 now considers that this cannot match a pointer and silently optimizes the comparison away. A tested workaround for this is to build with -fno-tree-pta. Note that older gcc versions even with -ftree-pta do not exhibit this rather surprizing behavior. This patch changes the test to instead cast the null-based address to an int to get the offset and subtract it from the pointer, and this time it works. There were just a few places to adjust. Ideally offsetof() should be used but the LIST_ELEM() API doesn't make this trivial as it's commonly called with a typeof(ptr) and not typeof(ptr*) thus it would require to completely change the whole API, which is not something workable in the short term, especially for a backport. With this change, the emitted code is subtly different even on older versions. A code size reduction of ~600 bytes and a total executable size reduction of ~1kB are expected to be observed and should not be taken as an anomaly. Typically this loop in dequeue_proxy_listeners() : while ((listener = MT_LIST_POP(...))) used to produce this code where the comparison is performed on RAX while the new offset is assigned to RDI even though both are always identical: 53ded8: 48 8d 78 c0 lea -0x40(%rax),%rdi 53dedc: 48 83 f8 40 cmp $0x40,%rax 53dee0: 74 39 je 53df1b <dequeue_proxy_listeners+0xab> and now produces this one which is slightly more efficient as the same register is used for both purposes: 53dd08: 48 83 ef 40 sub $0x40,%rdi 53dd0c: 74 2d je 53dd3b <dequeue_proxy_listeners+0x9b> Similarly, retrieving the channel from a stream_interface using si_ic() and si_oc() used to cause this (stream-int in rdi): 1cb7: c7 47 1c 00 02 00 00 movl $0x200,0x1c(%rdi) 1cbe: f6 47 04 10 testb $0x10,0x4(%rdi) 1cc2: 74 1c je 1ce0 <si_report_error+0x30> 1cc4: 48 81 ef 00 03 00 00 sub $0x300,%rdi 1ccb: 81 4f 10 00 08 00 00 orl $0x800,0x10(%rdi) and now causes this: 1cb7: c7 47 1c 00 02 00 00 movl $0x200,0x1c(%rdi) 1cbe: f6 47 04 10 testb $0x10,0x4(%rdi) 1cc2: 74 1c je 1ce0 <si_report_error+0x30> 1cc4: 81 8f 10 fd ff ff 00 orl $0x800,-0x2f0(%rdi) There is extremely little chance that this fix wakes up a dormant bug as the emitted code effectively does what the source code intends. This must be backported to all supported branches (dropping MT_LIST_ELEM and the spoa_example parts as needed), since the bug is subtle and may not always be visible even when compiling with gcc-10.	2020-03-11 14:12:51 +01:00
Olivier Houchard	1d117e3dcd	BUG/MEDIUM: mt_lists: Make sure we set the deleted element to NULL; In MT_LIST_DEL_SAFE(), when the code was changed to use a temporary variable instead of using the provided pointer directly, we shouldn't have changed the code that set the pointer to NULL, as we really want the pointer provided to be nullified, otherwise other parts of the code won't know we just deleted an element, and bad things will happen. This should be backported to 2.1.	2020-03-10 17:45:05 +01:00
Willy Tarreau	9a0dfa5298	CLEANUP: remove the now unused common/syscall.h It was added 9 years ago to implement USE_MY_SPLICE on some libcs where syscall() was bogus. It's about time to get rid of this.	2020-03-10 07:28:46 +01:00
Willy Tarreau	06c63aec95	CLEANUP: remove support for USE_MY_SPLICE The splice() syscall has been supported in glibc since version 2.5 issued in 2006 and is present on supported systems so there's no need for having our own arch-specific syscall definitions anymore.	2020-03-10 07:23:41 +01:00
Willy Tarreau	3858b122a6	CLEANUP: remove support for USE_MY_EPOLL This was made to support epoll on patched 2.4 kernels, and on early 2.6 using alternative libcs thanks to the arch-specific syscall definitions. All the features we support have been around since 2.6.2 and present in glibc since 2.3.2, neither of which are found in field anymore. Let's simply drop this and use epoll normally.	2020-03-10 07:08:10 +01:00
Willy Tarreau	618ac6ea52	CLEANUP: drop support for USE_MY_ACCEPT4 The accept4() syscall has been present for a while now, there is no more reason for maintaining our own arch-specific syscall implementation for systems lacking it in libc but having it in the kernel.	2020-03-10 07:02:46 +01:00
Willy Tarreau	c3e926bf3b	CLEANUP: remove support for Linux i686 vsyscalls This was introduced 10 years ago to squeeze a few CPU cycles per syscall on 32-bit x86 machines and was already quite old by then, requiring to explicitly enable support for this in the kernel. We don't even know if it still builds, let alone if it works at all on recent kernels! Let's completely drop this now.	2020-03-10 06:55:52 +01:00
William Lallemand	6763016866	BUG/MINOR: ssl/cli: sni_ctx' mustn't always be used as filters Since commit 244b070 ("MINOR: ssl/cli: support crt-list filters"), HAProxy generates a list of filters based on the sni_ctx in memory. However it's not always relevant, sometimes no filters were configured and the CN/SAN in the new certificate are not the same. This patch fixes the issue by using a flag filters in the ckch_inst, so we are able to know if there were filters or not. In the late case it uses the CN/SAN of the new certificate to generate the sni_ctx. note: filters are still only used in the crt-list atm.	2020-03-09 17:32:04 +01:00
William Lallemand	0a52846603	CLEANUP: ssl: is_default is a bit in ckch_inst The field is_default becomes a bit in the ckch_inst structure.	2020-03-09 17:32:04 +01:00
Miroslav Zagorac	d7dc67ba1d	CLEANUP: remove unused code in 'my_ffsl/my_flsl' functions Shifting the variable 'a' one bit to the right has no effect on the result of the functions.	2020-03-09 14:47:27 +01:00
Willy Tarreau	ee3bcddef7	MINOR: tools: add a generic function to generate UUIDs We currently have two UUID generation functions, one for the sample fetch and the other one in the SPOE filter. Both were a bit complicated since they were made to support random() implementations returning an arbitrary number of bits, and were throwing away 33 bits every 64. Now we don't need this anymore, so let's have a generic function consuming 64 bits at once and use it as appropriate.	2020-03-08 18:04:16 +01:00
Willy Tarreau	52bf839394	BUG/MEDIUM: random: implement a thread-safe and process-safe PRNG This is the replacement of failed attempt to add thread safety and per-process sequences of random numbers initally tried with commit `1c306aa84d` ("BUG/MEDIUM: random: implement per-thread and per-process random sequences"). This new version takes a completely different approach and doesn't try to work around the horrible OS-specific and non-portable random API anymore. Instead it implements "xoroshiro128*", a reputedly high quality random number generator, which is one of the many variants of xorshift, which passes all quality tests and which is described here: http://prng.di.unimi.it/ While not cryptographically secure, it is fast and features a 2^128-1 period. It supports fast jumps allowing to cut the period into smaller non-overlapping sequences, which we use here to support up to 2^32 processes each having their own, non-overlapping sequence of 2^96 numbers (~710^28). This is enough to provide 1 billion randoms per second and per process for 2200 billion years. The implementation was made thread-safe either by using a double 64-bit CAS on platforms supporting it (x86_64, aarch64) or by using a local lock for the time needed to perform the shift operations. This ensures that all threads pick numbers from the same pool so that it is not needed to assign per-thread ranges. For processes we use the fast jump method to advance the sequence by 2^96 for each process. Before this patch, the following config: global nbproc 8 frontend f bind :4445 mode http log stdout format raw daemon log-format "%[uuid] %pid" redirect location / Would produce this output: a4d0ad64-2645-4b74-b894-48acce0669af 12987 a4d0ad64-2645-4b74-b894-48acce0669af 12992 a4d0ad64-2645-4b74-b894-48acce0669af 12986 a4d0ad64-2645-4b74-b894-48acce0669af 12988 a4d0ad64-2645-4b74-b894-48acce0669af 12991 a4d0ad64-2645-4b74-b894-48acce0669af 12989 a4d0ad64-2645-4b74-b894-48acce0669af 12990 82d5f6cd-f6c1-4f85-a89c-36ae85d26fb9 12987 82d5f6cd-f6c1-4f85-a89c-36ae85d26fb9 12992 82d5f6cd-f6c1-4f85-a89c-36ae85d26fb9 12986 (...) And now produces: f94b29b3-da74-4e03-a0c5-a532c635bad9 13011 47470c02-4862-4c33-80e7-a952899570e5 13014 86332123-539a-47bf-853f-8c8ea8b2a2b5 13013 8f9efa99-3143-47b2-83cf-d618c8dea711 13012 3cc0f5c7-d790-496b-8d39-bec77647af5b 13015 3ec64915-8f95-4374-9e66-e777dc8791e0 13009 0f9bf894-dcde-408c-b094-6e0bb3255452 13011 49c7bfde-3ffb-40e9-9a8d-8084d650ed8f 13014 e23f6f2e-35c5-4433-a294-b790ab902653 13012 There are multiple benefits to using this method. First, it doesn't depend anymore on a non-portable API. Second it's thread safe. Third it is fast and more proven than any hack we could attempt to try to work around the deficiencies of the various implementations around. This commit depends on previous patches "MINOR: tools: add 64-bit rotate operators" and "BUG/MEDIUM: random: initialize the random pool a bit better", all of which will need to be backported at least as far as version 2.0. It doesn't require to backport the build fixes for circular include files dependecy anymore.	2020-03-08 10:09:02 +01:00
Willy Tarreau	7a40909c00	MINOR: tools: add 64-bit rotate operators This adds rotl64/rotr64 to rotate a 64-bit word by an arbitrary number of bits. It's mainly aimed at being used with constants.	2020-03-08 00:42:18 +01:00
Willy Tarreau	0fbf28a05b	Revert "BUG/MEDIUM: random: implement per-thread and per-process random sequences" This reverts commit `1c306aa84d`. It breaks the build on all non-glibc platforms. I got confused by the man page (which possibly is the most confusing man page I've ever read about a standard libc function) and mistakenly understood that random_r was portable, especially since it appears in latest freebsd source as well but not in released versions, and with a slightly different API :-/ We need to find a different solution with a fallback. Among the possibilities, we may reintroduce this one with a fallback relying on locking around the standard functions, keeping fingers crossed for no other library function to call them in parallel, or we may also provide our own PRNG, which is not necessarily more difficult than working around the totally broken up design of the portable API.	2020-03-07 11:24:39 +01:00
Willy Tarreau	1c306aa84d	BUG/MEDIUM: random: implement per-thread and per-process random sequences As mentioned in previous patch, the random number generator was never made thread-safe, which used not to be a problem for health checks spreading, until the uuid sample fetch function appeared. Currently it is possible for two threads or processes to produce exactly the same UUID. In fact it's extremely likely that this will happen for processes, as can be seen with this config: global nbproc 8 frontend f bind :4445 mode http log stdout daemon format raw log-format "%[uuid] %pid" redirect location / It typically produces this log: 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30645 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30641 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30644 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30639 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30646 07764439-c24d-4e6f-a5a6-0138be59e7a8 30645 07764439-c24d-4e6f-a5a6-0138be59e7a8 30639 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30643 07764439-c24d-4e6f-a5a6-0138be59e7a8 30646 b6773fdd-678f-4d04-96f2-4fb11ad15d6b 30646 551ce567-0bfb-4bbd-9b58-cdc7e9365325 30642 07764439-c24d-4e6f-a5a6-0138be59e7a8 30642 What this patch does is to use a distinct per-thread and per-process seed to make sure the same sequences will not appear, and will then extend these seeds by "burning" a number of randoms that depends on the global random seed, the thread ID and the process ID. This adds roughly 20 extra bits of randomness, resulting in 52 bits total per thread and per process. It only takes a few milliseconds to burn these randoms and given that threads start with a different seed, we know they will not catch each other. So these random extra bits are essentially added to ensure randomness between boots and cluster instances. This replaces all uses of random() with ha_random() which uses the thread-local state. This must be backported as far as 2.0 or any version having the UUID sample-fetch function since it's the main victim here. It's important to note that this patch, in addition to depending on the previous one "BUG/MEDIUM: init: initialize the random pool a bit better", also depends on the preceeding build fixes to address a circular dependency issue in the include files that prevented it from building. Part or all of these patches may need to be backported or adapted as well.	2020-03-07 06:11:15 +01:00
Willy Tarreau	6c3a681bd6	BUG/MEDIUM: random: initialize the random pool a bit better Since the UUID sample fetch was created, some people noticed that in certain virtualized environments they manage to get exact same UUIDs on different instances started exactly at the same moment. It turns out that the randoms were only initialized to spread the health checks originally, not to provide "clean" randoms. This patch changes this and collects more randomness from various sources, including existing randoms, /dev/urandom when available, RAND_bytes() when OpenSSL is available, as well as the timing for such operations, then applies a SHA1 on all this to keep a 160 bits random seed available, 32 of which are passed to srandom(). It's worth mentioning that there's no clean way to pass more than 32 bits to srandom() as even initstate() provides an opaque state that must absolutely not be tampered with since known implementations contain state information. At least this allows to have up to 4 billion different sequences from the boot, which is not that bad. Note that the thread safety was still not addressed, which is another issue for another patch. This must be backported to all versions containing the UUID sample fetch function, i.e. as far as 2.0.	2020-03-07 06:11:11 +01:00
Willy Tarreau	5a421a8f49	BUILD: listener: types/listener.h must not include standard.h It's only a type definition, this header is not needed and causes some circular dependency issues.	2020-03-07 06:07:18 +01:00
Willy Tarreau	c7f64e7a58	BUILD: freq_ctr: proto/freq_ctr needs to include common/standard.h This is needed for div_64_32() which is there and currently accidently inherited via global.h!	2020-03-07 06:07:18 +01:00
Willy Tarreau	f23e029409	BUILD: global: must not include common/standard.h but only types/freq_ctr.h This one was accidently inherited and used to work but causes a circular dependency.	2020-03-07 06:07:18 +01:00
Willy Tarreau	8dd0d55efe	BUILD: ssl: include mini-clist.h We use some list definitions and we don't include this header which is in fact accidently inherited from others, causing a circular dependency issue.	2020-03-07 06:07:18 +01:00
Willy Tarreau	a8561db936	BUILD: buffer: types/{ring.h,checks.h} should include buf.h, not buffer.h buffer.h relies on proto/activity because it contains some code and not just type definitions. It must not be included from types files. It should probably also be split in two if it starts to include a proto. This causes some circular dependencies at other places.	2020-03-07 06:07:18 +01:00
Christopher Faulet	d8f0e073dd	MINOR: lua: Remove the flag HLUA_TXN_HTTP_RDY This flag was used in some internal functions to be sure the current stream is able to handle HTTP content. It was introduced when the legacy HTTP code was still there. Now, It is possible to rely on stream's flags to be sure we have an HTX stream. So the flag HLUA_TXN_HTTP_RDY can be removed. Everywhere it was tested, it is replaced by a call to the IS_HTX_STRM() macro. This patch is mandatory to allow the support of the filters written in lua.	2020-03-06 14:13:00 +01:00
Christopher Faulet	1cdceb9365	MINOR: htx: Add a function to return a block at a specific offset The htx_find_offset() function may be used to look for a block at a specific offset in an HTX message, starting from the message head. A compound result is returned, an htx_ret structure, with the found block and the position of the offset in the block. If the offset is ouside of the HTX message, the returned block is NULL.	2020-03-06 14:12:59 +01:00
Christopher Faulet	251f4917c3	MINOR: buf: Add function to insert a string at an absolute offset in a buffer The b_insert_blk() function may now be used to insert a string, given a pointer and the string length, at an absolute offset in a buffer, moving data between this offset and the buffer's tail just after the end of the inserted string. The buffer's length is automatically updated. This function supports wrapping. All the string is copied or nothing. So it returns 0 if there are not enough space to perform the copy. Otherwise, the number of bytes copied is returned.	2020-03-06 14:12:59 +01:00
Carl Henrik Lunde	f91ac19299	OPTIM: startup: fast unique_id allocation for acl. pattern_finalize_config() uses an inefficient algorithm which is a problem with very large configuration files. This affects startup, and therefore reload time. When haproxy is deployed as a router in a Kubernetes cluster the generated configuration file may be large and reloads are frequently occuring, which makes this a significant issue. The old algorithm is O(n^2) * allocate missing uids - O(n^2) * sort linked list - O(n^2) The new algorithm is O(n log n): * find the user allocated uids - O(n) * store them for efficient lookup - O(n log n) * allocate missing uids - n times O(log n) * sort all uids - O(n log n) * convert back to linked list - O(n) Performance examples, startup time in seconds: pat_refs old new 1000 0.02 0.01 10000 2.1 0.04 20000 12.3 0.07 30000 27.9 0.10 40000 52.5 0.14 50000 77.5 0.17 Please backport to 1.8, 2.0 and 2.1.	2020-03-06 08:11:58 +01:00
Tim Duesterhus	a17e66289c	MEDIUM: stream: Make the `unique_id` member of `struct stream` a `struct ist` The `unique_id` member of `struct stream` now is a `struct ist`.	2020-03-05 20:21:58 +01:00
Tim Duesterhus	0643b0e7e6	MINOR: proxy: Make `header_unique_id` a `struct ist` The `header_unique_id` member of `struct proxy` now is a `struct ist`.	2020-03-05 19:58:22 +01:00
Tim Duesterhus	9576ab7640	MINOR: ist: Add `struct ist istdup(const struct ist)` istdup() performs the equivalent of strdup() on a `struct ist`.	2020-03-05 19:53:12 +01:00
Tim Duesterhus	35005d01d2	MINOR: ist: Add `struct ist istalloc(size_t)` and `void istfree(struct ist*)` `istalloc` allocates memory and returns an `ist` with the size `0` that points to this allocation. `istfree` frees the pointed memory and clears the pointer.	2020-03-05 19:52:07 +01:00
Tim Duesterhus	e296d3e5f0	MINOR: ist: Add `int isttest(const struct ist)` `isttest` returns whether the `.ptr` is non-null.	2020-03-05 19:52:07 +01:00
Tim Duesterhus	241e29ef9c	MINOR: ist: Add `IST_NULL` macro `IST_NULL` is equivalent to an `struct ist` with `.ptr = NULL` and `.len = 0`.	2020-03-05 19:52:07 +01:00
William Lallemand	cfca1422c7	MINOR: ssl: reach a ckch_store from a sni_ctx It was only possible to go down from the ckch_store to the sni_ctx but not to go up from the sni_ctx to the ckch_store. To allow that, 2 pointers were added: - a ckch_inst pointer in the struct sni_ctx - a ckckh_store pointer in the struct ckch_inst	2020-03-05 11:28:42 +01:00
William Lallemand	38df1c8006	MINOR: ssl/cli: support crt-list filters Generate a list of the previous filters when updating a certificate which use filters in crt-list. Then pass this list to the function generating the sni_ctx during the commit. This feature allows the update of the crt-list certificates which uses the filters with "set ssl cert". This function could be probably replaced by creating a new ckch_inst_new_load_store() function which take the previous sni_ctx list as an argument instead of the char **sni_filter, avoiding the allocation/copy during runtime for each filter. But since are still handling the multi-cert bundles, it's better this way to avoid code duplication.	2020-03-05 11:27:53 +01:00
Tim Duesterhus	127a74dd48	MINOR: stream: Add stream_generate_unique_id function Currently unique IDs for a stream are generated using repetitive code in multiple locations, possibly allowing for inconsistent behavior.	2020-03-05 07:23:00 +01:00
Willy Tarreau	899e5f69a1	MINOR: debug: use our own backtrace function on clang+x86_64 A test on FreeBSD with clang 4 to 8 produces this on a call to a spinning loop on the CLI: call trace(5): \| 0x53e2bc [eb 16 48 63 c3 48 c1 e0]: wdt_handler+0x10c \| 0x800e02cfe [e8 5d 83 00 00 8b 18 8b]: libthr:pthread_sigmask+0x53e with our own function it correctly produces this: call trace(20): \| 0x53e2dc [eb 16 48 63 c3 48 c1 e0]: wdt_handler+0x10c \| 0x800e02cfe [e8 5d 83 00 00 8b 18 8b]: libthr:pthread_sigmask+0x53e \| 0x800e022bf [48 83 c4 38 5b 41 5c 41]: libthr:pthread_getspecific+0xdef \| 0x7ffffffff003 [48 8d 7c 24 10 6a 00 48]: main+0x7fffffb416f3 \| 0x801373809 [85 c0 0f 84 6f ff ff ff]: libc:__sys_gettimeofday+0x199 \| 0x801373709 [89 c3 85 c0 75 a6 48 8b]: libc:__sys_gettimeofday+0x99 \| 0x801371c62 [83 f8 4e 75 0f 48 89 df]: libc:gettimeofday+0x12 \| 0x51fa0a [48 89 df 4c 89 f6 e8 6b]: ha_thread_dump_all_to_trash+0x49a \| 0x4b723b [85 c0 75 09 49 8b 04 24]: mworker_cli_sockpair_new+0xd9b \| 0x4b6c68 [85 c0 75 08 4c 89 ef e8]: mworker_cli_sockpair_new+0x7c8 \| 0x532f81 [4c 89 e7 48 83 ef 80 41]: task_run_applet+0xe1 So let's add clang+x86_64 to the list of platforms that will use our simplified version. As a bonus it will not require to link with -lexecinfo on FreeBSD and will work out of the box when passing USE_BACKTRACE=1.	2020-03-04 12:04:07 +01:00
Willy Tarreau	13faf16e1e	MINOR: debug: improve backtrace() on aarch64 and possibly other systems It happens that on aarch64 backtrace() only returns one entry (tested with gcc 4.7.4, 5.5.0 and 7.4.1). Probably that it refrains from unwinding the stack due to the risk of hitting a bad pointer. Here we can use may_access() to know when it's safe, so we can actually unwind the stack without taking risks. It happens that the faulting function (the one just after the signal handler) is not listed here, very likely because the signal handler uses a special stack and did not create a new frame. So this patch creates a new my_backtrace() function in standard.h that either calls backtrace() or does its own unrolling. The choice depends on HA_HAVE_WORKING_BACKTRACE which is set in compat.h based on the build target.	2020-03-04 12:04:07 +01:00
Emmanuel Hocdet	842e94ee06	MINOR: ssl: add "ca-verify-file" directive It's only available for bind line. "ca-verify-file" allows to separate CA certificates from "ca-file". CA names sent in server hello message is only compute from "ca-file". Typically, "ca-file" must be defined with intermediate certificates and "ca-verify-file" with certificates to ending the chain, like root CA. Fix issue #404.	2020-03-04 11:53:11 +01:00
Willy Tarreau	eb8b1ca3eb	MINOR: tools: add resolve_sym_name() to resolve function pointers We use various hacks at a few places to try to identify known function pointers in debugging outputs (show threads & show fd). Let's centralize this into a new function dedicated to this. It already knows about the functions matched by "show threads" and "show fd", and when built with USE_DL, it can rely on dladdr1() to resolve other functions. There are some limitations, as static functions are not resolved, linking with -rdynamic is mandatory, and even then some functions will not necessarily appear. It's possible to do a better job by rebuilding the whole symbol table from the ELF headers in memory but it's less portable and the gains are still limited, so this solution remains a reasonable tradeoff.	2020-03-03 18:18:40 +01:00
Willy Tarreau	762fb3ec8e	MINOR: tools: add new function dump_addr_and_bytes() This function dumps <n> bytes from <addr> in hex form into buffer <buf> enclosed in brackets after the address itself, formatted on 14 chars including the "0x" prefix. This is meant to be used as a prefix for code areas. For example: "0x7f10b6557690 [48 c7 c0 0f 00 00 00 0f]: " It relies on may_access() to know if the bytes are dumpable, otherwise "--" is emitted. An optional prefix is supported.	2020-03-03 17:46:37 +01:00
Willy Tarreau	27d00c0167	MINOR: task: export run_tasks_from_list This will help refine debug traces.	2020-03-03 15:26:10 +01:00
Willy Tarreau	3ebd55ee51	MINOR: haproxy: export run_poll_loop This will help refine debug traces.	2020-03-03 15:26:10 +01:00
Willy Tarreau	1827845a3d	MINOR: haproxy: export main to ease access from debugger Better just export main instead of declaring it as extern, it's cleaner and may be usable elsewhere.	2020-03-03 15:26:10 +01:00
Willy Tarreau	1ed3781e21	MINOR: fd: merge the read and write error bits into RW error We always set them both, which makes sense since errors at the FD level indicate a terminal condition for the socket that cannot be recovered. Usually this is detected via a write error, but sometimes such an error may asynchronously be reported on the read side. Let's simplify this using only the write bit and calling it RW since it's used like this everywhere, and leave the R bit spare for future use.	2020-02-28 07:42:29 +01:00
Willy Tarreau	a135ea63a6	CLEANUP: fd: remove some unneeded definitions of FD_EV_* flags There's no point in trying to be too generic for these flags as the read and write sides will soon differ a bit. Better explicitly define the flags for each direction without trying to be direction-agnostic. this clarifies the code and removes some defines.	2020-02-28 07:42:29 +01:00
Willy Tarreau	f80fe832b1	CLEANUP: fd: remove the FD_EV_STATUS aggregate This was used only by fd_recv_state() and fd_send_state(), both of which are unused. This will not work anymore once recv and send flags start to differ, so let's remove this.	2020-02-28 07:42:29 +01:00
Jerome Magnin	967d3cc105	BUG/MINOR: http_ana: make sure redirect flags don't have overlapping bits commit `c87e46881` ("MINOR: http-rules: Add a flag on redirect rules to know the rule direction") introduced a new flag for redirect rules, but its value has bits in common with REDIRECT_FLAG_DROP_QS, which makes us enter this code path in http_apply_redirect_rule(), which will then drop the query string. To fix this, just give REDIRECT_FLAG_FROM_REQ its own unique value. This must be backported where `c87e468816` is backported. This should fix issue 521.	2020-02-27 23:44:41 +01:00
Willy Tarreau	2104659cd5	MEDIUM: buffer: remove the buffer_wq lock This lock was only needed to protect the buffer_wq list, but now we have the mt_list for this. This patch simply turns the buffer_wq list to an mt_list and gets rid of the lock. It's worth noting that the whole buffer_wait thing still looks totally wrong especially in a threaded context: the wakeup_cb() callback is called synchronously from any thread and may end up calling some connection code that was not expected to run on a given thread. The whole thing should probably be reworked to use tasklets instead and be a bit more centralized.	2020-02-26 10:39:36 +01:00
William Lallemand	e0f3fd5b4c	CLEANUP: ssl: move issuer_chain tree and definition Move the cert_issuer_tree outside the global_ssl structure since it's not a configuration variable. And move the declaration of the issuer_chain structure in types/ssl_sock.h	2020-02-25 15:06:40 +01:00
Willy Tarreau	226ef26056	MINOR: compiler: add new alignment macros This commit adds ALWAYS_ALIGN(), MAYBE_ALIGN() and ATOMIC_ALIGN() to be placed as delimitors inside structures to force alignment to a given size. These depend on the architecture's capabilities so that it is possible to always align, align only on archs not supporting unaligned accesses at all, or only on those not supporting them for atomic accesses (e.g. before a lock).	2020-02-25 10:34:43 +01:00
Willy Tarreau	908071171b	BUILD: general: always pass unsigned chars to is* functions The isalnum(), isalpha(), isdigit() etc functions from ctype.h are supposed to take an int in argument which must either reflect an unsigned char or EOF. In practice on some platforms they're implemented as macros referencing an array, and when passed a char, they either cause a warning "array subscript has type 'char'" when lucky, or cause random segfaults when unlucky. It's quite unconvenient by the way since none of them may return true for negative values. The recent introduction of cygwin to the list of regularly tested build platforms revealed a lot of breakage there due to the same issues again. So this patch addresses the problem all over the code at once. It adds unsigned char casts to every valid use case, and also drops the unneeded double cast to int that was sometimes added on top of it. It may be backported by dropping irrelevant changes if that helps better support uncommon platforms. It's unlikely to fix bugs on platforms which would already not emit any warning though.	2020-02-25 08:16:33 +01:00
Willy Tarreau	03e7853581	BUILD: remove obsolete support for -mregparm / USE_REGPARM This used to be a minor optimization on ix86 where registers are scarce and the calling convention not very efficient, but this platform is not relevant enough anymore to warrant all this dirt in the code for the sake of saving 1 or 2% of performance. Modern platforms don't use this at all since their calling convention already defaults to using several registers so better get rid of this once for all.	2020-02-25 07:41:47 +01:00
Tim Duesterhus	1d48ba91d7	CLEANUP: net_helper: Do not negate the result of unlikely This patch turns the double negation of 'not unlikely' into 'likely' and then turns the negation of 'not smaller' into 'greater or equal' in an attempt to improve readability of the condition. [wt: this was not a bug but purposely written like this to improve code generation on older compilers but not needed anymore as described here: https://www.mail-archive.com/haproxy@formilux.org/msg36392.html ]	2020-02-25 07:30:49 +01:00
Tim Duesterhus	927063b892	CLEANUP: conn: Do not pass a pointer to likely Move the `!` inside the likely and negate it to unlikely. The previous version should not have caused issues, because it is converted to a boolean / integral value before being passed to __builtin_expect(), but it's certainly unusual. [wt: this was not a bug but purposely written like this to improve code generation on older compilers but not needed anymore as described here: https://www.mail-archive.com/haproxy@formilux.org/msg36392.html ]	2020-02-25 07:30:49 +01:00
Willy Tarreau	89ee79845c	MINOR: compiler: drop special cases of likely/unlikely for older compilers We used to special-case the likely()/unlikely() macros for a series of early gcc 4.x compilers which used to produce very bad code when using __builtin_expect(x,1), which basically used to build an integer (0 or 1) from a condition then compare it to integer 1. This was already fixed in 5.x, but even now, looking at the code produced by various flavors of 4.x this bad behavior couldn't be witnessed anymore. So let's consider it as fixed by now, which will allow to get rid of some ugly tricks at some specific places. A test on 4.7.4 shows that the code shrinks by about 3kB now, thanks to some tests being inlined closer to the call place and the unlikely case being moved to real functions. See the link below for more background on this. Link: https://www.mail-archive.com/haproxy@formilux.org/msg36392.html	2020-02-25 07:29:55 +01:00
Willy Tarreau	0e2686762f	MINOR: compiler: move CPU capabilities definition from config.h and complete them These ones are irrelevant to the config but rather to the platform, and as such are better placed in compiler.h. Here we take the opportunity for declaring a few extra capabilities: - HA_UNALIGNED : CPU supports unaligned accesses - HA_UNALIGNED_LE : CPU supports unaligned accesses in little endian - HA_UNALIGNED_FAST : CPU supports fast unaligned accesses - HA_UNALIGNED_ATOMIC : CPU supports unaligned accesses in atomics This will help remove a number of #ifdefs with arch-specific statements.	2020-02-21 16:32:57 +01:00
Jerome Magnin	9dde0b2d31	MINOR: ist: add an iststop() function Add a function that finds a character in an ist and returns an updated ist with the length of the portion of the original string that doesn't contain the char. Might be backported to 2.1	2020-02-21 11:47:25 +01:00
Willy Tarreau	716bec2dc6	MINOR: connection: introduce a new receive flag: CO_RFL_READ_ONCE This flag is currently supported by raw_sock to perform a single recv() attempt and avoid subscribing. Typically on the request and response paths with keep-alive, with short messages we know that it's very likely that the first message is enough.	2020-02-21 11:22:45 +01:00
Willy Tarreau	5d4d1806db	CLEANUP: connection: remove the definitions of conn_xprt_{stop,want}_{send,recv} This marks the end of the transition from the connection polling states introduced in 1.5-dev12 and the subscriptions in that arrived in 1.9. The socket layer can now safely use its FD while all upper layers rely exclusively on subscriptions. These old functions were removed. Some may deserve some renaming to improved clarty though. The single call to conn_xprt_stop_both() was dropped in favor of conn_cond_update_polling() which already does the same.	2020-02-21 11:21:12 +01:00
Willy Tarreau	d1d14c3157	MINOR: connection: remove the last calls to conn_xprt_{want,stop}_* The last few calls to conn_xprt_{want,stop}_{recv,send} in the central connection code were replaced with their strictly exact equivalent fd_*, adding the call to conn_ctrl_ready() when it was missing.	2020-02-21 11:21:12 +01:00
Willy Tarreau	19bc201c9f	MEDIUM: connection: remove the intermediary polling state from the connection Historically we used to require that the connections held the desired polling states for the data layer and the socket layer. Then with muxes these were more or less merged into the transport layer, and now it happens that with all transport layers having their own state, the "transport layer state" as we have it in the connection (XPRT_RD_ENA, XPRT_WR_ENA) is only an exact copy of the undelying file descriptor state, but with a delay. All of this is causing some difficulties at many places in the code because there are still some locations which use the conn_want_* API to remain clean and only rely on connection, and count on a later collection call to conn_cond_update_polling(), while others need an immediate action and directly use the FD updates. Since our updates are now much cheaper, most of them being only an atomic test-and-set operation, and since our I/O callbacks are deferred, there's no benefit anymore in trying to "cache" the transient state change in the connection flags hoping to cancel them before they become an FD event. Better make such calls transparent indirections to the FD layer instead and get rid of the deferred operations which needlessly complicate the logic inside. This removes flags CO_FL_XPRT_{RD,WR}_ENA and CO_FL_WILL_UPDATE. A number of functions related to polling updates were either greatly simplified or removed. Two places were using CO_FL_XPRT_WR_ENA as a hint to know if more data were expected to be sent after a PROXY protocol or SOCKSv4 header. These ones were simply replaced with a check on the subscription which is where we ought to get the autoritative information from. Now the __conn_xprt_want_* and their conn_xprt_want_* counterparts are the same. conn_stop_polling() and conn_xprt_stop_both() are the same as well. conn_cond_update_polling() only causes errors to stop polling. It also becomes way more obvious that muxes should not at all employ conn_xprt_{want\|stop}_{recv,send}(), and that the call to __conn_xprt_stop_recv() in case a mux failed to allocate a buffer is inappropriate, it ought to unsubscribe from reads instead. All of this definitely requires a serious cleanup.	2020-02-21 11:21:12 +01:00
Christopher Faulet	727a3f1ca3	MINOR: http-htx: Add a function to retrieve the headers size of an HTX message http_get_hdrs_size() function may now be used to get the bytes held by headers in an HTX message. It only works if the headers were not already forwarded. Metadata are not counted here.	2020-02-18 11:19:57 +01:00
Willy Tarreau	a71667c07d	BUG/MINOR: tools: also accept '+' as a valid character in an identifier The function is_idchar() was added by commit `36f586b` ("MINOR: tools: add is_idchar() to tell if a char may belong to an identifier") to ease matching of sample fetch/converter names. But it lacked support for the '+' character used in "base32+src" and "url32+src". A quick way to figure the list of supported sample fetch+converter names is to issue the following command: git grep '"[^"]",.SMP_T_.*SMP_USE_'\|cut -f2 -d'"'\|sort -u No more entry is reported once searching for characters not covered by is_idchar(). No backport is needed.	2020-02-17 06:37:40 +01:00
Willy Tarreau	e3b57bf92f	MINOR: sample: make sample_parse_expr() able to return an end pointer When an end pointer is passed, instead of complaining that a comma is missing after a keyword, sample_parse_expr() will silently return the pointer to the current location into this return pointer so that the caller can continue its parsing. This will be used by more complex expressions which embed sample expressions, and may even permit to embed sample expressions into arguments of other expressions.	2020-02-14 19:02:06 +01:00
Willy Tarreau	80b53ffb1c	MEDIUM: arg: make make_arg_list() stop after its own arguments The main problem we're having with argument parsing is that at the moment the caller looks for the first character looking like an end of arguments (')') and calls make_arg_list() on the sub-string inside the parenthesis. Let's first change the way it works so that make_arg_list() also consumes the parenthesis and returns the pointer to the first char not consumed. This will later permit to refine each argument parsing. For now there is no functional change.	2020-02-14 19:02:06 +01:00
Willy Tarreau	d4ad669051	MINOR: chunk: implement chunk_strncpy() to copy partial strings This does like chunk_strcpy() except that the maximum string length may be limited by the caller. A trailing zero is always appended. This is particularly handy to extract portions of strings to put into the trash for use with libc functions requiring a nul-terminated string.	2020-02-14 19:02:06 +01:00
Willy Tarreau	36f586b694	MINOR: tools: add is_idchar() to tell if a char may belong to an identifier This function will simply be used to find the end of config identifiers (proxies, servers, ACLs, sample fetches, converters, etc).	2020-02-14 19:02:06 +01:00
Ilya Shipitsin	88a2f0304c	CLEANUP: ssl: remove unused functions in openssl-compat.h functions SSL_SESSION_get0_id_context, SSL_CTX_get_default_passwd_cb, SSL_CTX_get_default_passwd_cb_userdata are not used anymore	2020-02-14 16:15:00 +01:00
Willy Tarreau	160ad9e38a	CLEANUP: mini-clist: simplify nested do { while(1) {} } while (0) While looking for other occurrences of do { continue; } while (0) I found these few leftovers in mini-clist where an outer loop was made around "do { } while (0)" then another loop was placed inside just to handle the continue. Let's clean this up by just removing the outer one. Most of the patch is only the inner part of the loop that is reindented. It was verified that the resulting code is the same.	2020-02-11 10:27:04 +01:00
Christopher Faulet	7716cdf450	MINOR: lua: Get the action return code on the stack when an action finishes When an action successfully finishes, the action return code (ACT_RET_*) is now retrieve on the stack, ff the first element is an integer. In addition, in hlua_txn_done(), the value ACT_RET_DONE is pushed on the stack before exiting. Thus, when a script uses this function, the corresponding action still finishes with the good code. Thanks to this change, the flag HLUA_STOP is now useless. So it has been removed. It is a mandatory step to allow a lua action to return any action return code.	2020-02-06 15:13:03 +01:00
Christopher Faulet	07a718e712	CLEANUP: lua: Remove consistency check for sample fetches and actions It is not possible anymore to alter the HTTP parser state from lua sample fetches or lua actions. So there is no reason to still check for the parser state consistency.	2020-02-06 15:13:03 +01:00
Christopher Faulet	4a2c142779	MEDIUM: http-rules: Support extra headers for HTTP return actions It is now possible to append extra headers to the generated responses by HTTP return actions, while it is not based on an errorfile. For return actions based on errorfiles, these extra headers are ignored. To define an extra header, a "hdr" argument must be used with a name and a value. The value is a log-format string. For instance: http-request status 200 hdr "x-src" "%[src]" hdr "x-dst" "%[dst]"	2020-02-06 15:13:03 +01:00
Christopher Faulet	24231ab61f	MEDIUM: http-rules: Add the return action to HTTP rules Thanks to this new action, it is now possible to return any responses from HAProxy, with any status code, based on an errorfile, a file or a string. Unlike the other internal messages generated by HAProxy, these ones are not interpreted as errors. And it is not necessary to use a file containing a full HTTP response, although it is still possible. In addition, using a log-format string or a log-format file, it is possible to have responses with a dynamic content. This action can be used on the request path or the response path. The only constraint is to have a responses smaller than a buffer. And to avoid any warning the buffer space reserved to the headers rewritting should also be free. When a response is returned with a file or a string as payload, it only contains the content-length header and the content-type header, if applicable. Here are examples: http-request return content-type image/x-icon file /var/www/favicon.ico \ if { path /favicon.ico } http-request return status 403 content-type text/plain \ lf-string "Access denied. IP %[src] is blacklisted." \ if { src -f /etc/haproxy/blacklist.lst }	2020-02-06 15:12:54 +01:00
Christopher Faulet	6d0c3dfac6	MEDIUM: http: Add a ruleset evaluated on all responses just before forwarding This patch introduces the 'http-after-response' rules. These rules are evaluated at the end of the response analysis, just before the data forwarding, on ALL HTTP responses, the server ones but also all responses generated by HAProxy. Thanks to this ruleset, it is now possible for instance to add some headers to the responses generated by the stats applet. Following actions are supported : * allow * add-header * del-header * replace-header * replace-value * set-header * set-status * set-var * strict-mode * unset-var	2020-02-06 14:55:34 +01:00
Christopher Faulet	ef70e25035	MINOR: http-ana: Add a function for forward internal responses Operations performed when internal responses (redirect/deny/auth/errors) are returned are always the same. The http_forward_proxy_resp() function is added to group all of them under a unique function.	2020-02-06 14:55:34 +01:00
Christopher Faulet	72c7d8d040	MINOR: http-ana: Rely on http_reply_and_close() to handle server error The http_server_error() function now relies on http_reply_and_close(). Both do almost the same actions. In addtion, http_server_error() sets the error flag and the final state flag on the stream.	2020-02-06 14:55:34 +01:00
Christopher Faulet	c87e468816	MINOR: http-rules: Add a flag on redirect rules to know the rule direction HTTP redirect rules can be evaluated on the request or the response path. So when a redirect rule is evaluated, it is important to have this information because some specific processing may be performed depending on the direction. So the REDIRECT_FLAG_FROM_REQ flag has been added. It is set when applicable on the redirect rule during the parsing. This patch is mandatory to fix a bug on redirect rule. It must be backported to all stable versions.	2020-02-06 14:55:34 +01:00
Christopher Faulet	a4168434a7	MINOR: dns: Dynamically allocate dns options to reduce the act_rule size <.arg.dns.dns_opts> field in the act_rule structure is now dynamically allocated when a do-resolve rule is parsed. This drastically reduces the structure size.	2020-02-06 14:55:34 +01:00
Christopher Faulet	7651362e52	MINOR: htx/channel: Add a function to copy an HTX message in a channel's buffer The channel_htx_copy_msg() function can now be used to copy an HTX message in a channel's buffer. This function takes care to not overwrite existing data. This patch depends on the commit "MINOR: htx: Add a function to append an HTX message to another one". Both are mandatory to fix a bug in http_reply_and_close() function. Be careful to backport both first.	2020-02-06 14:55:16 +01:00
Christopher Faulet	0ea0c86753	MINOR: htx: Add a function to append an HTX message to another one the htx_append_msg() function can now be used to append an HTX message to another one. All the message is copied or nothing. If an error occurs during the copy, all changes are rolled back. This patch is mandatory to fix a bug in http_reply_and_close() function. Be careful to backport it first.	2020-02-06 14:54:47 +01:00
Olivier Houchard	1c7c0d6b97	BUG/MAJOR: memory: Don't forget to unlock the rwlock if the pool is empty. In __pool_get_first(), don't forget to unlock the pool lock if the pool is empty, otherwise no writer will be able to take the lock, and as it is done when reloading, it leads to an infinite loop on reload. This should be backported with commit `04f5fe87d3`	2020-02-03 13:05:31 +01:00
Olivier Houchard	04f5fe87d3	BUG/MEDIUM: memory: Add a rwlock before freeing memory. When using lockless pools, add a new rwlock, flush_pool. read-lock it when getting memory from the pool, so that concurrenct access are still authorized, but write-lock it when we're about to free memory, in pool_flush() and pool_gc(). The problem is, when removing an item from the pool, we unreference it to get the next one, however, that pointer may have been free'd in the meanwhile, and that could provoke a crash if the pointer has been unmapped. It should be OK to use a rwlock, as normal operations will still be able to access the pool concurrently, and calls to pool_flush() and pool_gc() should be pretty rare. This should be backported to 2.1, 2.0 and 1.9.	2020-02-01 18:08:34 +01:00
Willy Tarreau	b30a153cd1	MINOR: task: detect self-wakeups on tl==sched->current instead of TASK_RUNNING This is exactly what we want to detect (a task/tasklet waking itself), so let's use the proper condition for this.	2020-01-31 17:45:10 +01:00
Willy Tarreau	bb238834da	MINOR: task: permanently flag tasklets waking themselves up Commit `a17664d829` ("MEDIUM: tasks: automatically requeue into the bulk queue an already running tasklet") tried to inflict a penalty to self-requeuing tasks/tasklets which correspond to those involved in large, high-latency data transfers, for the benefit of all other processing which requires a low latency. However, it turns out that while it ought to do this on a case-by-case basis, basing itself on the RUNNING flag isn't accurate because this flag doesn't leave for tasklets, so we'd rather need a distinct flag to tag such tasklets. This commit introduces TASK_SELF_WAKING to mark tasklets acting like this. For now it's still set when TASK_RUNNING is present but this will have to change. The flag is kept across wakeups.	2020-01-31 17:45:10 +01:00
Willy Tarreau	a17664d829	MEDIUM: tasks: automatically requeue into the bulk queue an already running tasklet When a tasklet re-runs itself such as in this chain: si_cs_io_cb -> si_cs_process -> si_notify -> si_chk_rcv then we know it can easily clobber the run queue and harm latency. Now what the scheduler does when it detects this is that such a tasklet is automatically placed into the bulk list so that it's processed with the remaining CPU bandwidth only. Thanks to this the CLI becomes instantly responsive again even under heavy stress at 50 Gbps over 40kcon and 100% CPU on 16 threads.	2020-01-30 19:03:31 +01:00
Willy Tarreau	a62917b890	MEDIUM: tasks: implement 3 different tasklet classes with their own queues We used to mix high latency tasks and low latency tasklets in the same list, and to even refill bulk tasklets there, causing some unfairness in certain situations (e.g. poll-less transfers between many connections saturating the machine with similarly-sized in and out network interfaces). This patch changes the mechanism to split the load into 3 lists depending on the task/tasklet's desired classes : - URGENT: this is mainly for tasklets used as deferred callbacks - NORMAL: this is for regular tasks - BULK: this is for bulk tasks/tasklets Arbitrary ratios of max_processed are picked from each of these lists in turn, with the ability to complete in one list from what was not picked in the previous one. After some quick tests, the following setup gave apparently good results both for raw TCP with splicing and for H2-to-H1 request rate: - 0 to 75% for urgent - 12 to 50% for normal - 12 to what remains for bulk Bulk is not used yet.	2020-01-30 18:59:33 +01:00
Willy Tarreau	911db9bd29	MEDIUM: connection: use CO_FL_WAIT_XPRT more consistently than L4/L6/HANDSHAKE As mentioned in commit `c192b0ab95` ("MEDIUM: connection: remove CO_FL_CONNECTED and only rely on CO_FL_WAIT_*"), there is a lack of consistency on which flags are checked among L4/L6/HANDSHAKE depending on the code areas. A number of sample fetch functions only check for L4L6 to report MAY_CHANGE, some places only check for HANDSHAKE and many check both L4L6 and HANDSHAKE. This patch starts to make all of this more consistent by introducing a new mask CO_FL_WAIT_XPRT which is the union of L4/L6/HANDSHAKE and reports whether the transport layer is ready or not. All inconsistent call places were updated to rely on this one each time the goal was to check for the readiness of the transport layer.	2020-01-23 16:34:26 +01:00
Willy Tarreau	4450b587dd	MINOR: connection: remove CO_FL_SSL_WAIT_HS from CO_FL_HANDSHAKE Most places continue to check CO_FL_HANDSHAKE while in fact they should check CO_FL_HANDSHAKE_NOSSL, which contains all handshakes but the one dedicated to SSL renegotiation. In fact the SSL layer should be the only one checking CO_FL_SSL_WAIT_HS, so as to avoid processing data when a renegotiation is in progress, but other ones randomly include it without knowing. And ideally it should even be an internal flag that's not exposed in the connection. This patch takes CO_FL_SSL_WAIT_HS out of CO_FL_HANDSHAKE, uses this flag consistently all over the code, and gets rid of CO_FL_HANDSHAKE_NOSSL. In order to limit the confusion that has accumulated over time, the CO_FL_SSL_WAIT_HS flag which indicates an ongoing SSL handshake, possibly used by a renegotiation was moved after the other ones.	2020-01-23 16:34:26 +01:00
Willy Tarreau	c192b0ab95	MEDIUM: connection: remove CO_FL_CONNECTED and only rely on CO_FL_WAIT_* Commit `477902bd2e` ("MEDIUM: connections: Get ride of the xprt_done callback.") broke the master CLI for a very obscure reason. It happens that short requests immediately terminated by a shutdown are properly received, CS_FL_EOS is correctly set, but in si_cs_recv(), we refrain from setting CF_SHUTR on the channel because CO_FL_CONNECTED was not yet set on the connection since we've not passed again through conn_fd_handler() and it was not done in conn_complete_session(). While commit `a8a415d31a` ("BUG/MEDIUM: connections: Set CO_FL_CONNECTED in conn_complete_session()") fixed the issue, such accident may happen again as the root cause is deeper and actually comes down to the fact that CO_FL_CONNECTED is lazily set at various check points in the code but not every time we drop one wait bit. It is not the first time we face this situation. Originally this flag was used to detect the transition between WAIT_* and CONNECTED in order to call ->wake() from the FD handler. But since at least 1.8-dev1 with commit `7bf3fa3c23` ("BUG/MAJOR: connection: update CO_FL_CONNECTED before calling the data layer"), CO_FL_CONNECTED is always synchronized against the two others before being checked. Moreover, with the I/Os moved to tasklets, the decision to call the ->wake() function is performed after the I/Os in si_cs_process() and equivalent, which don't care about this transition either. So in essence, checking for CO_FL_CONNECTED has become a lazy wait to check for (CO_FL_WAIT_L4_CONN \| CO_FL_WAIT_L6_CONN), but that always relies on someone else having synchronized it. This patch addresses it once for all by killing this flag and only checking the two others (for which a composite mask CO_FL_WAIT_L4L6 was added). This revealed a number of inconsistencies that were purposely not addressed here for the sake of bisectability: - while most places do check both L4+L6 and HANDSHAKE at the same time, some places like assign_server() or back_handle_st_con() and a few sample fetches looking for proxy protocol do check for L4+L6 but don't care about HANDSHAKE ; these ones will probably fail on TCP request session rules if the handshake is not complete. - some handshake handlers do validate that a connection is established at L4 but didn't clear CO_FL_WAIT_L4_CONN - the ->ctl method of mux_fcgi, mux_pt and mux_h1 only checks for L4+L6 before declaring the mux ready while the snd_buf function also checks for the handshake's completion. Likely the former should validate the handshake as well and we should get rid of these extra tests in snd_buf. - raw_sock_from_buf() would directly set CO_FL_CONNECTED and would only later clear CO_FL_WAIT_L4_CONN. - xprt_handshake would set CO_FL_CONNECTED itself without actually clearing CO_FL_WAIT_L4_CONN, which could apparently happen only if waiting for a pure Rx handshake. - most places in ssl_sock that were checking CO_FL_CONNECTED don't need to include the L4 check as an L6 check is enough to decide whether to wait for more info or not. It also becomes obvious when reading the test in si_cs_recv() that caused the failure mentioned above that once converted it doesn't make any sense anymore: having CS_FL_EOS set while still waiting for L4 and L6 to complete cannot happen since for CS_FL_EOS to be set, the other ones must have been validated. Some of these parts will still deserve further cleanup, and some of the observations above may induce some backports of potential bug fixes once totally analyzed in their context. The risk of breaking existing stuff is too high to blindly backport everything.	2020-01-23 14:41:37 +01:00
Olivier Houchard	477902bd2e	MEDIUM: connections: Get ride of the xprt_done callback. The xprt_done_cb callback was used to defer some connection initialization until we're connected and the handshake are done. As it mostly consists of creating the mux, instead of using the callback, introduce a conn_create_mux() function, that will just call conn_complete_session() for frontend, and create the mux for backend. In h2_wake(), make sure we call the wake method of the stream_interface, as we no longer wakeup the stream task.	2020-01-22 18:56:05 +01:00
Olivier Houchard	8af03b396a	MEDIUM: streams: Always create a conn_stream in connect_server(). In connect_server(), when creating a new connection for which we don't yet know the mux (because it'll be decided by the ALPN), instead of associating the connection to the stream_interface, always create a conn_stream. This way, we have less special-casing needed. Store the conn_stream in conn->ctx, so that we can reach the upper layers if needed.	2020-01-22 18:55:59 +01:00
Emmanuel Hocdet	6b5b44e10f	BUG/MINOR: ssl: ssl_sock_load_pem_into_ckch is not consistent "set ssl cert <filename> <payload>" CLI command should have the same result as reload HAproxy with the updated pem file (<filename>). Is not the case, DHparams/cert-chain is kept from the previous context if no DHparams/cert-chain is set in the context (<payload>). This patch should be backport to 2.1	2020-01-22 15:55:55 +01:00
Adis Nezirovic	1a693fc2fd	MEDIUM: cli: Allow multiple filter entries for "show table" For complex stick tables with many entries/columns, it can be beneficial to filter using multiple criteria. The maximum number of filter entries can be controlled by defining STKTABLE_FILTER_LEN during build time. This patch can be backported to older releases.	2020-01-22 14:33:17 +01:00
Ilya Shipitsin	056c629531	BUG/MINOR: ssl: fix build on development versions of openssl-1.1.x while working on issue #429, I encountered build failures with various non-released openssl versions, let us improve ssl defines, switch to features, not versions, for EVP_CTRL_AEAD_SET_IVLEN and EVP_CTRL_AEAD_SET_TAG. No backport is needed as there is no valid reason to build a stable haproxy version against a development version of openssl.	2020-01-22 07:54:52 +01:00
Willy Tarreau	2086365f51	CLEANUP: pattern: remove the pat_time definition It was inherited from acl_time, introduced in 1.3.10 by commit `a84d374367` ("[MAJOR] new framework for generic ACL support") and was never ever used. Let's simply drop it now.	2020-01-22 07:44:36 +01:00
Tim Duesterhus	6a0dd73390	CLEANUP: Consistently `unsigned int` for bitfields Signed bitfields of size `1` hold the values `0` and `-1`, but are usually assigned `1`, possibly leading to subtle bugs when the value is explicitely compared against `1`.	2020-01-22 07:28:39 +01:00
Baptiste Assmann	13a9232ebc	MEDIUM: dns: use Additional records from SRV responses Most DNS servers provide A/AAAA records in the Additional section of a response, which correspond to the SRV records from the Answer section: ;; QUESTION SECTION: ;_http._tcp.be1.domain.tld. IN SRV ;; ANSWER SECTION: _http._tcp.be1.domain.tld. 3600 IN SRV 5 500 80 A1.domain.tld. _http._tcp.be1.domain.tld. 3600 IN SRV 5 500 80 A8.domain.tld. _http._tcp.be1.domain.tld. 3600 IN SRV 5 500 80 A5.domain.tld. _http._tcp.be1.domain.tld. 3600 IN SRV 5 500 80 A6.domain.tld. _http._tcp.be1.domain.tld. 3600 IN SRV 5 500 80 A4.domain.tld. _http._tcp.be1.domain.tld. 3600 IN SRV 5 500 80 A3.domain.tld. _http._tcp.be1.domain.tld. 3600 IN SRV 5 500 80 A2.domain.tld. _http._tcp.be1.domain.tld. 3600 IN SRV 5 500 80 A7.domain.tld. ;; ADDITIONAL SECTION: A1.domain.tld. 3600 IN A 192.168.0.1 A8.domain.tld. 3600 IN A 192.168.0.8 A5.domain.tld. 3600 IN A 192.168.0.5 A6.domain.tld. 3600 IN A 192.168.0.6 A4.domain.tld. 3600 IN A 192.168.0.4 A3.domain.tld. 3600 IN A 192.168.0.3 A2.domain.tld. 3600 IN A 192.168.0.2 A7.domain.tld. 3600 IN A 192.168.0.7 SRV record support was introduced in HAProxy 1.8 and the first design did not take into account the records from the Additional section. Instead, a new resolution is associated to each server with its relevant FQDN. This behavior generates a lot of DNS requests (1 SRV + 1 per server associated). This patch aims at fixing this by: - when a DNS response is validated, we associate A/AAAA records to relevant SRV ones - set a flag on associated servers to prevent them from running a DNS resolution for said FADN - update server IP address with information found in the Additional section If no relevant record can be found in the Additional section, then HAProxy will failback to running a dedicated resolution for this server, as it used to do. This behavior is the one described in RFC 2782.	2020-01-22 07:19:54 +01:00
Christopher Faulet	2f5339079b	MINOR: proxy/http-ana: Add support of extra attributes for the cookie directive It is now possible to insert any attribute when a cookie is inserted by HAProxy. Any value may be set, no check is performed except the syntax validity (CTRL chars and ';' are forbidden). For instance, it may be used to add the SameSite attribute: cookie SRV insert attr "SameSite=Strict" The attr option may be repeated to add several attributes. This patch should fix the issue #361.	2020-01-22 07:18:31 +01:00
Christopher Faulet	554c0ebffd	MEDIUM: http-rules: Support an optional error message in http deny rules It is now possible to set the error message to use when a deny rule is executed. It may be a specific error file, adding "errorfile <file>" : http-request deny deny_status 400 errorfile /etc/haproxy/errorfiles/400badreq.http It may also be an error file from an http-errors section, adding "errorfiles <name>" : http-request deny errorfiles my-errors # use 403 error from "my-errors" section When defined, this error message is set in the HTTP transaction. The tarpit rule is also concerned by this change.	2020-01-20 15:18:46 +01:00
Christopher Faulet	473e880a25	MINOR: http-ana: Add an error message in the txn and send it when defined It is now possible to set the error message to return to client in the HTTP transaction. If it is defined, this error message is used instead of proxy's errors or default errors.	2020-01-20 15:18:46 +01:00
Christopher Faulet	76edc0f29c	MEDIUM: proxy: Add a directive to reference an http-errors section in a proxy It is now possible to import in a proxy, fully or partially, error files declared in an http-errors section. It may be done using the "errorfiles" directive, followed by a name and optionally a list of status code. If there is no status code specified, all error files of the http-errors section are imported. Otherwise, only error files associated to the listed status code are imported. For instance : http-errors my-errors errorfile 400 ... errorfile 403 ... errorfile 404 ... frontend frt errorfiles my-errors 403 404 # ==> error 400 not imported	2020-01-20 15:18:46 +01:00
Christopher Faulet	35cd81d363	MINOR: http-htx: Add a new section to create groups of custom HTTP errors A new section may now be declared in the configuration to create global groups of HTTP errors. These groups are not linked to a proxy and are referenced by name. The section must be declared using the keyword "http-errors" followed by the group name. This name must be unique. A list of "errorfile" directives may be declared in such section. For instance: http-errors website-1 errorfile 400 /path/to/site1/400.http errorfile 404 /path/to/site1/404.http http-errors website-2 errorfile 400 /path/to/site2/400.http errorfile 404 /path/to/site2/404.http For now, it is just possible to create "http-errors" sections. There is no documentation because these groups are not used yet.	2020-01-20 15:18:46 +01:00
Christopher Faulet	5885775de1	MEDIUM: http-htx/proxy: Use a global and centralized storage for HTTP error messages All custom HTTP errors are now stored in a global tree. Proxies use a references on these messages. The key used for errorfile directives is the file name as specified in the configuration. For errorloc directives, a key is created using the redirect code and the url. This means that the same custom error message is now stored only once. It may be used in several proxies or for several status code, it is only parsed and stored once.	2020-01-20 15:18:46 +01:00
Christopher Faulet	bdf6526e94	MINOR: http-htx: Add functions to create HTX redirect message http_parse_errorloc() may now be used to create an HTTP 302 or 303 redirect message with a specific url passed as parameter. A parameter is used to known if it is a 302 or a 303 redirect. A status code is passed as parameter. It must be one of the supported HTTP error codes to be valid. Otherwise an error is returned. It aims to be used to parse "errorloc" directives. It relies on http_load_errormsg() to do most of the job, ie converting it in HTX.	2020-01-20 15:18:45 +01:00
Christopher Faulet	5031ef58ca	MINOR: http-htx: Add functions to read a raw error file and convert it in HTX http_parse_errorfile() may now be used to parse a raw HTTP message from a file. A status code is passed as parameter. It must be one of the supported HTTP error codes to be valid. Otherwise an error is returned. It aims to be used to parse "errorfile" directives. It relies on http_load_errorfile() to do most of the job, ie reading the file content and converting it in HTX.	2020-01-20 15:18:45 +01:00
Christopher Faulet	d73b96d48c	MINOR: tcp-rules: Make tcp-request capture a custom action Now, this action is use its own dedicated function and is no longer handled "in place" during the TCP rules evaluation. Thus the action name ACT_TCP_CAPTURE is removed. The action type is set to ACT_CUSTOM and a check function is used to know if the rule depends on request contents while there is no inspect-delay.	2020-01-20 15:18:45 +01:00
Christopher Faulet	ac98d81f46	MINOR: http-rule/tcp-rules: Make track-sc* custom actions Now, these actions use their own dedicated function and are no longer handled "in place" during the TCP/HTTP rules evaluation. Thus the action names ACT_ACTION_TRK_SC0 and ACT_ACTION_TRK_SCMAX are removed. The action type is now the tracking index. Thus the function trk_idx() is no longer needed.	2020-01-20 15:18:45 +01:00
Christopher Faulet	91b3ec13c6	MEDIUM: http-rules: Make early-hint custom actions Now, the early-hint action uses its own dedicated action and is no longer handled "in place" during the HTTP rules evaluation. Thus the action name ACT_HTTP_EARLY_HINT is removed. In additionn, http_add_early_hint_header() and http_reply_103_early_hints() are also removed. This part is now handled in the new action_ptr callback function.	2020-01-20 15:18:45 +01:00
Christopher Faulet	046cf44f6c	MINOR: http-rules: Make set/del-map and add/del-acl custom actions Now, these actions use their own dedicated function and are no longer handled "in place" during the HTTP rules evaluation. Thus the action names ACT_HTTP__ACL and ACT_HTTP__MAP are removed. The action type is now mapped as following: 0 = add-acl, 1 = set-map, 2 = del-acl and 3 = del-map.	2020-01-20 15:18:45 +01:00
Christopher Faulet	d1f27e3394	MINOR: http-rules: Make set-header and add-header custom actions Now, these actions use their own dedicated function and are no longer handled "in place" during the HTTP rules evaluation. Thus the action names ACT_HTTP_SET_HDR and ACT_HTTP_ADD_VAL are removed. The action type is now set to 0 to set a header (so remove existing ones if any and add a new one) or to 1 to add a header (add without remove).	2020-01-20 15:18:45 +01:00
Christopher Faulet	92d34fe38d	MINOR: http-rules: Make replace-header and replace-value custom actions Now, these actions use their own dedicated function and are no longer handled "in place" during the HTTP rules evaluation. Thus the action names ACT_HTTP_REPLACE_HDR and ACT_HTTP_REPLACE_VAL are removed. The action type is now set to 0 to evaluate the whole header or to 1 to evaluate every comma-delimited values. The function http_transform_header_str() is renamed to http_replace_hdrs() to be more explicit and the function http_transform_header() is removed. In fact, this last one is now more or less the new action function. The lua code has been updated accordingly to use http_replace_hdrs().	2020-01-20 15:18:45 +01:00
Christopher Faulet	006f6507d7	MINOR: actions: Use an integer to set the action type <action> field in the act_rule structure is now an integer. The act_name values are used for all actions without action function (but it is not a pre-requisit though) or the action will have no effect. But for all other actions, any integer value may used, only the action function will take care of it. The default for such actions is ACT_CUSTOM.	2020-01-20 15:18:45 +01:00
Christopher Faulet	245cf795c1	MINOR: actions: Add flags to configure the action behaviour Some flags can now be set on an action when it is registered. The flags are defined in the act_flag enum. For now, only ACT_FLAG_FINAL may be set on an action to specify if it stops the rules evaluation. It is set on ACT_ACTION_ALLOW, ACT_ACTION_DENY, ACT_HTTP_REQ_TARPIT, ACT_HTTP_REQ_AUTH, ACT_HTTP_REDIR and ACT_TCP_CLOSE actions. But, when required, it may also be set on custom actions. Consequently, this flag is checked instead of the action type during the configuration parsing to trigger a warning when a rule inhibits all the following ones.	2020-01-20 15:18:45 +01:00
Christopher Faulet	105ba6cc54	MINOR: actions: Rename the act_flag enum into act_opt The flags in the act_flag enum have been renamed act_opt. It means ACT_OPT prefix is used instead of ACT_FLAG. The purpose of this patch is to reserve the action flags for the actions configuration.	2020-01-20 15:18:45 +01:00
Christopher Faulet	cd26e8a2ec	MINOR: http-rules/tcp-rules: Call the defined action function first if defined When TCP and HTTP rules are evaluated, if an action function (action_ptr field in the act_rule structure) is defined for a given action, it is now always called in priority over the test on the action type. Concretly, for now, only custom actions define it. Thus there is no change. It just let us the choice to extend the action type beyond the existing ones in the enum.	2020-01-20 15:18:45 +01:00
Christopher Faulet	96bff76087	MINOR: actions: Regroup some info about HTTP rules in the same struct Info used by HTTP rules manipulating the message itself are splitted in several structures in the arg union. But it is possible to group all of them in a unique struct. Now, <arg.http> is used by most of these rules, which contains: * <arg.http.i> : an integer used as status code, nice/tos/mark/loglevel or action id. * <arg.http.str> : an IST used as header name, reason string or auth realm. * <arg.http.fmt> : a log-format compatible expression * <arg.http.re> : a regular expression used by replace rules	2020-01-20 15:18:45 +01:00
Christopher Faulet	58b3564fde	MINOR: actions: Add a function pointer to release args used by actions Arguments used by actions are never released during HAProxy deinit. Now, it is possible to specify a function to do so. ".release_ptr" field in the act_rule structure may be set during the configuration parsing to a specific deinit function depending on the action type.	2020-01-20 15:18:45 +01:00
Christopher Faulet	e00d06c99f	MINOR: http-rules: Handle all message rewrites the same way In HTTP rules, error handling during a rewrite is now handle the same way for all rules. First, allocation errors are reported as internal errors. Then, if soft rewrites are allowed, rewrite errors are ignored and only the failed_rewrites counter is incremented. Otherwise, when strict rewrites are mandatory, interanl errors are returned. For now, only soft rewrites are supported. Note also that the warning sent to notify a rewrite failure was removed. It will be useless once the strict rewrites will be possible.	2020-01-20 15:18:45 +01:00
Christopher Faulet	a00071e2e5	MINOR: http-ana: Add a txn flag to support soft/strict message rewrites the HTTP_MSGF_SOFT_RW flag must now be set on the HTTP transaction to ignore rewrite errors on a message, from HTTP rules. The mode is called the soft rewrites. If thes flag is not set, strict rewrites are performed. In this mode, if a rewrite error occurred, an internal error is reported. For now, HTTP_MSGF_SOFT_RW is always set and there is no way to switch a transaction in strict mode.	2020-01-20 15:18:45 +01:00
Christopher Faulet	a08546bb5a	MINOR: counters: Remove failed_secu counter and use denied_resp instead The failed_secu counter is only used for the servers stats. It is used to report the number of denied responses. On proxies, the same info is stored in the denied_resp counter. So, it is more consistent to use the same field for servers.	2020-01-20 15:18:45 +01:00
Christopher Faulet	0159ee4032	MINOR: stats: Report internal errors in the proxies/listeners/servers stats The stats field ST_F_EINT has been added to report internal errors encountered per proxy, per listener and per server. It appears in the CLI export and on the HTML stats page.	2020-01-20 15:18:45 +01:00
Christopher Faulet	30a2a3724b	MINOR: http-rules: Add more return codes to let custom actions act as normal ones When HTTP/TCP rules are evaluated, especially HTTP ones, some results are possible for normal actions and not for custom ones. So missing return codes (ACT_RET_) have been added to let custom actions act as normal ones. Concretely following codes have been added: * ACT_RET_DENY : deny the request/response. It must be handled by the caller * ACT_RET_ABRT : abort the request/response, handled by action itsleft. * ACT_RET_INV : invalid request/response	2020-01-20 15:18:45 +01:00
Christopher Faulet	4d90db5f4c	MINOR: http-rules: Add a rule result to report internal error Now, when HTTP rules are evaluated, HTTP_RULE_RES_ERROR must be returned when an internal error is catched. It is a way to make the difference between a bad request or a bad response and an error during its processing.	2020-01-20 15:18:45 +01:00
Christopher Faulet	d4ce6c2957	MINOR: counters: Add a counter to report internal processing errors This counter, named 'internal_errors', has been added in frontend and backend counters. It should be used when a internal error is encountered, instead for failed_req or failed_resp.	2020-01-20 15:18:45 +01:00
Christopher Faulet	cb5501327c	BUG/MINOR: http-rules: Remove buggy deinit functions for HTTP rules Functions to deinitialize the HTTP rules are buggy. These functions does not check the action name to release the right part in the arg union. Only few info are released. For auth rules, the realm is released and there is no problem here. But the regex <arg.hdr_add.re> is always unconditionally released. So it is easy to make these functions crash. For instance, with the following rule HAProxy crashes during the deinit : http-request set-map(/path/to/map) %[src] %[req.hdr(X-Value)] For now, These functions are simply removed and we rely on the deinit function used for TCP rules (renamed as deinit_act_rules()). This patch fixes the bug. But arguments used by actions are not released at all, this part will be addressed later. This patch must be backported to all stable versions.	2020-01-20 15:18:45 +01:00
Willy Tarreau	ee1a6fc943	MINOR: connection: make the last arg of subscribe() a struct wait_event* The subscriber used to be passed as a "void param" that was systematically cast to a struct wait_event. By now it appears clear that the subscribe() call at every layer is well defined and always takes a pointer to an event subscriber of type wait_event, so let's enforce this in the functions' prototypes, remove the intermediary variables used to cast it and clean up the comments to clarify what all these functions do in their context.	2020-01-17 18:30:37 +01:00
Willy Tarreau	7872d1fc15	MEDIUM: connection: merge the send_wait and recv_wait entries In practice all callers use the same wait_event notification for any I/O so instead of keeping specific code to handle them separately, let's merge them and it will allow us to create new events later.	2020-01-17 18:30:36 +01:00
Willy Tarreau	3a9312af8f	REORG: stream/backend: move backend-specific stuff to backend.c For more than a decade we've kept all the sess_update_st_*() functions in stream.c while they're only there to work in relation with what is currently being done in backend.c (srv_redispatch_connect, connect_server, etc). Let's move all this pollution over there and take this opportunity to try to find slightly less confusing names for these old functions whose role is only to handle transitions from one specific stream-int state: sess_update_st_rdy_tcp() -> back_handle_st_rdy() sess_update_st_con_tcp() -> back_handle_st_con() sess_update_st_cer() -> back_handle_st_cer() sess_update_stream_int() -> back_try_conn_req() sess_prepare_conn_req() -> back_handle_st_req() sess_establish() -> back_establish() The last one remained in stream.c because it's more or less a completion function which does all the initialization expected on a connection success or failure, can set analysers and emit logs. The other ones could possibly slightly benefit from being modified to take a stream-int instead since it's really what they're working with, but it's unimportant here.	2020-01-17 18:30:36 +01:00
Willy Tarreau	3381bf89e3	MEDIUM: connection: get rid of CO_FL_CURR_* flags These ones used to serve as a set of switches between CO_FL_SOCK_* and CO_FL_XPRT_, and now that the SOCK layer is gone, they're always a copy of the last know CO_FL_XPRT_ ones that is resynchronized before I/O events by calling conn_refresh_polling_flags(), and that are pushed back to FDs when detecting changes with conn_xprt_polling_changes(). While these functions are not particularly heavy, what they do is totally redundant by now because the fd_want_/fd_stop_() actions already perform test-and-set operations to decide to create an entry or not, so they do the exact same thing that is done by conn_xprt_polling_changes(). As such it is pointless to call that one, and given that the only reason to keep CO_FL_CURR_* is to detect changes there, we can now remove them. Even if this does only save very few cycles, this removes a significant complexity that has been responsible for many bugs in the past, including the last one affecting FreeBSD. All tests look good, and no performance regressions were observed.	2020-01-17 17:45:12 +01:00
Willy Tarreau	e2a0eeca77	MINOR: connection: move the CO_FL_WAIT_ROOM cleanup to the reader only CO_FL_WAIT_ROOM is set by the splicing function in raw_sock, and cleared by the stream-int when splicing is disabled, as well as in conn_refresh_polling_flags() so that a new call to ->rcv_pipe() could be attempted by the I/O callbacks called from conn_fd_handler(). This clearing in conn_refresh_polling_flags() makes no sense anymore and is in no way related to the polling at all. Since we don't call them from there anymore it's better to clear it before attempting to receive, and to set it again later. So let's move this operation where it should be, in raw_sock_to_pipe() so that it's now symmetric. It was also placed in raw_sock_to_buf() so that we're certain that it gets cleared if an attempt to splice is replaced with a subsequent attempt to recv(). And these were currently already achieved by the call to conn_refresh_polling_flags(). Now it could theorically be removed from the stream-int.	2020-01-17 17:19:27 +01:00
Willy Tarreau	17ccd1a356	BUG/MEDIUM: connection: add a mux flag to indicate splice usability Commit `c640ef1a7d` ("BUG/MINOR: stream-int: avoid calling rcv_buf() when splicing is still possible") fixed splicing in TCP and legacy mode but broke it badly in HTX mode. What happens in HTX mode is that the channel's to_forward value remains set to CHN_INFINITE_FORWARD during the whole transfer, and as such it is not a reliable signal anymore to indicate whether more data are expected or not. Thus, when data are spliced out of the mux using rcv_pipe(), even when the end is reached (that only the mux knows about), the call to rcv_buf() to get the final HTX blocks completing the message were skipped and there was often no new event to wake this up, resulting in transfer timeouts at the end of large objects. All this goes down to the fact that the channel has no more information about whether it can splice or not despite being the one having to take the decision to call rcv_pipe() or not. And we cannot afford to call rcv_buf() inconditionally because, as the commit above showed, this reduces the forwarding performance by 2 to 3 in TCP and legacy modes due to data lying in the buffer preventing splicing from being used later. The approach taken by this patch consists in offering the muxes the ability to report a bit more information to the upper layers via the conn_stream. This information could simply be to indicate that more data are awaited but the real need being to distinguish splicing and receiving, here instead we clearly report the mux's willingness to be called for splicing or not. Hence the flag's name, CS_FL_MAY_SPLICE. The mux sets this flag when it knows that its buffer is empty and that data waiting past what is currently known may be spliced, and clears it when it knows there's no more data or that the caller must fall back to rcv_buf() instead. The stream-int code now uses this to determine if splicing may be used or not instead of looking at the rcv_pipe() callbacks through the whole chain. And after the rcv_pipe() call, it checks the flag again to decide whether it may safely skip rcv_buf() or not. All this bitfield dance remains a bit complex and it starts to appear obvious that splicing vs reading should be a decision of the mux based on permission granted by the data layer. This would however increase the API's complexity but definitely need to be thought about, and should even significantly simplify the data processing layer. The way it was integrated in mux-h1 will also result in no more calls to rcv_pipe() on chunked encoded data, since these ones are currently disabled at the mux level. However once the issue with chunks+splice is fixed, it will be important to explicitly check for curr_len\|CHNK to set MAY_SPLICE, so that we don't call rcv_buf() after each chunk. This fix must be backported to 2.1 and 2.0.	2020-01-17 17:00:12 +01:00
Willy Tarreau	340b07e868	BUG/MAJOR: hashes: fix the signedness of the hash inputs Wietse Venema reported in the thread below that we have a signedness issue with our hashes implementations: due to the use of const char* for the input key that's often text, the crc32, sdbm, djb2, and wt6 algorithms return a platform-dependent value for binary input keys containing bytes with bit 7 set. This means that an ARM or PPC platform will hash binary inputs differently from an x86 typically. Worse, some algorithms are well defined in the industry (like CRC32) and do not provide the expected result on x86, possibly causing interoperability issues (e.g. a user-agent would fail to compare the CRC32 of a message body against the one computed by haproxy). Fortunately, and contrary to the first impression, the CRC32c variant used in the PROXY protocol processing is not affected. Thus the impact remains very limited (the vast majority of input keys are text-based, such as user-agent headers for exmaple). This patch addresses the issue by fixing all hash functions' prototypes (even those not affected, for API consistency). A reg test will follow in another patch. The vast majority of users do not use these hashes. And among those using them, very few will pass them on binary inputs. However, for the rare ones doing it, this fix MAY have an impact during the upgrade. For example if the package is upgraded on one LB then on another one, and the CRC32 of a binary input is used as a stick table key (why?) then these CRCs will not match between both nodes. Similarly, if "hash-type ... crc32" is used, LB inconsistency may appear during the transition. For this reason it is preferable to apply the patch on all nodes using such hashes at the same time. Systems upgraded via their distros will likely observe the least impact since they're expected to be upgraded within a short time frame. And it is important for distros NOT to skip this fix, in order to avoid distributing an incompatible implementation of a hash. This is the reason why this patch is tagged as MAJOR, eventhough it's extremely unlikely that anyone will ever notice a change at all. This patch must be backported to all supported branches since the hashes were introduced in 1.5-dev20 (commit `98634f0c`). Some parts may be dropped since implemented later. Link to Wietse's report: https://marc.info/?l=postfix-users&m=157879464518535&w=2	2020-01-16 08:23:42 +01:00
Willy Tarreau	f31af9367e	MEDIUM: lua: don't call the GC as often when dealing with outgoing connections In order to properly close connections established from Lua in case a Lua context dies, the context currently automatically gets a flag HLUA_MUST_GC set whenever an outgoing connection is used. This causes the GC to be enforced on the context's death as well as on yield. First, it does not appear necessary to do it when yielding, since if the connections die they are already cleaned up. Second, the problem with the flag is that even if a connection gets properly closed, the flag is not removed and the GC continues to be called on the Lua context. The impact on performance looks quite significant, as noticed and diagnosed by Sadasiva Gujjarlapudi in the following thread: https://www.mail-archive.com/haproxy@formilux.org/msg35810.html This patch changes the flag for a counter so that each created connection increments it and each cleanly closed connection decrements it. That way we know we have to call the GC on the context's death only if the count is non-null. As reported in the thread above, the Lua performance gain is now over 20% by doing this. Thanks to Sada and Thierry for the design discussion and tests that led to this solution.	2020-01-14 10:12:31 +01:00
Olivier Houchard	3c4f40acbf	BUG/MEDIUM: tasks: Use the MT macros in tasklet_free(). In tasklet_free(), to attempt to remove ourself, use MT_LIST_DEL, we can't just use LIST_DEL(), as we theorically could be in the shared tasklet list. This should be backported to 2.1.	2020-01-10 16:56:59 +01:00
Florian Tham	9205fea13a	MINOR: http: Add 404 to http-request deny This patch adds http status code 404 Not Found to http-request deny. See issue #80.	2020-01-08 16:15:23 +01:00
Florian Tham	272e29b5cc	MINOR: http: Add 410 to http-request deny This patch adds http status code 410 Gone to http-request deny. See issue #80.	2020-01-08 16:15:23 +01:00
Willy Tarreau	eaf05be0ee	OPTIM: polling: do not create update entries for FD removal In order to reduce the number of poller updates, we can benefit from the fact that modern pollers use sampling to report readiness and that under load they rarely report the same FD multiple times in a row. As such it's not always necessary to disable such FDs especially when we're almost certain they'll be re-enabled again and will require another set of syscalls. Now instead of creating an update for a (possibly temporary) removal, we only perform this removal if the FD is reported again as ready while inactive. In addition this is performed via another update so that alternating workloads like transfers have a chance to re-enable the FD without any syscall during the loop (typically after the data that filled a buffer have been sent). However we only do that for single- threaded FDs as the other ones require a more complex setup and are not on the critical path. This does cause a few spurious wakeups but almost totally eliminates the calls to epoll_ctl() on connections seeing intermitent traffic like HTTP/1 to a server or client. A typical example with 100k requests for 4 kB objects over 200 connections shows that the number of epoll_ctl() calls doesn't depend on the number of requests anymore but most exclusively on the number of established connections: Before: % time seconds usecs/call calls errors syscall ------ ----------- ----------- --------- --------- ---------------- 57.09 0.499964 0 654361 321190 recvfrom 38.33 0.335741 0 369097 1 epoll_wait 4.56 0.039898 0 44643 epoll_ctl 0.02 0.000211 1 200 200 connect ------ ----------- ----------- --------- --------- ---------------- 100.00 0.875814 1068301 321391 total After: % time seconds usecs/call calls errors syscall ------ ----------- ----------- --------- --------- ---------------- 59.25 0.504676 0 657600 323630 recvfrom 40.68 0.346560 0 374289 1 epoll_wait 0.04 0.000370 0 620 epoll_ctl 0.03 0.000228 1 200 200 connect ------ ----------- ----------- --------- --------- ---------------- 100.00 0.851834 1032709 323831 total As expected there is also a slight increase of epoll_wait() calls since delaying de-activation of events can occasionally cause one spurious wakeup.	2019-12-27 16:38:47 +01:00
Willy Tarreau	19689882e6	MINOR: poller: do not call the IO handler if the FD is not active For now this almost never happens but with subsequent patches it will become more important not to uselessly call the I/O handlers if the FD is not active.	2019-12-27 16:38:47 +01:00
Willy Tarreau	0fbc318e24	CLEANUP: connection: merge CO_FL_NOTIFY_DATA and CO_FL_NOTIFY_DONE Both flags became equal in commit `82967bf9` ("MINOR: connection: adjust CO_FL_NOTIFY_DATA after removal of flags"), which already predicted the overlap between xprt_done_cb() and wake() after the removal of the DATA specific flags in 1.8. Let's simply remove CO_FL_NOTIFY_DATA since the "_DONE" version already covers everything and explains the intent well enough.	2019-12-27 16:38:47 +01:00
Willy Tarreau	4970e5adb7	REORG: connection: move tcp_connect_probe() to conn_fd_check() The function is not TCP-specific at all, it covers all FD-based sockets so let's move this where other similar functions are, in connection.c, and rename it conn_fd_check().	2019-12-27 16:38:43 +01:00
Willy Tarreau	11ef0837af	MINOR: pollers: add a new flag to indicate pollers reporting ERR & HUP In practice it's all pollers except select(). It turns out that we're keeping some legacy code only for select and enforcing it on all pollers, let's offer the pollers the ability to declare that they do not need that.	2019-12-27 14:04:33 +01:00
Lukas Tribus	a26d1e1324	BUILD: ssl: improve SSL_CTX_set_ecdh_auto compatibility SSL_CTX_set_ecdh_auto() is not defined when OpenSSL 1.1.1 is compiled with the no-deprecated option. Remove existing, incomplete guards and add a compatibility macro in openssl-compat.h, just as OpenSSL does: `bf4006a6f9/include/openssl/ssl.h (L1486)` This should be backported as far as 2.0 and probably even 1.9.	2019-12-21 06:46:55 +01:00
Rosen Penev	b3814c2ca8	BUG/MINOR: ssl: openssl-compat: Fix getm_ defines LIBRESSL_VERSION_NUMBER evaluates to 0 under OpenSSL, making the condition always true. Check for the define before checking it. Signed-off-by: Rosen Penev <rosenp@gmail.com> [wt: to be backported as far as 1.9]	2019-12-20 16:01:31 +01:00
Willy Tarreau	dd0e89a084	BUG/MAJOR: task: add a new TASK_SHARED_WQ flag to fix foreing requeuing Since 1.9 with commit `b20aa9eef3` ("MAJOR: tasks: create per-thread wait queues") a task bound to a single thread will not use locks when being queued or dequeued because the wait queue is assumed to be the owner thread's. But there exists a rare situation where this is not true: the health check tasks may be running on one thread waiting for a response, and may in parallel be requeued by another thread calling health_adjust() after a detecting a response error in traffic when "observe l7" is set, and "fastinter" is lower than "inter", requiring to shorten the running check's timeout. In this case, the task being requeued was present in another thread's wait queue, thus opening a race during task_unlink_wq(), and gets requeued into the calling thread's wait queue instead of the running one's, opening a second race here. This patch aims at protecting against the risk of calling task_unlink_wq() from one thread while the task is queued on another thread, hence unlocked, by introducing a new TASK_SHARED_WQ flag. This new flag indicates that a task's position in the wait queue may be adjusted by other threads than then one currently executing it. This means that such WQ manipulations must be performed under a lock. There are two types of such tasks: - the global ones, using the global wait queue (technically speaking, those whose thread_mask has at least 2 bits set). - some local ones, which for now will be placed into the global wait queue as well in order to benefit from its lock. The flag is automatically set on initialization if the task's thread mask indicates more than one thread. The caller must also set it if it intends to let other threads update the task's expiration delay (e.g. delegated I/Os), or if it intends to change the task's affinity over time as this could lead to the same situation. Right now only the situation described above seems to be affected by this issue, and it is very difficult to trigger, and even then, will often have no visible effect beyond stopping the checks for example once the race is met. On my laptop it is feasible with the following config, chained to httpterm: global maxconn 400 # provoke FD errors, calling health_adjust() defaults mode http timeout client 10s timeout server 10s timeout connect 10s listen px bind :8001 option httpchk /?t=50 server sback 127.0.0.1:8000 backup server-template s 0-999 127.0.0.1:8000 check port 8001 inter 100 fastinter 10 observe layer7 This patch will automatically address the case for the checks because check tasks are created with multiple threads bound and will get the TASK_SHARED_WQ flag set. If in the future more tasks need to rely on this (multi-threaded muxes for example) and the use of the global wait queue becomes a bottleneck again, then it should not be too difficult to place locks on the local wait queues and queue the task on its bound thread. This patch needs to be backported to 2.1, 2.0 and 1.9. It depends on previous patch "MINOR: task: only check TASK_WOKEN_ANY to decide to requeue a task". Many thanks to William Dauchy for providing detailed traces allowing to spot the problem.	2019-12-19 14:42:22 +01:00
Christopher Faulet	76014fd118	MEDIUM: h1-htx: Add HTX EOM block when the message is in H1_MSG_DONE state During H1 parsing, the HTX EOM block is added before switching the message state to H1_MSG_DONE. It is an exception in the way to convert an H1 message to HTX. Except for this block, the message is first switched to the right state before starting to add the corresponding HTX blocks. For instance, the message is switched in H1_MSG_DATA state and then the HTX DATA blocks are added. With this patch, the message is switched to the H1_MSG_DONE state when all data blocks or trailers were processed. It is the caller responsibility to call h1_parse_msg_eom() when the H1_MSG_DONE state is reached. This way, it is far easier to catch failures when the HTX buffer is full. The H1 and FCGI muxes have been updated accordingly. This patch may eventually be backported to 2.1 if it helps other backports.	2019-12-11 16:46:16 +01:00
Willy Tarreau	fec56c6a76	BUG/MINOR: listener: fix off-by-one in state name check As reported in issue #380, the state check in listener_state_str() is invalid as it allows state value 9 to report crap. We don't use such a state value so the issue should never happen unless the memory is already corrupted, but better clean this now while it's harmless. This should be backported to all maintained branches.	2019-12-11 15:51:37 +01:00
Willy Tarreau	d26c9f9465	BUG/MINOR: mworker: properly pass SIGTTOU/SIGTTIN to workers If a new process is started with -sf and it fails to bind, it may send a SIGTTOU to the master process in hope that it will temporarily unbind. Unfortunately this one doesn't catch it and stops to background instead of forwarding the signal to the workers. The same is true for SIGTTIN. This commit simply implements an extra signal handler for the master to deal with such signals that must be passed down to the workers. It must be backported as far as 1.8, though there the code differs in that it's entirely in haproxy.c and doesn't require an extra sig handler.	2019-12-11 14:26:53 +01:00
Willy Tarreau	c49ba52524	MINOR: tasks: split wake_expired_tasks() in two parts to avoid useless wakeups We used to have wake_expired_tasks() wake up tasks and return the next expiration delay. The problem this causes is that we have to call it just before poll() in order to consider latest timers, but this also means that we don't wake up all newly expired tasks upon return from poll(), which thus systematically requires a second poll() round. This is visible when running any scheduled task like a health check, as there are systematically two poll() calls, one with the interval, nothing is done after it, and another one with a zero delay, and the task is called: listen test bind *:8001 server s1 127.0.0.1:1111 check 09:37:38.200959 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=8696843}) = 0 09:37:38.200967 epoll_wait(3, [], 200, 1000) = 0 09:37:39.202459 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=8712467}) = 0 >> nothing run here, as the expired task was not woken up yet. 09:37:39.202497 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=8715766}) = 0 09:37:39.202505 epoll_wait(3, [], 200, 0) = 0 09:37:39.202513 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=8719064}) = 0 >> now the expired task was woken up 09:37:39.202522 socket(AF_INET, SOCK_STREAM, IPPROTO_TCP) = 7 09:37:39.202537 fcntl(7, F_SETFL, O_RDONLY\|O_NONBLOCK) = 0 09:37:39.202565 setsockopt(7, SOL_TCP, TCP_NODELAY, [1], 4) = 0 09:37:39.202577 setsockopt(7, SOL_TCP, TCP_QUICKACK, [0], 4) = 0 09:37:39.202585 connect(7, {sa_family=AF_INET, sin_port=htons(1111), sin_addr=inet_addr("127.0.0.1")}, 16) = -1 EINPROGRESS (Operation now in progress) 09:37:39.202659 epoll_ctl(3, EPOLL_CTL_ADD, 7, {EPOLLOUT, {u32=7, u64=7}}) = 0 09:37:39.202673 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=8814713}) = 0 09:37:39.202683 epoll_wait(3, [{EPOLLOUT\|EPOLLERR\|EPOLLHUP, {u32=7, u64=7}}], 200, 1000) = 1 09:37:39.202693 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=8818617}) = 0 09:37:39.202701 getsockopt(7, SOL_SOCKET, SO_ERROR, [111], [4]) = 0 09:37:39.202715 close(7) = 0 Let's instead split the function in two parts: - the first part, wake_expired_tasks(), called just before process_runnable_tasks(), wakes up all expired tasks; it doesn't compute any timeout. - the second part, next_timer_expiry(), called just before poll(), only computes the next timeout for the current thread. Thanks to this, all expired tasks are properly woken up when leaving poll, and each poll call's timeout remains up to date: 09:41:16.270449 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=10223556}) = 0 09:41:16.270457 epoll_wait(3, [], 200, 999) = 0 09:41:17.270130 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=10238572}) = 0 09:41:17.270157 socket(AF_INET, SOCK_STREAM, IPPROTO_TCP) = 7 09:41:17.270194 fcntl(7, F_SETFL, O_RDONLY\|O_NONBLOCK) = 0 09:41:17.270204 setsockopt(7, SOL_TCP, TCP_NODELAY, [1], 4) = 0 09:41:17.270216 setsockopt(7, SOL_TCP, TCP_QUICKACK, [0], 4) = 0 09:41:17.270224 connect(7, {sa_family=AF_INET, sin_port=htons(1111), sin_addr=inet_addr("127.0.0.1")}, 16) = -1 EINPROGRESS (Operation now in progress) 09:41:17.270299 epoll_ctl(3, EPOLL_CTL_ADD, 7, {EPOLLOUT, {u32=7, u64=7}}) = 0 09:41:17.270314 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=10337841}) = 0 09:41:17.270323 epoll_wait(3, [{EPOLLOUT\|EPOLLERR\|EPOLLHUP, {u32=7, u64=7}}], 200, 1000) = 1 09:41:17.270332 clock_gettime(CLOCK_THREAD_CPUTIME_ID, {tv_sec=0, tv_nsec=10341860}) = 0 09:41:17.270340 getsockopt(7, SOL_SOCKET, SO_ERROR, [111], [4]) = 0 09:41:17.270367 close(7) = 0 This may be backported to 2.1 and 2.0 though it's unlikely to bring any user-visible improvement except to clarify debugging.	2019-12-11 09:42:58 +01:00
Willy Tarreau	440d09b244	BUG/MINOR: tasks: only requeue a task if it was already in the queue Commit `0742c314c3` ("BUG/MEDIUM: tasks: Make sure we switch wait queues in task_set_affinity().") had a slight side effect on expired timeouts, which is that when used before a timeout is updated, it will cause an existing task to be requeued earlier than its expected timeout when done before being updated, resulting in the next poll wakup timeout too early or even instantly if the previous wake up was done on a timeout. This is visible in strace when health checks are enabled because there are two poll calls, one of which has a short or zero delay. The correct solution is to only requeue a task if it was already in the queue. This can be backported to all branches having the fix above.	2019-12-11 09:21:36 +01:00
Willy Tarreau	a1d97f88e0	REORG: listener: move the global listener queue code to listener.c The global listener queue code and declarations were still lying in haproxy.c while not needed there anymore at all. This complicates the code for no reason. As a result, the global_listener_queue_task and the global_listener_queue were made static.	2019-12-10 14:16:03 +01:00
Willy Tarreau	241797a3fc	MINOR: listener: split dequeue_all_listener() in two We use it half times for the global_listener_queue and half times for a proxy's queue and this requires the callers to take care of these. Let's split it in two versions, the current one working only on the global queue and another one dedicated to proxies for the per-proxy queues. This cleans up quite a bit of code.	2019-12-10 14:14:09 +01:00
Willy Tarreau	a45a8b5171	MEDIUM: init: set NO_NEW_PRIVS by default when supported HAProxy doesn't need to call executables at run time (except when using external checks which are strongly recommended against), and is even expected to isolate itself into an empty chroot. As such, there basically is no valid reason to allow a setuid executable to be called without the user being fully aware of the risks. In a situation where haproxy would need to call external checks and/or disable chroot, exploiting a vulnerability in a library or in haproxy itself could lead to the execution of an external program. On Linux it is possible to lock the process so that any setuid bit present on such an executable is ignored. This significantly reduces the risk of privilege escalation in such a situation. This is what haproxy does by default. In case this causes a problem to an external check (for example one which would need the "ping" command), then it is possible to disable this protection by explicitly adding this directive in the global section. If enabled, it is possible to turn it back off by prefixing it with the "no" keyword. Before the option: $ socat - /tmp/sock1 <<< "expert-mode on; debug dev exec sudo /bin/id" uid=0(root) gid=0(root) groups=0(root After the option: $ socat - /tmp/sock1 <<< "expert-mode on; debug dev exec sudo /bin/id" sudo: effective uid is not 0, is /usr/bin/sudo on a file system with the 'nosuid' option set or an NFS file system without root privileges?	2019-12-06 17:20:26 +01:00
Olivier Houchard	0742c314c3	BUG/MEDIUM: tasks: Make sure we switch wait queues in task_set_affinity(). In task_set_affinity(), leave the wait_queue if any before changing the affinity, and re-enter a wait queue once it is done. If we don't do that, the task may stay in the wait queue of another thread, and we later may end up modifying that wait queue while holding no lock, which could lead to memory corruption. THis should be backported to 2.1, 2.0 and 1.9.	2019-12-05 15:11:19 +01:00
Willy Tarreau	d96f1126fe	MEDIUM: init: prevent process and thread creation at runtime Some concerns are regularly raised about the risk to inherit some Lua files which make use of a fork (e.g. via os.execute()) as well as whether or not some of bugs we fix might or not be exploitable to run some code. Given that haproxy is event-driven, any foreground activity completely stops processing and is easy to detect, but background activity is a different story. A Lua script could very well discretely fork a sub-process connecting to a remote location and taking commands, and some injected code could also try to hide its activity by creating a process or a thread without blocking the rest of the processing. While such activities should be extremely limited when run in an empty chroot without any permission, it would be better to get a higher assurance they cannot happen. This patch introduces something very simple: it limits the number of processes and threads to zero in the workers after the last thread was created. By doing so, it effectively instructs the system to fail on any fork() or clone() syscall. Thus any undesired activity has to happen in the foreground and is way easier to detect. This will obviously break external checks (whose concept is already totally insecure), and for this reason a new option "insecure-fork-wanted" was added to disable this protection, and it is suggested in the fork() error report from the checks. It is obviously recommended not to use it and to reconsider the reasons leading to it being enabled in the first place. If for any reason we fail to disable forks, we still start because it could be imaginable that some operating systems refuse to set this limit to zero, but in this case we emit a warning, that may or may not be reported since we're after the fork point. Ideally over the long term it should be conditionned by strict-limits and cause a hard fail.	2019-12-03 11:49:00 +01:00
Emmanuel Hocdet	e9a100e982	BUG/MINOR: ssl: fix X509 compatibility for openssl < 1.1.0 Commit `d4f9a60e` "MINOR: ssl: deduplicate ca-file" uses undeclared X509 functions when build with openssl < 1.1.0. Introduce this functions in openssl-compat.h . Fix issue #385.	2019-12-03 07:13:12 +01:00
Emmanuel Hocdet	d4f9a60ee2	MINOR: ssl: deduplicate ca-file Typically server line like: 'server-template srv 1-1000 *:443 ssl ca-file ca-certificates.crt' load ca-certificates.crt 1000 times and stay duplicated in memory. Same case for bind line: ca-file is loaded for each certificate. Same 'ca-file' can be load one time only and stay deduplicated in memory. As a corollary, this will prevent file access for ca-file when updating a certificate via CLI.	2019-11-28 11:11:20 +01:00
Willy Tarreau	cdb27e8295	MINOR: version: this is development again, update the status It's basically a revert of commit `9ca7f8cea`.	2019-11-25 20:38:32 +01:00
Willy Tarreau	2e077f8d53	[RELEASE] Released version 2.2-dev0 Released version 2.2-dev0 with the following main changes : - exact copy of 2.1.0	2019-11-25 20:36:16 +01:00
Willy Tarreau	9ca7f8ceac	MINOR: version: indicate that this version is stable Also indicate that it will get fixes till ~Q1 2021.	2019-11-25 19:47:23 +01:00
Willy Tarreau	c22d5dfeb8	MINOR: h2: add a function to report H2 error codes as strings Just like we have frame type to string, let's have error to string to improve debugging and traces.	2019-11-25 11:34:26 +01:00
Willy Tarreau	8f3ce06f14	MINOR: ist: add ist_find_ctl() This new function looks for the first control character in a string (a char whose value is between 0x00 and 0x1F included) and returns it, or NULL if there is none. It is optimized for quickly evicting non-matching strings and scans ~0.43 bytes per cycle. It can be used as an accelerator when it's needed to look up several of these characters (e.g. CR/LF/NUL).	2019-11-25 10:33:35 +01:00
Willy Tarreau	47479eb0e7	MINOR: version: emit the link to the known bugs in output of "haproxy -v" The link to the known bugs page for the current version is built and reported there. When it is a development version (less than 2 dots), instead a link to github open issues is reported as there's no way to be sure about the current situation in this case and it's better that users report their trouble there.	2019-11-21 18:48:20 +01:00
Willy Tarreau	08dd202d73	MINOR: version: report the version status in "haproxy -v" As discussed on Discourse here: https://discourse.haproxy.org/t/haproxy-branch-support-lifetime/4466 it's not always easy for end users to know the lifecycle of the version they are using. This patch introduces a "Status" line in the output of "haproxy -vv" indicating whether it's a development, stable, long-term supported version, possibly with an estimated end of life for the branch when it can be anticipated (e.g. for stable versions). This field should be adjusted when creating a major release to reflect the new status. It may make sense to backport this to other branches to clarify the situation.	2019-11-21 18:47:54 +01:00
William Lallemand	8b453912ce	MINOR: ssl: ssl_sock_prepare_ctx() return an error code Rework ssl_sock_prepare_ctx() so it fills a buffer with the error messages instead of using ha_alert()/ha_warning(). Also returns an error code (ERR_*) instead of the number of errors.	2019-11-21 17:48:11 +01:00
Daniel Corbett	f8716914c7	MEDIUM: dns: Add resolve-opts "ignore-weight" It was noted in #48 that there are times when a configuration may use the server-template directive with SRV records and simultaneously want to control weights using an agent-check or through the runtime api. This patch adds a new option "ignore-weight" to the "resolve-opts" directive. When specified, any weight indicated within an SRV record will be ignored. This is for both initial resolution and ongoing resolution.	2019-11-21 17:25:31 +01:00
Fr�d�ric L�caille	ec1c10b839	MINOR: peers: Add debugging information to "show peers". This patch adds three counters to help in debugging peers protocol issues to "peer" struct: ->no_hbt counts the number of reconnection period without receiving heartbeat ->new_conn counts the number of reconnections after ->reconnect timeout expirations. ->proto_err counts the number of protocol errors.	2019-11-19 14:48:28 +01:00
Fr�d�ric L�caille	33cab3c0eb	MINOR: peers: Add TX/RX heartbeat counters. Add RX/TX heartbeat counters to "peer" struct to have an idead about which peer is alive or not. Dump these counters values on the CLI via "show peers" command.	2019-11-19 14:48:25 +01:00
C�dric Dufour	0d7712dff0	MINOR: stick-table: allow sc-set-gpt0 to set value from an expression Allow the sc-set-gpt0 action to set GPT0 to a value dynamically evaluated from its <expr> argument (in addition to the existing static <int> alternative).	2019-11-15 18:24:19 +01:00
Willy Tarreau	869efd5eeb	BUG/MINOR: log: make "show startup-log" use a ring buffer instead The copy of the startup logs used to rely on a re-allocated memory area on the fly, that would attempt to be delivered at once over the CLI. But if it's too large (too many warnings) it will take time to start up, and may not even show up on the CLI as it doesn't fit in a buffer. The ring buffer infrastructure solves all this with no more code, let's switch to this instead. It simply requires a parsing function to attach the ring via ring_attach_cli() and all the rest is automatically handled. Initially this was imagined as a code cleanup, until a test with a config involving 100k backends and just one occurrence of "load-server-state-from-file global" in the defaults section took approx 20 minutes to parse due to the O(N^2) cost of concatenating the warnings resulting in ~1 TB of data to be copied, while it took only 0.57s with the ring. Ideally this patch should be backported to 2.0 and 1.9, though it relies on the ring infrastructure which will then also need to be backported. Configs able to trigger the bug are uncommon, so another workaround for older versions without backporting the rings would consist in simply limiting the size of the error message in print_message() to something always printable, which will only return the first errors.	2019-11-15 15:50:16 +01:00
Christopher Faulet	0d1c2a65e8	MINOR: stats: Report max times in addition of the averages for sessions Now, for the sessions, the maximum times (queue, connect, response, total) are reported in addition of the averages over the last 1024 connections. These values are called qtime_max, ctime_max, rtime_max and ttime_max. This patch is related to #272.	2019-11-15 14:23:54 +01:00
Christopher Faulet	efb41f0d8d	MINOR: counters: Add fields to store the max observed for {q,c,d,t}_time For backends and servers, some average times for last 1024 connections are already calculated. For the moment, the averages for the time passed in the queue, the connect time, the response time (for HTTP session only) and the total time are calculated. Now, in addition, the maximum time observed for these values are also stored. In addition, These new counters are cleared as all other max values with the CLI command "clear counters". This patch is related to #272.	2019-11-15 14:23:21 +01:00
Christopher Faulet	e2e8c6779e	MINOR: freq_ctr: Make the sliding window sums thread-safe swrate_add() and swrate_add_scaled() now rely on the CAS atomic operation. So the sliding window sums are atomically updated.	2019-11-15 13:43:08 +01:00
Christopher Faulet	b2e58492b1	MEDIUM: filters: Adapt filters API to allow again TCP filtering on HTX streams This change make the payload filtering uniform between TCP and HTTP filters. Now, in TCP, like in HTTP, there is only one callback responsible to forward data. Thus, old callbacks, tcp_data() and tcp_forward_data(), are replaced by a single callback function, tcp_payload(). This new callback gets the offset in the payload to (re)start the filtering and the maximum amount of data it can forward. It is the filter's responsibility to be compatible with HTX streams. If not, it must not set the flag FLT_CFG_FL_HTX. Because of this change, nxt and fwd offsets are no longer needed. Thus they are removed from the filter structure with their update functions, flt_change_next_size() and flt_change_forward_size(). Moreover, the trace filter has been updated accordingly. This patch breaks the compatibility with the old API. Thus it should probably not be backported. But, AFAIK, there is no TCP filter, thus the breakage is very limited.	2019-11-15 13:43:08 +01:00
Willy Tarreau	da52035a45	MINOR: memory: also poison the area on freeing Doing so sometimes helps detect some UAF situations without the overhead associated to the DEBUG_UAF define.	2019-11-15 07:06:46 +01:00
Olivier Houchard	7031e3dace	BUG/MEDIUM: tasks: Make tasklet_remove_from_tasklet_list() no matter the tasklet. In tasklet_remove_from_tasket_list(), we can be called for a tasklet that is either in the private task list, or in the shared tasklet list. Take that into account and always use MT_LIST_DEL() to remove it, otherwise if we're in the shared list and another thread attempts to add a tasklet in it, bad things will happen. __tasklet_remove_from_tasklet_list() is left unchanged, it's only supposed to be used by process_runnable_task() to remove task/tasklets from the private tast list. This should not be backported. This should fix github issue #357.	2019-11-09 18:27:17 +01:00
Christopher Faulet	fee726ffa7	MINOR: http-ana: Remove the unused function http_reset_txn() Since the legacy HTTP mode was removed, the stream is always released at the end of each HTTP transaction and a new is created to handle the next request for keep-alive connections. So the HTTP transaction is no longer reset and the function http_reset_txn() can be removed.	2019-11-07 15:32:52 +01:00
Christopher Faulet	eea8fc737b	MEDIUM: stream/trace: Register a new trace source with its events Runtime traces are now supported for the streams, only if compiled with debug. process_stream() is covered as well as TCP/HTTP analyzers and filters. In traces, the first argument is always a stream. So it is easy to get the info about the channels and the stream-interfaces. The second argument, when defined, is always a HTTP transaction. And the third one is an HTTP message. The trace message is adapted to report HTTP info when possible.	2019-11-06 10:14:32 +01:00
Christopher Faulet	db703b1918	MINOR: trace: Add a set of macros to trace events if HA is compiled with debug The macros DBG_TRACE_*() can be used instead of existing trace macros to emit trace messages in debug mode only, ie, when HAProxy is compiled with DEBUG_FULL or DEBUG_DEV. Otherwise, these macros do nothing. So it is possible to add traces for development purpose without impacting performance of production instances.	2019-11-06 10:14:32 +01:00
William Lallemand	21724f0807	MINOR: ssl/cli: replace the default_ctx during 'commit ssl cert' If the SSL_CTX of a previous instance (ckch_inst) was used as a default_ctx, replace the default_ctx of the bind_conf by the first SSL_CTX inserted in the SNI tree. Use the RWLOCK of the sni tree to handle the change of the default_ctx.	2019-11-04 18:16:53 +01:00
Damien Claisse	ae6f125c7b	MINOR: sample: add us/ms support to date/http_date It can be sometimes interesting to have a timestamp with a resolution of less than a second. It is currently painful to obtain this, because concatenation of date and date_us lead to a shorter timestamp during first 100ms of a second, which is not parseable and needs ugly ACLs in configuration to prepend 0s when needed. To improve this, add an optional <unit> parameter to date sample to report an integer with desired unit. Also support this unit in http_date converter to report a date string with sub-second precision.	2019-10-31 08:47:31 +01:00
William Lallemand	beea2a476e	CLEANUP: ssl/cli: remove leftovers of bundle/certs (it < 2) Remove the leftovers of the certificate + bundle updating in 'ssl set cert' and 'commit ssl cert'. * Remove the it variable in appctx.ctx.ssl. * Stop doing everything twice. * Indent	2019-10-30 17:52:34 +01:00
William Lallemand	bc6ca7ccaa	MINOR: ssl/cli: rework 'set ssl cert' as 'set/commit' This patch splits the 'set ssl cert' CLI command into 2 commands. The previous way of updating the certificate on the CLI was limited with the bundles. It was only able to apply one of the tree part of the certificate during an update, which mean that we needed 3 updates to update a full 3 certs bundle. It was also not possible to apply atomically several part of a certificate with the ability to rollback on error. (For example applying a .pem, then a .ocsp, then a .sctl) The command 'set ssl cert' will now duplicate the certificate (or bundle) and update it in a temporary transaction.. The second command 'commit ssl cert' will commit all the changes made during the transaction for the certificate. This commit breaks the ability to update a certificate which was used as a unique file and as a bundle in the HAProxy configuration. This way of using the certificates wasn't making any sense. Example: // For a bundle: $ echo -e "set ssl cert localhost.pem.rsa <<\n$(cat kikyo.pem.rsa)\n" \| socat /tmp/sock1 - Transaction created for certificate localhost.pem! $ echo -e "set ssl cert localhost.pem.dsa <<\n$(cat kikyo.pem.dsa)\n" \| socat /tmp/sock1 - Transaction updated for certificate localhost.pem! $ echo -e "set ssl cert localhost.pem.ecdsa <<\n$(cat kikyo.pem.ecdsa)\n" \| socat /tmp/sock1 - Transaction updated for certificate localhost.pem! $ echo "commit ssl cert localhost.pem" \| socat /tmp/sock1 - Committing localhost.pem. Success!	2019-10-30 17:01:07 +01:00
William Dauchy	0fec3ab7bf	MINOR: init: always fail when setrlimit fails this patch introduces a strict-limits parameter which enforces the setrlimit setting instead of a warning. This option can be forcingly disable with the "no" keyword. The general aim of this patch is to avoid bad surprises on a production environment where you change the maxconn for example, a new fd limit is calculated, but cannot be set because of sysfs setting. In that case you might want to have an explicit failure to be aware of it before seeing your traffic going down. During a global rollout it is also useful to explictly fail as most progressive rollout would simply check the general health check of the process. As discussed, plan to use the strict by default mode starting from v2.3. Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2019-10-29 17:42:27 +01:00
Olivier Houchard	6e8e2ec849	BUG/MEDIUM: stream_interface: Only use SI_ST_RDY when the mux is ready. In si_connect(), only switch the strema_interface status to SI_ST_RDY if we're reusing a connection and if the connection's mux is ready. Otherwise, maybe we're reusing a connection that is not fully established yet, and may fail, and setting SI_ST_RDY would mean we would not be able to retry to connect. This should be backported to 1.9 and 2.0. This commit depends on 55234e33708c5a584fb9efea81d71ac47235d518.	2019-10-29 14:15:20 +01:00
Olivier Houchard	9b8e11e691	MINOR: mux: Add a new method to get informations about a mux. Add a new method, ctl(), to muxes. It uses a "enum mux_ctl_type" to let it know which information we're asking for, and can output it either directly by returning the expected value, or by using an optional argument. "output" argument. Right now, the only known mux_ctl_type is MUX_STATUS, that will return 0 if the mux is not ready, or MUX_STATUS_READY if the mux is ready. We probably want to backport this to 1.9 and 2.0.	2019-10-29 14:15:20 +01:00
Willy Tarreau	2254b8ef4a	Revert "MINOR: istbuf: add b_fromist() to make a buffer from an ist" This reverts commit `9e46496d45`. It was wrong and is not reliable, depending on the compiler's version and optimization, as the struct is assigned inside a statement, thus on its own stack. It's not needed anymore now so let's remove this.	2019-10-29 13:09:14 +01:00
Willy Tarreau	20020ae804	MINOR: chunk: add chunk_istcat() to concatenate an ist after a chunk We previously relied on chunk_cat(dst, b_fromist(src)) for this but it is not reliable as the allocated buffer is inside the expression and may be on a temporary stack. While it's possible to allocate stack space for a struct and return a pointer to it, it's not possible to initialize it form a temporary variable to prevent arguments from being evaluated multiple times. Since this is only used to append an ist after a chunk, let's instead have a chunk_istcat() function to perform exactly this from a native ist. The only call place (URI computation in the cache) was updated.	2019-10-29 13:09:14 +01:00
Willy Tarreau	9b013701f1	MINOR: stats/debug: maintain a counter of debug commands issued Debug commands will usually mark the fate of the process. We'd rather have them counted and visible in a core or in stats output than trying to guess how a flag combination could happen. The counter is only incremented when the command is about to be issued however, so that failed attempts are ignored.	2019-10-24 18:38:00 +02:00
Willy Tarreau	abb9f9b057	MINOR: cli: add an expert mode to hide dangerous commands Some commands like the debug ones are not enabled by default but can be useful on some production environments. In order to avoid the temptation of using them incorrectly, let's introduce an "expert" mode for a CLI connection, which allows some commands to appear and be used. It is enabled by command "expert-mode on" which is not listed by default.	2019-10-24 18:38:00 +02:00
Willy Tarreau	86bfe146c9	REORG: move CLI access level definitions to cli.h These ones were still in global.h which is misplaced.	2019-10-24 18:38:00 +02:00
William Lallemand	705e088f0a	BUG/MINOR: ssl: fix build of X509_chain_up_ref() w/ libreSSL LibreSSL brought X509_chain_up_ref() in 2.7.5, so no need to build our own version starting from this version.	2019-10-23 23:20:08 +02:00
William Lallemand	89f5807315	BUG/MINOR: ssl: fix build with openssl < 1.1.0 `8c1cddef` ("MINOR: ssl: new functions duplicate and free a ckch_store") use some OpenSSL refcount functions that were introduced in OpenSSL 1.0.2 and OpenSSL 1.1.0. Fix the problem by introducing them in openssl-compat.h. Fix #336.	2019-10-23 19:44:50 +02:00
William Lallemand	8f840d7e55	MEDIUM: cli/ssl: handle the creation of SSL_CTX in an IO handler To avoid affecting too much the traffic during a certificate update, create the SNIs in a IO handler which yield every 10 ckch instances. This way haproxy continues to respond even if we tries to update a certificate which have 50 000 instances.	2019-10-23 11:54:51 +02:00
Willy Tarreau	403bfbb130	BUG/MEDIUM: pattern: make the pattern LRU cache thread-local and lockless As reported in issue #335, a lot of contention happens on the PATLRU lock when performing expensive regex lookups. This is absurd since the purpose of the LRU cache was to have a fast cache for expressions, thus the cache must not be shared between threads and must remain lockless. This commit makes the LRU cache thread-local and gets rid of the PATLRU lock. A test with 7 threads on 4 cores climbed from 67kH/s to 369kH/s, or a scalability factor of 5.5. Given the huge performance difference and the regression caused to users migrating from processes to threads, this should be backported at least to 2.0. Thanks to Brian Diekelman for his detailed report about this regression.	2019-10-23 07:27:25 +02:00
Willy Tarreau	8cdc167df8	BUG/MEDIUM: task: make tasklets either local or shared but not both at once Tasklets may be woken up to run on the calling thread or by a specific thread (the owner). But since we use a non-thread safe mechanism when the calling thread is also the for the owner, there may sometimes be collisions when two threads decide to wake the same tasklet up at the same time and one of them is the owner. This is more of a matter of usage than code, in that a tasklet usually is designed to be woken up and executed on the calling thread only (most cases) or on a specific thread. Thus it is a property of the tasklet itself as this solely depends how the code is constructed around it. This patch performs a small change to address this. By default tasklet_new() creates a "local" tasklet, which will run on the calling thread, like in 2.0. This is done by setting tl->tid to a negative value. If the caller wants the tasklet to run exclusively on a specific thread, it just has to set tl->tid, which is already what shared tasklet callers do anyway. No backport is needed.	2019-10-18 09:04:55 +02:00
Willy Tarreau	891b5ef05a	BUG/MEDIUM: tasklet: properly compute the sleeping threads mask in tasklet_wakeup() The use of ~(1 << tid) to compute the sleeping_mask in tasklet_wakeup() will result in breakage above 32 threads, because (1<<31) = 0xFFFFFFFF8000000, and upper values will lead to theorically undefined results, but practically will wrap over 0x1 to 0x80000000 again and indicate wrong sleeping masks. It seems that the main visible effect maybe extra latency on some threads or short CPU loops on others. No backport is needed.	2019-10-18 09:00:26 +02:00
Olivier Houchard	2068ec4f89	BUG/MEDIUM: lists: Handle 1-element-lists in MT_LIST_BEHEAD(). In MT_LIST_BEHEAD(), explicitely set the next element of the prev to NULL, instead of setting it to the prev of the next. If we only had one element, then we'd set the next and the prev to the element itself, and thus it would make the element appear to be outside any list.	2019-10-17 17:48:20 +02:00
Willy Tarreau	9e46496d45	MINOR: istbuf: add b_fromist() to make a buffer from an ist A lot of our chunk-based functions are able to work on a buffer pointer but not on an ist. Instead of duplicating all of them to also take an ist as a source, let's have a macro to make a temporary dummy buffer from an ist. This will only result in structure field manipulations that the compiler will quickly figure to eliminate them with inline functions, and in other cases it will just use 4 words in the stack before calling a function, instead of performing intermediary conversions.	2019-10-17 10:40:47 +02:00
David Carlier	a92c5cec2d	BUILD/MEDIUM: threads: rename thread_info struct to ha_thread_info On Darwin, the thread_info name exists as a standard function thus we need to rename our array to ha_thread_info to fix this conflict.	2019-10-17 07:15:17 +02:00
Christopher Faulet	065118166c	MINOR: htx: Add a flag on HTX to known when a response was generated by HAProxy The flag HTX_FL_PROXY_RESP is now set on responses generated by HAProxy, excluding responses returned by applets and services. It is an informative flag set by the applicative layer.	2019-10-16 10:03:12 +02:00
Willy Tarreau	abefa34c34	MINOR: version: make the version strings variables, not constants It currently is not possible to figure the exact haproxy version from a core file for the sole reason that the version is stored into a const string and as such ends up in the .text section that is not part of a core file. By turning them into variables we move them to the data section and they appear in core files. In order to help finding them, we just prepend an extra variable in front of them and we're able to immediately spot the version strings from a core file: $ strings core \| fgrep -A2 'HAProxy version' HAProxy version follows 2.1-dev2-e0f48a-88 2019/10/15 (These are haproxy_version and haproxy_date respectively). This may be backported to 2.0 since this part is not support to impact anything but the developer's time spent debugging.	2019-10-16 09:56:57 +02:00
Christopher Faulet	53a899b946	CLEANUP: h1-htx: Move htx-to-h1 formatting functions from htx.c to h1_htx.c The functions "htx__to_h1()" have been renamed into "h1_format_htx_()" and moved in the file h1_htx.c. It is the right place for such functions.	2019-10-14 22:28:50 +02:00
Christopher Faulet	48fa033f28	BUG/MINOR: chunk: Fix tests on the chunk size in functions copying data When raw data are copied or appended in a chunk, the result must not exceed the chunk size but it can reach it. Unlike functions to copy or append a string, there is no terminating null byte. This patch must be backported as far as 1.8. Note in 1.8, the functions chunk_cpy() and chunk_cat() don't exist.	2019-10-14 16:45:09 +02:00
William Lallemand	e0c51ae358	BUG/MINOR: ssl: fix build without SSL Commits `222a7c6` and `150bfa8` introduced some SSL initialization in bind_conf_alloc() which broke the build without SSL. Issue #322.	2019-10-14 11:24:17 +02:00
William Lallemand	246c0246d3	MINOR: ssl: load the ocsp in/from the ckch Don't try to load the files containing the issuer and the OCSP response each time we generate a SSL_CTX. The .ocsp and the .issuer are now loaded in the struct cert_key_and_chain only once and then loaded from this structure when creating a SSL_CTX.	2019-10-11 17:32:03 +02:00
William Lallemand	a17f4116d5	MINOR: ssl: load the sctl in/from the ckch Don't try to load the file containing the sctl each time we generate a SSL_CTX. The .sctl is now loaded in the struct cert_key_and_chain only once and then loaded from this structure when creating a SSL_CTX. Note that this now make possible the use of sctl with multi-cert bundles.	2019-10-11 17:32:03 +02:00
William Lallemand	150bfa84e3	MEDIUM: ssl/cli: 'set ssl cert' updates a certificate from the CLI $ echo -e "set ssl cert certificate.pem <<\n$(cat certificate2.pem)\n" \| \ socat stdio /var/run/haproxy.stat Certificate updated! The operation is locked at the ckch level with a HA_SPINLOCK_T which prevents the ckch architecture (ckch_store, ckch_inst..) to be modified at the same time. So you can't do a certificate update at the same time from multiple CLI connections. SNI trees are also locked with a HA_RWLOCK_T so reading operations are locked only during a certificate update. Bundles are supported but you need to update each file (.rsa\|ecdsa\|.dsa) independently. If a file is used in the configuration as a bundle AND as a unique certificate, both will be updated. Bundles, directories and crt-list are supported, however filters in crt-list are currently unsupported. The code tries to allocate every SNIs and certificate instances first, so it can rollback the operation if that was unsuccessful. If you have too much instances of the certificate (at least 20000 in my tests on my laptop), the function can take too much time and be killed by the watchdog. This will be fixed later. Also with too much certificates it's possible that socat exits before the end of the generation without displaying a message, consider changing the socat timeout in this case (-t2 for example). The size of the certificate is currently limited by the maximum size of a payload, that must fit in a buffer.	2019-10-11 17:32:03 +02:00
William Lallemand	1d29c7438e	MEDIUM: ssl: split ssl_sock_add_cert_sni() In order to allow the creation of sni_ctx in runtime, we need to split the function to allow rollback. We need to be able to allocate all sni_ctxs required before inserting them in case we need to rollback if we didn't succeed the allocation. The function was splitted in 2 parts. The first one ckch_inst_add_cert_sni() allocates a struct sni_ctx, fill it with the right data and insert it in the ckch_inst's list of sni_ctx. The second will take every sni_ctx in the ckch_inst and insert them in the bind_conf's sni tree.	2019-10-11 17:32:03 +02:00
William Lallemand	9117de9e37	MEDIUM: ssl: introduce the ckch instance structure struct ckch_inst represents an instance of a certificate (ckch_node) used in a bind_conf. Every sni_ctx created for 1 ckch_node in a bind_conf are linked in this structure. This patch allocate the ckch_inst for each bind_conf and inserts the sni_ctx in its linked list.	2019-10-11 17:32:03 +02:00
William Lallemand	222a7c6ae0	MINOR: ssl: initialize explicitly the sni_ctx trees	2019-10-11 17:32:02 +02:00
William Lallemand	f6adbe9f28	REORG: ssl: move structures to ssl_sock.h	2019-10-11 17:32:02 +02:00
Olivier Houchard	804ef244c6	MINOR: lists: Fix alignement of \ when relevant. Make sure all the \ are properly aligned in macroes, this contains no functional change.	2019-10-11 16:56:25 +02:00
Olivier Houchard	74715da030	MINOR: lists: Try to use local variables instead of macro arguments. When possible, use local variables instead of using the macro arguments explicitely, otherwise they may be evaluated over and over.	2019-10-11 16:56:25 +02:00
Olivier Houchard	06910464dd	MEDIUM: task: Split the tasklet list into two lists. As using an mt_list for the tasklet list is costly, instead use a regular list, but add an mt_list for tasklet woken up by other threads, to be run on the current thread. At the beginning of process_runnable_tasks(), we just take the new list, and merge it into the task_list. This should give us performances comparable to before we started using a mt_list, but allow us to use tasklet_wakeup() from other threads.	2019-10-11 16:37:41 +02:00
Willy Tarreau	d7f2bbcbe3	MINOR: list: add new macro MT_LIST_BEHEAD This macro atomically cuts the head of a list and returns the list of elements as a detached list, meaning that they're all linked together without any head. If the list was empty, NULL is returned.	2019-10-11 16:37:41 +02:00
Willy Tarreau	c32a0e522f	MINOR: lists: add new macro LIST_SPLICE_END_DETACHED This macro adds a detached list at the end of an existing list. The detached list is a list without head, containing only elements.	2019-10-11 16:37:41 +02:00
Willy Tarreau	eaa55370c3	MINOR: stats: prepare to add a description with each stat/info field Several times some users have expressed the non-intuitive aspect of some of our stat/info metrics and suggested to add some help. This patch replaces the char* arrays with an array of name_desc so that we now have some reserved room to store a description with each stat or info field. These descriptions are currently empty and not reported yet.	2019-10-10 11:30:07 +02:00
Willy Tarreau	2f39738750	MINOR: stats: support the "desc" output format modifier for info and stat Now "show info" and "show stat" can parse "desc" as an output format modifier that will be passed down the chain to add some descriptions to the fields depending on the format in use. For now it is not exploited.	2019-10-10 11:30:07 +02:00
Willy Tarreau	ab02b3f345	MINOR: stats: get rid of the STAT_SHOWADMIN flag This flag is used to decide to show the check box in front of a proxy on the HTML stat page. It is always equal to STAT_ADMIN except when the proxy has no backend capability (i.e. a pure frontend) or has no server, in which case it's only used to avoid leaving an empty column at the beginning of the table. Not only this is pretty useless, but it also causes the columns not to align well when mixing multiple proxies with or without servers. Let's simply always use STAT_ADMIN and get rid of this flag.	2019-10-10 11:30:07 +02:00
Willy Tarreau	708c41602b	MINOR: stats: replace the ST_* uri_auth flags with STAT_* We used to rely on some config flags defined in uri_auth.h set during parsing, and another set of STAT_* flags defined in stats.h set at run time, with a somewhat gray area between the two sets. This is confusing in the stats code as both are called "flags" in various functions and it's quite hard to know which one describes what. This patch cleans this up by replacing all ST_* by a newly assigned value from the STAT_* set so that we can now use unified flags to describe both the configuration and the current state. There is no functional change at all.	2019-10-10 11:30:07 +02:00
Willy Tarreau	ee4f5f83d3	MINOR: stats: get rid of the ST_CONVDONE flag This flag was added in 1.4-rc1 by commit `329f74d463` ("[BUG] uri_auth: do not attemp to convert uri_auth -> http-request more than once") to address the case where two proxies inherit the stats settings from the defaults instance, and the first one compiles the expression while the second one uses it. In this case since they use the exact same uri_auth pointer, only the first one should compile and the second one must not fail the check. This was addressed by adding an ST_CONVDONE flag indicating that the expression conversion was completed and didn't need to be done again. But this is a hack and it becomes cumbersome in the middle of the other flags which are all relevant to the stats applet. Let's instead fix it by checking if we're dealing with an alias of the defaults instance and refrain from compiling this twice. This allows us to remove the ST_CONVDONE flag. A typical config requiring this check is : defaults mode http stats auth foo:bar listen l1 bind :8080 listen l2 bind :8181 Without this (or previous) check it would cmoplain when checking l2's validity since the rule was already built.	2019-10-10 11:30:07 +02:00
Christopher Faulet	16fdc55f79	MINOR: http: Add a function to get the authority into a URI The function http_get_authority() may be used to parse a URI and looks for the authority, between the scheme and the path. An option may be used to skip the user info (part before the '@'). Most of time, the user info will be ignored.	2019-10-09 11:05:31 +02:00
Christopher Faulet	9a67c293b9	MINOR: htx: Add 2 flags on the start-line to have more info about the uri The first flag, HTX_SL_F_HAS_AUTHORITY, is set when the uri contains an authority. For the H1, it happens when a CONNECT request is received or when an absolute uri is used. For the H2, it happens when the pseudo header ":authority" is provided. The second one, HTX_SL_F_NORMALIZED_URI, is set when the received uri is represented as an absolute uri because of the protocol requirements. For now, it is only used for h2 requests, when the pseudo headers :authority and :scheme are found. Internally, the uri is represented as an absolute uri. This flag allows us to make the difference between an absolute uri in h1 and h2.	2019-10-09 11:05:31 +02:00
Christopher Faulet	c5a3eb4e3a	MINOR: fcgi: Add function to get the string representation of a record type This function will be used to emit traces in the FCGI multiplexer.	2019-10-04 16:12:02 +02:00
Christopher Faulet	27aa65ecfb	MINOR: htx: Adapt htx_dump() to be used from traces This function now dumps info about the HTX message into a buffer, passed as argument. In addition, it is possible to only dump meta information, without the message content.	2019-10-04 15:48:55 +02:00
Christopher Faulet	af542635f7	MINOR: h1-htx: Update h1_copy_msg_data() to ease the traces in the mux-h1 This function now uses the address of the pointer to the htx message where the copy must be performed. This way, when a zero-copy is performed, there is no need to refresh the caller's htx message. It is a bit easier to do that way, especially to add traces in the mux-h1.	2019-10-04 15:46:59 +02:00
Willy Tarreau	2aaeee34da	BUG/MEDIUM: fd: HUP is an error only when write is active William reported that since commit `6b3089856f` ("MEDIUM: fd: do not use the FD_POLL_* flags in the pollers anymore") the master's CLI often fails to access sub-processes. There are two causes to this. One is that we did report FD_POLL_ERR on an FD as soon as FD_EV_SHUT_W was seen, which is automatically inherited from POLLHUP. And since we do not store the current shutdown state of an FD we can't know if the poller reports a sudden close resulting from an error or just a byproduct of a previous shutdown(WR) followed by a read0. The current patch addresses this by only considering this when the FD was active, since a shutdown FD is not active. The second issue is that somewhere down the chain, channel data are ignored if an error is reported on a channel. This results in content truncation, but this cause was not figured yet. No backport is needed.	2019-10-01 11:52:08 +02:00
Tim Duesterhus	07626eafa2	CLEANUP: proxy: Remove `proxy_tbl_by_name` It is no longer required as of `1b8e68e89a` and is no longer used when #306 is fixed.	2019-09-30 04:11:36 +02:00
Christopher Faulet	88a0db28ae	MINOR: stats: Add the support of float fields in stats It is now possible to format stats counters as floats. But the stats applet does not use it. This patch is required by the Prometheus exporter to send the time averages in seconds. If the promex change is backported, this patch must be backported first.	2019-09-27 08:49:09 +02:00
Christopher Faulet	d72665b425	CLEANUP: http-ana: Remove the unused function http_send_name_header() Because the HTTP multiplexers are now responsible to handle the option "http-send-name-header", the function http_send_name_header() can be removed.	2019-09-27 08:48:53 +02:00
Christopher Faulet	b1bb1afa47	MINOR: spoe: Support the async mode with several threads A different engine-id is now generated for each thread. So, it is possible to enable the async mode with several threads. This patch may be backported to older versions.	2019-09-26 16:51:02 +02:00
Willy Tarreau	93acfa2263	MINOR: time: add timeofday_as_iso_us() to return instant time as ISO We often need ISO time + microseconds in traces and ring buffers, thus function does this by calling gettimeofday() and keeping a cached value of the part representing the tv_sec value, and only rewrites the microsecond part. The cache is per-thread so it's lockless and safe to use as-is. Some tests already show that it's easy to see 3-4 events in a single microsecond, thus it's likely that the nanosecond version will have to be implemented as well. But certain comments on the net suggest that some parsers are having trouble beyond microsecond, thus for now let's stick to the microsecond only.	2019-09-26 08:13:38 +02:00
Olivier Houchard	bba1a263c5	BUG/MEDIUM: tasklets: Make sure we're waking the target thread if it sleeps. Now that we can wake tasklet for other threads, make sure that if the thread is sleeping, we wake it up, or the tasklet won't be executed until it's done sleeping. That also means that, before going to sleep, and after we put our bit in sleeping_thread_mask, we have to check that nobody added a tasklet for us, just checking for global_tasks_mask isn't enough anymore.	2019-09-24 14:58:45 +02:00
Willy Tarreau	d022e9c98b	MINOR: task: introduce a thread-local "sched" variable for local scheduler stuff The aim is to rassemble all scheduler information related to the current thread. It simply points to task_per_thread[tid] without having to perform the operation at each time. We save around 1.2 kB of code on performance sensitive paths and increase the request rate by almost 1%.	2019-09-24 11:23:30 +02:00
Willy Tarreau	d66d75656e	MINOR: task: split the tasklet vs task code in process_runnable_tasks() There are a number of tests there which are enforced on tasklets while they will never apply (various handlers, destroyed task or not, arguments, results, ...). Instead let's have a single TASK_IS_TASKLET() test and call the tasklet processing function directly, skipping all the rest. It now appears visible that the only unneeded code is the update to curr_task that is never used for tasklets, except for opportunistic reporting in the debug handler, which can only catch si_cs_io_cb, which in practice doesn't appear in any report so the extra cost incurred there is pointless. This change alone removes 700 bytes of code, mostly in process_runnable_tasks() and increases the performance by about 1%.	2019-09-24 11:23:30 +02:00
Willy Tarreau	2bd65a781e	OPTIM: listeners: use tasklets for the multi-queue rings Now that we can wake up a remote thread's tasklet, it's way more interesting to use a tasklet than a task in the accept queue, as it will avoid passing through all the scheduler. Just doing this increases the accept rate by about 4%, overall recovering the slight loss introduced by the tasklet change. In addition it makes sure that even a heavily loaded scheduler (e.g. many very fast checks) will not delay a connection accept.	2019-09-24 06:57:32 +02:00
Olivier Houchard	ff1e9f39b9	MEDIUM: tasklets: Make the tasklet list a struct mt_list. Change the tasklet code so that the tasklet list is now a mt_list. That means that tasklet now do have an associated tid, for the thread it is expected to run on, and any thread can now call tasklet_wakeup() for that tasklet. One can change the associated tid with tasklet_set_tid().	2019-09-23 18:16:08 +02:00
Olivier Houchard	0cd6a976ff	MINOR: mt_lists: Give MT_LIST_ADD, MT_LIST_ADDQ and MT_LIST_DEL a return value. Make it so MT_LIST_ADD and MT_LIST_ADDQ return 1 if it managed to add the item, 0 (because it was already in a list) otherwise. Make it so MT_LIST_DEL returns 1 if it managed to remove the item from a list, or 0 otherwise (because it was in no list).	2019-09-23 18:16:08 +02:00
Olivier Houchard	cb22ad4f71	MINOR: mt_lists: Do nothing in MT_LIST_ADD/MT_LIST_ADDQ if already in list. Modify MT_LIST_ADD and MT_LIST_ADDQ to do nothing if the element is already in a list.	2019-09-23 18:16:08 +02:00
Olivier Houchard	9570ecf662	MEDIUM: servers: Use LIST_DEL_INIT() instead of LIST_DEL(). In srv_add_to_idle_list(), use LIST_DEL_INIT instead of just LIST_DEL. We're about to add the connection to a mt_list, and MT_LIST_ADD/MT_LIST_ADDQ will be modified to make sure we're not adding the element if it's already in a list.	2019-09-23 18:16:08 +02:00
Olivier Houchard	5e9b92cbff	MINOR: mt_lists: Add new macroes. Add a few new macroes to the mt_lists. MT_LIST_LOCK_ELT()/MT_LIST_UNLOCK_ELT() helps locking/unlocking an element. This should only be used if you know for sure nobody else will remove the element from the list in the meanwhile. mt_list_for_each_entry_safe() is an iterator, similar to list_for_each_entry_safe(). It takes 5 arguments, item, list_head, member are similar to those of the non-mt variant, tmpelt is a temporary pointer to a struct mt_list, while tmpelt2 is a struct mt_list itself. MT_LIST_DEL_SELF() can be used to delete an item while parsing the list with mt_list_for_each_entry_safe(). It shouldn't be used outside, and you shouldn't use MT_LIST_DEL() while using mt_list_for_each_entry_safe().	2019-09-23 18:16:08 +02:00
Olivier Houchard	859dc80f94	MEDIUM: list: Separate "locked" list from regular list. Instead of using the same type for regular linked lists and "autolocked" linked lists, use a separate type, "struct mt_list", for the autolocked one, and introduce a set of macros, similar to the LIST_* macros, with the MT_ prefix. When we use the same entry for both regular list and autolocked list, as is done for the "list" field in struct connection, we know have to explicitely cast it to struct mt_list when using MT_ macros.	2019-09-23 18:16:08 +02:00
Christopher Faulet	78fbb9f991	MEDIUM: fcgi-app: Add FCGI application and filter The FCGI application handles all the configuration parameters used to format requests sent to an application. The configuration of an application is grouped in a dedicated section (fcgi-app <name>) and referenced in a backend to be used (use-fcgi-app <name>). To be valid, a FCGI application must at least define a document root. But it is also possible to set the default index, a regex to split the script name and the path-info from the request URI, parameters to set or unset... In addition, this patch also adds a FCGI filter, responsible for all processing on a stream.	2019-09-17 10:18:54 +02:00
Christopher Faulet	63bbf284a1	MINOR: fcgi: Add code related to FCGI protocol This code is independant and is only responsible to encode and decode part of the FCGI protocol.	2019-09-17 10:18:54 +02:00
Christopher Faulet	4f0f88a9d0	MEDIUM: mux-h1/h1-htx: move HTX convertion of H1 messages in dedicated file To avoid code duplication in the futur mux FCGI, functions parsing H1 messages and converting them into HTX have been moved in the file h1_htx.c. Some specific parts remain in the mux H1. But most of the parsing is now generic.	2019-09-17 10:18:54 +02:00
Christopher Faulet	341fac1eb2	MINOR: http: Add function to parse value of the header Status It will be used by the mux FCGI to get the status a response.	2019-09-17 10:18:54 +02:00
Christopher Faulet	5c6fefc8eb	MINOR: log: Provide a function to emit a log for an application Application is a generic term here. It is a modules which handle its own log server list, with no dependency on a proxy. Such applications can now call the function app_log() to log messages, passing a log server list and a tag as parameters. Internally, the function __send_log() has been adapted accordingly.	2019-09-17 10:18:54 +02:00
Christopher Faulet	130cf21709	MINOR: istbuf: Add the function b_isteqi() This function compares a part of a buffer to an indirect string (ist), ignoring the case of the characters.	2019-09-17 10:18:54 +02:00
Christopher Faulet	c16929658f	MINOR: config: Support per-proxy and per-server post-check functions callbacks Most of times, when a keyword is added in proxy section or on the server line, we need to have a post-parser callback to check the config validity for the proxy or the server which uses this keyword. It is possible to register a global post-parser callback. But all these callbacks need to loop on the proxies and servers to do their job. It is neither handy nor efficient. Instead, it is now possible to register per-proxy and per-server post-check callbacks.	2019-09-17 10:18:54 +02:00
Christopher Faulet	3ea5cbe6a4	MINOR: config: Support per-proxy and per-server deinit functions callbacks Most of times, when any allocation is done during configuration parsing because of a new keyword in proxy section or on the server line, we must add a call in the deinit() function to release allocated ressources. It is now possible to register a post-deinit callback because, at this stage, the proxies and the servers are already releases. Now, it is possible to register deinit callbacks per-proxy or per-server. These callbacks will be called for each proxy and server before releasing them.	2019-09-17 10:18:54 +02:00
Christopher Faulet	e3d2a877fb	MINOR: http-ana: Remove err_state field from http_msg This field is not used anymore. In addition, the state HTTP_MSG_ERROR is now only used when an error occurred during the body forward.	2019-09-17 10:18:54 +02:00
Christopher Faulet	505adfca51	MINOR: htx: Add a flag on HTX message to report processing errors This new flag may be used to report unexpected error because of not well formatted HTX messages (not related to a parsing error) or our incapactity to handle the processing because we reach a limit (ressource exhaustion, too big headers...). It should result to an error 500 returned to the client when applicable.	2019-09-17 10:18:54 +02:00
Christopher Faulet	6338a08c34	MINOR: stats: Add JSON export from the stats page It is now possible to export stats using the JSON format from the HTTP stats page. Like for the CSV export, to export stats in JSON, you must add the option ";json" on the stats URL. It is also possible to dump the JSON schema with the option ";json-schema". Corresponding Links have been added on the HTML page. This patch fixes the issue #263.	2019-09-10 10:29:54 +02:00
Willy Tarreau	f21d17bbe8	MINOR: stats: report the number of idle connections for each server This adds two extra fields to the stats, one for the current number of idle connections and one for the configured limit. A tooltip link now appears on the HTML page to show these values in front of the active connection values. This should be backported to 2.0 and 1.9 as it's the only way to monitor the idle connections behaviour.	2019-09-08 09:30:50 +02:00
Willy Tarreau	4cae3bf631	BUG/MEDIUM: connection: don't keep more idle connections than ever needed When using "http-reuse safe", which is the default, a new incoming connection does not automatically reuse an existing connection for the first request, as we don't want to risk to lose the contents if we know the client will not be able to replay the request. A side effect to this is that when dealing with mostly http-close traffic, the reuse rate is extremely low and we keep accumulating server-side connections that may even never be reused. At some point we're limited to a ratio of file descriptors, but when the system is configured with very high FD limits, we can still reach the limit of outgoing source ports and make the system significantly slow down trying to find an available port for outgoing connections. A simple test on my laptop with ulimit 100000 and with the following config results in the load immediately dropping after a few seconds : listen l1 bind :4445 mode http server s1 127.0.0.1:8000 As can be seen, the load falls from 38k cps to 400 cps during the first 200ms (in fact when the source port table is full and connect() takes ages to find a spare port for a new connection): $ injectl464 -p 4 -o 1 -u 10 -G 127.0.0.1:4445/ -F -c -w 100 hits ^hits hits/s ^h/s bytes kB/s last errs tout htime sdht ptime 2439 2439 39338 39338 356094 5743 5743 0 0 0.4 0.5 0.4 7637 5198 38185 37666 1115002 5575 5499 0 0 0.7 0.5 0.7 7719 82 25730 820 1127002 3756 120 0 0 21.8 18.8 21.8 7797 78 19492 780 1138446 2846 114 0 0 61.4 2.5 61.4 7877 80 15754 800 1150182 2300 117 0 0 58.6 0.5 58.6 7920 43 13200 430 1156488 1927 63 0 0 58.9 0.3 58.9 At this point, lots of connections are indeed in use, for only 10 connections on the frontend side: $ ss -ant state established \| wc -l 39022 This patch makes sure we never keep more idle connections than we've ever had outstanding requests on a server. This way the total number of idle connections will never exceed the sum of maximum connections. Thus highly loaded servers will be able to get many connections and slightly loaded servers will keep less. Ideally we should apply similar limits per process and the per backend, but in practice this already addresses the issues pretty well: $ injectl464 -p 4 -o 1 -u 10 -G 127.0.0.1:4445/ -F -c -w 100 hits ^hits hits/s ^h/s bytes kB/s last errs tout htime sdht ptime 4423 4423 40209 40209 645758 5870 5870 0 0 0.2 0.4 0.2 8020 3597 40100 39966 1170920 5854 5835 0 0 0.2 0.4 0.2 12037 4017 40123 40170 1757402 5858 5864 0 0 0.2 0.4 0.2 16069 4032 40172 40320 2346074 5865 5886 0 0 0.2 0.4 0.2 20047 3978 40013 39386 2926862 5842 5750 0 0 0.3 0.4 0.3 24005 3958 40008 39979 3504730 5841 5837 0 0 0.2 0.4 0.2 $ ss -ant state established \| wc -l 234 This patch must be backported to 2.0. It could be useful in 1.9 as well eventhough pools and reuse are not enabled by default there.	2019-09-08 09:30:50 +02:00
Willy Tarreau	6b3089856f	MEDIUM: fd: do not use the FD_POLL_* flags in the pollers anymore As mentioned in previous commit, these flags do not map well to modern poller capabilities. Let's use the FD_EV_*_{R,W} flags instead. This first patch only performs a 1-to-1 mapping making sure that the previously reported flags are still reported identically while using the closest possible semantics in the pollers. It's worth noting that kqueue will now support improvements such as returning distinctions between shut and errors on each direction, though this is not exploited for now.	2019-09-06 19:09:56 +02:00
Willy Tarreau	77abb43ed1	MINOR: fd: add two flags ERR and SHUT to describe FD states There's currently a big ambiguity on our use of POLLHUP because we currently map POLLHUP and POLLRDHUP to FD_POLL_HUP. The first one indicates a close in both directions while the second one indicates a unidirectional close. Since we don't know from the resulting flag we always have to read when reported. Furthermore kqueue only reports unidirectional responses which are mapped to FD_POLL_HUP as well, and their write closes are mapped to a general error. We could add a new FD_POLL_RDHUP flag to improve the mapping, or switch only to the POLL* flags, but that further complicates the portability for operating systems like FreeBSD which do not have POLLRDHUP but have its semantics. Let's instead directly use the per-direction flag values we already have, and it will be a first step in the direction of finer states. Thus we introduce an ERR and a SHUT status for each direction, that the pollers will be able to compute and pass to fd_update_events(). It's worth noting that FD_EV_STATUS already sees the two new flags, but they are harmless since used only by fd_{recv,send}_state() which are never called. Thus in its current state this patch must be totally transparent.	2019-09-06 18:33:07 +02:00
Willy Tarreau	8f2825f3ab	MINOR: fd: add two new calls fd_cond_{recv,send}() These two functions are used to enable recv/send but only if the FD is not marked as active yet. The purpose is to conditionally mark them as tentatively usable without interfering with the polling if polling was already enabled, when it's supposed to be likely true.	2019-09-06 17:50:36 +02:00
Willy Tarreau	4ac9d064d2	MEDIUM: fd: mark the FD as ready when it's inserted Given that all our I/Os are now directed from top to bottom and not the opposite way around, and the FD cache was removed, it doesn't make sense anymore to create FDs that are marked not ready since this would prevent the first accesses unless the caller explicitly does an fd_may_recv() which is not expected to be its job (which conn_ctrl_init() has to do by the way). Let's move this into fd_insert() instead, and have a single atomic operation for both directions via fd_may_both().	2019-09-06 17:50:36 +02:00
Willy Tarreau	dbe3060e81	MINOR: fd: make updt_fd_polling() a normal function It's called from many places, better use a real function than an inline.	2019-09-05 09:31:18 +02:00
Willy Tarreau	f8ecc7f667	MEDIUM: fd: simplify the fd__{recv,send} functions using BTS/BTR Now that we don't have to update FD_EV_POLLED_ at the same time as FD_EV_ACTIVE_*, we don't need to use a CAS anymore, a bit-test-and-set operation is enough. Doing so reduces the code size by a bit more than 1 kB. One function was special, fd_done_recv(), whose comments and doc were inaccurate for the part related to the lack of polling.	2019-09-05 09:31:18 +02:00
Willy Tarreau	5bee3e2f47	MEDIUM: fd: remove the FD_EV_POLLED status bit Since commit `7ac0e35f2` in 1.9-dev1 ("MAJOR: fd: compute the new fd polling state out of the fd lock") we've started to update the FD POLLED bit a bit more aggressively. Lately with the removal of the FD cache, this bit is always equal to the ACTIVE bit. There's no point continuing to watch it and update it anymore, all it does is create confusion and complicate the code. One interesting side effect is that it now becomes visible that all fd_*_{send,recv}() operations systematically call updt_fd_polling(), except fd_cant_recv()/fd_cant_send() which never saw it change.	2019-09-05 09:31:18 +02:00
Willy Tarreau	c046d167e4	MEDIUM: log: add support for logging to a ring buffer Now by prefixing a log server with "ring@<name>" it's possible to send the logs to a ring buffer. One nice thing is that it allows multiple sessions to consult the logs in real time in parallel over the CLI, and without requiring file system access. At the moment, ring0 is created as a default sink for tracing purposes and is available. No option is provided to create new rings though this is trivial to add to the global section.	2019-08-30 15:24:59 +02:00
Willy Tarreau	f3dc30f6de	MINOR: log: add a target type instead of hacking the address family Instead of detecting an AF_UNSPEC address family for a log server and to deduce a file descriptor, let's create a target type field and explicitly mention that the socket is of type FD.	2019-08-30 15:07:25 +02:00
Willy Tarreau	d660990cee	MINOR: fd: add a new "initialized" bit in the fdtab struct The purpose is to be able to remember that initialization was already done for a file descriptor. This will allow to get rid of some dirty hacks performed in the logs or fd sinks where the init state of the fd has to be guessed.	2019-08-30 15:07:25 +02:00
Willy Tarreau	76913d3ef4	CLEANUP: fd: remove leftovers of the fdcache The "cache" entry was still present in the fdtab struct and it was reported in "show sess". Removing it broke the cache-line alignment on 64-bit machines which is important for threads, so it was fixed by adding an attribute(aligned()) when threads are in use. Doing it only in this case allows 32-bit thread-less platforms to see the struct fit into 32 bytes.	2019-08-30 15:07:25 +02:00
Willy Tarreau	1d181e489c	MEDIUM: ring: implement a wait mode for watchers Now it is possible for a reader to subscribe and wait for new events sent to a ring buffer. When new events are written to a ring buffer, the applets that are subscribed are woken up to display new events. For now we only support this with the CLI applet called by "show events" since the I/O handler is indeed a CLI I/O handler. But it's not complicated to add other mechanisms to consume events and forward them to external log servers for example. The wait mode is enabled by adding "-w" after "show events <sink>". An extra "-n" was added to directly seek to new events only.	2019-08-30 11:58:58 +02:00
Willy Tarreau	300decc8d9	MINOR: cli: extend the CLI context with a list and two offsets Some CLI parsers are currently abusing the CLI context types such as pointers to stuff longs into them by lack of room. But the context is 80 bytes while cli is only 48, thus there's some room left. This patch adds a list element and two size_t usable as various offsets. The list element is initialized.	2019-08-30 11:58:58 +02:00
Willy Tarreau	370a694879	MINOR: trace: change the detail_level to per-source verbosity The detail level initially based on syslog levels is not used, while something related is missing, trace verbosity, to indicate whether or not we want to call the decoding callback and what level of decoding we want (raw captures etc). Let's change the field to "verbosity" for this. A verbosity of zero means that the decoding callback is not called, and all other levels are handled by this callback and are source-specific. The source is now prompted to list the levels that are proposed to the user. When the source doesn't define anything, "quiet" and "default" are available.	2019-08-29 17:11:25 +02:00
Willy Tarreau	09fb0df6fd	MINOR: trace: prepend the function name for developer level traces Working on adding traces to mux-h2 revealed that the function names are manually copied a lot in developer traces. The reason is that they are not preprocessor macros and as such cannot be concatenated. Let's slightly adjust the trace() function call to take a function name just after the file:line argument. This argument is only added for the TRACE_DEVEL and 3 new TRACE_ENTER, TRACE_LEAVE, and TRACE_POINT macros and left NULL for others. This way the function name is only reported for traces aimed at the developers. The pretty-print callback was also extended to benefit from this. This will also significantly shrink the data segment as the "entering" and "leaving" strings will now be merged. One technical point worth mentioning is that the function name is not passed as an ist to the inline function because it's not considered as a builtin constant by the compiler, and would lead to strlen() being run on it from all call places before calling the inline function. Thus instead we pass the const char * (that the compiler knows where to find) and it's the __trace() function that converts it to an ist for internal consumption and for the pretty-print callback. Doing this avoids losing 5-10% peak performance.	2019-08-29 17:09:13 +02:00
Willy Tarreau	2ea549bc43	MINOR: trace: change the "payload" level to "data" and move it The "payload" trace level was ambigous because its initial purpose was to be able to dump received data. But it doesn't make sense to force to report data transfers just to be able to report state changes. For example, all snd_buf()/rcv_buf() operations coming from the application layer should be tagged at this level. So here we move this payload level above the state transitions and rename it to avoid the ambiguity making one think it's only about request/response payload. Now it clearly is about any data transfer and is thus just below the developer level. The help messages on the CLI and the doc were slightly reworded to help remove this ambiguity.	2019-08-29 10:46:11 +02:00
Willy Tarreau	be5a288424	MINOR: trace: replace struct trace_lockon_args with struct name_desc No need for a specific struct anymore, name_desc suits us.	2019-08-29 09:34:53 +02:00
Willy Tarreau	fb4ba91ac1	MINOR: tools: add a generic struct "name_desc" for name-description pairs In prompts on the CLI we now commonly need to propose a keyword name and a description and it doesn't make sense to define a new struct for each such pairs. Let's simply have a generic "name_desc" for this.	2019-08-29 09:34:53 +02:00
Geoff Simmons	7185b789f9	MINOR: connection: add the fc_pp_authority fetch -- authority TLV, from PROXYv2 Save the authority TLV in a PROXYv2 header from the client connection, if present, and make it available as fc_pp_authority. The fetch can be used, for example, to set the SNI for a backend TLS connection.	2019-08-28 17:16:20 +02:00
Willy Tarreau	c326ecc9b1	MINOR: trace: change the TRACE() calling convention to put the args and cb last Previously the callback was almost mandatory so it made sense to have it before the message. Now that it can default to the one declared in the trace source, most TRACE() calls contain series of empty args and callbacks, which make them suitable for being at the end and being totally omitted. This patch thus reverses the TRACE arguments so that the message appears first, then the mask, then arg1..arg4, then the callback. In practice we'll mostly see 1 arg, or 2 args and nothing else, and it will not be needed anymore to pass long series of commas in the middle of the arguments. However if a source is enforced, the empty commas will still be needed for all omitted arguments.	2019-08-28 10:39:43 +02:00
Willy Tarreau	3da0026d25	MINOR: trace: support a default callback for the source It becomes apparent that most traces will use a single trace pretty print callback, so let's allow the trace source to declare a default one so that it can be omitted from trace calls, and will be used if no other one is specified.	2019-08-28 07:06:23 +02:00
Willy Tarreau	8f24023ba0	MINOR: sink: now report the number of dropped events on output The principle is that when emitting a message, if some dropped events were logged, we first attempt to report this counter before going further. This is done under an exclusive lock while all logs are produced under a shared lock. This ensures that the dropped line is accurately reported and doesn't accidently arrive after a later event.	2019-08-27 17:14:19 +02:00
Willy Tarreau	4ed23ca0e7	MINOR: sink: add support for ring buffers This now provides sink_new_buf() which allocates a ring buffer. One such ring ("buf0") of 1 MB is created already, and may be used by sink_write(). The sink's creation should probably be moved somewhere else later.	2019-08-27 17:14:19 +02:00
Willy Tarreau	072931cdcb	MINOR: ring: add a generic CLI io_handler to dump a ring buffer The three functions (attach, IO handler, and release) are meant to be called by any CLI command which requires to dump the contents of a ring buffer. We do not implement anything generic to dump any ring buffer on the CLI since it's meant to be used by other functionalities above. However these functions deal with locking and everything so it's trivial to embed them in other code.	2019-08-27 17:14:19 +02:00
Willy Tarreau	be97853c2f	MINOR: ring: add a ring_write() function This function tries to write to the ring buffer, possibly removing enough old messages to make room for the new one. It takes two arrays of fragments on input to ease the insertion of prefixes by the caller. It atomically writes the message, possibly truncating it if desired, and returns the operation's status.	2019-08-27 17:14:19 +02:00
Willy Tarreau	172945fbad	MINOR: ring: add a new mechanism for retrieving/storing ring data in buffers Our circular buffers are well suited for being used as ring buffers for not-so-structured data. The machanism here consists in making room in a buffer before inserting a new record which is prefixed by its size, and looking up next record based on the previous one's offset and size. We can have up to 255 consumers watching for data (dump in progress, tail) which guarantee that entrees are not recycled while they're being dumped. The complete representation is described in the header file. For now only ring_new(), ring_resize() and ring_free() are created.	2019-08-27 17:14:19 +02:00
Willy Tarreau	931d8b79a8	MINOR: fd: add fd_write_frag_line() to send a fragmented line to an fd Currently both logs and event sinks may use a file descriptor to atomically emit some output contents. The two may use the same FD though nothing is done to make sure they use the same lock. Also there is quite some redundancy between the two. Better make a specific function to send a fragmented message to a file descriptor which will take care of the locking via the fd's lock. The function is also able to truncate a message and to enforce addition of a trailing LF when building the output message.	2019-08-27 17:14:19 +02:00
Willy Tarreau	b88d231773	MINOR: buffer: add functions to read/write varints from/to buffers The new functions are : __b_put_varint() : inserts a varint when it's known that it fits b_put_varint() : tries to insert a varint at the tail b_get_varint() : tries to get a varint from the head b_peek_varint() : tries to peek a varint at a specific offset Wrapping is supported so that they are expected to be safe to use to manipulate varints with buffers anywhere.	2019-08-27 17:14:19 +02:00
Willy Tarreau	4d589e719b	MINOR: tools: add a function varint_bytes() to report the size of a varint It will sometimes be useful to encode varints to know the output size in advance. Two versions are provided, one inline using a switch/case construct which will be trivial for use with constants (and will be very fast albeit huge) and one function iterating on the number which is 5 times smaller, for use with variables.	2019-08-27 17:14:19 +02:00
Willy Tarreau	e40f274878	BUILD: trace: make the lockon_ptr const to silence a warning without threads I forgot to fix this one before pushing, despite my tests. lockon_ptr is only used to compare pointers, it doesn't need to point to a writable location. Without threads the atomic store is turned into an assignment and rightfully complains.	2019-08-22 20:26:28 +02:00
Willy Tarreau	c14eea49e6	MINOR: trace: add the possibility to lock on some arguments Given that we can pass typed arguments to the trace() function, let's add provisions for tracking them. They are source-specific so we need to let the source fill their name and description. Only those with a non-null name will be proposed.	2019-08-22 20:21:00 +02:00
Willy Tarreau	17a51c64b5	MINOR: trace: add a definition of typed arguments to trace() With a few macros it's possible for a trace source to commit to only using a certain type for a given argument (or set of). This will be particularly useful to let the trace subsystem retrieve some precious information such as a connection, session, listener, source address or so, and enable/disable filtering and/or locking.	2019-08-22 20:21:00 +02:00
Willy Tarreau	4ab242136d	MINOR: trace: add per-level macros to produce traces The new TRACE_<level>() macros take a mask, 4 args, a callback and a static message. From this they also inherit the TRACE_SOURCE macro from the caller, which contains the pointer to the trace source (so that it's not required to paste it everywhere), and an ist string is also made by the concatenation of the file name and the line number. This uses string concatenation by the preprocessor, and turns it into an ist by the compiler so that there is no operation at all to perform to adjust the data length as the compiler knows where to cut during the optimization phase. Last, the message is also automatically turned into an ist so that it's trivial to put it into an iovec without having to run strlen() on it. All arguments and the callback may be empty and will then automatically be replaced with a NULL pointer. This makes the TRACE calls slightly lighter especially since arguments are not always used. Several other options were considered to use variadic macros but there's no outstanding rule that justifies to place an argument before another one, and it still looks convenient to have the message be the last one to encourage copy- pasting of the trace statements. A generic TRACE() macro takes TRACE_LEVEL in from the source file as the trace level instead of taking it from its name. This may slightly simplify the production of traces that always run at the same level (internal core parts may probably only be called at developer level).	2019-08-22 20:21:00 +02:00
Willy Tarreau	bfd14fc6eb	MINOR: trace: implement a call to a decode function The trace() call will support an optional decoding callback and 4 arguments that this function is supposed to know how to use to provide extra information. The output remains unchanged when the function is NULL. Otherwise, the message is pre-filled into the thread-local trace_buf, and the function is called with all arguments so that it completes the buffer in a readable form depending on the expected level of detail.	2019-08-22 20:21:00 +02:00
Willy Tarreau	5da408818b	MINOR: trace: make trace() now also take a level in argument This new "level" argument will allow the trace sources to label the traces for different purposes, and filter out some of them if they are not relevant to the current target. Right now we have 5 different levels: - USER : the least verbose one, only a few functional information - PAYLOAD: like user but also displays some payload-related information - PROTO: focuses on the protocol's framing - STATE: also indicate state internal transitions or non-transitions - DEVELOPER: adds extra info about branches taken in the code (break points, return points)	2019-08-22 20:21:00 +02:00
Willy Tarreau	419bd49f0b	MINOR: trace: add the file name and line number in the prefix We now pass an extra argument "where" to the trace() call, which is supposed to be an ist made of the concatenation of the filename and the line number. We only keep the last 10 chars from this string since the end of file names is most often easy to recognize. This gives developers useful information at very low cost.	2019-08-22 20:21:00 +02:00
Willy Tarreau	4c2ae48375	MINOR: trace: implement a very basic trace() function For now it remains quite basic. It performs a few state checks, calls the source's sink if defined, and performs the transitions between RUNNING, STOPPED and WAITING when the configured events match.	2019-08-22 20:21:00 +02:00
Willy Tarreau	864e880f6c	MINOR: trace/cli: register the "trace" CLI keyword to list the sources For now it lists the sources if one is not provided, and checks for the source's existence. It lists the events if not provided, checks for their existence if provided, and adjusts reported events/start/stop/pause events, and performs state transitions. It lists sinks and adjusts them as well. Filters, lock, and level are not implemented yet.	2019-08-22 20:21:00 +02:00
Willy Tarreau	88ebd4050e	MINOR: trace: add allocation of buffer-sized trace buffers This will be needed so that we can implement protocol decoders which will have to emit their contents into such a buffer.	2019-08-22 20:21:00 +02:00
Willy Tarreau	4151c753fc	MINOR: trace: start to create a new trace subsystem The principle of this subsystem will be to support taking live traces at various places in the code with conditional triggers, filters, and ability to lock on some elements. The traces will support typed events and will be sent into sinks made of ring buffers, file descriptors or remote servers.	2019-08-22 20:21:00 +02:00
Willy Tarreau	973e662fe8	MINOR: sink: add a support for file descriptors This is the most basic type of sink. It pre-registers "stdout" and "stderr", and is able to use writev() on them. The writev() operation is locked to avoid mixing outputs. It's likely that the registration should move somewhere else to take into account the fact that stdout and stderr are still opened or are closed.	2019-08-22 20:21:00 +02:00
Willy Tarreau	67b5a161b4	MINOR: sink: create definitions a minimal code for event sinks The principle will be to be able to dispatch events to various destinations called "sinks". This is already done in part in logs where log servers can be either a UDP socket or a file descriptor. This will be needed with the new trace subsystem where we may also want to add ring buffers. And it turns out that all such destinations make sense at all places. Logs may need to be sent to a TCP server via a ring buffer, or consulted from the CLI. Trace events may need to be sent to stdout/stderr as well as to remote log servers. This patch creates a new structure "sink" aiming at addressing these similar needs. The goal is to merge together what is common to all of them, such as the output format, the dropped events count, etc, and also keep separately the target identification (network address, file descriptor). Provisions were made to have a "waiter" on the sink. For a TCP log server it will be the task to wake up after writing to the log buffer. For a ring buffer, it could be the list of watchers on the CLI running a "tail" operation and waiting for new events. A lock was also placed in the struct since many operations will require some locking, including the FD ones. The output formats covers those in use by logs and two extra ones prepending the ISO time in front of the message (convenient for stdio/buffer). For now only the generic infrastructure is present, no type-specific output is implemented. There's the sink_write() function which prepares and formats a message to be sent, trying hard to avoid copies and only using pointer manipulation, where the type-specific code just has to be added. Dropped messages are already counted (for now 100% drop). The message is put into an iovec array as it will be trivial to use with file descriptors and sockets.	2019-08-22 20:21:00 +02:00
Willy Tarreau	9eebd8a978	REORG: trace: rename trace.c to calltrace.c and mention it's not thread-safe The function call tracing code is a quite old and was never ported to support threads. It's not even sure whether it still works well, but at least its presence creates confusion for future work so let's rename it to calltrace.c and add a comment about its lack of thread-safety.	2019-08-22 20:21:00 +02:00
Willy Tarreau	32c24552e4	MINOR: tools: add a DEFNULL() macro to use NULL for empty args It's sometimes convenient for debugging macros not to be forced to explicitly pass NULL in an unused argument. This macro does this, it replaces a missing arg with NULL.	2019-08-22 20:21:00 +02:00
Willy Tarreau	9bead8c7f5	MINOR: list: add LIST_SPLICE() to merge one list into another This will move the contents of list <old> at the beginning of list <new>.	2019-08-22 20:21:00 +02:00
Willy Tarreau	60409db0b1	MINOR: lua: export applet and task handlers The current functions are seen outside from the debugging code and are convenient to export so that we can improve the thread dump output : void hlua_applet_tcp_fct(struct appctx ctx); void hlua_applet_http_fct(struct appctx ctx); struct task hlua_process_task(struct task task, void *context, unsigned short state); Of course they are only available when USE_LUA is defined.	2019-08-21 14:32:09 +02:00
Willy Tarreau	a2c9911ace	MINOR: tools: add append_prefixed_str() This is somewhat related to indent_msg() except that this one places a known prefix at the beginning of each line, allows to replace the EOL character, and not to insert a prefix on the first line if not desired. It works with a normal output buffer/chunk so it doesn't need to allocate anything nor to modify the input string. It is suitable for use in multi- line backtraces.	2019-08-21 14:32:09 +02:00
Willy Tarreau	f5cab82025	MINOR: fd: make sure to mark the thread as not stuck in fd_update_events() When I/O events are being processed, we want to make sure to mark the thread as not stuck. The reason is that some pollers (like poll()) which do not limit the number of FDs they report could possibly report a huge amount of FD all having to perform moderately expensive operations in the I/O callback (e.g. via mux-pt which forwards to the upper layers), making the watchdog think the thread is stuck since it does not schedule. Of course this must never happen but if it ever does we must be liberal about it. This should be backported to 2.0, where the situation may happen more easily due to the FD cache which can start to collect a large amount of events. It may be related to the report in issue #201 though nothing is certain about it.	2019-08-16 16:06:14 +02:00
Willy Tarreau	edb91ad647	MINOR: cli: add cli_msg(), cli_err(), cli_dynmsg(), cli_dynerr() These functions perform all the boring filling of the appctx's cli struct needed by CLI parsers to return a message or an error, and they return 1 so that they can be used as a single-line return statement. They may be used for const messages or dynamic messages.	2019-08-09 10:11:38 +02:00
Willy Tarreau	d50c7feaa1	MINOR: cli: add two new states to print messages on the CLI Right now we used to have extremely inconsistent states to report output, one is CLI_ST_PRINT which prints constant message cli->msg with the assigned severity, and CLI_ST_PRINT_FREE which prints dynamically allocated cli->err with severity LOG_ERR, and nothing in between, eventhough it's useful to be able to report dynamically allocated messages as well as constant error messages. This patch adds two extra states, which are not particularly well named given the constraints imposed by existing ones. One is CLI_ST_PRINT_ERR which prints a constant error message. The other one is CLI_ST_PRINT_DYN which prints a dynamically allocated message. By doing so we maintain the compatibility with current code. It is important to keep in mind that we cannot pre-initialize pointers and automatically detect what message type it is based on the assigned fields, because the CLI's context is in a union shared with all other users, thus unused fields contain anything upon return. This is why we have no choice but using 4 states. Keeping the two fields <msg> and <err> remains useful because one is const and not the other one, and this catches may copy-paste mistakes. It's just that <err> is pretty confusing here, it should be renamed.	2019-08-09 10:11:38 +02:00
Willy Tarreau	247a8b1d81	CLEANUP: task: move the cpu_time field to the task-only part The CPU time accounting field called "cpu_time" is used only by tasks and not tasklets, yet it used to be stored into the TASK_COMMON part, which doesn't make sense and wastes tasklet memory. In addition, moving it to tasks also helps better group the various parts in cache lines.	2019-08-08 10:11:05 +02:00
Willy Tarreau	e0d0b4089d	CLEANUP: buffer: replace b_drop() with b_free() Since last commit there's no point anymore in having two variants of the same function, let's switch to b_free() only. __b_drop() was renamed to __b_free() for obvious consistency reasons.	2019-08-08 08:07:45 +02:00
Willy Tarreau	3b091f80aa	BUG/MINOR: buffers/threads: always clear a buffer's head before releasing it A small race exists in buffers with "show sess all". This one wants to show some information grabbed from the buffer (especially in HTX mode). But the thread owning this buffer might just be releasing its area, right after a free() or munmap() call, resulting in a head that is not seen as empty yet though the area was released. It may then be dereferenced by "show sess all" causing a crash. Note that in practice it only happens in debug mode with UAF enabled, but it's tricky enough to fix it right now. This should be backported to stable versions which support threads and a store barrier. It's worth noting that by performing the clearing first, b_free() and b_drop() now become two exact equivalent.	2019-08-08 08:07:45 +02:00
Willy Tarreau	229e739c21	BUG/MINOR: pools: don't mark the thread harmless if already isolated Commit `85b2cae63` ("MINOR: pools: make the thread harmless during the mmap/munmap syscalls") was used to relax the pressure experienced by other threads when running in debug mode with UAF enabled. It places a pair of thread_harmless_now()/thread_harmless_end() around the call to mmap(), assuming callers are not sensitive to parallel activity. But there are a few cases like "show sess all" where this happens in isolated threads, and marking the thread as harmless there is a very bad idea, even worse when arriving to thread_harmless_end() which loops forever. Let's only do that when the thread is not isolated. No backport is needed as the patch above was only in 2.1-dev.	2019-08-08 07:41:52 +02:00
Fr�d�ric L�caille	be36793d1d	BUG/MEDIUM: stick-table: Wrong stick-table backends parsing. When parsing references to stick-tables declared as backends, they are added to a list of proxies (they are proxies!) which refer to this stick-tables. Before this patch we added them to these list without checking they were already present, making the silly hypothesis the actions/sample were checked/resolved in the same order the proxies are parsed. This patch implement a simple inline function to in_proxies_list() to test the presence of a proxy in a list of proxies. We use this function when resolving /checking samples/actions. This bug was introduced by `015e4d7` commit. Must be backported to 2.0.	2019-08-07 10:32:31 +02:00
Olivier Houchard	4c18f94c11	BUG/MEDIUM: proxy: Make sure to destroy the stream on upgrade from TCP to H2 In stream_set_backend(), if we have a TCP stream, and we want to upgrade it to H2 instead of attempting ot reuse the stream, just destroy the conn_stream, make sure we don't log anything about the stream, and pretend we failed setting the backend, so that the stream will get destroyed. New streams will then be created by the mux, as if the connection just happened. This fixes a crash when upgrading from TCP to H2, as the H2 mux totally ignored the conn_stream provided by the upgrade, as reported in github issue #196. This should be backported to 2.0.	2019-08-02 18:28:58 +02:00
Emmanuel Hocdet	f580d0f391	BUILD: ssl: BoringSSL add EVP_PKEY_base_id Remove EVP_PKEY_base_id compatibility, it is now included in BoringSSL.	2019-08-01 11:21:42 +02:00
Willy Tarreau	a37cb1880c	MINOR: wdt: also consider that waiting in the thread dumper is normal It happens that upon looping threads the watchdog fires, starts a dump, and other threads expire their budget while waiting for the other threads to get dumped and trigger a watchdog event again, adding some confusion to the traces. With this patch the situation becomes clearer as we export the list of threads being dumped so that the watchdog can check it before deciding to trigger. This way such threads in queue for being dumped are not attempted to be reported in turn. This should be backported to 2.0 as it helps understand stack traces.	2019-07-31 19:35:31 +02:00
Olivier Houchard	53055055c5	MEDIUM: pollers: Remember the state for read and write for each threads. In the poller code, instead of just remembering if we're currently polling a fd or not, remember if we're polling it for writing and/or for reading, that way, we can avoid to modify the polling if it's already polled as needed.	2019-07-31 14:54:41 +02:00
Olivier Houchard	305d5ab469	MAJOR: fd: Get rid of the fd cache. Now that the architecture was changed so that attempts to receive/send data always come from the upper layers, instead of them only trying to do so when the lower layer let them know they could try, we can finally get rid of the fd cache. We don't really need it anymore, and removing it gives us a small performance boost.	2019-07-31 14:12:55 +02:00
Willy Tarreau	5e83d996cf	BUG/MAJOR: queue/threads: avoid an AB/BA locking issue in process_srv_queue() A problem involving server slowstart was reported by @max2k1 in issue #197. The problem is that pendconn_grab_from_px() takes the proxy lock while already under the server's lock while process_srv_queue() first takes the proxy's lock then the server's lock. While the latter seems more natural, it is fundamentally incompatible with mayn other operations performed on servers, namely state change propagation, where the proxy is only known after the server and cannot be locked around the servers. Howwever reversing the lock in process_srv_queue() is trivial and only the few functions related to dynamic cookies need to be adjusted for this so that the proxy's lock is taken for each server operation. This is possible because the proxy's server list is built once at boot time and remains stable. So this is what this patch does. The comments in the proxy and server structs were updated to mention this rule that the server's lock may not be taken under the proxy's lock but may enclose it. Another approach could consist in using a second lock for the proxy's queue which would be different from the regular proxy's lock, but given that the operations above are rare and operate on small servers list, there is no reason for overdesigning a solution. This fix was successfully tested with 10000 servers in a backend where adjusting the dyncookies in loops over the CLI didn't have a measurable impact on the traffic. The only workaround without the fix is to disable any occurrence of "slowstart" on server lines, or to disable threads using "nbthread 1". This must be backported as far as 1.8.	2019-07-30 14:02:06 +02:00
Christopher Faulet	bfab2dddad	MINOR: hlua: Add a flag on the lua txn to know in which context it can be used When a lua action or a lua sample fetch is called, a lua transaction is created. It is an entry in the stack containing the class TXN. Thanks to it, we can know the direction (request or response) of the call. But, for some functions, it is also necessary to know if the buffer is "HTTP ready" for the given direction. "HTTP ready" means there is a valid HTTP message in the channel's buffer. So, when a lua action or a lua sample fetch is called, the flag HLUA_TXN_HTTP_RDY is set if it is appropriate.	2019-07-29 11:17:52 +02:00
Willy Tarreau	d6e0c03384	BUILD: threads: add the definition of PROTO_LOCK This one was added by commit `daacf3664` ("BUG/MEDIUM: protocols: add a global lock for the init/deinit stuff") but I forgot to add it to the include file, breaking DEBUG_THREAD.	2019-07-25 07:53:56 +02:00
Christopher Faulet	98fbe9531a	MEDIUM: mux-h1: Add the support of headers adjustment for bogus HTTP/1 apps There is no standard case for HTTP header names because, as stated in the RFC7230, they are case-insensitive. So applications must handle them in a case-insensitive manner. But some bogus applications erroneously rely on the case used by most browsers. This problem becomes critical with HTTP/2 because all header names must be exchanged in lowercase. And HAProxy uses the same convention. All header names are sent in lowercase to clients and servers, regardless of the HTTP version. This design choice is linked to the HTX implementation. So, for previous versions (2.0 and 1.9), a workaround is to disable the HTX mode to fall back to the legacy HTTP mode. Since the legacy HTTP mode was removed, some users reported interoperability issues because their application was not able anymore to handle HTTP/1 message received from HAProxy. So, we've decided to add a way to change the case of some headers before sending them. It is now possible to define a "mapping" between a lowercase header name and a version supported by the bogus application. To do so, you must use the global directives "h1-case-adjust" and "h1-case-adjust-file". Then options "h1-case-adjust-bogus-client" and "h1-case-adjust-bogus-server" may be used in proxy sections to enable the conversion. See the configuration manual for more info. Of course, our advice is to urgently upgrade these applications for interoperability concerns and because they may be vulnerable to various types of content smuggling attacks. But, if your are really forced to use an unmaintained bogus application, you may use these directive, at your own risks. If it is relevant, this feature may be backported to 2.0.	2019-07-24 18:32:47 +02:00
Willy Tarreau	daacf36645	BUG/MEDIUM: protocols: add a global lock for the init/deinit stuff Dragan Dosen found that the listeners lock is not sufficient to protect the listeners list when proxies are stopping because the listeners are also unlinked from the protocol list, and under certain situations like bombing with soft-stop signals or shutting down many frontends in parallel from multiple CLI connections, it could be possible to provoke multiple instances of delete_listener() to be called in parallel for different listeners, thus corrupting the protocol lists. Such operations are pretty rare, they are performed once per proxy upon startup and once per proxy on shut down. Thus there is no point trying to optimize anything and we can use a global lock to protect the protocol lists during these manipulations. This fix (or a variant) will have to be backported as far as 1.8.	2019-07-24 16:45:02 +02:00
Christopher Faulet	90cc4811be	BUG/MINOR: http_htx: Support empty errorfiles Empty error files may be used to disable the sending of any message for specific error codes. A common use-case is to use the file "/dev/null". This way the default error message is overridden and no message is returned to the client. It was supported in the legacy HTTP mode, but not in HTX. Because of a bug, such messages triggered an error. This patch must be backported to 2.0 and 1.9. However, the patch will have to be adapted.	2019-07-23 14:58:32 +02:00
Willy Tarreau	1c8d32bb62	MAJOR: stream: store the target address into s->target_addr When forcing the outgoing address of a connection, till now we used to allocate this outgoing connection and set the address into it, then set SF_ADDR_SET. With connection reuse this causes a whole lot of issues and difficulties in the code. Thanks to the previous changes, it is now possible to store the target address into the stream instead, and copy the address from the stream to the connection when initializing the connection. assign_server_address() does this and as a result SF_ADDR_SET now reflects the presence of the target address in the stream, not in the connection. The http_proxy mode, the peers and the master's CLI now use the same mechanism. For now the existing connection code was not removed to limit the amount of tricky changes, but the allocated connection is not used anymore. This change also revealed a latent issue that we've been having around option http_proxy : the address was set in the connection but neither the SF_ADDR_SET nor the SF_ASSIGNED flags were set. It looks like the connection could establish only due to the fact that it existed with a non-null destination address.	2019-07-19 13:50:09 +02:00
Willy Tarreau	9042060b0b	MINOR: stream: add a new target_addr entry in the stream structure The purpose will be to store the target address there and not to allocate a connection just for this anymore. For now it's only placed in the struct, a few fields were moved to plug some holes, and the entry is freed on release (never allocated yet for now). This must have no impact. Note that in order to fit, the store_count which previously was an int was turned into a short, which is way more than enough given that the hard-coded limit is 8.	2019-07-19 13:50:09 +02:00
Willy Tarreau	e71fca81dd	MAJOR: connection: remove the addr field Now addresses are dynamically allocated when needed. Each connection is created with src=dst=NULL, these entries are allocated on the fly, and released when the connection is released.	2019-07-19 13:50:09 +02:00
Willy Tarreau	ca79f59365	MEDIUM: connection: make sure all address producers allocate their address This commit places calls to sockaddr_alloc() at the places where an address is needed, and makes sure that the allocation is properly tested. This does not add too many error paths since connection allocations are already in the vicinity and share the same error paths. For the two cases where a clear_addr() was called, instead the address was not allocated.	2019-07-19 13:50:09 +02:00
Willy Tarreau	ff5d57b022	MINOR: connection: create a new pool for struct sockaddr_storage This pool will be used to allocate storage for source and destination addresses used in connections. Two functions sockaddr_{alloc,free}() were added and will have to be used everywhere an address is needed. These ones are safe for progressive replacement as they check that the existing pointer is set before replacing it. The pool is not yet used during allocation nor freeing. Also they operate on pointers to pointers so they will perform checks and replace values. The free one nulls the pointer.	2019-07-19 13:50:09 +02:00
Willy Tarreau	226572f55f	MINOR: connection: use conn->{src,dst} instead of &conn->addr.{from,to} This is in preparation for the switch to dynamic address allocation, let's migrate the code using the old fields to the pointers instead. Note that no extra check was added for now, the purpose is only to get the code to use the pointers and still work. In the proxy protocol message handling we make sure the addresses are properly allocated before declaring them unset.	2019-07-19 13:50:09 +02:00
Willy Tarreau	1ef4cbc693	MINOR: connection: add new src and dst fields At the moment we're facing difficulties with connection reuse based on the fact that connections may be allocated very early only to set a target address in transparent mode. With the imminent removal of the legacy mode, the connection reuse by a same stream will not exist anymore and all this awful complexity is not justified anymore. However we still need to be able to assign addresses somewhere. Thus instead of allocating a connection, we'll only place addresses where needed in the stream during operations. But this takes quite some room (typically 128 bytes). This is a nice opportunity for cleaning all this up and dynamically allocatating the addresses fields, which will result in actually saving memory from connection structs since most of the time the client's "to" address is not used and the server's "from" is not used either, thus saving ~256 bytes per end-to-end connection. For now these new "src" and "dst" pointers point to addr.from and addr.to. This will allow us to smoothly update the whole code to use these pointers prior to going further and switching them to pools.	2019-07-19 13:50:09 +02:00
Willy Tarreau	cc4df3b3de	CLEANUP: connection: remove the now unused conn_get_{from,to}_addr() These functions are not used anymore. They didn't report failures and as such were often misused. conn_get_src() and conn_get_dst() now replaced them everywhere.	2019-07-19 13:50:09 +02:00
Willy Tarreau	3cc01d84b3	MINOR: backend: switch to conn_get_{src,dst}() for port and address mapping The backend connect code uses conn_get_{from,to}_addr to forward addresses in transparent mode and to map server ports, without really checking if the operation succeeds. In preparation of future changes, let's switch to conn_get_{src,dst}() and integrate status check for possible failures.	2019-07-19 13:50:09 +02:00
Willy Tarreau	2e34c11458	MINOR: connection: add conn_get_src() and conn_get_dst() These functions currently are the same as conn_get_from_addr() and conn_get_to_addr() respectively except that they return a status for the operation that the caller can test.	2019-07-19 13:50:09 +02:00
Christopher Faulet	f734638976	MINOR: http: Don't store raw HTTP errors in chunks anymore Default HTTP error messages are stored in an array of chunks. And since the HTX was added, these messages are also converted in HTX and stored in another array. But now, the first array is not used anymore because the legacy HTTP mode was removed. So now, only the array with the HTX messages are kept. The other one was removed.	2019-07-19 09:46:23 +02:00
Christopher Faulet	1b6adb4a51	MINOR: proxy/http_ana: Remove unused req_exp/rsp_exp and req_add/rsp_add lists The keywords req* and rsp* are now unsupported. So the corresponding lists are now unused. It is safe to remove them from the structure proxy. As a result, the code dealing with these rules in HTTP analyzers was also removed.	2019-07-19 09:24:12 +02:00
Christopher Faulet	8c3b63ae1d	MINOR: proxy: Remove the unused list of block rules The keyword "block" is now unsupported. So the list of block rules is now unused. It can be safely removed from the structure proxy.	2019-07-19 09:24:12 +02:00
Christopher Faulet	a6a56e6483	MEDIUM: config: Remove parsing of req* and rsp* directives It was announced for the 2.1. Following keywords are now unsupported: * reqadd, reqallow, reqiallow, reqdel, reqidel, reqdeny, reqideny, reqpass, reqipass, reqrep, reqirep reqtarpit, reqitarpit * rspadd, rspdel, rspidel, rspdeny, rspideny, rsprep, rspirep a fatal error is emitted if one of these keyword is found during the configuraion parsing.	2019-07-19 09:24:12 +02:00
Christopher Faulet	73e8ede156	MINOR: proxy: Remove support of the option 'http-tunnel' The option 'http-tunnel' is deprecated and it was only used in the legacy HTTP mode. So this option is now totally ignored and a warning is emitted during HAProxy startup if it is found in a configuration file.	2019-07-19 09:24:12 +02:00
Christopher Faulet	fc9cfe4006	REORG: proto_htx: Move HTX analyzers & co to http_ana.{c,h} files The old module proto_http does not exist anymore. All code dedicated to the HTTP analysis is now grouped in the file proto_htx.c. So, to finish the polishing after removing the legacy HTTP code, proto_htx.{c,h} files have been moved in http_ana.{c,h} files. In addition, all HTX analyzers and related functions prefixed with "htx_" have been renamed to start with "http_" instead.	2019-07-19 09:24:12 +02:00
Christopher Faulet	eb2754bef8	CLEANUP: proto_http: Remove unecessary includes and comments	2019-07-19 09:24:12 +02:00
Christopher Faulet	22dc248c2a	CLEANUP: channel: Remove the unused flag CF_WAKE_CONNECT This flag is tested or cleared but never set anymore.	2019-07-19 09:24:12 +02:00
Christopher Faulet	3716ebc50f	CLEANUP: proto_http: Group remaining flags of the HTTP transaction	2019-07-19 09:24:12 +02:00
Christopher Faulet	cc76d5b9a1	MINOR: proto_http: Remove the unused flag HTTP_MSGF_WAIT_CONN This flag is set but never used. So remove it.	2019-07-19 09:24:12 +02:00
Christopher Faulet	c41547b66e	MINOR: proto_http: Remove unused http txn flags Many flags of the HTTP transction (TX_) are now unused and useless. So the flags TX_WAIT_CLEANUP, TX_HDR_CONN_, TX_CON_CLO_SET and TX_CON_KAL_SET were removed. Most of TX_CON_WANT_* were also removed. Only TX_CON_WANT_TUN has been kept.	2019-07-19 09:24:12 +02:00
Christopher Faulet	711ed6ae4a	MAJOR: http: Remove the HTTP legacy code First of all, all legacy HTTP analyzers and all functions exclusively used by them were removed. So the most of the functions in proto_http.{c,h} were removed. Only functions to deal with the HTTP transaction have been kept. Then, http_msg and hdr_idx modules were entirely removed. And finally the structure http_msg was lightened of all its useless information about the legacy HTTP. The structure hdr_ctx was also removed because unused now, just like unused states in the enum h1_state. Note that the memory pool "hdr_idx" was removed and "http_txn" is now smaller.	2019-07-19 09:24:12 +02:00
Christopher Faulet	3d11969a91	MAJOR: filters: Remove code relying on the legacy HTTP mode This commit breaks the compatibility with filters still relying on the legacy HTTP code. The legacy callbacks were removed (http_data, http_chunk_trailers and http_forward_data). For now, the filters must still set the flag FLT_CFG_FL_HTX to be used on HTX streams.	2019-07-19 09:18:27 +02:00
Christopher Faulet	28b18c5e21	CLEANUP: proxy: Remove the flag PR_O2_USE_HTX This flag is now unused. So we can safely remove it.	2019-07-19 09:18:27 +02:00
Christopher Faulet	6d1dd46917	MEDIUM: http_fetch: Remove code relying on HTTP legacy mode Since the legacy HTTP mode is disbabled, all HTTP sample fetches work on HTX streams. So it is safe to remove all code relying on HTTP legacy mode. Among other things, the function smp_prefetch_http() was removed with the associated macros CHECK_HTTP_MESSAGE_FIRST() and CHECK_HTTP_MESSAGE_FIRST_PERM().	2019-07-19 09:18:27 +02:00
Christopher Faulet	c985f6c5d8	MINOR: connection: Remove the multiplexer protocol PROTO_MODE_HTX Since the legacy HTTP mode is disabled and no multiplexer relies on it anymore, there is no reason to have 2 multiplexer protocols for the HTTP. So the protocol PROTO_MODE_HTX was removed and all HTTP multiplexers use now PROTO_MODE_HTTP.	2019-07-19 09:18:27 +02:00
Christopher Faulet	5ed8353dcf	CLEANUP: h2: Remove functions converting h2 requests to raw HTTP/1.1 ones Because the h2 multiplexer only uses the HTX mode, following H2 functions were removed : * h2_prepare_h1_reqline * h2_make_h1_request() * h2_make_h1_trailers()	2019-07-19 09:18:27 +02:00
Christopher Faulet	24e116bfe0	MINOR: htx: Slightly update htx_dump() to report better messages Sign of <tail_addr>, <head_addr> and <end_addr> is respsected to not convert -1 into its unsigned representation.	2019-07-19 09:18:27 +02:00
Christopher Faulet	2bf43f0746	MINOR: htx: Use an array of char to store HTX blocks Instead of using a array of (struct block), it is more natural and intuitive to use an array of char. Indeed, not only (struct block) are stored in this array, but also their payload.	2019-07-19 09:18:27 +02:00
Christopher Faulet	192c6a23d4	MINOR: htx: Deduce the number of used blocks from tail and head values <head> and <tail> fields are now signed 32-bits integers. For an empty HTX message, these fields are set to -1. So the field <used> is now useless and can safely be removed. To know if an HTX message is empty or not, we just compare <head> against -1 (it also works with <tail>). The function htx_nbblks() has been added to get the number of used blocks.	2019-07-19 09:18:27 +02:00
Christopher Faulet	5a916f7326	CLEANUP: htx: Remove the unsued function htx_add_blk_type_size()	2019-07-19 09:18:27 +02:00
Christopher Faulet	3b21972061	DOC: htx: Update comments in HTX files This patch may be backported to 2.0 to have accurate comments.	2019-07-19 09:18:27 +02:00
Christopher Faulet	304cc40536	MINOR: proto_htx: Add the function htx_return_srv_error() Instead of using a function from the legacy HTTP, the HTX code now uses its own one.	2019-07-19 09:18:27 +02:00
Willy Tarreau	8280ea97a0	MINOR: applet: make appctx use their own pool A long time ago, applets were seen as an alternative to connections, and since their respective sizes were roughly equal it appeared wise to share the same pool. Nowadays, connections got significantly larger but applets are not that often used, except for the cache. However applets are mostly complementary and not alternatives anymore, as it's very possible not to have a back connection or to share one with other streams. The connections will soon lose their addresses and their size will shrink so much that appctx won't fit anymore. Given that the old benefits of sharing these pools have long disappeared, let's stop doing this and have a dedicated pool for appctx.	2019-07-18 10:45:08 +02:00
Willy Tarreau	7764a57d32	BUG/MEDIUM: threads: cpu-map designating a single thread/process are ignored Since commit `81492c989` ("MINOR: threads: flatten the per-thread cpu-map"), we don't keep the procthread matrix anymore to represent the full binding possibilities, but only the proc and thread ones. The problem is that the per-process binding is not the same for each thread and for the process, and the proc[] array was assumed to store the per-proc first thread value when doing this change. Worse, the logic present there tries to deal with thread ranges and process ranges in a way which automatically exclused the other possibility (since ranges cannot be used on both) but as such fails to apply changes if neither the process nor the thread is expressed as a range. The real problem comes from the fact that specifying cpu-map 1/1 doesn't yet reveal if the per-process mask or the per-thread mask needs to be updated. In practice it's the thread one but then the current storage doesn't allow to store the binding of the first thread of each other process in nbproc>1 configurations. When removing the procthread matrix, what ought to have been kept was both the thread column for process 1 and the process line for threads 1, but instead only the thread column was kept. This patch reintroduces the storage of the configuration for the first thread of each process so that it is again possible to store either the per-thread or per-process configuration. As a partial workaround for existing configurations, it is possible to systematically indicate at least two processes or two threads at once and map them by pairs or more so that at least two values are present in the range. E.g : # set processes 1-4 to cpus 0-3 : cpu-map auto:1-4/1 0 1 2 3 # or: cpu-map 1-2/1 0 1 cpu-map 2-3/1 2 3 # set threads 1-4 to cpus 0-3 : cpu-map auto:1/1-4 0 1 2 3 # or : cpu-map 1/1-2 0 1 cpu-map 3/3-4 2 3 This fix must be backported to 2.0.	2019-07-16 15:23:09 +02:00
Andrew Heberle	9723696759	MEDIUM: mworker-prog: Add user/group options to program section This patch adds "user" and "group" config options to the "program" section so the configured command can be run as a different user.	2019-07-15 16:43:16 +02:00
Olivier Houchard	4bd5867627	BUG/MEDIUM: streams: Don't redispatch with L7 retries if redispatch isn't set. Move the logic to decide if we redispatch to a new server from sess_update_st_cer() to a new inline function, stream_choose_redispatch(), and use it in do_l7_retry() instead of just setting the state to SI_ST_REQ. That way, when using L7 retries, we won't redispatch the request to another server except if "option redispatch" is used. This should be backported to 2.0.	2019-07-12 16:17:50 +02:00
Willy Tarreau	64e6012eb9	MINOR: task: introduce work lists Sometimes we need to delegate some list processing to a function running on another thread. In this case the list element will simply be queued into a dedicated self-locked list and the task responsible for this list will be woken up, calling the associated function which will run over the list. This is what work_list does. Such lists will be dedicated to a limited type of work but will significantly ease such remote handling. A function is provided to create these per-thread lists, their tasks and to properly bind each task to a distinct thread, so that the caller only has to store the resulting pointer to the start of the structure. These structures should not be abused though as each head will consume 4 pointers per thread, hence 32 bytes per thread or 2 kB for 64 threads.	2019-07-12 09:07:48 +02:00
Olivier Houchard	4be7190c10	BUG/MEDIUM: servers: Fix a race condition with idle connections. When we're purging idle connections, there's a race condition, when we're removing the connection from the idle list, to add it to the list of connections to free, if the thread owning the connection tries to free it at the same time. To fix this, simply add a per-thread lock, that has to be hold before removing the connection from the idle list, and when, in conn_free(), we're about to remove the connection from every list. That way, we know for sure the connection will stay valid while we remove it from the idle list, to add it to the list of connections to free. This should happen rarely enough that it shouldn't have any impact on performances. This has not been reported yet, but could provoke random segfaults. This should be backported to 2.0.	2019-07-11 16:16:38 +02:00
Christopher Faulet	34ce7d075a	BUG/MINOR: server: Be really able to keep "pool-max-conn" idle connections The maximum number of idle connections for a server can be configured by setting the server option "pool-max-conn". But when we try to add a connection in its idle list, because of a wrong comparison, it may be rejected because there are already "pool-max-conn - 1" idle connections. This patch must be backported to 2.0 and 1.9.	2019-07-10 14:20:52 +02:00
Willy Tarreau	1dad3843dc	BUG/MEDIUM: fd/threads: fix excessive CPU usage on multi-thread accept While experimenting with potentially improved fairness and latency using ticket locks on a Ryzen 16-thread/8-core, a very strange situation happened a lot for some levels of traffic. Around 300k connections per second, no more connections would be accepted on the multi-threaded listener but all others would continue to work fine. All attempts to trace showed that the threads were all in the trylock in the fd cache, or in the spinlock of fd_update_events(), or in the one of fd_may_recv(). But as indicated this was not a deadlock since the process continues to work fine. After quite some investigation it appeared that the issue is caused by a lack of fairness between the fdcache's trylock and these functions' spin locks above. In fact, regardless of the success or failure of the fdcache's attempt at grabbing the lock, the poller was calling fd_update_events() which locks the FD once for something that can be done with a CAS, and then calls fd_may_recv() with another lock for something that most often didn't change. The high contention on these spinlocks leaves no chance to any other thread to grab the lock using trylock(), and once this happens, there is no thread left to process incoming connection events nor to stop polling on the FD, leaving all threads at 100% CPU but partially operational. This patch addresses the issue by using bit-test-and-set instead of the OR in fd_may_recv() / fd_may_send() so that nothing is done if the FD was already configured as expected. It does the same in fd_update_events() using a CAS to check if the FD's events need to be changed at all or not. With this patch applied, it became impossible to reproduce the issue, and now there's no way to saturate all 16 CPUs with the load used for testing, as no more than 1350-1400 were noticed at 300+kcps vs 1600. Ideally this patch should go further and try to remove the remaining incarnations of the fdlock as this seems possible, but it's difficult enough to be done in a distinct patch that will not have to be backported. It is possible that workloads involving a high connection rate may slightly benefit from this patch and observe a slightly lower CPU usage even when the service doesn't misbehave. This patch must be backported to 2.0 and 1.9.	2019-07-09 10:41:24 +02:00
Willy Tarreau	85b2cae63c	MINOR: pools: make the thread harmless during the mmap/munmap syscalls These calls can take quite some time and leave the thread harmless so it's better to mark it as such. This makes "show sess" respond way faster during high loads running on processes build with DEBUG_UAF since these calls are stressed a lot.	2019-07-09 10:40:33 +02:00
Willy Tarreau	828675421e	MINOR: pools: always pre-initialize allocated memory outside of the lock When calling mmap(), in general the system gives us a page but does not really allocate it until we first dereference it. And it turns out that this time is much longer than the time to perform the mmap() syscall. Unfortunately, when running with memory debugging enabled, we mmap/munmap() each object resulting in lots of such calls and a high contention on the allocator. And the first accesses to the page being done under the pool lock is extremely damaging to other threads. The simple fact of writing a 0 at the beginning of the page after allocating it and placing the POOL_LINK pointer outside of the lock is enough to boost the performance by 8x in debug mode and to save the watchdog from triggering on lock contention. This is what this patch does.	2019-07-09 10:40:33 +02:00
Willy Tarreau	3e853ea74d	MINOR: pools: release the pool's lock during the malloc/free calls The malloc and free calls and especially the underlying mmap/munmap() can occasionally take a huge amount of time and even cause the thread to sleep. This is visible when haproxy is compiled with DEBUG_UAF which causes every single pool allocation/free to allocate and release pages. In this case, when using the locked pools, the watchdog can occasionally fire under high contention (typically requesting 40000 1M objects in parallel over 8 threads). Then, "perf top" shows that 50% of the CPU time is spent in mmap() and munmap(). The reason the watchdog fires is because some threads spin on the pool lock which is held by other threads waiting on mmap() or munmap(). This patch modifies this so that the pool lock is released during these syscalls. Not only this allows other threads to request try to allocate their data in parallel, but it also considerably reduces the lock contention. Note that the locked pools are only used on small architectures where high thread counts would not make sense, so this will not provide any benefit in the general case. However it makes the debugging versions way more stable, which is always appreciated.	2019-07-09 10:40:33 +02:00
Christopher Faulet	037b3ebd35	BUG/MEDIUM: stream-int: Don't rely on CF_WRITE_PARTIAL to unblock opposite si In the function stream_int_notify(), when the opposite stream-interface is blocked because there is no more room into the input buffer, if the flag CF_WRITE_PARTIAL is set on this buffer, it is unblocked. It is a way to unblock the reads on the other side because some data was sent. But it is a problem during the fast-forwarding because only the stream is able to remove the flag CF_WRITE_PARTIAL. So it is possible to have this flag because of a previous send while the input buffer of the opposite stream-interface is now full. In such case, the opposite stream-interface will be woken up for nothing because its input buffer is full. If the same happens on the opposite side, we will have a loop consumming all the CPU. To fix the bug, the opposite side is now only notify if there is some available room in its input buffer in the function si_cs_send(), so only if some data was sent. This patch must be backported to 2.0 and 1.9.	2019-07-05 14:26:15 +02:00
Christopher Faulet	2e4843d1d2	MINOR: action: Add the return code ACT_RET_DONE for actions This code should be now used by action to stop at the same time the rules processing and the possible following processings. And from its side, the return code ACT_RET_STOP should be used to only stop rules processing. So concretely, for TCP rules, there is no changes. ACT_RET_STOP and ACT_RET_DONE are handled the same way. However, for HTTP rules, ACT_RET_STOP should now be mapped on HTTP_RULE_RES_STOP and ACT_RET_DONE on HTTP_RULE_RES_DONE. So this way, a action will have the possibilty to stop all processing or only rules processing. Note that changes about the TCP is done in this commit but changes about the HTTP will be done in another one because it will fix a bug in the same time. This patch must be backported to 2.0 because a bugfix depends on it.	2019-07-05 14:26:14 +02:00
Olivier Houchard	cee0389088	BUG/MEDIUM: sessions: Don't keep an extra idle connection in sessions. When deciding if we keep an idle connection in the session, check if the number of connections currently in the session is >= the max allowed, not >, or we'll keep an extra connection. This should be backported to 1.9 and 2.0.	2019-07-04 14:28:18 +02:00
Olivier Houchard	2ab3dada01	BUG/MEDIUM: connections: Make sure we're unsubscribe before upgrading the mux. Just calling conn_force_unsubscribe() from conn_upgrade_mux_fe() is not enough, as there may be multiple XPRT involved. Instead, require that any user of conn_upgrade_mux_fe() unsubscribe itself before calling it. This should fix upgrading a TCP connection to HTX when using SSL. This should be backported to 2.0.	2019-07-03 13:57:30 +02:00
Christopher Faulet	621da6bafa	BUG/MEDIUM: channel/htx: Use the total HTX size in channel_htx_recv_limit() The receive limit of an HTX channel must be calculated against the total size of the HTX message. Otherwise, the buffer may never be seen as full whereas the receive limit is 0. Indeed, the function channel_htx_full() already takes care to add a block size to the buffer's reserve (8 bytes). So if the function channel_htx_recv_limit() also keep a block size free in addition to the buffer's reserve, it means that at least 2 block size will be kept free but only one will be taken into account, freezing the stream if the option http-buffer-request is enabled. This patch fixes the Github issue #136. It should be backported to 2.0 and 1.9. Thanks jaroslawr (Jarosław Rzeszótko) for his help.	2019-07-02 21:32:45 +02:00
Olivier Houchard	6c7e96a3e1	BUG/MEDIUM: connections: Always call shutdown, with no linger. Revert commit `fe4abe62c7`. The goal was to make sure for health-checks, we would not get sockets in TIME_WAIT. To do so, we would not call shutdown() if linger_risk is set. However that is wrong, and that means shutw would never be forwarded to the server, and thus we could get connection that are never properly closed. Instead, to fix the original problem as described here : https://www.mail-archive.com/haproxy@formilux.org/msg34080.html Just make sure the checks code call cs_shutr() before calling cs_shutw(). If shutr has been called, conn_sock_shutw() will make no attempt to call shutdown(), as it knows close() will be called. We should really review and revamp the shutr/shutw code, as described in github issue #142. This should be backported to 1.9 and 2.0.	2019-07-02 16:40:55 +02:00
William Lallemand	ad03288e6b	BUG/MINOR: mworker/cli: don't output a \n before the response When using a level lower than admin on the master CLI, a \n is output before the response, this is caused by the response of the "operator" or "user" that are sent before the actual command. To fix this problem we introduce the flag APPCTX_CLI_ST1_NOLF which ask a command response to not be followed by the final \n. This patch made a special case with the command operator and user followed by a - so they are not followed by \n. This patch must be backported to 2.0 and 1.9.	2019-07-01 15:34:11 +02:00
Christopher Faulet	bb0efcdd29	MINOR: htx: Add the function htx_change_blk_value_len() As its name suggest, this function change the value length of a block. But it also update the HTX message accordingly. It simplifies the HTX API. The function htx_set_blk_value_len() is still available and must be used with caution because this one does not update the HTX message. It just updates the HTX block. It should be considered as an internal function. When possible, htx_change_blk_value_len() should be used instead. This function is used to fix a bug affecting the 2.0. So, this patch must be backported to 2.0.	2019-06-18 10:01:55 +02:00
Baptiste Assmann	da29fe2360	MEDIUM: server: server-state global file stored in a tree Server states can be recovered from either a "global" file (all backends) or a "local" file (per backend). The way the algorithm to parse the state file was first implemented was good enough for a low number of backends and servers per backend. Basically, for each backend the state file (global or local) is opened, parsed entirely and for each line we check if it contains data related to a server from the backend we're currently processing. We must read the file entirely, just in case some lines for the current backend are stored at the end of the file. This does not scale at all! This patch changes the behavior above for the "global" file only. Now, the global file is read and parsed once and all lines it contains are stored in a tree, for faster discovery. This result in way much less fopen, fgets, and strcmp calls, which make loading of very big state files very quick now.	2019-06-17 13:40:42 +02:00
Tim Duesterhus	86e6b6ebf8	MEDIUM: Make '(cli\|con\|srv)timeout' directive fatal They were deprecated with HAProxy 1.5. Time to remove them.	2019-06-17 13:35:54 +02:00
Tim Duesterhus	dac168bc15	MEDIUM: Make 'redispatch' directive fatal It was deprecated with HAProxy 1.5. Time to remove it.	2019-06-17 13:35:54 +02:00
Tim Duesterhus	7b7c47f05c	MEDIUM: Make 'block' directive fatal It was deprecated with HAProxy 1.5. Time to remove it.	2019-06-17 13:35:54 +02:00
Willy Tarreau	9dc6b97429	[RELEASE] Released version 2.1-dev0 Released version 2.1-dev0 with the following main changes : - exact copy of 2.0.0	2019-06-16 21:49:47 +02:00
Willy Tarreau	bd20a9dd4e	BUG: tasks: fix bug introduced by latest scheduler cleanup In commit `86eded6c6` ("CLEANUP: tasks: rename task_remove_from_tasklet_list() to tasklet_remove_*") which consisted in removing the casts between tasks and tasklet, I was a bit too fast to believe that we only saw tasklets in this function since process_runnable_tasks() also uses it with tasks under a cast. So removing the bookkeeping on task_list_size was not appropriate. Bah, the joy of casts which hide the real thing... This patch does two things at once to address this mess once for all: - it restores the decrement of task_list_size when it's a real task, but moves it to process_runnable_task() since it's the only place where it's allowed to call it with a task - it moves the increment there as well and renames task_insert_into_tasklet_list() to tasklet_insert_into_tasklet_list() of obvious consistency reasons. This way the increment/decrement of task_list_size is made at the only places where the cast is enforced, so it has less risks to be missed. The comments on top of these functions were updated to reflect that they are only supposed to be used with tasklets and that the caller is responsible for keeping task_list_size up to date if it decides to enforce a task there. Now we don't have to worry anymore about how these functions work outside of the scheduler, which is better longterm-wise. Thanks to Christopher for spotting this mistake. No backport is needed.	2019-06-14 18:16:19 +02:00
Olivier Houchard	fe4abe62c7	BUG/MEDIUM: connections: Don't call shutdown() if we want to disable linger. In conn_sock_shutw(), avoid calling shutdown() if linger_risk is set. Not doing so will result in getting sockets in TIME_WAIT for some time. This is particularly observable with health checks. This should be backported to 1.9.	2019-06-14 15:33:41 +02:00
Willy Tarreau	86eded6c69	CLEANUP: tasks: rename task_remove_from_tasklet_list() to tasklet_remove_* The function really only operates on tasklets, its arguments are always tasklets cast as tasks to match the function's type, to be cast back to a struct tasklet. Let's rename it to tasklet_remove_from_tasklet_list(), take a struct tasklet, and get rid of the undesired task casts.	2019-06-14 14:57:03 +02:00
Willy Tarreau	3c39a7d889	CLEANUP: connection: rename the wait_event.task field to .tasklet It's really confusing to call it a task because it's a tasklet and used in places where tasks and tasklets are used together. Let's rename it to tasklet to remove this confusion.	2019-06-14 14:42:29 +02:00
Christopher Faulet	e21c01637a	MINOR: htx: Add 3 flags on the start-line to deal with the request schemes The first one, HTX_SL_F_HAS_SCHM, will be used to know the request has an explicit scheme. So, in H2, it is always true because the pseudo-header ":scheme" is mandatory. In H1, it is only true when an absolute URI is found on the start-line. The other flags, HTX_SL_F_SCHM_HTTP and HTX_SL_F_SCHM_HTTPS, will be used to know which scheme the request have. For now, other protocols are not handled. The aim of these flags is to pass this information to the backend side in general, and to the H2 mux in particular. So the multiplexer will have a chance to use this information to send the right scheme to the server.	2019-06-14 11:13:32 +02:00
Christopher Faulet	36a7702b03	CLEANUP: channel: Remove channel_htx_fwd_payload() and channel_htx_fwd_all() These functions are unused now. No backport needed.	2019-06-14 11:13:32 +02:00
Christopher Faulet	421e769783	BUG/MEDIUM: htx: Don't change position of the first block during HTX analysis In the HTX structure, the field <first> is used to know where to (re)start the analysis. It may differ from the message's head. It is especially important to update it to handle 1xx messages, to be sure to restart the analysis on the next message (another 1xx message or the final one). It is also updated when some data are forwarded (the headers or part of the body). But this update is an error and must never be done at the analysis level. It is a bug, because some sample fetches may be used after the data forwarding (but before the first send of course). At this stage, if the first block position does not point on the start-line, most of HTTP sample fetches fail. So now, when something is forwarding by HTX analyzers, the first block position is not update anymore. This issue was reported on Github. See #119. No backport needed.	2019-06-14 11:13:32 +02:00
Christopher Faulet	87ebe944d6	BUG/MINOR: channel/htx: Call channel_htx_full() from channel_full() When channel_full() is called for an HTX stream, we fall back on the HTX version. This function is called, among other, from tcp_inspect_request(). With this patch, the inspect delay is respected again. This patch must be backported to 1.9.	2019-06-14 11:13:32 +02:00
Willy Tarreau	3cec0f94f3	BUG/MINOR: task: prevent schedulable tasks from starving under high I/O activity With both I/O and tasks in the same tasklet list, we now have a very smooth and responsive scheduler, providing a good fairness between I/O activities. With the lower layers relying on tasklet a lot (I/O wakeup, subscribe, etc), there may often be a large number of totally autonomous tasklets doing their business such as forwarding data between two muxes. But the task scheduler historically refrained from picking tasks from the priority-ordered run queue to put them into the tasklet list until this later had less than max_runqueue_depth entries. This was to make sure that low-latency, high-priority tasks would have an opportunity to be dequeued before others even if they arrive late. But the counter used for this is still the tasklet list size, which contains countless I/O events. This causes an unfairness between unbounded I/Os and bounded tasks, resulting for example in the CLI responding slower when forwarding 40 Gbps of HTTP traffic spread over a thousand of connections. A good solution consists in sticking to the initial intent of max_runqueue_depth which is to limit the number of tasks in the list (to maintain fairness between them) and not to limit the number of these tasks among tasklets. It just turns out that the task_list_size initially was this task counter and changed over time to be a tasklet list size. Let's simply refrain from updating it for pure tasklets so that it takes back its original role of counting real tasks as its name implies. With this change the CLI becomes instantly responsive under load again. This patch may possibly be backported to 1.9 though it requires some careful checks.	2019-06-14 09:16:51 +02:00
William Lallemand	1dc6963086	MINOR: mworker: add the HAProxy version in "show proc" Displays the HAProxy version so you can compare the version of old processes and new ones.	2019-06-12 19:19:57 +02:00
Olivier Houchard	a0fdce3950	MINOR: fd: Don't use atomic operations when it's not needed. In updt_fd_polling(), when updating fd_nbupdt, there's no need to use an atomic operation, as it's a TLS variable.	2019-06-12 14:36:24 +02:00
Christopher Faulet	86fcf6d6cd	MINOR: htx: Add the function htx_move_blk_before() The function htx_add_data_before() was removed because it was buggy. The function htx_move_blk_before() may be used if necessary to do something equivalent, except it just moves blocks. It doesn't handle the adding.	2019-06-11 14:05:25 +02:00
Christopher Faulet	d7884d3449	MAJOR: htx: Rework how free rooms are tracked in an HTX message In an HTX message, it may have 2 available rooms to store a new block. The first one is between the blocks and their payload. Blocks are added starting from the end of the buffer and their payloads are added starting from the begining. So the first free room is between these 2 edges. The second one is at the begining of the buffer, when we start to wrap to add new payloads. Once we start to use this one, the other one is ignored until the next defragmentation of the HTX message. In theory, there is no problem. But in practice, some lacks in the HTX structure force us to defragment too often HTX messages to always be in a known state. The second free room is not tracked as it should do and the first one may be easily corrupted when rewrites happen. So to fix the problem and avoid unecessary defragmentation, the HTX structure has been refactored. The front (the block's position of the first payload before the blocks) is no more stored. Instead we keep the relative addresses of 3 edges: * tail_addr : The start address of the free space in front of the the blocks table * head_addr : The start address of the free space at the beginning * end_addr : The end address of the free space at the beginning Here is the general view of the HTX message now: head_addr end_addr tail_addr \| \| \| V V V +------------+------------+------------+------------+------------------+ \| \| \| \| \| \| \| PAYLOAD \| Free space \| PAYLOAD \| Free space \| Blocks area \| \| ==> \| 1 \| ==> \| 2 \| <== \| +------------+------------+------------+------------+------------------+ <head_addr> is always lower or equal to <end_addr> and <tail_addr>. <end_addr> is always lower or equal to <tail_addr>. In addition;, to simplify everything, the blocks area are now contiguous. It doesn't wrap anymore. So the head is always the block with the lowest position, and the tail is always the one with the highest position.	2019-06-11 14:05:25 +02:00
Christopher Faulet	86bc8df955	BUG/MEDIUM: compression/htx: Fix the adding of the last data block The function htx_add_data_before() is buggy and cannot work. It first add a data block and then move it before another one, passed in argument. The problem happens when a defragmentation is done to add the new block. In this case, the reference is no longer valid, because the blocks are rearranged. So, instead of moving the new block before the reference, it is moved at the head of the HTX message. So this function has been removed. It was only used by the compression filter to add a last data block before a TLR, EOT or EOM block. Now, the new function htx_add_last_data() is used. It adds a last data block, after all others and before any TLR, EOT or EOM block. Then, the next bock is get. It is the first non-data block after data in the HTX message. The compression loop continues with it. This patch must be backported to 1.9.	2019-06-11 14:05:25 +02:00
Willy Tarreau	9a1f57351d	MEDIUM: threads: add thread_sync_release() to synchronize steps This function provides an alternate way to leave a critical section run under thread_isolate(). Currently, a thread may remain in thread_release() without having the time to notice that the rdv mask was released and taken again by another thread entering thread_isolate() (often the same that just released it). This is because threads wait in harmless mode in the loop, which is compatible with the conditions to enter thread_isolate(). It's not possible to make them wait with the harmless bit off or we cannot know when the job is finished for the next thread to start in thread_isolate(), and if we don't clear the rdv bit when going there, we create another race on the start point of thread_isolate(). This new synchronous variant of thread_release() makes use of an extra mask to indicate the threads that want to be synchronously released. In this case, they will be marked harmless before releasing their sync bit, and will wait for others to release their bit as well, guaranteeing that thread_isolate() cannot be started by any of them before they all left thread_sync_release(). This allows to construct synchronized blocks like this : thread_isolate() /* optionally do something alone here / thread_sync_release() / do something together here / thread_isolate() / optionally do something alone here */ thread_sync_release() And so on. This is particularly useful during initialization where several steps have to be respected and no thread must start a step before the previous one is completed by other threads. This one must not be placed after any call to thread_release() or it would risk to block an earlier call to thread_isolate() which the current thread managed to leave without waiting for others to complete, and end up here with the thread's harmless bit cleared, blocking others. This might be improved in the future.	2019-06-10 09:42:43 +02:00
Willy Tarreau	9faebe34cd	MEDIUM: tools: improve time format error detection As reported in GH issue #109 and in discourse issue https://discourse.haproxy.org/t/haproxy-returns-408-or-504-error-when-timeout-client-value-is-every-25d the time parser doesn't error on overflows nor underflows. This is a recurring problem which additionally has the bad taste of taking a long time before hitting the user. This patch makes parse_time_err() return special error codes for overflows and underflows, and adds the control in the call places to report suitable errors depending on the requested unit. In practice, underflows are almost never returned as the parsing function takes care of rounding values up, so this might possibly happen on 64-bit overflows returning exactly zero after rounding though. It is not really possible to cut the patch into pieces as it changes the function's API, hence all callers. Tests were run on about every relevant part (cookie maxlife/maxidle, server inter, stats timeout, timeout*, cli's set timeout command, tcp-request/response inspect-delay).	2019-06-07 19:32:02 +02:00
Fr�d�ric L�caille	b65717fa55	MINOR: peers: Optimization for dictionary cache lookup. When we look up an dictionary entry in the cache used upon transmission we store the last result in ->prev_lookup of struct dcache_tx so that to compare it with the subsequent entries to look up and save performances.	2019-06-07 15:47:54 +02:00
Fr�d�ric L�caille	99de1d0479	MINOR: dict: Store the length of the dictionary entries. When allocating new dictionary entries we store the length of the strings. May be useful so that not to have to call strlen() too much often at runing time.	2019-06-07 15:47:54 +02:00
Fr�d�ric L�caille	6c39198b57	MINOR peers: data structure simplifications for server names dictionary cache. We store pointers to server names dictionary entries in a pre-allocated array of ebpt_node's (->entries member of struct dcache_tx) to cache those sent to remote peers. Consequently the ID used to identify the server name dictionary entry is also used as index for this array. There is no need to implement a lookup by key for this dictionary cache.	2019-06-07 15:47:54 +02:00
Willy Tarreau	1bfd6020ce	MINOR: logs: use the new bitmap functions instead of fd_sets for encoding maps The fd_sets we've been using in the log encoding functions are not portable and were shown to break at least under Cygwin. This patch gets rid of them in favor of the new bitmap functions. It was verified with the config below that the log output was exactly the same before and after the change : defaults mode http option httplog log stdout local0 timeout client 1s timeout server 1s timeout connect 1s frontend foo bind :8001 capture request header chars len 255 backend bar option httpchk "GET" "/" "HTTP/1.0\r\nchars: \x01\x02\x03\x04\x05\x06\x07\x08\x09\x0b\x0c\x0e\x0f\x10\x11\x12\x13\x14\x15\x16\x17\x18\x19\x1a\x1b\x1c\x1d\x1e\x1f\x20\x21\x22\x23\x24\x25\x26\x27\x28\x29\x2a\x2b\x2c\x2d\x2e\x2f\x30\x31\x32\x33\x34\x35\x36\x37\x38\x39\x3a\x3b\x3c\x3d\x3e\x3f\x40\x41\x42\x43\x44\x45\x46\x47\x48\x49\x4a\x4b\x4c\x4d\x4e\x4f\x50\x51\x52\x53\x54\x55\x56\x57\x58\x59\x5a\x5b\x5c\x5d\x5e\x5f\x60\x61\x62\x63\x64\x65\x66\x67\x68\x69\x6a\x6b\x6c\x6d\x6e\x6f\x70\x71\x72\x73\x74\x75\x76\x77\x78\x79\x7a\x7b\x7c\x7d\x7e\x7f\x80\x81\x82\x83\x84\x85\x86\x87\x88\x89\x8a\x8b\x8c\x8d\x8e\x8f\x90\x91\x92\x93\x94\x95\x96\x97\x98\x99\x9a\x9b\x9c\x9d\x9e\x9f\xa0\xa1\xa2\xa3\xa4\xa5\xa6\xa7\xa8\xa9\xaa\xab\xac\xad\xae\xaf\xb0\xb1\xb2\xb3\xb4\xb5\xb6\xb7\xb8\xb9\xba\xbb\xbc\xbd\xbe\xbf\xc0\xc1\xc2\xc3\xc4\xc5\xc6\xc7\xc8\xc9\xca\xcb\xcc\xcd\xce\xcf\xd0\xd1\xd2\xd3\xd4\xd5\xd6\xd7\xd8\xd9\xda\xdb\xdc\xdd\xde\xdf\xe0\xe1\xe2\xe3\xe4\xe5\xe6\xe7\xe8\xe9\xea\xeb\xec\xed\xee\xef\xf0\xf1\xf2\xf3\xf4\xf5\xf6\xf7\xf8\xf9\xfa\xfb\xfc\xfd\xfe\xff" server foo 127.0.0.1:8001 check	2019-06-07 11:13:24 +02:00
Willy Tarreau	7355b040d1	MINOR: tools: add new bitmap manipulation functions We now have ha_bit_{set,clr,flip,test} to manipulate bitfields made of arrays of longs. The goal is to get rid of the remaining non-portable FD_{SET,CLR,ISSET} that still exist at a few places.	2019-06-07 10:44:49 +02:00
Willy Tarreau	ad660e3f84	BUILD: stream-int: avoid a build warning in dev mode in si_state_bit() The BUG_ON() test emits a warning about an always-true comparison regarding <state> which cannot be lower than zero. Let's get rid of it.	2019-06-06 16:42:08 +02:00
Willy Tarreau	3b285d7fbd	MINOR: stream-int: make si_sync_send() from the send code of si_update_both() Just like we have a synchronous recv() function for the stream interface, let's have a synchronous send function that we'll be able to call from different places. For now this only moves the code, nothing more.	2019-06-06 16:36:19 +02:00
Willy Tarreau	236c4298b3	MINOR: stream-int: split si_update() into si_update_rx() and si_update_tx() We should not update the two directions at once, in fact we should update the Rx path after recv() and the Tx path after send(). Let's start by splitting the update function in two for this.	2019-06-06 16:36:19 +02:00
Willy Tarreau	8c603ded39	MEDIUM: stream-int: make idle-conns switch to ST_RDY The purpose of making idle-conns switch to SI_ST_CON was to make the transition detectable and the operation retryable in case of connection error. Now we have the RDY state for this which is much more suitable since it indicates a validated connection on which we didn't necessarily send anything yet. This will still lead to a transition to EST while not requiring unnatural write polling nor connect timeouts.	2019-06-06 16:36:19 +02:00
Willy Tarreau	4f283fa604	MEDIUM: stream-int: introduce a new state SI_ST_RDY The main reason for all the trouble we're facing with stream interface error or timeout reports during the connection phase is that we currently can't make the difference between a connection attempt and a validated connection attempt. It is problematic because we tend to switch early to SI_ST_EST but can't always do what we want in this state since it's supposed to be set when we don't need to visit sess_establish() again. This patch introduces a new state betwen SI_ST_CON and SI_ST_EST, which is SI_ST_RDY. It indicates that we've verified that the connection is ready. It's a transient state, like SI_ST_DIS, that cannot persist when leaving process_stream(). For now it is not set, only verified in various tests where SI_ST_CON was used or SI_ST_EST depending on the cases. The stream-int state diagram was minimally updated to reflect the new state, though it is largely obsolete and would need to be seriously updated.	2019-06-06 16:36:19 +02:00
Willy Tarreau	7ab22adbf7	MEDIUM: stream-int: remove dangerous interval checks for stream-int states The stream interface state checks involving ranges were replaced with checks on a set of states, already revealing some issues. No issue was fixed, all was replaced in a one-to-one mapping for easier control. Some checks involving a strict difference were also replaced with fields to be clearer. At this stage, the result must be strictly equivalent. A few tests were also turned to their bit-field equivalent for better readability or in preparation for upcoming changes. The test performed in the SPOE filter was swapped so that the closed and error states are evicted first and that the established vs conn state is tested second.	2019-06-06 16:36:19 +02:00
Willy Tarreau	bedcd698b3	MINOR: stream-int: use bit fields to match multiple stream-int states at once At some places we do check for ranges of stream-int states but those are confusing as states ordering is not well known (e.g. it's not obvious that CER is between CON and EST). Let's create a bit field from states so that we can match multiple states at once instead. The new enum si_state_bit contains SI_SB_* which are state bits instead of state values. The function si_state_in() indicates if the state in argument is one of those represented by the bit mask in second argument.	2019-06-06 16:36:19 +02:00
Olivier Houchard	03abf2d31e	MEDIUM: connections: Remove CONN_FL_SOCK* Now that the various handshakes come with their own XPRT, there's no need for the CONN_FL_SOCK* flags, and the conn_sock_want\|stop functions, so garbage-collect them.	2019-06-05 18:03:38 +02:00
Olivier Houchard	fe50bfb82c	MEDIUM: connections: Introduce a handshake pseudo-XPRT. Add a new XPRT that is used when using non-SSL handshakes, such as proxy protocol or Netscaler, instead of taking care of it in conn_fd_handler(). This XPRT is installed when any of those is used, and it removes itself once the handshake is done. This should allow us to remove the distinction between CO_FL_SOCK* and CO_FL_XPRT*.	2019-06-05 18:03:38 +02:00
Olivier Houchard	2e055483ff	MINOR: connections: Add a new xprt method, add_xprt(). Add a new method to xprt_ops, add_xprt(), that changes the underlying xprt to the one provided, and optionally provide the old one.	2019-06-05 18:03:38 +02:00
Olivier Houchard	5149b59851	MINOR: connections: Add a new xprt method, remove_xprt. Add a new method to xprt_ops, remove_xprt. When called, if the provided xprt_ctx is the same as the xprt's underlying xprt_ctx, it then uses the new xprt provided, otherwise it calls the remove_xprt method of the next xprt. The goal is to be able to add a temporary xprt, that removes itself from the chain when it did what it had to do. This will be used to implement a pseudo-xprt for anything that just requires a handshake (such as the proxy protocol).	2019-06-05 18:03:38 +02:00
Olivier Houchard	000694cf96	MINOR: ssl: Make ssl_sock_handshake() static. ssl_sock_handshake is now only used by the ssl code itself, there's no need to export it anymore, so make it static.	2019-06-05 18:03:38 +02:00
Olivier Houchard	ea8dd949e4	MEDIUM: ssl: Handle subscribe by itself. As the SSL code may have different needs than the upper layer, ie it may want to receive when the upper layer wants to right, instead of directly forwarding the subscribe to the underlying xprt, handle it ourself. The SSL code will know remember any subscribe call, and wake the tasklet when it is ready for more I/O.	2019-06-05 18:03:38 +02:00
Christopher Faulet	54b5e214b0	MINOR: htx: Don't use end-of-data blocks anymore This type of blocks is useless because transition between data and trailers is obvious. And when there is no trailers, the end-of-message is still there to know when data end for chunked messages.	2019-06-05 10:12:11 +02:00
Christopher Faulet	2d7c5395ed	MEDIUM: htx: Add the parsing of trailers of chunked messages HTTP trailers are now parsed in the same way headers are. It means trailers are converted to K/V blocks followed by an end-of-trailer marker. For now, to make things simple, the type for trailer blocks are not the same than for header blocks. But the aim is to make no difference between headers and trailers by using the same type. Probably for the end-of marker too.	2019-06-05 10:12:11 +02:00
Christopher Faulet	8f3c256f7e	MEDIUM: cache/htx: Always store info about HTX blocks in the cache It was only done for the headers (including the EOH marker). data were prefixed by the info field of these blocks. The payload and the trailers of the messages were stored in raw. The total size of headers and payload were kept in the cached object state to help output formatting. Now, info about each HTX block is store in the cache. Only data are allowed to be splitted. Otherwise, all blocks of an HTX message are handled the same way, both when storing a message in the cache and when delivering it from the cache. This will help the cache implementation to be more robust to internal changes in the HTX. Especially for the upcoming parsing of trailers. There is also no more need to keep extra info in the cached object state.	2019-06-05 10:12:11 +02:00
Christopher Faulet	a4f9dd4a56	BUG/MINOR: channel/htx: Don't alter channel during forward for empty HTX message In channel_htx_forward() and channel_htx_forward_forever(), if the HTX message is empty, the underlying buffer may be really empty too. And we have no warranty the caller will call htx_to_buf() later. And in practice, it is almost never done. So the channel's buffer must not be altered. Otherwise, the buffer may be considered as full (data == size) for an empty HTX message and no outgoing data. This patch must be backported to 1.9.	2019-06-05 10:12:11 +02:00
Fr�d�ric L�caille	8d78fa7def	MINOR: peers: Make peers protocol support new "server_name" data type. Make usage of the APIs implemented for dictionaries (dict.c) and their LRU caches (struct dcache) so that to send/receive server names used for the server by name stickiness. These names are sent over the network as follows: - in every case we send the encode length of the data (STD_T_DICT), then - if the server names is not present in the cache used upon transmission (struct dcache_tx) we cache it and we the ID of this TX cache entry followed the encode length of the server name, and finally the sever name itseft (non NULL terminated string). - if the server name is present, we repead these operations but we only send the TX cache entry ID. Upon receipt, the couple of (cache IDs, server name) are stored the LRU cache used only upon receipt (struct dcache_rx). As the peers protocol is symetrical, the fact that the server name is present in the received data (resp. or not) denotes if the entry is absent (resp. or not).	2019-06-05 08:42:33 +02:00
Fr�d�ric L�caille	7da71293e4	MINOR: server: Add a dictionary for server names. This patch only declares and defines a dictionary for the server names (stored as ->id member field).	2019-06-05 08:33:35 +02:00
Fr�d�ric L�caille	84d6046a33	MINOR: proxy: Add a "server by name" tree to proxy. Add a tree to proxy struct to lookup by name for servers attached to this proxy and populated it at parsing time.	2019-06-05 08:33:35 +02:00
Fr�d�ric L�caille	5ad57ea85f	MINOR: stick-table: Add "server_name" new data type. This simple patch only adds definitions to create a new stick-table data type ID and a new standard type to store information in relation wich dictionary entries (STD_T_DICT).	2019-06-05 08:33:35 +02:00
Fr�d�ric L�caille	74167b25f7	MINOR: peers: Add a LRU cache implementation for dictionaries. We want to send some stick-table data fields stored as strings in dictionaries without consuming too much memory and CPU. To do so we implement with this patch a cache for send/received dictionaries entries. These dictionary of strings entries are stored in others real dictionary entries with an identifier as key (unsigned int) and a pointer to the dictionary of strings entries as values.	2019-06-05 08:33:35 +02:00
Fr�d�ric L�caille	4a3fef834c	MINOR: dict: Add dictionary new data structure. This patch adds minimalistic definitions to implement dictionary new data structure which is an ebtree of ebpt_node structs with strings as keys. Note that this has nothing to see with real dictionary data structure (maps of keys in association with values).	2019-06-05 08:33:35 +02:00
Fr�d�ric L�caille	1673bbdf98	CLEANUP: peers: Remove tabs characters. This patch only replaces very annoying tabulation characters by spaces so that not to have to use again tabulations where they should not be used.	2019-06-05 08:33:34 +02:00
Willy Tarreau	7bb39d7cd6	CLEANUP: connection: remove the now unused CS_FL_REOS flag Let's remove it before it gets uesd again. It was mostly replaced with CS_FL_EOI and by mux-specific states or flags.	2019-06-03 14:23:33 +02:00
Willy Tarreau	7067b3a92e	BUG/MINOR: deinit/threads: make hard-stop-after perform a clean exit As reported in GH issue #99, when hard-stop-after triggers and threads are in use, the chance that any thread releases the resources in use by the other ones is non-null. Thus no thread should be allowed to deinit() nor exit by itself. Here we take a different approach. We simply use a 3rd possible value for the "killed" variable so that all threads know they must break out of the run-poll-loop and immediately stop. This patch was tested by commenting the stream_shutdown() calls in hard_stop() to increase the chances to see a stream use released resources. With this fix applied, it never crashes anymore. This fix should be backported to 1.9 and 1.8.	2019-06-02 11:30:07 +02:00
Alexander Liu	2a54bb74cd	MEDIUM: connection: Upstream SOCKS4 proxy support Have "socks4" and "check-via-socks4" server keyword added. Implement handshake with SOCKS4 proxy server for tcp stream connection. See issue #82. I have the "SOCKS: A protocol for TCP proxy across firewalls" doc found at "https://www.openssh.com/txt/socks4.protocol". Please reference to it. [wt: for now connecting to the SOCKS4 proxy over unix sockets is not supported, and mixing IPv4/IPv6 is discouraged; indeed, the control layer is unique for a connection and will be used both for connecting and for target address manipulation. As such it may for example report incorrect destination addresses in logs if the proxy is reached over IPv6]	2019-05-31 17:24:06 +02:00
Olivier Houchard	cfbb3e6560	MEDIUM: tasks: Get rid of active_tasks_mask. Remove the active_tasks_mask variable, we can deduce if we've work to do by other means, and it is costly to maintain. Instead, introduce a new function, thread_has_tasks(), that returns non-zero if there's tasks scheduled for the thread, zero otherwise.	2019-05-29 21:53:37 +02:00
Olivier Houchard	250031e444	MEDIUM: sessions: Introduce session flags. Add session flags, and add a new flag, SESS_FL_PREFER_LAST, to be set when we use NTLM authentication, and we should reuse the last connection. This should fix using NTLM with HTX. This totally replaces TX_PREFER_LAST. This should be backported to 1.9.	2019-05-29 15:41:47 +02:00
Willy Tarreau	ef28dc11e3	MINOR: task: turn the WQ lock to an RW_LOCK For now it's exclusively used as a write lock though, thus it remains 100% equivalent to the spinlock it replaces.	2019-05-28 19:15:44 +02:00
Willy Tarreau	186e96ece0	MEDIUM: buffers: relax the buffer lock a little bit In lock profiles it's visible that there is a huge contention on the buffer lock. The reason is that when offer_buffers() is called, it systematically takes the lock before verifying if there is any waiter. However doing so doesn't protect against races since a waiter can happen just after we release the lock as well. Similarly in h2 we take the lock every time an h2c is going to be released, even without checking that the h2c belongs to a wait list. These two have now been addressed by verifying non-emptiness of the list prior to taking the lock.	2019-05-28 17:25:21 +02:00
Willy Tarreau	a8b2ce02b8	MINOR: activity: report the number of failed pool/buffer allocations Haproxy is designed to be able to continue to run even under very low memory conditions. However this can sometimes have a serious impact on performance that it hard to diagnose. Let's report counters of failed pool and buffer allocations per thread in show activity.	2019-05-28 17:25:21 +02:00
Willy Tarreau	2ae84e445d	MEDIUM: poller: separate the wait time from the wake events We have been abusing the do_poll()'s timeout for a while, making it zero whenever there is some known activity. The problem this poses is that it complicates activity diagnostic by incrementing the poll_exp field for each known activity. It also requires extra computations that could be avoided. This change passes a "wake" argument to say that the poller must not sleep. This simplifies the operations and allows one to differenciate expirations from activity.	2019-05-28 17:25:21 +02:00
Willy Tarreau	0a7ef02074	MINOR: htx: make htx_add_data() return the transmitted byte count In order to later allow htx_add_data() to transmit partial blocks and avoid defragmenting the buffer, we'll need to return the number of bytes consumed. This first modification makes the function do this and its callers take this into account. At the moment the function still works atomically so it returns either the block size or zero. However all call places have been adapted to consider any value between zero and the block size.	2019-05-28 14:48:59 +02:00
Willy Tarreau	d4908fa465	MINOR: htx: rename htx_append_blk_value() to htx_add_data_atonce() This function is now dedicated to data blocks, and we'll soon need to access it from outside in a rare few cases. Let's rename it and export it.	2019-05-28 14:48:59 +02:00
Christopher Faulet	39744f792d	MINOR: htx: Remove support of pseudo headers because it is unused The code to handle pseudo headers is unused and with no real value. So remove it.	2019-05-28 07:42:33 +02:00
Christopher Faulet	613346b60e	MINOR: htx: remove the unused function htx_find_blk()	2019-05-28 07:42:33 +02:00
Christopher Faulet	dab5ab551d	MINOR: channel/htx: Add functions to forward a part or all HTX payload The functions channel_htx_fwd_payload() and channel_htx_fwd_all() should now be used to forward, respectively, a part of the HTX payload or all of it. These functions forward data and update the first block position.	2019-05-28 07:42:33 +02:00
Christopher Faulet	29f1758285	MEDIUM: htx: Store the first block position instead of the start-line one We don't store the start-line position anymore in the HTX message. Instead we store the first block position to analyze. For now, it is almost the same. But once all changes will be made on this part, this position will have to be used by HTX analyzers, and only in the analysis context, to know where the analyse should start. When new blocks are added in an HTX message, if the first block position is not defined, it is set. When the block pointed by it is removed, it is set to the block following it. -1 remains the value to unset the position. the first block position is unset when the HTX message is empty. It may also be unset on a non-empty message, meaning every blocks were already analyzed. From HTX analyzers point of view, this position is always set during headers analysis. When they are waiting for a request or a response, if it is unset, it means the analysis should wait. But once the analysis is started, and as long as headers are not forwarded, it points to the message start-line. As mentionned, outside the HTX analysis, no code must rely on the first block position. So multiplexers and applets must always use the head position to start a loop on an HTX message.	2019-05-28 07:42:33 +02:00
Christopher Faulet	b2f4e83a28	MINOR: channel/htx: Add function to forward headers of an HTX message The function channel_htx_fwd_headers() should now be used by HTX analyzers to forward all headers of an HTX message, from the start-line to the corresponding EOH. It takes care to update the star-line position.	2019-05-28 07:42:33 +02:00
Christopher Faulet	05c083ca8d	MINOR: htx: Add a field to set the memory used by headers in the HTX start-line The field hdrs_bytes has been added in the structure htx_sl. It should be used to set how many bytes are help by all headers, from the start-line to the corresponding EOH block. it must be set to -1 if it is unknown.	2019-05-28 07:42:12 +02:00
Christopher Faulet	9b04d22945	MINOR: connection: Remove the unused flag CO_RFL_KEEP_RSV	2019-05-28 07:42:12 +02:00
Christopher Faulet	2ae35045e2	MINOR: htx: Add function htx_get_max_blksz() This functions should be used to get the maximum size for a block, not exceeding the max amount of bytes passed in argument. Thus max may be set to -1 to have no limit.	2019-05-28 07:42:12 +02:00
Christopher Faulet	aad458587d	MINOR: channel/htx: Call channel_htx_recv_max() from channel_recv_max() When channel_recv_max() is called for an HTX stream, we fall back on the HTX version. This function is called from si_cs_recv(). This will let us pass the max amount of bytes to read to HTX multiplexers.	2019-05-28 07:42:12 +02:00
Christopher Faulet	dd2ad8518f	CLEANUP: htx: Remove unused function htx_get_stline()	2019-05-28 07:42:12 +02:00
Christopher Faulet	297fbb45fe	MINOR: htx: Replace the function http_find_stline() by http_get_stline() Now, we only return the start-line. If not found, NULL is returned. No lookup is performed and the HTX message is no more updated. It is now the caller responsibility to update the position of the start-line to the right value. So when it is not found, i.e sl_pos is set to -1, it means the last start-line has been already processed and the next one has not been inserted yet. It is mandatory to rely on this kind of warranty to store 1xx informational responses and final reponse in the same HTX message.	2019-05-28 07:42:12 +02:00
Christopher Faulet	a3ad6b1b8f	MINOR: htx: Add functions to get the first block of an HTX message It is the first block relatively to the start-line. So it is the start-line if its position is set (sl_pos != -1), otherwise it is the head. The functions htx_get_first() and htx_get_first_blk() can be used to get it. This change is mandatory to consider 1xx informational messages as part of a response.	2019-05-28 07:42:12 +02:00
Christopher Faulet	9c66b980fa	MINOR: htx: Store start-line block's position instead of address of its payload Nothing much to say. This change is just mandatory to consider 1xx informational messages as part of a response.	2019-05-28 07:42:12 +02:00
Christopher Faulet	28f29c7eea	MINOR: htx: Store the head position instead of the wrap one The head of an HTX message is heavily used whereas the wrap position is only used when a block is added or removed. So it is more logical to store the head position in the HTX message instead of the wrap one. The wrap position can be easily deduced. To get it, the new function htx_get_wrap() may be used.	2019-05-28 07:42:12 +02:00
Christopher Faulet	c8b246f108	MINOR: htx: Move the macro IS_HTX_STRM() in proto/stream.h The macro IS_HTX_STRM() only relies on stream flags. So move it in proto/stream.h.	2019-05-28 07:42:12 +02:00
Christopher Faulet	429b91d308	MINOR: htx: Remove the macro IS_HTX_SMP() and always use IS_HTX_STRM() instead The macro IS_HTX_SMP() is only used at a place, in a context where the stream always exists. So, we can remove it to use IS_HTX_STRM() instead.	2019-05-28 07:42:12 +02:00
Willy Tarreau	c3b5958255	BUG/MEDIUM: threads: fix double-word CAS on non-optimized 32-bit platforms On armv7 haproxy doesn't work because of the fixes on the double-word CAS. There are two issues. The first one is that the last argument in case of dwcas is a pointer to the set of value and not a value ; the second is that it's not enough to cast the data as (void*) since it will be a single word. Let's fix this by using the pointers as an array of long. This was tested on i386, armv7, x86_64 and aarch64 and it is now fine. An alternate approach using a struct was attempted as well but it used to produce less optimal code. This fix must be backported to 1.9. This fixes github issue #105. Cc: Olivier Houchard <ohouchard@haproxy.com>	2019-05-27 17:40:59 +02:00
Willy Tarreau	d6a7850200	MINOR: cli/activity: add 3 general purpose counters in development mode The unused fd_del and fd_skip were being abused during debugging sessions as general purpose event counters. With their removal, let's officially have dedicated counters for such use cases. These counters are called "ctr0".."ctr2" and are listed at the end when DEBUG_DEV is set.	2019-05-27 07:03:38 +02:00
Willy Tarreau	394c9b4215	MINOR: cli/activity: remove "fd_del" and "fd_skip" from show activity These variables are never set anymore and were always reported as zero.	2019-05-27 06:59:14 +02:00
Willy Tarreau	c4943d5170	MINOR: buffer: add a new buffer ring API to manipulate rings of buffers The purpose is to manipulate rings made of series of buffers so that it is possible to continue to work on a next buffer once one is full. This will be used by muxes to deal with contention between multiple streams and a single output buffer. No data is expected to span over multiple buffers, all of them will be used like a regular buffer. This will significantly limit the amount of changes and the code complexity while still supporting larger output buffering. The ring is made of a head and a tail indexes both of which point to a buffer descriptor. At least one descriptor is always valid, so it could be seen as a form of pagination always presenting one buffer. The root of the ring is itself stored into a buffer descriptor so that the user only has to declare a buffer array and to call br_init() on it in order to use it.	2019-05-26 09:26:59 +02:00
Willy Tarreau	e39b58f045	MINOR: buffer: introduce b_make() to make a buffer from its parameters This is convenient to assign a buffer from parts of another one.	2019-05-26 09:26:59 +02:00
Willy Tarreau	7562a7291d	CLEANUP: debug: remove the TRACE() macro It has not been used for many years, is unlikely to be reused and conflicts with the similarly named macro in flt_trace, causing warnings at build time when including debug.h in low-level files. Let's simply remove it.	2019-05-26 09:25:59 +02:00
Willy Tarreau	0d6c75d749	OPTIM: freq-ctr: don't take the date lock for most updates It's amazing that the value was still incremented under the date lock, let's first use an atomic increment for the counter and move it out of the date lock to reduce contention. These are just counters, we don't need to take locks if we're not rotating, atomic ops are enough. This patch does this, and leaves the lock for when the period is over. It's important to note that some values might be added just before or just after a rotation but this is not a problem since we don't care if a value is counted in the previous or next period when it's exactly on the edge. Great care was taken to ensure that the current counter is always atomically updated. Other minor cleanups were performed, such as avoiding to reload the value from memory after a CAS, or using &~1 instead of two shifts to remove the lowest bit.	2019-05-25 20:31:53 +02:00
Willy Tarreau	7cf0e4517d	MINOR: raw_sock: report global traffic statistics Many times we've been missing per-process traffic statistics. While it didn't make sense in multi-process mode, with threads it does. Thus we now have a counter of bytes emitted by raw_sock, and a freq counter for these as well. However, freq_ctr are limited to 32 bits, and given that loads of 300 Gbps have already been reached over a loopback using splicing, we need to downscale this a bit. Here we're storing 1/32 of the byte rate, which gives a theorical limit of 128 GB/s or ~1 Tbps, which is more than enough. Let's have fun re-reading this sentence in 2029 :-) The values can be read in "show info" output on the CLI.	2019-05-23 11:45:38 +02:00
Willy Tarreau	f4c1e56b5e	BUILD: signals: FreeBSD has SI_LWP instead of SI_TKILL SI_TKILL is for Linux. We're again in the non-portable area. Both OSes use macros to define these values so we can #ifdef them. Let's make SI_TKILL defined based on SI_LWP when only the latter is defined.	2019-05-23 08:40:50 +02:00
Willy Tarreau	96d5195862	MEDIUM: config: deprecate the antique req* and rsp* commands These commands don't follow the same flow as the rest of the commands, each of them iterates over all header lines before switching to the next directive. In addition they make no distinction between start line and headers and can lead to unparsable rewrites which are very difficult to deal with internally. Most of them are still occasionally found in configurations, mainly because of the usual "we've always done this way". By marking them deprecated and emitting a warning and recommendation on first use of each of them, we will raise users' awareness of users regarding the cleaner, faster and more reliable alternatives. Some use cases of "reqrep" still appear from time to time for URL rewriting that is not so convenient with other rules. But at least users facing this requirement will explain their use case so that we can best serve them. Some discussion started on this subject in a thread linked to from github issue #100. The goal is to remove them in 2.1 since they require to reparse the result before indexing it and we don't want this hack to live long. The following directives were marked deprecated : -reqadd -reqallow -reqdel -reqdeny -reqiallow -reqidel -reqideny -reqipass -reqirep -reqitarpit -reqpass -reqrep -reqtarpit -rspadd -rspdel -rspdeny -rspidel -rspideny -rspirep -rsprep	2019-05-22 20:43:45 +02:00
Willy Tarreau	d1f56c9a01	BUG/MEDIUM: dns: make the port numbers unsigned Mustafa Yildirim reported in Discourse that ports >32767 advertised in SRV records are wrong. Given the high value they definitely correspond to a sign extension of a negative number. The cause was indeed that the port is declared as a signed int in the dns_answer_item structure, and Lukas confirmed in github issue #103 that turning it to unsigned addresses the issue. It is worth noting that there are other such fields in this structure that don't look right (ttl, priority, class, type) and that someone should audit this part to be certain they are properly typed. This fix must be backported to 1.9 and likely to 1.8 as well.	2019-05-22 20:07:45 +02:00
Willy Tarreau	e5733234f6	CLEANUP: build: rename some build macros to use the USE_* ones We still have quite a number of build macros which are mapped 1:1 to a USE_something setting in the makefile but which have a different name. This patch cleans this up by renaming them to use the USE_something one, allowing to clean up the makefile and make it more obvious when reading the code what build option needs to be added. The following renames were done : ENABLE_POLL -> USE_POLL ENABLE_EPOLL -> USE_EPOLL ENABLE_KQUEUE -> USE_KQUEUE ENABLE_EVPORTS -> USE_EVPORTS TPROXY -> USE_TPROXY NETFILTER -> USE_NETFILTER NEED_CRYPT_H -> USE_CRYPT_H CONFIG_HAP_CRYPT -> USE_LIBCRYPT CONFIG_HAP_NS -> DUSE_NS CONFIG_HAP_LINUX_SPLICE -> USE_LINUX_SPLICE CONFIG_HAP_LINUX_TPROXY -> USE_LINUX_TPROXY CONFIG_HAP_LINUX_VSYSCALL -> USE_LINUX_VSYSCALL	2019-05-22 19:47:57 +02:00
Willy Tarreau	823bda0eb7	BUILD: time: remove the test on _POSIX_C_SOURCE It seems it's not defined on FreeBSD while it's mentioned on Linux that clock_gettime() can be detected using this. Given that we also have the test for _POSIX_TIMERS>0 that should cover it well enough. If it breaks on other systems, we'll see. Report was here : https://github.com/haproxy/haproxy/runs/133866993	2019-05-22 19:14:59 +02:00
Willy Tarreau	082b62828d	BUG/MEDIUM: init/threads: provide per-thread alloc/free function callbacks We currently have the ability to register functions to be called early on thread creation and at thread deinitialization. It turns out this is not sufficient because certain such functions may use resources that are being allocated by the other ones, thus creating a race condition depending only on the linking order. For example the mworker needs to register a file descriptor while the pollers will reallocate the fd_updt[] array. Similarly logs and trashes may be used by some init functions while it's unclear whether they have been deduplicated. The same issue happens on deinit, if the fd_updt[] or trash is released before some functions finish to use them, we'll get into trouble. This patch creates a couple of early and late callbacks for per-thread allocation/freeing of resources. A few init functions were moved there, and the fd init code was split between the two (since it used to both allocate and initialize at once). This way the init/deinit sequence is expected to be safe now. This patch should be backported to 1.9 as at least the trash/log issue seems to be present. The run_thread_poll_loop() code is a bit different there as the mworker is not a callback, but it will have no effect and it's enough to drop the mworker changes. This bug was reported by Ilya Shipitsin in github issue #104.	2019-05-22 14:59:08 +02:00
Willy Tarreau	ca2a3cc8d5	MINOR: connection: report the mux names in "haproxy -vv" Since the mux names appear at a few places (dumps etc), let's list them in front of supported mux protocols in "haproxy -vv".	2019-05-22 11:50:48 +02:00
Willy Tarreau	430f590b5b	MINOR: threads: add a timer_t per thread in thread_info This will be used by the watchdog to detect that a thread locked up. It's only defined on platforms supporting it. This patch only reserves the room for the timer in the struct. A special value was reserved for the uninitialized timer. The problem is that the POSIX API was horribly designed, defining no invalid value, thus for each timer it is required to keep a second variable to indicate whether it's valid. A quick check shows that defining a 32-bit invalid value is not something uncommon across other implementations, with ~0 being common. Let's try with this and if it causes issues we can revisit this decision.	2019-05-22 11:50:48 +02:00
Willy Tarreau	e6a02fa65a	MINOR: threads: add a "stuck" flag to the thread_info struct This flag is constantly cleared by the scheduler and will be set by the watchdog timer to detect stuck threads. It is also set by the "show threads" command so that it is easy to spot if the situation has evolved between two subsequent calls : if the first "show threads" shows no stuck thread and the second one shows such a stuck thread, it indicates that this thread didn't manage to make any forward progress since the previous call, which is extremely suspicious.	2019-05-22 11:50:48 +02:00
Willy Tarreau	5484d58a17	MINOR: stream: introduce a stream_dump() function and use it in stream_dump_and_crash() This function dumps a lot of information about a stream into the provided buffer. It is now used by stream_dump_and_crash() and will be used by the debugger as well.	2019-05-22 11:50:48 +02:00
Willy Tarreau	2beaaf7d46	MINOR: threads: implement ha_tkill() and ha_tkillall() These functions are used respectively to signal one thread or all threads. When multithreading is disabled, it's always the current thread which is signaled.	2019-05-22 11:50:48 +02:00
Willy Tarreau	441259c561	MINOR: threads: make threads_{harmless\|want_rdv}_mask constant 0 without threads Some code starts to add ifdefs everywhere to work around the lack of threads_harmless_mask when threads are not compiled in. This one is often used to indicate a thread having joined the rendez-vous point or a thread sleeping in the poller. By setting it to zero we translate what usually is required in debugging code (i.e. the only thread is currently working) and for signal handlers we can use a combination of threads_harmless_mask and sleeping_threads_mask to detect the polling cases as well. Similarly do the same with threads_want_rdv_mask which is less often used though.	2019-05-22 11:50:48 +02:00
Willy Tarreau	6ea63c301d	CLEANUP: objtype: make obj_type() and obj_type_name() take consts There is no reason for them to require a writable area.	2019-05-22 11:50:48 +02:00
Tim Duesterhus	9b7a976cd6	BUG/MINOR: mworker: Fix memory leak of mworker_proc members The struct mworker_proc is not uniformly freed everywhere, sometimes leading to leaks of the `id` string (and possibly the other strings). Introduce a mworker_free_child function instead of duplicating the freeing logic everywhere to prevent this kind of issues. This leak was reported in issue #96. It looks like the leaks have been introduced in commit `9a1ee7ac31`, which is specific to 2.0-dev. Backporting `mworker_free_child` might be helpful to ease backporting other fixes, though.	2019-05-22 11:29:18 +02:00
Willy Tarreau	80daaa1e9d	CLEANUP: time: switch clockid_t to empty_t when not available This is cleaner than using an int. We also get rid of the constants that we don't need nor use.	2019-05-21 20:03:03 +02:00
Willy Tarreau	9a85a1700b	MINOR: compat: define a new empty type empty_t for non-implemented fields Some structures have optional fields which depend on availability of certain features on certain platforms, and having to stuff lots of ifdefs in these structs makes them unreadable. Using real values like ints requires some initialization and adds even more confusion. Here we take a different approach : we create an empty type called empty_t to use as a substitute for the real type that is not implemented and which doesn't contain any value (it's an empty struct). Thus it has a size of zero but an address, thus a pointer may point to it. It will not have to be initialized though. Some initialization code might even continue to work and do nothing like initializing it using memset with its sizeof which is zero.	2019-05-21 20:03:03 +02:00
Willy Tarreau	f61782418c	CLEANUP: time: refine the test on _POSIX_TIMERS The clock_gettime() man page says we must check that _POSIX_TIMERS is defined to a value greater than zero, not just that it's simply defined so let's fix this right now.	2019-05-21 20:03:03 +02:00
Emmanuel Hocdet	0ba4f483d2	MAJOR: polling: add event ports support (Solaris) Event ports are kqueue/epoll polling class for Solaris. Code is based on https://github.com/joyent/haproxy-1.8/tree/joyent/dev-v1.8.8. Event ports are available only on SunOS systems derived from Solaris 10 and later (including illumos systems).	2019-05-21 15:16:45 +02:00
Willy Tarreau	219b829b62	MINOR: time: add a function to retrieve another thread's cputime now_cpu_time_thread() does the same as now_cpu_time() but for another thread based on its clockid.	2019-05-20 21:14:14 +02:00
Willy Tarreau	81036f2738	MINOR: time: move the cpu, mono, and idle time to thread_info These ones are useful across all threads and would be better placed in struct thread_info than thread-local. There are very few users.	2019-05-20 21:14:14 +02:00
Willy Tarreau	8323a375bc	MINOR: threads: add a thread-local thread_info pointer "ti" Since we're likely to access this thread_info struct more frequently in the future, let's reserve the thread-local symbol to access it directly and avoid always having to combine thread_info and tid. This pointer is set when tid is set.	2019-05-20 21:14:12 +02:00
Willy Tarreau	624dcbf41e	MINOR: threads: always place the clockid in the struct thread_info It will be easier to deal with the internal API to always have it.	2019-05-20 21:13:01 +02:00
Willy Tarreau	b81939cef0	MINOR: compat: make sure to always define clockid_t In order to ease the internal time API, we'll have the threads time always present even when threads are disabled. Let's make sure clockid_t, and the minimum clock times are defined even on older or non-compatible systems.	2019-05-20 20:24:10 +02:00
Willy Tarreau	5a6e2245fa	REORG: threads: move the struct thread_info from global.h to hathreads.h It doesn't make sense to keep this struct thread_info in global.h, it causes difficulties to access its contents from hathreads.h, let's move it to the threads where it ought to have been created.	2019-05-20 20:00:25 +02:00
Willy Tarreau	e3e2b7283f	REORG: compat: move some integer limit definitions from standard.h to compat.h Historically standard.h was the location where we used to (re-)define the standard set of macros and functions, and to complement the ones missing on the target OS. Over time it has become a toolbox in itself relying on many other things, and its definition of LONGBITS is used everywhere else (e.g. for MAX_THREADS), resulting in painful circular dependencies. Let's move these few defines (integer sizes) to compat.h where other similar definitions normally are.	2019-05-20 19:59:34 +02:00
Willy Tarreau	3710105945	MINOR: tools: provide a may_access() function and make dump_hex() use it It's a bit too easy to crash by accident when using dump_hex() on any area. Let's have a function to check if the memory may safely be read first. This one abuses the stat() syscall checking if it returns EFAULT or not, in which case it means we're not allowed to read from there. In other situations it may return other codes or even a success if the area pointed to by the file exists. It's important not to abuse it though and as such it's tested only once per output line.	2019-05-20 16:59:37 +02:00
Willy Tarreau	56131ca58e	MINOR: debug: implement ha_panic() This function dumps all existing threads using the thread dump mechanism then aborts. This will be used by the lockup detection and by debugging tools.	2019-05-20 16:51:30 +02:00
Willy Tarreau	9fc5dcbd71	MINOR: tools: add dump_hex() This is used to dump a memory area into a buffer for debugging purposes.	2019-05-20 16:51:30 +02:00
Willy Tarreau	91e6df01fa	MINOR: threads: add each thread's clockid into the global thread_info This is the per-thread CPU runtime clock, it will be used to measure the CPU usage of each thread and by the lockup detection mechanism. It must only be retrieved at the beginning of run_thread_poll_loop() since the thread must already have been started for this. But it must be done before performing any per-thread initcall so that all thread init functions have access to the clock ID. Note that it could make sense to always have this clockid available even in non-threaded situations and place the process' clock there instead. But it would add portability issues which are currently easy to deal with by disabling threads so it may not be worth it for now.	2019-05-20 11:42:25 +02:00
Willy Tarreau	522cfbc1ea	MINOR: init/threads: make the global threads an array of structs This way we'll be able to store more per-thread information than just the pthread pointer. The storage became an array of struct instead of an allocated array since it's very small (typically 512 bytes) and not worth the hassle of dealing with memory allocation on this. The array was also renamed thread_info to make its intended usage more explicit.	2019-05-20 11:37:57 +02:00
Willy Tarreau	b49a58dda2	CLEANUP: threads: remove the now unused START_LOCK label The last two users are now gone.	2019-05-20 11:26:12 +02:00
Willy Tarreau	619a95f5ad	MEDIUM: init/mworker: make the pipe register function a regular initcall Now that we have the guarantee that init calls happen before any other thread starts, we don't need anymore the workaround installed by commit `1605c7ae6` ("BUG/MEDIUM: threads/mworker: fix a race on startup") and we can instead rely on a regular per-thread initcall for this function. It will only be performed on worker thread #0, the other ones and the master have nothing to do, just like in the original code that was only moved to the function.	2019-05-20 11:26:12 +02:00
Willy Tarreau	c7091d89ae	MEDIUM: debug/threads: implement an advanced thread dump system The current "show threads" command was too limited as it was not possible to dump other threads' detailed states (e.g. their tasks). This patch goes further by using thread signals so that each thread can dump its own state in turn into a shared buffer provided by the caller. Threads are synchronized using a mechanism very similar to the rendez-vous point and using this method, each thread can safely dump any of its contents and the caller can finally report the aggregated ones from the buffer. It is important to keep in mind that the list of signal-safe functions is limited, so we take care of only using chunk_printf() to write to a pre-allocated buffer. This mechanism is enabled by USE_THREAD_DUMP and is enabled by default on Linux 2.6.28+. On other platforms it falls back to the previous solution using the loop and the less precise dump.	2019-05-17 17:16:20 +02:00
Willy Tarreau	29bf96d73d	MINOR: task: always reset curr_task when freeing a task or tasklet With the thread debugger it becomes visible that we can leave some wandering pointers for a while in curr_task, which is inappropriate. This patch addresses this by resetting curr_task to NULL before really freeing the area. This way it becomes safe even regarding signals.	2019-05-17 17:16:20 +02:00
Willy Tarreau	38171daf21	MINOR: thread: implement ha_thread_relax() At some places we're using a painful ifdef to decide whether to use sched_yield() or pl_cpu_relax() to relax in loops, this is hardly exportable. Let's move this to ha_thread_relax() instead and une this one only.	2019-05-17 17:16:20 +02:00
Willy Tarreau	5cf64dd1bd	MINOR: debug: make ha_thread_dump() and ha_task_dump() take a buffer Instead of having them dump into the trash and initialize it, let's have the caller initialize a buffer and pass it. This will be convenient to dump multiple threads at once into a single buffer.	2019-05-17 17:16:20 +02:00
Willy Tarreau	4e2b646d60	MINOR: cli/debug: add a thread dump function The new function ha_thread_dump() will dump debugging info about all known threads. The current thread will contain a bit more info. The long-term goal is to make it possible to use it in signal handlers to improve the accuracy of some dumps. The function dumps its output into the trash so as it was trivial to add, a new "show threads" command appeared on the CLI.	2019-05-16 18:06:45 +02:00
Willy Tarreau	aa1e1be88f	MINOR: task: export global_task_mask It will be used in debugging functions and must be exported.	2019-05-16 18:02:03 +02:00
Tim Duesterhus	10c6c16cde	MEDIUM: Make 'option forceclose' actually warn It is deprecated since `315b39c391` (1.9-dev), but only was deprecated in the docs. Make it warn when being used and remove it from the docs.	2019-05-16 18:02:03 +02:00
Willy Tarreau	0f35c593f6	BUILD: ist: turn the lower/upper case tables to literal on obsolete linkers Gil Bahat reported build issues on Cygwin starting with 1.9 due to a difference in the way the linker handles the weak symbols there, causing multiple declarations of ist_lc[] and ist_uc[]. It's likely that this issue could also happen on any older or non-ELF linker. This patch addresses this by using literals instead on such platforms, leaving it to the compiler to merge the constants when it can. On other platforms the resulting executable is slightly larger due to strings that could not be merged but this is a minor detail compared to not being able to build at all. If this change alone is confirmed to fix these issues, it's safe to backport to 1.9.	2019-05-15 16:14:04 +02:00
Willy Tarreau	469fa2c9d9	MINOR: debug: add a new BUG_ON macro We do have some code paths testing for impossible errors that tend to be quite confusing, first for maintenance (what to do on such errors, and how far to guess the bug), second for developers as it tends to hide the main purpose and expectations of these call places. Also most of the time impossible errors are ignored by the callers so the tests are not even usable during debugging. Let's instead implement a BUG_ON macro which takes a condition, which if true, will cause a message to be emitted and optionally to crash the process. Additionally, these calls inserted at various places server as hints and documentation for developers to know that such conditions must absolutely not happen. This is only enabled when DEBUG_STRICT or DEBUG_STRICT_NOCRASH are set. As its name implies, DEBUG_STRICT_NOCRASH only performs the test but does not crash, which can be useful to track some checkpoints. At the moment nothing uses this code.	2019-05-14 17:34:49 +02:00
Willy Tarreau	a5e33a9b66	BUILD: debug: make gcc not complain on the ABORT_NOW() macro On recent gcc versions with the null-deref checks, ABORT_NOW() rightfully emits such a warning. But here it's on purpose. Simply changing the memory address to 1 makes gcc happy.	2019-05-14 17:22:28 +02:00
Willy Tarreau	8bdb5c9bb4	CLEANUP: connection: remove the handle field from the wait_event struct It was only set and not consumed after the previous change. The reason is that the task's context always contains the relevant information, so there is no need for a second pointer.	2019-05-13 19:14:52 +02:00
Willy Tarreau	42ccb5ac45	MINOR: lists: add LIST_ADDED() to check if an element belongs to a list Some code parts use LIST_ISEMPTY() a lot on list elements to detect if they were reset consecutive to their removal from a list, but this test is always confusing as this was initially designed for list heads. Instead let's have a new macro, LIST_ADDED(), which returns true when the element is in a list (i.e. it's not "empty").	2019-05-13 19:14:52 +02:00
Olivier Houchard	478281f55d	BUG/MEDIUM: connections: Don't forget to set xprt_ctx to NULL on close. In conn_xprt_close(), after calling xprt->close(), don't forget to set conn->xprt_ctx to NULL, or we may attempt to reuse the now-free'd conn->xprt_ctx if the connection failed and we're retrying it.	2019-05-13 19:11:38 +02:00
Willy Tarreau	6a38b3297c	BUILD: threads: fix again the __ha_cas_dw() definition This low-level asm implementation of a double CAS was implemented only for certain architectures (x86_64, armv7, armv8). When threads are not used, they were not defined, but since they were called directly from a few locations, they were causing build issues on certain platforms with threads disabled. This was addressed in commit `f4436e1` ("BUILD: threads: Add __ha_cas_dw fallback for single threaded builds") by making it fall back to HA_ATOMIC_CAS() when threads are not defined, but this actually made the situation worse by breaking other cases. This patch fixes this by creating a high-level macro HA_ATOMIC_DWCAS() which is similar to HA_ATOMIC_CAS() except that it's intended to work on a double word, and which rely on the asm implementations when threads are in use, and uses its own open-coded implementation when threads are not used. The 3 call places relying on __ha_cas_dw() were updated to use HA_ATOMIC_DWCAS() instead. This change was tested on i586, x86_64, armv7, armv8 with and without threads with gcc 4.7, armv8 with gcc 5.4 with and without threads, as well as i586 with gcc-3.4 without threads. It will need to be backported to 1.9 along with the fix above to fix build on armv7 with threads disabled.	2019-05-11 18:13:29 +02:00
Willy Tarreau	295d614de1	CLEANUP: ssl: move all BIO_* definitions to openssl-compat The following macros are now defined for openssl < 1.1 so that we can remove the code performing direct access to the structures : BIO_get_data(), BIO_set_data(), BIO_set_init(), BIO_meth_free(), BIO_meth_new(), BIO_meth_set_gets(), BIO_meth_set_puts(), BIO_meth_set_read(), BIO_meth_set_write(), BIO_meth_set_create(), BIO_meth_set_ctrl(), BIO_meth_set_destroy()	2019-05-11 17:39:08 +02:00
Willy Tarreau	11b167167e	CLEANUP: ssl: remove ifdef around SSL_CTX_get_extra_chain_certs() Instead define this one in openssl-compat.h when SSL_CTRL_GET_EXTRA_CHAIN_CERTS is not defined (which was the current condition used in the ifdef).	2019-05-11 17:38:21 +02:00
Willy Tarreau	366a6987a7	CLEANUP: ssl: move the SSL_OP_* and SSL_MODE_* definitions to openssl-compat These ones were defined in the middle of ssl_sock.c, better move them to the include file to find them.	2019-05-11 17:37:44 +02:00
Olivier Houchard	602bf7d2ea	MEDIUM: streams: Add a new http action, disable-l7-retry. Add a new action for http-request, disable-l7-retry, that can be used to disable any attempt at retry requests (see retry-on) if it fails for any reason other than a connection failure. This is useful for example to make sure POST requests aren't retried.	2019-05-10 17:49:09 +02:00
Chris Packham	f4436e145b	BUILD: threads: Add __ha_cas_dw fallback for single threaded builds __ha_cas_dw() is used in fd_rm_from_fd_list() and when built without USE_THREADS=1 the linker fails to find __ha_cas_dw(). Add a definition of __ha_cas_dw() for the #ifndef USE_THREADS case. Signed-off-by: Chris Packham <chris.packham@alliedtelesis.co.nz>	2019-05-10 10:55:31 +02:00
Willy Tarreau	c125cef6da	CLEANUP: ssl: make inclusion of openssl headers safe It's always a pain to have to stuff lots of #ifdef USE_OPENSSL around ssl headers, it even results in some of them appearing in a random order and multiple times just to benefit form an existing ifdef block. Let's make these headers safe for inclusion when USE_OPENSSL is not defined, they now perform the test themselves and do nothing if USE_OPENSSL is not defined. This allows to remove no less than 8 such ifdef blocks and make include blocks more readable.	2019-05-10 09:58:43 +02:00
Willy Tarreau	8d164dc568	CLEANUP: ssl: never include openssl/*.h outside of openssl-compat.h anymore Since we're providing a compatibility layer for multiple OpenSSL implementations and their derivatives, it is important that no C file directly includes openssl headers but only passes via openssl-compat instead. As a bonus this also gets rid of redundant complex rules for inclusion of certain files (engines etc).	2019-05-10 09:36:42 +02:00
Willy Tarreau	9356dacd22	REORG: ssl: move some OpenSSL defines from ssl_sock to openssl-compat Some defines like OPENSSL_VERSION or X509_getm_notBefore() have nothing to do in ssl_sock and must move to openssl-compat.h so that they are consistently shared by the whole code. A warning in the code was added against wild additions of macros there.	2019-05-10 09:31:06 +02:00
Willy Tarreau	5599456ee2	REORG: ssl: move openssl-compat from proto to common This way we can include it much earlier to cover types/ as well.	2019-05-10 09:19:50 +02:00
Willy Tarreau	1d158ab12d	BUILD: ssl: make libressl use its own version numbers LibreSSL causes lots of build issues by pretending to be OpenSSL 2.0.0, and it requires lots of care for each #if added to cover any specific OpenSSL features. This commit addresses the problem by making LibreSSL only advertise the version it forked from (1.0.1g) and by starting to use tests based on its real version to enable features instead of working by exclusion.	2019-05-09 14:25:47 +02:00
Willy Tarreau	9a1ab08160	CLEANUP: ssl-sock: use HA_OPENSSL_VERSION_NUMBER instead of OPENSSL_VERSION_NUMBER Most tests on OPENSSL_VERSION_NUMBER have become complex and break all the time because this number is fake for some derivatives like LibreSSL. This patch creates a new macro, HA_OPENSSL_VERSION_NUMBER, which will carry the real openssl version defining the compatibility level, and this version will be adjusted depending on the variants.	2019-05-09 14:25:43 +02:00
Olivier Houchard	92e38e254b	BUG/MEDIUM: servers: Don't use the same srv flag for cookie-set and TFO. The tfo code was based on an old patch, and the value of the SRV_F_FASTOPEN flag it used was since reused for SRV_F_COOKIESET. So give SRV_F_FASTOPEN its own value.	2019-05-08 19:48:32 +02:00
Christopher Faulet	6f3cb1801b	MINOR: htx: Remove support for unused OOB HTX blocks This type of block was introduced in the early design of the HTX and it is not used anymore. So, just remove it. This patch may be backported to 1.9.	2019-05-07 22:16:41 +02:00
Christopher Faulet	3b1d004d41	BUG/MEDIUM: spoe: Be sure the sample is found before setting its context When a sample fetch is encoded, we use its context to set info about the fragmentation. But if the sample is not found, the function sample_process() returns NULL. So we me be sure the sample exists before setting its context. This patch must be backported to 1.9 and 1.8.	2019-05-07 22:16:41 +02:00
William Lallemand	27edc4b915	MINOR: mworker: support a configurable maximum number of reloads This patch implements a new global parameter for the master-worker mode. When setting the mworker-max-reloads value, a worker receive a SIGTERM if its number of reloads is greater than this value.	2019-05-07 19:09:01 +02:00
Willy Tarreau	f656279347	CLEANUP: task: remove unneeded tests before task_destroy() Since previous commit it's not needed anymore to test a task pointer before calling task_destory() so let's just remove these tests from the various callers before they become confusing. The function's arguments were also documented. The same should probably be done with tasklet_free() which involves a test in roughly half of the call places.	2019-05-07 19:08:16 +02:00
Dragan Dosen	75bc6d3e59	BUG/MEDIUM: tasks: fix possible segfault on task_destroy() Commit `3f795f7` ("MEDIUM: tasks: Merge task_delete() and task_free() into task_destroy().") replaced task_delete() and task_free() with a single function named task_destroy(). This patch adds a check for struct task* argument in function task_destroy() to prevent a possible segfault on NULL and also to make the function safer for use in other cases.	2019-05-07 18:58:52 +02:00
Dragan Dosen	2674303912	MEDIUM: regex: modify regex_comp() to atomically allocate/free the my_regex struct Now we atomically allocate the my_regex struct within function regex_comp() and compile the regex or free both in case of failure. The pointer to the allocated my_regex struct is returned directly. The my_regex* argument to regex_comp() is removed. Function regex_free() was modified so that it systematically frees the my_regex entry. The function does nothing when called with a NULL as argument (like free()). It will avoid existing risk of not properly freeing the initialized area. Other structures are also updated in order to be compatible (the ones related to Lua and action rules).	2019-05-07 06:58:15 +02:00
Fr�d�ric L�caille	c02766a267	MINOR: stick-table: Add prefixes to stick-table names. With this patch we add a prefix to stick-table names declared in "peers" sections concatenating the "peers" section name followed by a '/' character with the stick-table name. Consequently, "peers" sections have their own namespace for their stick-tables. Obviously, these stick-table names are not the ones which should be sent over the network. So these configurations must be compatible and should make A and B peers communicate with peers protocol: # haproxy A config, old way stick-table declerations peers mypeers peer A ... peer B ... backend t1 stick-table type string size 10m store gpc0 peers mypeers # haproxy B config, new way stick-table declerations peers mypeers peer A ... peer B ... table t1 type string size store gpc0 10m This "network" name is stored in ->nid new field of stktable struct. The "local" stktable-name is still stored in ->id.	2019-05-07 06:54:07 +02:00
Fr�d�ric L�caille	015e4d7d93	MINOR: stick-tables: Add peers process binding computing. Add a list of proxies for all the stick-tables (->proxies_list struct stktable member) so that to be able to compute the process bindings of the peers after having parsed the configuration file. The proxies are added to the stick-tables they reference when parsing stick-tables lines in proxy sections, when checking the actions in check_trk_action() and when resolving samples args for stick-tables without checking is they are duplicates. We check only there is no loop. Then, after having parsed everything, we add the proxy bindings to the peers frontend bindings with stick-tables they reference.	2019-05-07 06:54:07 +02:00
Fr�d�ric L�caille	1b8e68e89a	MEDIUM: stick-table: Stop handling stick-tables as proxies. This patch adds the support for the "table" line parsing in "peers" sections to declare stick-table in such sections. This also prevents the user from having to declare dummy backends sections with a unique stick-table inside. Even if still supported, this usage will become deprecated. To do so, the ->table member of proxy struct which is a stktable struct is replaced by a pointer to a stktable struct allocated at parsing time in src/cfgparse-listen.c for the dummy stick-table backends and in src/cfgparse.c for "peers" sections. This has an impact on the code for stick-table sample converters and on the stickiness rules parsers which first store the name of the dummy before resolving the rules. This patch replaces proxy_tbl_by_name() calls by stktable_find_by_name() calls to lookup for stick-tables stored in "stktable_by_name" ebtree at parsing time. There is only one remaining place where proxy_tbl_by_name() is used: src/hlua.c. At several places in the code we relied on the fact that ->size member of stick-table was equal to zero to consider the stick-table was present by not configured, this do not make sense anymore as ->table member of struct proxyis fow now on a pointer. These tests are replaced by a test on ->table value itself. In "peers" section we do not have to temporary store the name of the section the stick-table are attached to because this name is obviously already known just after having entered this "peers" section. About the CLI stick-table I/O handler, the pointer to proxy struct is replaced by a pointer to a stktable struct.	2019-05-07 06:54:06 +02:00
Fr�d�ric L�caille	f92da38222	BUILD/MINOR: stick-table: Compilation fix. Missing header to dereference struct peers pointer from struct table.	2019-05-07 06:54:06 +02:00
Fr�d�ric L�caille	d456aa4ac2	MINOR: config: Extract the code of "stick-table" line parsing. With this patch we move the code responsible of parsing "stick-table" lines to implement parse_stick_table() function in src/stick-tabble.c so that to be able to parse "stick-table" elsewhere than in proxy sections. We have have also added a conf struct to stktable struct to store the filename and the line in the file the stick-table has been parsed to help in diagnosing and displaying any configuration issue.	2019-05-07 06:54:06 +02:00
Willy Tarreau	034c88cf03	MEDIUM: tcp: add the "tfo" option to support TCP fastopen on the server This implements support for the new API which relies on a call to setsockopt(). On systems that support it (currently, only Linux >= 4.11), this enables using TCP fast open when connecting to server. Please note that you should use the retry-on "conn-failure", "empty-response" and "response-timeout" keywords, or the request won't be able to be retried on failure. Co-authored-by: Olivier Houchard <ohouchard@haproxy.com>	2019-05-06 22:29:39 +02:00
Olivier Houchard	fdcb007ad8	MEDIUM: proto: Change the prototype of the connect() method. The connect() method had 2 arguments, "data", that tells if there's pending data to be sent, and "delack" that tells if we have to use a delayed ack inconditionally, or if the backend is configured with tcp-smart-connect. Turn that into one argument, "flags". That way it'll be easier to provide more informations to connect() without adding extra arguments.	2019-05-06 22:12:57 +02:00
Ilya Shipitsin	54832b97c6	BUILD: enable several LibreSSL hacks, including SSL_SESSION_get0_id_context is introduced in LibreSSL-2.7.0 async operations are not supported by LibreSSL early data is not supported by LibreSSL packet_length is removed from SSL struct in LibreSSL	2019-05-06 07:26:24 +02:00
Olivier Houchard	e3249a98e2	MEDIUM: streams: Add a new keyword for retry-on, "junk-response" Add a way to retry requests if we got a junk response from the server, ie an incomplete response, or something that is not valid HTTP. To do so, one can use the new "junk-response" keyword for retry-on.	2019-05-04 10:20:24 +02:00
Olivier Houchard	865d8392bb	MEDIUM: streams: Add a way to replay failed 0rtt requests. Add a new keyword for retry-on, 0rtt-rejected. If set, we will try to replay requests for which we sent early data that got rejected by the server. If that option is set, we will attempt to use 0rtt if "allow-0rtt" is set on the server line even if the client didn't send early data.	2019-05-04 10:20:24 +02:00
Olivier Houchard	a254a37ad7	MEDIUM: streams: Add the ability to retry a request on L7 failure. When running in HTX mode, if we sent the request, but failed to get the answer, either because the server just closed its socket, we hit a server timeout, or we get a 404, 408, 425, 500, 501, 502, 503 or 504 error, attempt to retry the request, exactly as if we just failed to connect to the server. To do so, add a new backend keyword, "retry-on". It accepts a list of keywords, which can be "none" (never retry), "conn-failure" (we failed to connect, or to do the SSL handshake), "empty-response" (the server closed the connection without answering), "response-timeout" (we timed out while waiting for the server response), or "404", "408", "425", "500", "501", "502", "503" and "504". The default is "conn-failure".	2019-05-04 10:19:56 +02:00
Willy Tarreau	c40efc1919	MINOR: init/threads: make the threads array global Currently the thread array is a local variable inside a function block and there is no access to it from outside, which often complicates debugging. Let's make it global and export it. Also the allocation return is now checked.	2019-05-03 10:16:30 +02:00
Willy Tarreau	81492c989c	MINOR: threads: flatten the per-thread cpu-map When we initially experimented with threads and processes support, we needed to implement arrays of threads per process for cpu-map, but this is not needed anymore since we support either threads or processes. Let's simply make the thread-based cpu-map per thread and not per thread and per process since that's not used anymore. Doing so reduces the global struct from 33kB to 1.5kB.	2019-05-03 09:46:45 +02:00
Olivier Houchard	a48237fd07	BUG/MEDIUM: connections: Make sure we remove CO_FL_SESS_IDLE on disown. When for some reason the session is not the owner of the connection anymore, make sure we remove CO_FL_SESS_IDLE, even if we're about to call conn->mux->destroy(), as the destroy may not destroy the connection immediately if it's still in use. This should be backported to 1.9. u	2019-05-02 12:08:39 +02:00
Olivier Houchard	55071d30ca	BUG/MEDIUM: channels: Don't forget to reset output in channel_erase(). In channel_erase(), don't forget to set output to 0, otherwise the channel won't seem empty, when it really is, and that could lead to stream never closing properly. This should be backported to 1.9.	2019-05-02 10:40:59 +02:00
Christopher Faulet	102854cbba	BUG/MEDIUM: listener: Fix how unlimited number of consecutive accepts is handled There is a bug when global.tune.maxaccept is set to -1 (no limit). It is pretty visible with one process (nbproc sets to 1). The functions listener_accept() and accept_queue_process() don't expect to handle negative maxaccept values. So instead of accepting incoming connections without any limit, none are never accepted and HAProxy loop infinitly in the scheduler. When there are 2 or more processes, the bug is a bit more subtile. The limit for a listener is set to 1. So only one connection is accepted at a time by a given listener. This happens because the listener's maxaccept value is an unsigned integer. In check_config_validity(), it is first set to UINT_MAX (-1 casted in an unsigned integer), and then some calculations on it leads to an integer overflow. To fix the bug, the listener's maxaccept value is now a signed integer. So, if a negative value is set for global.tune.maxaccept, we keep it untouched for the listener and no calculation is made on it. Then, in the listener code, this signed value is casted to a unsigned one. It simplifies all tests instead of dealing with negative values. So, it limits the number of connections accepted at a time to UINT_MAX at most. But, honestly, it not an issue. This patch must be backported to 1.9 and 1.8.	2019-04-30 15:28:29 +02:00
Olivier Houchard	07425de717	BUG/MEDIUM: port_range: Make the ring buffer lock-free. Port range uses a ring buffer, and unfortunately, when making haproxy multithreaded, it's been overlooked, and the ring buffer is not thread-safe. When specifying a source range, 2 or more threads could pick the same port, and of course only one of them could use the port, the others would always fail the connection. To fix this, make it a lock-free ring buffer. This is easier than usual because we know the ring buffer can never be full. This should be backported to 1.8 and 1.9.	2019-04-30 15:10:17 +02:00
Olivier Houchard	9ce62b5498	MINOR: threads: Implement HA_ATOMIC_LOAD(). The same way we have HA_ATOMIC_STORE(), implement HA_ATOMIC_LOAD(). This should be backported to 1.8 and 1.9, as we need it for a bug fix in port ranges.	2019-04-30 15:10:08 +02:00
Willy Tarreau	bc13bec548	MINOR: activity: report context switch counts instead of rates It's not logical to report context switch rates per thread in show activity because everything else is a counter and it's not even possible to compare values. Let's only report counts. Further, this simplifies the scheduler's code.	2019-04-30 14:55:18 +02:00
Willy Tarreau	9634e86dc7	CLEANUP: task: move the task_per_thread definition to task.h It's the second time I look for it and can't find it because it's not in the right file.	2019-04-30 14:36:47 +02:00
Fr�d�ric L�caille	d803e475e5	MINOR: log: Enable the log sampling and load-balancing feature. This patch implements the sampling and load-balancing of log servers configured with "sample" new keyword implemented by this commit: 'MINOR: log: Add "sample" new keyword to "log" lines'. As the list of ranges used to sample the log to balance is ordered, we only have to maintain ->curr_idx member of smp_info struct which is the index of the sample and check if it belongs or not to the current range to decide if we must send it to the log server or not.	2019-04-30 09:25:09 +02:00
Fr�d�ric L�caille	d95ea2897e	MINOR: log: Add "sample" new keyword to "log" lines. This patch implements the parsing of "sample" new optional keyword for "log" lines to be able to sample and balance the load of log messages between serveral log destinations declared by "log" lines. This keyword must be followed by a list of comma seperated ranges of indexes numbered from 1 to define the samples to be used to balance the load of logs to send. This "sample" keyword must be used on "log" lines obviously before the remaining optional ones without keyword. The list of ranges must be followed by a colon character to separate it from the log sampling size. With such following configuration declarations: log stderr local0 log 127.0.0.1:10001 sample 2-3,8-11:11 local0 log 127.0.0.2:10002 sample 5:5 local0 in addition to being sent to stderr, about the second "log" line, every 11 logs the logs #2 up to #3 would be sent to 127.0.0.1:10001, then #8 up tp #11 four logs would be sent to the same log server and so on periodically. Logs would be sent to 127.0.0.2:100002 every 5 logs. It is also possible to define the size of the sample with a value different of the maximum of the high limits of the ranges, for instance as follows: log 127.0.0.1:10001 sample 2-3,8-11:15 local0 as before the two logs #2 and #3 would be sent to 127.0.0.1:10001, then #8 up tp #11 logs, but in this case here, this would be done periodically every 15 messages. Also note that the ranges must not overlap each others. This is to ease the way the logs are periodically sent.	2019-04-30 09:25:09 +02:00
Christopher Faulet	85db3212b8	MINOR: spoe: Use the sample context to pass frag_ctx info during encoding This simplifies the API and hide the details in the sample. This way, only string and binary are aware of these info, because other types cannot be partially encoded. This patch may be backported to 1.9 and 1.8.	2019-04-29 16:02:05 +02:00
Kevin Zhu	f7f54280c8	BUG/MEDIUM: spoe: arg len encoded in previous frag frame but len changed Fragmented arg will do fetch at every encode time, each fetch may get different result if SMP_F_MAY_CHANGE, for example res.payload, but the length already encoded in first fragment of the frame, that will cause SPOA decode failed and waste resources. This patch must be backported to 1.9 and 1.8.	2019-04-29 16:02:05 +02:00
Willy Tarreau	71c07ac65a	MINOR: stream/debug: make a stream dump and crash function During 1.9 development (and even a bit after) we've started to face a significant number of situations where streams were abusively spinning due to an uncaught error flag or complex conditions that couldn't be correctly identified. Sometimes streams wake appctx up and conversely as well. More importantly when this happens the only fix is to restart. This patch adds a new function to report a serious error, some relevant info and to crash the process using abort() so that a core dump is available. The purpose will be for this function to be called in various situations where the process is unfixable. It will help detect these issues much earlier during development and may even help fixing test platforms which are able to automatically restart when such a condition happens, though this is not the primary purpose. This patch only provides the function and doesn't use it yet.	2019-04-26 13:15:56 +02:00
Willy Tarreau	5e6a5b3a6e	MINOR: connection: make the debugging helper functions safer We have various functions like conn_get_ctrl_name() to retrieve some information reported in "show sess" for debugging, which assume that the connection is valid. This is really not convenient in code aimed at debugging and is error-prone. Let's add a validity test first.	2019-04-25 18:35:49 +02:00
Willy Tarreau	d5ec4bfe85	CLEANUP: standard: use proper const to addr_to_str() and port_to_str() The input parameter was not marked const, making it painful for some calls.	2019-04-25 17:48:16 +02:00
Willy Tarreau	d2d3348acb	MINOR: activity: enable automatic profiling turn on/off Instead of having to manually turn task profiling on/off in the configuration, by default it will work in "auto" mode, which automatically turns on on any thread experiencing sustained loop latencies over one millisecond averaged over the last 1024 samples. This may happen with configs using lots of regex (thing map_reg for example, which is the lazy way to convert Apache's rewrite rules but must not be abused), and such high latencies affect all the process and the problem is most often intermittent (e.g. hitting a map which is only used for certain host names). Thus now by default, with profiling set to "auto", it remains off all the time until something bad happens. This also helps better focus on the issues when looking at the logs as well as in "show sess" output. It automatically turns off when the average loop latency over the last 1024 calls goes below 990 microseconds (which typically takes a while when in idle). This patch could be backported to stable versions after a bit more exposure, as it definitely improves observability and the ability to quickly spot the culprit. In this case, previous patch ("MINOR: activity: make the profiling status per thread and not global") must also be taken.	2019-04-25 17:26:46 +02:00
Willy Tarreau	d9add3acc8	MINOR: activity: make the profiling status per thread and not global In order to later support automatic profiling turn on/off, we need to have it per-thread. We're keeping the global option to know whether to turn it or on off, but the profiling status is now set per thread. We're updating the status in activity_count_runtime() which is called before entering poll(). The reason is that we'll extend this with run time measurement when deciding to automatically turn it on or off.	2019-04-25 17:26:19 +02:00
Willy Tarreau	22d63a24d9	MINOR: applet: measure and report an appctx's call rate in "show sess" Very similarly to previous commit doing the same for streams, we now measure and report an appctx's call rate. This will help catch applets which do not consume all their data and/or which do not properly report that they're waiting for something else. Some of them like peers might theorically be able to exhibit some occasional peeks when teaching a full table to a nearby peer (e.g. the new replacement process), but nothing close to what a bogus service can do so there is no risk of confusion.	2019-04-24 16:04:23 +02:00
Willy Tarreau	2e9c1d2960	MINOR: stream: measure and report a stream's call rate in "show sess" Quite a few times some bugs have made a stream task incorrectly handle a complex combination of events, which was often reported as "100% CPU", and was usually caused by the event not being properly identified and flushed, and the stream's handler called in loops. This patch adds a call rate counter to the stream struct. It's not huge, it's really inexpensive (especially compared to the rest of the processing function) and will easily help spot such tasks in "show sess" output, possibly even allowing to kill them. A future patch should probably consist in alerting when they're above a certain threshold, possibly sending a dump and killing them. Some options could also consist in aborting in order to get an analyzable core dump and let a service manager restart a fresh new process.	2019-04-24 16:04:23 +02:00
Willy Tarreau	0212fadd65	MINOR: tasks/activity: report the context switch and task wakeup rates It's particularly useful to spot runaway tasks to see this. The context switch rate covers all tasklet calls (tasks and I/O handlers) while the task wakeups only covers tasks picked from the run queue to be executed. High values there will indicate either an intense traffic or a bug that mades a task go wild.	2019-04-24 16:04:23 +02:00
Christopher Faulet	c1918d1a8f	BUG/MAJOR: muxes: Use the HTX mode to find the best mux for HTTP proxies only Since the commit `1d2b586cd` ("MAJOR: htx: Enable the HTX mode by default for all proxies"), the HTX is enabled by default for all proxies, HTTP and TCP, but also CLI and HEALTH proxies. But when the best mux is retrieved, only HTTP and TCP modes are checked. If the TCP mode is not explicitly set, it is considered as an HTTP proxy. It is an hidden bug introduced when the option "http-use-htx" was added. It has no effect until the commit `1d2b586cd`. But now, when a stats socket is created for the master process, the mux h1 is installed on all incoming connections to the CLI proxy, leading to segfaults because HTX operations are performed on raw buffers. So to fix the buf, when a mux is installed, all proxies are considered as TCP proxies, except HTTP ones. This way, CLI and HEALTH proxies will be handled as TCP proxies. This patch must be backported to 1.9 although it has no effect. It is safer to not keep hidden bugs.	2019-04-24 15:40:02 +02:00
Baptiste Assmann	333939c2ee	MINOR: action: new '(http-request\|tcp-request content) do-resolve' action The 'do-resolve' action is an http-request or tcp-request content action which allows to run DNS resolution at run time in HAProxy. The name to be resolved can be picked up in the request sent by the client and the result of the resolution is stored in a variable. The time the resolution is being performed, the request is on pause. If the resolution can't provide a suitable result, then the variable will be empty. It's up to the admin to take decisions based on this statement (return 503 to prevent loops). Read carefully the documentation concerning this feature, to ensure your setup is secure and safe to be used in production. This patch creates a global counter to track various errors reported by the action 'do-resolve'.	2019-04-23 11:41:52 +02:00
Baptiste Assmann	0b9ce82dfa	MINOR: obj_type: new object type for struct stream This patch creates a new obj_type for the struct stream in HAProxy.	2019-04-23 11:35:56 +02:00
Baptiste Assmann	dfd35fd71a	MINOR: dns: dns_requester structures are now in a memory pool dns_requester structure can be allocated at run time when servers get associated to DNS resolution (this happens when SRV records are used in conjunction with service discovery). Well, this memory allocation is safer if managed in an HAProxy pool, furthermore with upcoming HTTP action which can perform DNS resolution at runtime. This patch moves the memory management of the dns_requester structure into its own pool.	2019-04-23 11:33:48 +02:00
Emeric Brun	d0e095c2aa	MINOR: ssl/cli: async fd io-handlers printable on show fd This patch exports the async fd iohandlers and make them printable doing a 'show fd' on cli.	2019-04-19 17:27:01 +02:00
Christopher Faulet	22c57bef56	BUG/MEDIUM: h1: Don't parse chunks CRLF if not enough data are available As specified in the function comment, the function h1_skip_chunk_crlf() must not change anything and return zero if not enough data are available. This must include the case where there is no data at all. On this point, it must do the same that other h1 parsing functions. This bug is made visible since the commit `91f77d599` ("BUG/MINOR: mux-h1: Process input even if the input buffer is empty"). This patch must be backported to 1.9.	2019-04-19 15:53:23 +02:00
Olivier Houchard	88698d966d	MEDIUM: connections: Add a way to control the number of idling connections. As by default we add all keepalive connections to the idle pool, if we run into a pathological case, where all client don't do keepalive, but the server does, and haproxy is configured to only reuse "safe" connections, we will soon find ourself having lots of idling, unusable for new sessions, connections, while we won't have any file descriptors available to create new connections. To fix this, add 2 new global settings, "pool_low_ratio" and "pool_high_ratio". pool-low-fd-ratio is the % of fds we're allowed to use (against the maximum number of fds available to haproxy) before we stop adding connections to the idle pool, and destroy them instead. The default is 20. pool-high-fd-ratio is the % of fds we're allowed to use (against the maximum number of fds available to haproxy) before we start killing idling connection in the event we have to create a new outgoing connection, and no reuse is possible. The default is 25.	2019-04-18 19:52:03 +02:00
Olivier Houchard	7c49d2e213	MINOR: fd: Add a counter of used fds. Add a new counter, ha_used_fds, that let us know how many file descriptors we're currently using.	2019-04-18 19:19:59 +02:00
Olivier Houchard	e179d0e88f	MEDIUM: connections: Provide a xprt_ctx for each xprt method. For most of the xprt methods, provide a xprt_ctx. This will be useful later when we'll want to be able to stack xprts. The init() method now has to create and provide the said xprt_ctx if needed.	2019-04-18 14:56:24 +02:00
Olivier Houchard	7b5fd1ec26	MEDIUM: connections: Move some fields from struct connection to ssl_sock_ctx. Move xprt_st, tmp_early_data and sent_early_data from struct connection to struct ssl_sock_ctx, as they are only used in the SSL code.	2019-04-18 14:56:24 +02:00
Olivier Houchard	3f795f76e8	MEDIUM: tasks: Merge task_delete() and task_free() into task_destroy(). task_delete() was never used without calling task_free() just after, and task_free() was only used on error pathes to destroy a just-created task, so merge them into task_destroy(), that will remove the task from the wait queue, and make sure the task is either destroyed immediately if it's not in the run queue, or destroyed when it's supposed to run.	2019-04-18 10:10:04 +02:00
Willy Tarreau	8c12e2f785	MINOR: task/thread: factor out a wake-up condition The wakeup condition in task_wakeup() is redundant as it is already validated by the CAS. Better move the __task_wakeup() call there, it also has the merit of being easier to audit this way. This also reduces the code size by around 1.8 kB : $ size haproxy-? text data bss dec hex filename 2153806 100208 1307676 3561690 3658da haproxy-1 2152094 100208 1307676 3559978 36522a haproxy-2	2019-04-17 22:15:58 +02:00
Willy Tarreau	a70bfaaf8b	BUG/MAJOR: task: make sure never to delete a queued task Commit `0c7a4b6` ("MINOR: tasks: Don't set the TASK_RUNNING flag when adding in the tasklet list.") revealed a hole in the way tasks may be freed : they could be removed while in the run queue when the TASK_QUEUED flag was present but not the TASK_RUNNING one. But it seems the issue was emphasized by commit `cde7902` ("MEDIUM: tasks: improve fairness between the local and global queues") though the code it replaces was already affected given how late the TASK_RUNNING flag was set after removal from the global queue. At the moment the task is picked from the global run queue, if it is the last one, the global run queue lock is dropped, and then the TASK_RUNNING flag was added. In the mean time another thread might have performed a task_free(), and immediately after, the TASK_RUNNING flag was re-added to the task, which was then added to the tasklet list. The unprotected window was extremely faint but does definitely exist and inconsistent task lists have been observed a few times during very intensive tests over the last few days. From this point various options are possible, the task might have been re-allocated while running, and assigned state 0 and/or state QUEUED while it was still running, resulting in the tast not being put back into the tree. This commit simply makes sure that tests on TASK_RUNNING before removing the task also cover TASK_QUEUED. It must be backported to 1.9 along with the previous ones touching that area.	2019-04-17 22:15:58 +02:00
Olivier Houchard	4a1be0c6d6	MEDIUM: tasks: No longer use rq.node.leaf_p as a lock. Now that we have the warranty that a task won't be added in the runqueue while the TASK_QUEUED or the TASK_RUNNING flag is set, don't bother trying to lock the task by setting leaf_p to 0x1 while inserting it in the runqueue or having it in the tasklet_list, as nobody else will attempt to add it.	2019-04-17 19:28:01 +02:00
Olivier Houchard	5c964f7b42	MINOR: tasks: Don't consider we can wake task with tasklet_wakeup(). In tasklet_wakeup(), don't bother checking if the tasklet is really a task, calling tasklet_wakeup() with a task is invalid.	2019-04-17 19:28:01 +02:00
Willy Tarreau	b038007ae8	BUG/MEDIUM: tasks: Make sure we set TASK_QUEUED before adding a task to the rq. Make sure we set TASK_QUEUED in every case before adding the task to the run queue. task_wakeup() now checks if either TASK_QUEUED or TASK_RUNNING is set, and if neither is set, add TASK_QUEUED and effectively add the task to the runqueue. No longer use __task_wakeup() anywhere except in task_wakeup(), always use task_wakeup() instead. With the old code, process_runnable_task() may re-add a task in the runqueue without setting the TASK_QUEUED flag, and there were race conditions that could lead to a task having the TASK_QUEUED flag but not in the runqueue, thus being unschedulable. This should be backported to 1.9.	2019-04-17 19:28:01 +02:00
Christopher Faulet	5ec8bcb021	BUG/MINOR: http_fetch/htx: Allow permissive sample prefetch for the HTX As for smp_prefetch_http(), there is now a way to successfully perform a prefetch in HTX, even if the message forwarding already begun. It is used for the sample fetches "req.proto_http" and "method". This patch must be backported to 1.9.	2019-04-17 15:12:27 +02:00
Christopher Faulet	89dc499359	BUG/MAJOR: http_fetch: Get the channel depending on the keyword used All HTTP samples are buggy because the channel tested in the prefetch functions (HTX and legacy HTTP) is chosen depending on the sample direction and not the keyword really used. It means the request channel is used if the sample is called during the request analysis and the response channel is used if it is called during the response analysis, regardless the sample really called. For instance, if you use the sample "req.ver" in an http-response rule, the response channel will be prefeched because it is called during the response analysis, while the request channel should have been used instead. So some assumptions on the validity of the sample may be made on the wrong channel. It is the first bug. Then the same error is done in some samples themselves. So fetches are performed on the wrong channel. For instance, the header extraction (req.fhdr, res.fhdr, req.hdr, res.hdr...). If the sample "req.hdr" is used in an http-response rule, then the matching is done on the response headers and not the request ones. It is the second bug. Finally, the last one but not the least, in some samples, the right channel is used. But because the prefetch was done on the wrong one, this channel may be in a undefined state. For instance, using the sample "req.ver" in an http-response rule leads to a matching on a posibility released buffer. To fix all these bugs, the right channel is now chosen in sample fetches, before the prefetch. If the same function is used to fetch requests and responses elements, then the keyword is used to choose the right one. This channel is then used by the functions smp_prefetch_htx() and smp_prefetch_http(). Of course, it is also used by the samples themselves to extract information. This patch must be backported to all supported versions. For version 1.8 and priors, it must be totally refactored. First because there is no HTX into these versions. Then the buffers API has changed in HAProxy 1.9. The files http_fetch.{ch} doesn't exist on old versions.	2019-04-17 15:12:27 +02:00
Christopher Faulet	3a4d1bea61	BUG/MEDIUM: htx: Don't return the start-line if the HTX message is empty In the function htx_get_stline(), NULL must be returned if the HTX message doesn't contain any element. This patch must be backported to 1.9.	2019-04-17 15:12:27 +02:00
Willy Tarreau	636848aa86	MINOR: init: add a "set-dumpable" global directive to enable core dumps It's always a pain to get a core dump when enabling user/group setting (which disables the dumpable flag on Linux), when using a chroot and/or when haproxy is started by a service management tool which requires complex operations to just raise the core dump limit. This patch introduces a new "set-dumpable" global directive to work around these troubles by doing the following : - remove file size limits (equivalent of ulimit -f unlimited) - remove core size limits (equivalent of ulimit -c unlimited) - mark the process dumpable again (equivalent of suid_dumpable=1) Some of these will depend on the operating system. This way it becomes much easier to retrieve a core file. Temporarily moving the chroot to a user-writable place generally enough.	2019-04-16 14:31:23 +02:00
William Lallemand	8f7069a389	CLEANUP: mworker: remove the type field in mworker_proc Since the introduction of the options field, we can use it to store the type of process. type = 'm' is replaced by PROC_O_TYPE_MASTER type = 'w' is replaced by PROC_O_TYPE_WORKER type = 'e' is replaced by PROC_O_TYPE_PROG The old values are still used in the HAPROXY_PROCESSES environment variable to pass the information during a reload.	2019-04-16 13:26:43 +02:00
William Lallemand	bd3de3efb7	MEDIUM: mworker-prog: implements 'option start-on-reload' This option is already the default, but its opposite 'no option start-on-reload' allows the master to keep a previous instance of a program and don't start a new one upon a reload. The old program will then appear as a current one in "show proc" and could also trigger an exit-on-failure upon a segfault.	2019-04-16 13:26:43 +02:00
William Lallemand	4528611ed6	MEDIUM: mworker: store the leaving state of a process Previously we were assuming than a process was in a leaving state when its number of reload was greater than 0. With mworker programs it's not the case anymore so we need to store a leaving state.	2019-04-16 13:26:43 +02:00
Willy Tarreau	9df86f997e	BUG/MAJOR: lb/threads: fix insufficient locking on round-robin LB Maksim Kupriianov reported very strange crashes in fwrr_update_position() which didn't make sense because of an apparent divide overflow except that the value was not null in the core. It happens that while the locking is correct in all the functions' call graph, the uppermost one (fwrr_get_next_server()) incorrectly expected that its target server was already locked when called. This stupid assumption causd the server lock not to be held when calling the other ones, explaining how it was possible to change the server's eweight by calling srv_lb_commit_status() under the server lock yet collide with its unprotected usage. This commit makes sure that fwrr_get_server_from_group() retrieves a locked server and that fwrr_get_next_server() is responsible for unlocking the server before returning it. There is one subtlety in this function which is that it builds a list of avoided servers that were full while scanning the tree, and all of them are queued in a full state so they must be unlocked upon return. Many thanks to Maksim for providing detailed info allowing to narrow down this bug. This fix must be backported to 1.9. In 1.8 the lock seems much wider and changes to the server's state are performed under the rendez-vous point so this it doesn't seem possible that it happens there.	2019-04-16 11:21:14 +02:00
Fr�d�ric L�caille	95679dc096	MINOR: peers: Add a new command to the CLI for peers. Implements "show peers [peers section]" new CLI command to dump information about the peers and their stick-tables to be synchronized and others internal. May be backported as far as 1.5.	2019-04-16 09:58:40 +02:00
Willy Tarreau	8de1df92a3	BUILD: do not specify "const" on functions returning structs or scalars Older compilers (like gcc-3.4) warn about the use of "const" on functions returning a struct, which makes sense since the return may only be copied : include/common/htx.h:233: warning: type qualifiers ignored on function return type Let's simply drop "const" here.	2019-04-15 21:55:48 +02:00
Willy Tarreau	0e492e2ad0	BUILD: address a few cases of "static <type> inline foo()" Older compilers don't like to see "inline" placed after the type in a function declaration, it must be "static inline <type>" only. This patch touches various areas. The warnings were seen with gcc-3.4.	2019-04-15 21:55:48 +02:00
Olivier Houchard	3212a2c438	BUG/MEDIUM: Threads: Only use the gcc >= 4.7 builtins when using gcc >= 4.7. Move the definition of the various _HA_ATOMIC_* macros that use __atomic_* in the #if GCC_VERSION >= 4.7, not just after it, so that we can build with older versions of gcc again.	2019-04-15 21:16:24 +02:00
Olivier Houchard	e5eef1f1b4	MINOR: connections: Remove the SUB_CALL_UNSUBSCRIBE flag. Garbage collect SUB_CALL_UNSUBSCIRBE, as it's now unused.	2019-04-15 19:27:57 +02:00
Nenad Merdanovic	8ef706502a	BUG/MINOR: ssl: Fix 48 byte TLS ticket key rotation Whenever HAProxy was reloaded with rotated keys, the resumption would be broken for previous encryption key. The bug was introduced with the addition of 80 byte keys in 9e7547 (MINOR: ssl: add support of aes256 bits ticket keys on file and cli.). This fix needs to be backported to 1.9.	2019-04-15 10:09:54 +02:00
Willy Tarreau	24f382f555	CLEANUP: task: do not export rq_next anymore This one hasn't been used anymore since the scheduler changes after 1.8 but it kept being exported and maintained up to date while it's always reset when scanning the trees. Let's stop exporting it and updating it.	2019-04-15 09:50:56 +02:00
Christopher Faulet	0ef372a390	MAJOR: muxes/htx: Handle inplicit upgrades from h1 to h2 The upgrade is performed when an H2 preface is detected when the first request on a connection is parsed. The CS is destroyed by setting EOS flag on it. A special flag is added on the HTX message to warn the HTX analyzers the stream will be closed because of an upgrade. This way, no error and no log are emitted. When the mux h1 is released, we create a mux h2, without any CS and passing the buffer with the unparsed H2 preface.	2019-04-12 22:06:53 +02:00
Christopher Faulet	c0016d8119	MEDIUM: connection: Add conn_upgrade_mux_fe() to handle mux upgrades This function will handle mux upgrades, for frontend connections only. It will retrieve the best mux in the same way than conn_install_mux_fe except that the mode and optionnally the proto are forced. The new multiplexer is initialized using a new context and a specific input buffer. Then, the old one is destroyed. If an error occurred, everything is rolled back.	2019-04-12 22:06:53 +02:00
Christopher Faulet	73c1207c71	MINOR: muxes: Pass the context of the mux to destroy() instead of the connection It is mandatory to handle mux upgrades, because during a mux upgrade, the connection will be reassigned to another multiplexer. So when the old one is destroyed, it does not own the connection anymore. Or in other words, conn->ctx does not point to the old mux's context when its destroy() callback is called. So we now rely on the multiplexer context do destroy it instead of the connection. In addition, h1_release() and h2_release() have also been updated in the same way.	2019-04-12 22:06:53 +02:00
Christopher Faulet	51f73eb11a	MEDIUM: muxes: Add an optional input buffer during mux initialization The mux's callback init() now take a pointer to a buffer as extra argument. It must be used by the multiplexer as its input buffer. This buffer is always NULL when a multiplexer is initialized with a fresh connection. But if a mux upgrade is performed, it may be filled with existing data. Note that, for now, mux upgrades are not supported. But this commit is mandatory to do so.	2019-04-12 22:06:53 +02:00
Christopher Faulet	209829f159	MINOR: http: update the macro IS_HTX_STRM() to check the stream flag SF_HTX Instead of matching on the frontend options, we now check if the flag SF_HTX is set or not on the stream to know if it is an HTX stream or not.	2019-04-12 22:06:53 +02:00
Christopher Faulet	0e160ff5bb	MINOR: stream: Set a flag when the stream uses the HTX The flag SF_HTX has been added to know when a stream uses the HTX or not. It is set when an HTX stream is created. There are 2 conditions to set it. The first one is when the HTTP frontend enables the HTX. The second one is when the attached conn_stream uses an HTX multiplexer.	2019-04-12 22:06:53 +02:00
Christopher Faulet	9f38f5aa80	MINOR: muxes: Add a flag to specify a multiplexer uses the HTX A multiplexer must now set the flag MX_FL_HTX when it uses the HTX to structured the data exchanged with channels. the muxes h1 and h2 set this flag. Of course, for the mux h2, it is set on h2_htx_ops only.	2019-04-12 22:06:53 +02:00
Christopher Faulet	a51ebb7f56	MEDIUM: h1: Add an option to sanitize connection headers during parsing The flag H1_MF_CLEAN_CONN_HDR has been added to let the H1 parser sanitize connection headers. It means it will remove all "close" and "keep-alive" values during the parsing. One noticeable effect is that connection headers may be unfolded. In practice, this is not a problem because it is not frequent to have multiple values for the connection headers. If this flag is set, during the parsing The function h1_parse_next_connection_header() is called in a loop instead of h1_parse_conection_header(). No need to backport this patch	2019-04-12 22:06:53 +02:00
Christopher Faulet	03b9d8ba4a	MINOR: proto_htx: Don't adjust transaction mode anymore in HTX analyzers Because the option http-tunnel is now ignored in HTX, there is no longer any need to adjust the transaction mode in HTX analyzers. A channel can still be switch to the tunnel mode for legitimate cases (HTTP CONNECT or switching protocols). So the function htx_adjust_conn_mode() is now useless. This patch must be backported to 1.9. It is not strictly speaking required but it will ease futur backports.	2019-04-12 22:06:53 +02:00
Willy Tarreau	64a9c05f37	MINOR: cli/listener: report the number of accepts on "show activity" The "show activity" command reports the number of incoming connections dispatched per thread but doesn't report the number of connections received by each thread. It is important to be able to monitor this value as it can show that for whatever reason a smaller set of threads is receiving the connections and dispatching them to all other ones.	2019-04-12 15:54:15 +02:00
Olivier Houchard	526dc95eb9	MINOR: initcall: Don't forget to define the __start/stop_init_##stg symbols. When creating a new initcall, don't forget to define the symbols, as it may not be done automatically and that would lead to undefined symbols. This should be backported to 1.9.	2019-04-10 16:33:25 +02:00
Christopher Faulet	f192d683a7	BUG/MINOR: htx: Preserve empty HTX messages with an unprocessed parsing error This let a chance to HTX analyzers to handle the error and send the appropriate response to the client. This patch must be backported to 1.9.	2019-04-01 15:43:40 +02:00
William Lallemand	9a1ee7ac31	MEDIUM: mworker-prog: implement program for master-worker This patch implements the external binary support in the master worker. To configure an external process, you need to use the program section, for example: program dataplane-api command ./dataplane_api Those processes are launched at the same time as the workers. During a reload of HAProxy, those processes are dealing with the same sequence as a worker: - the master is re-executed - the master sends a USR1 signal to the program - the master launches a new instance of the program During a stop, or restart, a SIGTERM is sent to the program.	2019-04-01 14:45:37 +02:00
William Lallemand	7175e6861e	MINOR: cli: export cli_parse_default() definition in cli.h Export the cli_parse_default() function in cli.h so it could be used in other files.	2019-04-01 14:45:37 +02:00
William Lallemand	3f12887ffa	MINOR: mworker: don't use children variable anymore The children variable is still used in haproxy, it is not required anymore since we have the information about the current workers in the mworker_proc linked list. The oldpids array is also replaced by this linked list when we generated the arguments for the master reexec.	2019-04-01 14:45:37 +02:00
William Lallemand	9001ce8c2f	REORG: mworker: move mworker_cleanlisteners to mworker.c	2019-04-01 14:45:37 +02:00
William Lallemand	e25473c846	REORG: mworker: move signal handlers and related functions Move the following functions to mworker.c: void mworker_catch_sighup(struct sig_handler sh); void mworker_catch_sigterm(struct sig_handler sh); void mworker_catch_sigchld(struct sig_handler *sh); static void mworker_kill(int sig); int current_child(int pid);	2019-04-01 14:45:37 +02:00
William Lallemand	3fa724db87	REORG: mworker: move IPC functions to mworker.c Move the following functions to mworker.c: void mworker_accept_wrapper(int fd); void mworker_pipe_register();	2019-04-01 14:45:37 +02:00
William Lallemand	3cd95d2f1b	REORG: mworker: move signals functions to mworker.c Move the following functions to mworker.c: void mworker_block_signals(); void mworker_unblock_signals();	2019-04-01 14:45:37 +02:00
William Lallemand	48dfbbdea9	REORG: mworker: move serializing functions to mworker.c Move the 2 following functions to mworker.c: void mworker_proc_list_to_env() void mworker_env_to_proc_list()	2019-04-01 14:45:37 +02:00
Willy Tarreau	a1bd1faeeb	BUILD: use inttypes.h instead of stdint.h I found on an (old) AIX 5.1 machine that stdint.h didn't exist while inttypes.h which is expected to include it does exist and provides the desired functionalities. As explained here, stdint being just a subset of inttypes for use in freestanding environments, it's probably always OK to switch to inttypes instead: https://pubs.opengroup.org/onlinepubs/009696799/basedefs/stdint.h.html Also it's even clearer here in the autoconf doc : https://www.gnu.org/software/autoconf/manual/autoconf-2.61/html_node/Header-Portability.html "The C99 standard says that inttypes.h includes stdint.h, so there's no need to include stdint.h separately in a standard environment. Some implementations have inttypes.h but not stdint.h (e.g., Solaris 7), but we don't know of any implementation that has stdint.h but not inttypes.h"	2019-04-01 07:44:56 +02:00
Willy Tarreau	7b5654f54a	BUILD: re-implement an initcall variant without using executable sections The current initcall implementation relies on dedicated sections (one section per init stage) to store the initcall descriptors. Then upon startup, these sections are scanned from beginning to end and all items found there are called in sequence. On platforms like AIX or Cygwin it seems difficult to figure the beginning and end of sections as the linker doesn't seem to provide the corresponding symbols. In order to replace this, this patch simply implements an array of single linked (one per init stage) which are fed using constructors for each register call. These constructors are declared static, with a name depending on their line number in the file, in order to avoid name clashes. The final effect is the same, except that the method is slightly more expensive in that it explicitly produces code to register these initcalls : $ size haproxy.sections haproxy.constructor text data bss dec hex filename 4060312 249176 1457652 5767140 57ffe4 haproxy.sections 4062862 260408 1457652 5780922 5835ba haproxy.constructor This mechanism is enabled as an alternative to the default one when build option USE_OBSOLETE_LINKER is set. This option is currently enabled by default only on AIX and Cygwin, and may be attempted for any target which fails to build complaining about missing symbols __start_init_* and/or __stop_init_*. Once confirmed as a reliable fix, this will likely have to be backported to 1.9 where AIX and Cygwin do not build anymore.	2019-04-01 07:43:07 +02:00
Willy Tarreau	9d22e56178	MINOR: tools: add an unsetenv() implementation Older Solaris and AIX versions do not have unsetenv(). This adds a fairly simple implementation which scans the environment, for use with those systems. It will simply require to pass the define in the "DEFINE" macro at build time like this : DEFINE="-Dunsetenv=my_unsetenv"	2019-03-29 21:05:37 +01:00
Willy Tarreau	72d9f3351d	BUILD: chunk: properly declare pool_head_trash as extern This one was also declared without the extern modifier in an include file. This needs to be backported to 1.9.	2019-03-29 21:03:20 +01:00
Willy Tarreau	e01d11a75b	BUILD: http: properly mark some struct as extern http_known_methods, HTTP_100 and HTTP_103 were not declared extern and as such were multiply defined since they were in http.h. There was apparently no more side effect but it may depend on the platform and the linker. This needs to be backported to 1.9.	2019-03-29 21:00:22 +01:00
Willy Tarreau	a33d39a1b1	CLEANUP: task: only perform a LIST_DEL() when the list is not empty In tasklet_free() we unconditionally perform a LIST_DEL() even when the list is empty, let's move the LIST_DEL() inside the matching block.	2019-03-25 18:10:53 +01:00
Willy Tarreau	e73256fd2a	BUG/MEDIUM: task/h2: add an idempotent task removal fucntion Previous commit `3ea351368` ("BUG/MEDIUM: h2: Remove the tasklet from the task list if unsubscribing.") uncovered an issue which needs to be addressed in the scheduler's API. The function task_remove_from_task_list() was initially designed to remove a task from the running tasklet list from within the scheduler, and had to be used in h2 to abort pending I/O events. However this function was not designed to be idempotent, occasionally causing a double removal from the tasklet list, with the second doing nothing but affecting the apparent tasks count and making haproxy use 100% CPU on some tests consisting in stopping the client during some transfers. The h2_unsubscribe() function can sometimes be called upon stream exit after an error where the tasklet was possibly already removed, so it. This patch does 2 things : - it renames task_remove_from_task_list() to __task_remove_from_tasklet_list() to discourage users from calling it. Also note the fix in the naming since it's a tasklet list and not a task list. This function is still uesd from the scheduler. - it adds a new, idempotent, task_remove_from_tasklet_list() function which does nothing if the task is already not in the tasklet list. This patch will need to be backported where the commit above is backported.	2019-03-25 18:02:54 +01:00
Christopher Faulet	87a8f353f1	CLEANUP: muxes/stream-int: Remove flags CS_FL_READ_NULL and SI_FL_READ_NULL Since the flag CF_SHUTR is no more set to mark the end of the message, these flags become useless. This patch should be backported to 1.9.	2019-03-25 06:55:23 +01:00
Christopher Faulet	297d3e2e0f	MINOR: channel: Report EOI on the input channel if it was reached in the mux The flag CF_EOI is now set on the input channel when the flag CS_FL_EOI is set on the corresponding conn_stream. In addition, if a read activity is reported when this flag is set, the stream is woken up. This patch should be backported to 1.9.	2019-03-25 06:24:43 +01:00
Christopher Faulet	5311a9255d	MINOR: connection: and new flag to mark end of input (EOI) Since the begining, in the H2 multiplexer, when the end of a message is reached, the flag CS_FL_(R)EOS is set on the conn_stream to notify the upper layer that all data were received and consumed and there is no longer any expected. The stream-interface converts it into a shutdown read. But it leads to some ambiguities with the real shutr. Once it was reported at the end of the message, there is no way to report it when the read0 is received. For this reason, aborts after the message was fully received cannot be reported. And on the channel side, it is hard to make the difference between a shutr because the end of the message was reached and a shutr because of an abort. For these reasons, there is now a flag to mark the end of the message. It is called CS_FL_EOI (end-of-input) because it is only used on the receipt path. This flag is only declared and not used yet. This patch will be used by future bug fixes and will have to be backported to 1.9.	2019-03-25 06:24:25 +01:00
Willy Tarreau	0f22299435	CLEANUP: cache: don't export http_cache_applet anymore This one can become static since it's not used by http/htx anymore.	2019-03-19 09:58:35 +01:00
Christopher Faulet	3a78aa6e95	BUG/MINOR: stats: Fully consume large requests in the stats applet In the stats applet (in HTX and legacy HTTP), after a response is fully sent to a client, the request is consumed. It is done at the end, after all the response was copied into the channel's buffer. But only outgoing data at time the applet is called are consumed. Then the applet is closed. If a request with a huge body is sent, an error is triggerred because a SHUTW is catched for an unfinisehd request. Now, we consume request data until the end. In fact, we don't try to shutdown the request's channel for write anymore. This patch must be backported to 1.9 after some observation period. It should probably be backported in prior versions too. But honnestly, with refactoring on the connection layer and the stream interface in 1.9, it is probably safer to not do so.	2019-03-19 09:49:29 +01:00
Willy Tarreau	679bba13f7	MINOR: init: report the list of optionally available services It's never easy to guess what services are built in. We currently have the prometheus exporter in contrib/ which is the only extension for now. Let's enumerate all available ones just like we do for filterr and pollers.	2019-03-19 08:08:10 +01:00
Christopher Faulet	203b2b0a5a	MINOR: muxes: Report the Last read with a dedicated flag For conveniance, in HTTP muxes (h1 and h2), the end of the stream and the end of the message are reported the same way to the stream, by setting the flag CS_FL_EOS. In the stream-interface, when CS_FL_EOS is detected, a shutdown for read is reported on the channel side. This is historical. With the legacy HTTP layer, because the parsing is done by the stream in HTTP analyzers, the EOS really means a shutdown for read. Most of time, for muxes h1 and h2, it works pretty well, especially because the keep-alive is handled by the muxes. The stream is only used for one transaction. So mixing EOS and EOM is good enough. But not everytime. For now, client aborts are only reported if it happens before the end of the request. It is an error and it is properly handled. But because the EOS was already reported, client aborts after the end of the request are silently ignored. Eventually an error can be reported when the response is sent to the client, if the sending fails. Otherwise, if the server does not reply fast enough, an error is reported when the server timeout is reached. It is the expected behaviour, excpect when the option abortonclose is set. In this case, we must report an error when the client aborts. But as said before, this event can be ignored. So to be short, for now, the abortonclose is broken. In fact, it is a design problem and we have to rethink all channel's flags and probably the conn-stream ones too. It is important to split EOS and EOM to not loose information anymore. But it is not a small job and the refactoring will be far from straightforward. So for now, temporary flags are introduced. When the last read is received, the flag CS_FL_READ_NULL is set on the conn-stream. This way, we can set the flag SI_FL_READ_NULL on the stream interface. Both flags are persistant. And to be sure to wake the stream, the event CF_READ_NULL is reported. So the stream will always have the chance to handle the last read. This patch must be backported to 1.9 because it will be used by another patch to fix the option abortonclose.	2019-03-18 15:50:23 +01:00
Christopher Faulet	2b9b6784b9	MINOR: stats: Move stuff about the stats status codes in stats files The status codes definition (STAT_STATUS_*) and their string representation stat_status_codes) have been moved in stats files. There is no reason to keep them in proto_http files.	2019-03-15 14:34:59 +01:00
Christopher Faulet	3c2ecf75c8	MINOR: stats: Add the status code STAT_STATUS_IVAL to handle invalid requests This patch must be backported to 1.9 because a bug fix depends on it.	2019-03-15 14:34:52 +01:00
Olivier Houchard	1d7f37a2cb	BUG/MAJOR: tasks: Use the TASK_GLOBAL flag to know if we're in the global rq. In task_unlink_rq, to decide if we should logk the global runqueue lock, use the TASK_GLOBAL flag instead of relying on t->thread_mask being tid_bit, as it could be so while still being in the global runqueue if another thread woke that task for us. This should be backported to 1.9.	2019-03-14 16:19:11 +01:00
Olivier Houchard	237985b228	MEDIUM: connections: Use _HA_ATOMIC_* Use _HA_ATOMIC_ instead of HA_ATOMIC_ because we know we don't need barriers	2019-03-14 15:55:15 +01:00
Olivier Houchard	9f8d821a55	MEDIUM: list: Use _HA_ATOMIC_* Use _HA_ATOMIC_ instead of HA_ATOMIC_ because we know we don't need barriers.	2019-03-14 15:55:15 +01:00
Olivier Houchard	17fbb4eb3f	MEDIUM: list: Remove useless barriers. Don't bother forcing a barrier after using HA_ATOMIC_XCHG if we're about to check the returned value anyway.	2019-03-14 15:55:15 +01:00
Willy Tarreau	b0cef35b09	BUG/MEDIUM: list: fix incorrect pointer unlocking in LIST_DEL_LOCKED() Injecting on a saturated listener started to exhibit some deadlocks again between LIST_POP_LOCKED() and LIST_DEL_LOCKED(). Olivier found it was due to a leftover from a previous debugging session. This patch fixes it. This will have to be backported if the other LIST_*_LOCKED() patches are backported.	2019-03-13 14:15:54 +01:00
Willy Tarreau	df23c0ce45	MINOR: config: continue to rely on DEFAULT_MAXCONN to set the minimum maxconn Some packages used to rely on DEFAULT_MAXCONN to set the default global maxconn value to use regardless of the initial ulimit. The recent changes made the lowest bound set to 100 so that it is compatible with almost any environment. Now that DEFAULT_MAXCONN is not needed for anything else, we can use it for the lowest bound set when maxconn is not configured. This way it retains its original purpose of setting the default maxconn value eventhough most of the time the effective value will be higher thanks to the automatic computation based on "ulimit -n".	2019-03-13 10:10:49 +01:00
Willy Tarreau	ca783d4ee6	MINOR: config: remove obsolete use of DEFAULT_MAXCONN at various places This entry was still set to 2000 but never used anymore. The only places where it appeared was as an alias to SYSTEM_MAXCONN which forces it, so let's turn these ones to SYSTEM_MAXCONN and remove the default value for DEFAULT_MAXCONN. SYSTEM_MAXCONN still defines the upper bound however.	2019-03-13 10:10:25 +01:00
Olivier Houchard	20872763dd	MEDIUM: memory: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	4c28328572	MEDIUM: task: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:37 +01:00
Olivier Houchard	aa4d71a7fe	MEDIUM: server: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:37 +01:00
Olivier Houchard	11ecfd1c01	MEDIUM: proxy: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:37 +01:00
Olivier Houchard	d5f9b19196	MEDIUM: freq_ctr: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:37 +01:00
Olivier Houchard	d360879fb5	MEDIUM: fd: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:37 +01:00
Olivier Houchard	8beb27e9ce	MEDIUM: xref: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:37 +01:00
Olivier Houchard	a2735340fb	MEDIUM: applets: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:37 +01:00
Olivier Houchard	d2b5d16187	MEDIUM: various: Use __ha_barrier_atomic* when relevant. When protecting data modified by atomic operations, use __ha_barrier_atomic* to avoid unneeded barriers on x86.	2019-03-11 17:02:37 +01:00
Olivier Houchard	d0c3b8894a	MINOR: threads: Add macros to do atomic operation with no memory barrier. Add variants of the HA_ATOMIC* macros, prefixed with a _, that do the atomic operation with no barrier generated by the compiler. It is expected the developer adds barriers manually if needed.	2019-03-11 17:02:37 +01:00
Olivier Houchard	113537967c	MEDIUM: threads: Use __ATOMIC_SEQ_CST when using the newer atomic API. When using the new __atomic* API, ask the compiler to generate barriers. A variant of those functions that don't generate barriers will be added later. Before that, using HA_ATOMIC* would not generate any barrier, and some parts of the code should be reviewed and missing barriers should be added. This should probably be backported to 1.8 and 1.9.	2019-03-11 17:02:37 +01:00
Olivier Houchard	9abcf6ef9a	MINOR: threads: Implement __ha_barrier_atomic*. Implement __ha_barrier functions to be used when trying to protect data modified by atomic operations (except when using HA_ATOMIC_STORE). On intel, atomic operations either use the LOCK prefix and xchg, and both atc as full barrier, so there's no need to add an extra barrier.	2019-03-11 17:02:37 +01:00
Olivier Houchard	92fce85d03	MINOR: fd: Remove debugging code. Remove a debugging test, and call to abort, it's no longer needed.	2019-03-08 16:05:25 +01:00
Willy Tarreau	1e56c70cc9	OPTIM: task: limit the impact of memory barriers in taks_remove_from_task_list() In this function we end up with successive locked operations then a store barrier, and in addition the compiler has to emit less efficient code due to a longer jump. There's no need for absolutely updating the tasks_run_queue counter before clearing the task's leaf pointer, so let's swap the two operations and benefit from a single barrier as much as possible. This code is on the hot path and shows about half a percent of improvement with 8 threads.	2019-03-07 18:44:12 +01:00
Willy Tarreau	0cf33176bd	MINOR: listener: move thr_idx from the bind_conf to the listener Tests show that it's slightly faster to have this field in the listener. The cache walk patterns are under heavy stress and having only this field written to in the bind_conf was wasting a cache line that was heavily read. Let's move this close to the other entries already written to in the listener. Warning, the position does have an impact on peak performance.	2019-03-07 14:08:26 +01:00
Willy Tarreau	9f1d4e7f7f	CLEANUP: listener: remove old thread bit mapping Now that the P2C algorithm for the accept queue is removed, we don't need to map a number to a thread bit anymore, so let's remove all these fields which are taking quite some space for no reason.	2019-03-07 13:59:04 +01:00
Willy Tarreau	d87a67f9bc	MINOR: tools: implement my_flsl() We already have my_ffsl() to find the lowest bit set in a word, and this patch implements the search for the highest bit set in a word. On x86 it uses the bsr instruction and on other architectures it uses an efficient implementation.	2019-03-07 13:48:04 +01:00
Willy Tarreau	fc630bd373	MINOR: listener: improve incoming traffic distribution By picking two randoms following the P2C algorithm, we seldom observe asymmetric loads on bursts of small session counts. This is typically what makes h2load take a bit of time to complete the last 100% because if a thread gets two connections while the other ones only have one, it takes twice the time to complete its work. This patch proposes a modification of the p2c algorithm which seems more suitable to this case : it mixes a rotating index with a random. This way, we're certain that all threads are consulted in turn and at the same time we're not forced to use the ones we're giving a chance. This significantly increases the traffic rate. Now h2load shows faster completion and the average request rates on H2 and the TLS resume rate increases by a bit more than 5% compared to pure p2c. The index was placed into the struct bind_conf because 1) it's faster there and it's the best place to optimally distribute traffic among a group of listeners. It's the only runtime-modified element there and it will be quite cache-hot.	2019-03-07 13:48:04 +01:00
Willy Tarreau	b238b12e98	MINOR: task: use LIST_DEL_INIT() to remove a task from the queue By using LIST_DEL_INIT() instead of LIST_DEL()+LIST_INIT() we manage to bump the peak connection rate by no less than 3% on 8 threads. The perf top profile shows much less contention in this area which suffered from the second reload.	2019-03-07 11:45:44 +01:00
Willy Tarreau	c5bd311b2a	MINOR: lists: add a LIST_DEL_INIT() macro It turns out that we call LIST_DEL+LIST_INIT very frequently and that the compiler doesn't know what pointers get modified in the e->n->p and e->p->n dance, so when LIST_INIT() is called, it reloads these pointers, which is quite a bit of a mess in terms of performance. This patch adds LIST_DEL_INIT() to perform the two operations at once using local temporary variables so that the compiler knows these pointers are left unaffected.	2019-03-07 11:45:44 +01:00
Fr�d�ric L�caille	5f33f85ce8	MINOR: sample: Extract some protocol buffers specific code. We move the code responsible of parsing protocol buffers messages inside gRPC messages from sample.c to include/proto/protocol_buffers.h so that to reuse it to cascade "ungrpc" converter.	2019-03-06 15:36:02 +01:00
Fr�d�ric L�caille	756d97f205	MINOR: sample: Rework gRPC converter code. For now on, "ungrpc" may take a second optional argument to provide the protocol buffers types used to encode the field value to be extracted. When absent the field value is extracted as a binary sample which may then followed by others converters like "hex" which takes binary as input sample. When this second argument is a type which does not match the one found by "ungrpc", this field is considered as not found even if present. With this patch we also remove the useless "varint" and "svarint" converters. Update the documentation about "ungrpc" converters.	2019-03-05 11:04:23 +01:00
Fr�d�ric L�caille	7c93e88d0c	MINOR: sample: Code factorization "ungrpc" converter. Parsing protocol buffer fields always consists in skip the field if the field is not found or store the field value if found. So, with this patch we factorize a little bit the code for "ungrpc" converter.	2019-03-05 11:03:53 +01:00
Willy Tarreau	967de20a43	BUG/MEDIUM: list: fix again LIST_ADDQ_LOCKED Well, that's becoming embarrassing. Now this fixes commit `4ef6801c` ("BUG/MEDIUM: list: correct fix for LIST_POP_LOCKED's removal of last element") which itself tried to fix commit `285192564`. This fix only works under low contention and was tested with the listener's queue. With the idle conns it's obvious that it's still wrong since adding more than one element to the list leaves a LLIST_BUSY pointer into the list's head. This was visible when accumulating idle connections in a server's list. This new version of the fix almost goes back to the original code, except that since then we addressed issues with expectedly idempotent operations that were not. Now the code has been verified on paper again and has survived 300 million connections spread over 4 threads. This will have to be backported if the commit above is backported.	2019-03-04 14:09:22 +01:00
Willy Tarreau	bf6964007a	MINOR: global: keep a copy of the initial rlim_fd_cur and rlim_fd_max values Let's keep a copy of these initial values. They will be useful to compute automatic maxconn, as well as to restore proper limits when doing an execve() on external checks.	2019-03-01 10:40:30 +01:00
Fr�d�ric L�caille	645635da84	MINOR: peers: Add a message for heartbeat. This patch implements peer heartbeat feature to prevent any haproxy peer from reconnecting too often, consuming sockets for nothing. To do so, we add PEER_MSG_CTRL_HEARTBEAT new message to PEER_MSG_CLASS_CONTROL peers control class of messages. A ->heartbeat field is added to peer structs to store the heatbeat timeout value which is handled by the same function as for ->reconnect to control the session timeouts. A 2-bytes heartbeat message is sent every 3s when no updates have to be sent. This way, the peer which receives such a message is sure the remote peer is still alive. So, it resets the ->reconnect peer session timeout to its initial value (5s). This prevents any reconnection to an already connected alive peer.	2019-03-01 09:33:26 +01:00
Willy Tarreau	c8d5b95e6d	MEDIUM: config: don't enforce a low frontend maxconn value anymore Historically the default frontend's maxconn used to be quite low (2000), which was sufficient two decades ago but often proved to be a problem when users had purposely set the global maxconn value but forgot to set the frontend's. There is no point in keeping this arbitrary limit for frontends : when the global maxconn is lower, it's already too high and when the global maxconn is much higher, it becomes a limiting factor which causes trouble in production. This commit allows the value to be set to zero, which becomes the new default value, to mean it's not directly limited, or in fact it's set to the global maxconn. Since this operation used to be performed before computing a possibly automatic global maxconn based on memory limits, the calculation of the maxconn value and its propagation to the backends' fullconn has now moved to a dedicated function, proxy_adjust_all_maxconn(), which is called once the global maxconn is stabilized. This comes with two benefits : 1) a configuration missing "maxconn" in the defaults section will not limit itself to a magically hardcoded value but will scale up to the global maxconn ; 2) when the global maxconn is not set and memory limits are used instead, the frontends' maxconn automatically adapts, and the backends' fullconn as well.	2019-02-28 17:05:32 +01:00
Willy Tarreau	e2711c7bd6	MINOR: listener: introduce listener_backlog() to report the backlog value In an attempt to try to provide automatic maxconn settings, we need to decorrelate a listner's backlog and maxconn so that these values can be independent. This introduces a listener_backlog() function which retrieves the backlog value from the listener's backlog, the frontend's, the listener's maxconn, the frontend's or falls back to 1024. This corresponds to what was done in cfgparse.c to force a value there except the last fallback which was not set since the frontend's maxconn is always known.	2019-02-28 17:05:29 +01:00
Willy Tarreau	4ef6801cd4	BUG/MEDIUM: list: correct fix for LIST_POP_LOCKED's removal of last element As seen with Olivier, in the end the fix in commit `285192564` ("BUG/MEDIUM: list: fix LIST_POP_LOCKED's removal of the last pointer") is wrong, the code there was right but the bug was triggered by another bug in LIST_ADDQ_LOCKED() which doesn't properly update the list's head by inserting in the wrong order. This will have to be backported if the commit above is backported.	2019-02-28 16:51:28 +01:00
Willy Tarreau	01abd02508	BUG/MEDIUM: listener: use a self-locked list for the dequeue lists There is a very difficult to reproduce race in the listener's accept code, which is much easier to reproduce once connection limits are properly enforced. It's an ABBA lock issue : - the following functions take l->lock then lq_lock : disable_listener, pause_listener, listener_full, limit_listener, do_unbind_listener - the following ones take lq_lock then l->lock : resume_listener, dequeue_all_listener This is because __resume_listener() only takes the listener's lock and expects to be called with lq_lock held. The problem can easily happen when listener_full() and limit_listener() are called a lot while in parallel another thread releases sessions for the same listener using listener_release() which in turn calls resume_listener(). This scenario is more prevalent in 2.0-dev since the removal of the accept lock in listener_accept(). However in 1.9 and before, a different but extremely unlikely scenario can happen : thread1 thread2 ............................ enter listener_accept() limit_listener() ............................ long pause before taking the lock session_free() dequeue_all_listeners() lock(lq_lock) [1] ............................ try_lock(l->lock) [2] __resume_listener() spin_lock(l->lock) =>WAIT[2] ............................ accept() l->accept() nbconn==maxconn => listener_full() state==LI_LIMITED => lock(lq_lock) =>DEADLOCK[1]! In practice it is almost impossible to trigger it because it requires to limit both on the listener's maxconn and the frontend's rate limit, at the same time, and to release the listener when the connection rate goes below the limit between poll() returns the FD and the lock is taken (a few nanoseconds). But maybe with threads competing on the same core it has more chances to appear. This patch removes the lq_lock and replaces it with a lockless queue for the listener's wait queue (well, technically speaking a self-locked queue) brought by commit `a8434ec14` ("MINOR: lists: Implement locked variations.") and its few subsequent fixes. This relieves us from the need of the lq_lock and removes the deadlock. It also gets rid of the distinction between __resume_listener() and resume_listener() since the only difference was the lq_lock. All listener removals from the list are now unconditional to avoid races on the state. It's worth noting that the list used to never be initialized and that it used to work only thanks to the state tests, so the initialization has now been added. This patch must carefully be backported to 1.9 and very likely 1.8. It is mandatory to be careful about replacing all manipulations of l->wait_queue, global.listener_queue and p->listener_queue.	2019-02-28 16:08:54 +01:00
Willy Tarreau	c912f94b57	MINOR: server: remove a few unneeded LIST_INIT calls after LIST_DEL_LOCKED Since LIST_DEL_LOCKED() and LIST_POP_LOCKED() now automatically reinitialize the removed element, there's no need for keeping this LIST_INIT() call in the idle connection code.	2019-02-28 16:08:54 +01:00
Willy Tarreau	4c747e86cd	MINOR: list: make the delete and pop operations idempotent These operations previously used to return a "locked" element, which is a constraint when multiple threads try to delete the same element, because the second one will block indefinitely. Instead, let's make sure that both LIST_DEL_LOCKED() and LIST_POP_LOCKED() always reinitialize the element after deleting it. This ensures that the second thread will immediately unblock and succeed with the removal. It also secures the pop vs delete competition that may happen when trying to remove an element that's about to be dequeued.	2019-02-28 16:03:29 +01:00
Willy Tarreau	690d2ad4d2	BUG/MEDIUM: list: add missing store barriers when updating elements and head Commit `a8434ec14` ("MINOR: lists: Implement locked variations.") introduced locked lists which use the elements pointers as locks for concurrent operations. Under heavy stress the lists occasionally fail. The cause is a missing barrier at some points when updating the list element and the head : nothing prevents the compiler (or CPU) from updating the list head first before updating the element, making another thread jump to a wrong location. This patch simply adds the missing barriers before these two opeations. This will have to be backported if the commit above is backported.	2019-02-28 15:59:31 +01:00
Willy Tarreau	285192564d	BUG/MEDIUM: list: fix LIST_POP_LOCKED's removal of the last pointer There was a typo making the last updated pointer be the pre-last element's prev instead of the last's prev element. It didn't show up during early tests because the contention is very rare on this one and it's implicitly recovered when updating the pointers to go to the next element, but it was clearly visible in the listener_accept() tests by having all threads block on LIST_POP_LOCKED() with n==p==LLIST_BUSY. This will have to be backported if commit `a8434ec14` ("MINOR: lists: Implement locked variations.") is backported.	2019-02-28 15:59:31 +01:00
Willy Tarreau	bd20ad5874	BUG/MEDIUM: list: fix the rollback on addq in the locked liss Commit `a8434ec14` ("MINOR: lists: Implement locked variations.") introduced locked lists which use the elements pointers as locks for concurrent operations. A copy-paste typo in LIST_ADDQ_LOCKED() causes corruption in the list in case the next pointer is already held, as it restores the previous pointer into the next one. It may impact the server pools. This will have to be backported if the commit above is backported.	2019-02-28 15:10:15 +01:00
Willy Tarreau	149ab779cc	MAJOR: threads: enable one thread per CPU by default Threads have long matured by now, still for most users their usage is not trivial. It's about time to enable them by default on platforms where we know the number of CPUs bound. This patch does this, it counts the number of CPUs the process is bound to upon startup, and enables as many threads by default. Of course, "nbthread" still overrides this, but if it's not set the default behaviour is to start one thread per CPU. The default number of threads is reported in "haproxy -vv". Simply using "taskset -c" is now enough to adjust this number of threads so that there is no more need for playing with cpu-map. And thanks to the previous patches on the listener, the vast majority of configurations will not need to duplicate "bind" lines with the "process x/y" statement anymore either, so a simple config will automatically adapt to the number of processors available.	2019-02-27 14:51:50 +01:00
Willy Tarreau	7ac908bf8c	MINOR: config: add global tune.listener.multi-queue setting tune.listener.multi-queue { on \| off } Enables ('on') or disables ('off') the listener's multi-queue accept which spreads the incoming traffic to all threads a "bind" line is allowed to run on instead of taking them for itself. This provides a smoother traffic distribution and scales much better, especially in environments where threads may be unevenly loaded due to external activity (network interrupts colliding with one thread for example). This option is enabled by default, but it may be forcefully disabled for troubleshooting or for situations where it is estimated that the operating system already provides a good enough distribution and connections are extremely short-lived.	2019-02-27 14:27:07 +01:00
Willy Tarreau	8a03408d81	MINOR: activity: add accept queue counters for pushed and overflows It's important to monitor the accept queues to know if some incoming connections had to be handled by their originating thread due to an overflow. It's also important to be able to confirm thread fairness. This patch adds "accq_pushed" to activity reporting, which reports the number of connections that were successfully pushed into each thread's queue, and "accq_full", which indicates the number of connections that couldn't be pushed because the thread's queue was full.	2019-02-27 14:27:07 +01:00
Willy Tarreau	1efafce61f	MINOR: listener: implement multi-queue accept for threads There is one point where we can migrate a connection to another thread without taking risk, it's when we accept it : the new FD is not yet in the fd cache and no task was created yet. It's still possible to assign it a different thread than the one which accepted the connection. The only requirement for this is to have one accept queue per thread and their respective processing tasks that have to be woken up each time an entry is added to the queue. This is a multiple-producer, single-consumer model. Entries are added at the queue's tail and the processing task is woken up. The consumer picks entries at the head and processes them in order. The accept queue contains the fd, the source address, and the listener. Each entry of the accept queue was rounded up to 64 bytes (one cache line) to avoid cache aliasing because tests have shown that otherwise performance suffers a lot (5%). A test has shown that it's important to have at least 256 entries for the rings, as at 128 it's still possible to fill them often at high loads on small thread counts. The processing task does almost nothing except calling the listener's accept() function and updating the global session and SSL rate counters just like listener_accept() does on synchronous calls. At this point the accept queue is implemented but not used.	2019-02-27 14:27:07 +01:00
Willy Tarreau	b2b50a7784	MINOR: listener: pre-compute some thread counts per bind_conf In order to quickly pick a thread ID when accepting a connection, we'll need to know certain pre-computed values derived from the thread mask, which are counts of bits per position multiples of 1, 2, 4, 8, 16 and 32. In practice it is sufficient to compute only the 4 first ones and store them in the bind_conf. We update the count every time the bind_thread value is adjusted. The fields in the bind_conf struct have been moved around a little bit to make it easier to group all thread bit values into the same cache line. The function used to return a thread number is bind_map_thread_id(), and it maps a number between 0 and 31/63 to a thread ID between 0 and 31/63, starting from the left.	2019-02-27 14:27:07 +01:00
Willy Tarreau	f3241115e7	MINOR: tools: implement functions to look up the nth bit set in a mask Function mask_find_rank_bit() returns the bit position in mask <m> of the nth bit set of rank <r>, between 0 and LONGBITS-1 included, starting from the left. For example ranks 0,1,2,3 for mask 0x55 will be 6, 4, 2 and 0 respectively. This algorithm is based on a popcount variant and is described here : https://graphics.stanford.edu/~seander/bithacks.html.	2019-02-27 14:27:07 +01:00
Willy Tarreau	9e85318417	MINOR: listener: maintain a per-thread count of the number of connections on a listener Having this information will help us improve thread-level distribution of incoming traffic.	2019-02-27 14:27:07 +01:00
Willy Tarreau	a36b324777	MEDIUM: listener: keep a single thread-mask and warn on "process" misuse Now that nbproc and nbthread are exclusive, we can still provide more detailed explanations about what we've found in the config when a bind line appears on multiple threads and processes at the same time, then ignore the setting. This patch reduces the listener's thread mask to a single mask instead of an array of masks per process. Now we have only one thread mask and one process mask per bind-conf. This removes ~504 bytes of RAM per bind-conf and will simplify handling of thread masks. If a "bind" line only refers to process numbers not found by its parent frontend or not covered by the global nbproc directive, or to a thread not covered by the global nbthread directive, a warning is emitted saying what will be used instead.	2019-02-27 14:27:07 +01:00
Olivier Houchard	db64489aac	BUG/MEDIUM: lists: Properly handle the case we're removing the first elt. In LIST_DEL_LOCKED(), initialize p2 to NULL, and only attempt to set it back to its previous value if we had a previous element, and thus p2 is non-NULL.	2019-02-26 18:47:59 +01:00
Olivier Houchard	9ea5d361ae	MEDIUM: servers: Reorganize the way idle connections are cleaned. Instead of having one task per thread and per server that does clean the idling connections, have only one global task for every servers. That tasks parses all the servers that currently have idling connections, and remove half of them, to put them in a per-thread list of connections to kill. For each thread that does have connections to kill, wake a task to do so, so that the cleaning will be done in the context of said thread.	2019-02-26 18:17:32 +01:00
Olivier Houchard	7f1bc31fee	MEDIUM: servers: Used a locked list for idle_orphan_conns. Use the locked macros when manipulating idle_orphan_conns, so that other threads can remove elements from it. It will be useful later to avoid having a task per server and per thread to cleanup the orphan list.	2019-02-26 18:17:32 +01:00
Olivier Houchard	a8434ec146	MINOR: lists: Implement locked variations. Implement LIST_ADD_LOCKED(), LIST_ADDQ_LOCKED(), LIST_DEL_LOCKED() and LIST_POP_LOCKED(). LIST_ADD_LOCKED, LIST_ADDQ_LOCKED and LIST_DEL_LOCKED work the same as LIST_ADD, LIST_ADDQ and LIST_DEL, except before any manipulation it locks the relevant elements of the list, so it's safe to manipulate the list with multiple threads. LIST_POP_LOCKED() removes the first element from the list, and returns its data.	2019-02-26 18:17:32 +01:00
Fr�d�ric L�caille	1fceee8316	MINOR: http_fetch: add "req.ungrpc" sample fetch for gRPC. This patch implements "req.ungrpc" sample fetch method to decode and parse a gRPC request. It takes only one argument: a protocol buffers field number to identify the protocol buffers message number to be looked up. This argument is a sort of path in dotted notation to the terminal field number to be retrieved. ex: req.ungrpc(1.2.3.4) This sample fetch catch the data in raw mode, without interpreting them. Some protocol buffers specific converters may be used to convert the data to the correct type.	2019-02-26 16:27:05 +01:00
Fr�d�ric L�caille	3a463c92cf	MINOR: arg: Add support for ARGT_PBUF_FNUM arg type. This new argument type is used to parse Protocol Buffers field number with dotted notation (e.g: 1.2.3.4).	2019-02-26 16:27:05 +01:00
Fr�d�ric L�caille	3b71716685	MINOR: standard: Add a function to parse uints (dotted notation). This function is useful to parse strings made of unsigned integers and to allocate a C array of unsigned integers from there. For instance this function allocates this array { 1, 2, 3, 4, } from this string: "1.2.3.4".	2019-02-26 16:27:05 +01:00
Christopher Faulet	c6827d52c1	MINOR: channel/htx: Add function to skips output bytes from an HTX channel It is the HTX version of co_skip(). Internally, It uses the function htx_drain(). It will be used by other commits to fix bugs, so it must be backported to 1.9.	2019-02-26 14:04:23 +01:00
Christopher Faulet	549822f0a1	MINOR: htx: Add function to drain data from an HTX message The function htx_drain() can now be used to drain data from an HTX message. It will be used by other commits to fix bugs, so it must be backported to 1.9.	2019-02-26 14:04:23 +01:00
Christopher Faulet	729b5b308c	BUG/MINOR: channel: Set CF_WROTE_DATA when outgoing data are skipped in co_skip(), the flag CF_WRITE_PARTIAL is set on the channel. The flag CF_WROTE_DATA must also be set to notify the channel some data were sent. This patch must be backported to 1.9.	2019-02-26 14:04:23 +01:00
Richard Russo	bc9d9844d5	BUG/MAJOR: fd/threads, task/threads: ensure all spin locks are unlocked Calculate if the fd or task should be locked once, before locking, and reuse the calculation when determing when to unlock. Fixes a race condition added in `87d54a9a` for fds, and `b20aa9ee` for tasks, released in 1.9-dev4. When one thread modifies thread_mask to be a single thread for a task or fd while a second thread has locked or is waiting on a lock for that task or fd, the second thread will not unlock it. For FDs, this is observable when a listener is polled by multiple threads, and is closed while those threads have events pending. For tasks, this seems possible, where task_set_affinity is called, but I did not observe it. This must be backported to 1.9.	2019-02-25 16:16:36 +01:00
Willy Tarreau	2d7f81b809	MINOR: fd: add a new my_closefrom() function to close all FDs This is a naive implementation of closefrom() which closes all FDs starting from the one passed in argument. closefrom() is not provided on all operating systems, and other versions will follow.	2019-02-21 22:19:17 +01:00
Olivier Houchard	f131481a0a	BUG/MEDIUM: servers: Add a per-thread counter of idle connections. Add a per-thread counter of idling connections, and use it to determine how many connections we should kill after the timeout, instead of using the global counter, or we're likely to just kill most of the connections. This should be backported to 1.9.	2019-02-21 19:07:45 +01:00
Olivier Houchard	e737103173	BUG/MEDIUM: servers: Use atomic operations when handling curr_idle_conns. Use atomic operations when dealing with srv->curr_idle_conns, as it's shared between threads, otherwise we could get inconsistencies. This should be backported to 1.9.	2019-02-21 19:07:19 +01:00
Christopher Faulet	0b46548a68	BUG/MEDIUM: h2/htx: Correctly handle interim responses when HTX is enabled 1xx responses does not work in HTTP2 when the HTX is enabled. First of all, when a response is parsed, only one HEADERS frame is expected. So when an interim response is received, the flag H2_SF_HEADERS_RCVD is set and the next HEADERS frame (for another interim repsonse or the final one) is parsed as a trailers one. Then when the response is sent, because an EOM block is found at the end of the interim HTX response, the ES flag is added on the frame, closing too early the stream. Here, it is a design problem of the HTX. Iterim responses are considered as full messages, leading to some ambiguities when HTX messages are processed. This will not be fixed now, but we need to keep it in mind for future improvements. To fix the parsing bug, the flag H2_MSGF_RSP_1XX is added when the response headers are decoded. When this flag is set, an EOM block is added into the HTX message, despite the fact that there is no ES flag on the frame. And we don't set the flag H2_SF_HEADERS_RCVD on the corresponding H2S. So the next HEADERS frame will not be parsed as a trailers one. To fix the sending bug, the ES flag is not set on the frame when an interim response is processed and the flag H2_SF_HEADERS_SENT is not set on the corresponding H2S. This patch must be backported to 1.9.	2019-02-19 16:26:14 +01:00
Olivier Houchard	9efa7b8ba8	BUILD/MEDIUM: initcall: Fix build on MacOS. MacOS syntax for sections is a bit different, so implement it. (see issue #42). This should be backported to 1.9.	2019-02-15 14:32:35 +01:00
Fr�d�ric L�caille	76d2cef0c2	BUG/MEDIUM: peers: Missing peer initializations. Initialize ->srv peer field for all the peers, the local peer included. Indeed, a haproxy process needs to connect to the local peer of a remote process. Furthermore, when a "peer" or "server" line is parsed by parse_server() the address must be copied to ->addr field of the peer object only if this address has been also parsed by parse_server(). This is not the case if this address belongs to the local peer and is provided on a "server" line. After having parsed the "peer" or "server" lines of a peer sections, the ->srv part of all the peer must be initialized for SSL, if enabled. Same thing for the binding part. Revert `1417f0b` commit which is no more required. No backport is needed, this is purely 2.0.	2019-02-12 19:49:22 +01:00
Ben51Degrees	4ddf59d070	MEDIUM: 51d: Enabled multi threaded operation in the 51Degrees module. The existing threading flag in the 51Degrees API (FIFTYONEDEGREES_NO_THREADING) has now been mapped to the HAProxy threading flag (USE_THREAD), and the 51Degrees module code has been made thread safe. In Pattern, the cache is now locked with a spin lock from hathreads.h using a new lable 'OTHER_LOCK'. The workset pool is now created with the same size as the number of threads to avoid any time waiting on a worket. In Hash Trie, the global device offsets structure is only used in single threaded operation. Multi threaded operation creates a new offsets structure in each thread.	2019-02-08 21:29:23 +01:00
Willy Tarreau	1417f0b5dc	BUG/MEDIUM: peers: check that p->srv actually exists before using p->srv->use_ssl Commit `1055e687a` ("MINOR: peers: Make outgoing connection to SSL/TLS peers work.") introduced an "srv" field in the peers, which points to the equivalent server to hold SSL settings. This one is not set when the peer is local so we must always test it before testing p->srv->use_ssl otherwise haproxy dies during reloads. No backport is needed, this is purely 2.0.	2019-02-08 10:22:31 +01:00
Willy Tarreau	ff9c9140f4	MINOR: config: make MAX_PROCS configurable at build time For some embedded systems, it's pointless to have 32- or even 64- large arrays of processes when it's known that much fewer processes will be used in the worst case. Let's introduce this MAX_PROCS define which contains the highest number of processes allowed to run at once. It still defaults to LONGBITS but may be lowered.	2019-02-07 15:10:19 +01:00
Willy Tarreau	980855bd95	BUG/MEDIUM: server: initialize the orphaned conns lists and tasks at the end This also depends on the nbthread count, so it must only be performed after parsing the whole config file. As a side effect, this removes some code duplication between servers and server-templates. This must be backported to 1.9.	2019-02-07 15:08:13 +01:00
Willy Tarreau	2415727a00	MINOR: global: add proc_mask() and thread_mask() These two functions return either all_{proc,threads}_mask, or the argument. This is used to default to all_proc_mask or all_threads_mask when not set on bind_conf or proxies.	2019-02-04 05:09:15 +01:00
Willy Tarreau	a38a7175b1	MINOR: config: keep an all_proc_mask like we have all_threads_mask This simplifies some mask comparisons at various places where nbits(global.nbproc) was used.	2019-02-04 05:09:15 +01:00
Willy Tarreau	cafa56ecd6	MINOR: tools: improve the popcount() operation We'll call popcount() more often so better use a parallel method than an iterative one. One optimal design is proposed at the site below. It requires a fast multiplication though, but even without it will still be faster than the iterative one, and all relevant 64 bit platforms do have a multiply unit. https://graphics.stanford.edu/~seander/bithacks.html	2019-02-04 05:09:15 +01:00
Willy Tarreau	4ed84c96cf	OPTIM: listener: optimize cache-line packing for struct listener Some unused fields were placed early and some important ones were on the second cache line. Let's move the proto_list and name closer to the end of the structure to bring accept() and default_target() into the first cache line.	2019-02-04 05:09:14 +01:00
Willy Tarreau	da9e939f3c	CLEANUP: threads: fix misleading comment about all_threads_mask This variable changed a bit after 1.8, it's never zero anymore.	2019-02-02 17:48:39 +01:00
Olivier Houchard	dc21ff778b	MINOR: debug: Add an option that causes random allocation failures. When compiling with DEBUG_FAIL_ALLOC, add a new option, tune.fail-alloc, that gives the percentage of chances an allocation fails. This is useful to check that allocation failures are always handled gracefully.	2019-01-31 19:38:25 +01:00
Olivier Houchard	ff5dd74e25	MINOR: xref: Add missing barriers. Add a few missing barriers in the xref code, it's unlikely to be a problem for x86, but may be on architectures with weak memory ordering.	2019-01-31 19:38:25 +01:00
Willy Tarreau	00f18a36b6	BUG/MINOR: server: fix logic flaw in idle connection list management With variable connection limits, it's not possible to accurately determine whether the mux is still in use by comparing usage and max to be equal due to the fact that one determines the capacity and the other one takes care of the context. This can cause some connections to be dropped before they reach their stream ID limit. It seems it could also cause some connections to be terminated with streams still alive if the limit was reduced to match the newly computed avail_streams() value, though this cannot yet happen with existing muxes. Instead let's switch to usage reports and simply check whether connections are both unused and available before adding them to the idle list. This should be backported to 1.9.	2019-01-31 19:38:25 +01:00
Willy Tarreau	51d0a7e54c	MINOR: connstream: have a new flag CS_FL_KILL_CONN to kill a connection This is the equivalent of SI_FL_KILL_CONN but for the connstreams. It will be set by the stream-interface during the various shutdown operations.	2019-01-31 19:38:25 +01:00
Willy Tarreau	0f9cd7b196	MINOR: stream-int: add a new flag to mention that we want the connection to be killed The new flag SI_FL_KILL_CONN is now set by the rare actions which deliberately want the whole connection (and not just the stream) to be killed. This is only used for "tcp-request content reject", "tcp-response content reject", "tcp-response content close" and "http-request reject". The purpose is to desambiguate the close from a regular shutdown. This will be used by the next patches.	2019-01-31 19:38:25 +01:00
Olivier Houchard	8788b4111c	BUG/MEDIUM: connections: Don't forget to remove CO_FL_SESS_IDLE. If we're adding a connection to the server orphan idle list, don't forget to remove the CO_FL_SESS_IDLE flag, or we will assume later it's still attached to a session. This should be backported to 1.9.	2019-01-31 19:38:25 +01:00
Willy Tarreau	e5fcfbed5c	MINOR: htx: never check for null htx pointer in htx_is_{,not_}empty() The previous patch clarifies the fact that the htx pointer is never null along all the code. This test for a null will never match, didn't catch the pointer 1 before the fix for b_is_null(), but it confuses the compiler letting it think that any dereferences made to this pointer after this test could actually mean we're dereferencing a null. Let's now drop this test. This saves us from having to add impossible tests everywhere to avoid the warning. This should be backported to 1.9 if the b_is_null() patch is backported.	2019-01-31 08:07:17 +01:00
Willy Tarreau	245d189cce	DOC: htx: make it clear that htxbuf() and htx_from_buf() always return valid pointers Update the comments above htxbuf() and htx_from_buf() to make it clear that they always return valid htx pointers so that callers know they do not have to test them. This is only true after the fix on b_is_null() which was the only known corner case. This should be backported to 1.9 if the b_is_null() patch is backported.	2019-01-31 08:07:17 +01:00
Olivier Houchard	203d735cac	BUG/MEDIUM: buffer: Make sure b_is_null handles buffers waiting for allocation. In b_is_null(), make sure we return 1 if the buffer is waiting for its allocation, as users assume there's memory allocated if b_is_null() returns 0. The indirect impact of not having this was that htxbuf() would not match b_is_null() for a buffer waiting for an allocation, and would thus return the value 1 for the htx pointer, causing various crashes under low memory condition. Note that this patch makes gcc versions 6 and above report two null-deref warnings in proto_htx.c since htx_is_empty() continues to check for a null pointer without knowing that this is protected by the test on b_is_null(). This is addressed by the following patches. This should be backported to 1.9.	2019-01-31 08:07:17 +01:00
Willy Tarreau	9c84d8299a	MINOR: h2: add a generic frame checker The new function h2_frame_check() checks the protocol limits for the received frame (length, ID, direction) and returns a verdict made of a connection error code. The purpose is to be able to validate any frame regardless of the state and the ability to call the frame handler, and to emit a GOAWAY early in this case.	2019-01-30 19:37:20 +01:00
Willy Tarreau	13afcb7ab3	BUG/MINOR: task: fix possibly missed event in inter-thread wakeups There's a very small but existing uncertainty window when waking another thread up where it is possible for task_wakeup() not to wake the other task up because it's still running while this once is in the process of finishing and loses its TASK_RUNNING flag. In this case the wakeup will be missed. The problem is that we have a single flag to store 3 states, since the transition from running to sleeping isn't atomic. Thus we need to have another flag to cover this part. This patch introduces TASK_QUEUED to mention that the task is already in the run queue, running or not. This bit will be removed while TASK_RUNNING is kept once dequeued, and will be used when removing TASK_RUNNING to check if the task has been requeued. It might be possible to slightly improve this but the occurrence rate is quite low and we don't really need to complexify the scheduler to optimize for a rare case. The impact with the current code is very low since we have few inter- thread wakeups. Most of them are caused by checks killing sessions. This must be backported to 1.9.	2019-01-28 15:03:04 +01:00
Willy Tarreau	f5809cde7a	MINOR: threads: make MAX_THREADS configurable at build time There's some value in being able to limit MAX_THREADS, either to save precious resources in embedded environments, or to protect certain deployments against accidently incorrect settings. With this patch, if MAX_THREADS is defined at build time, it will be used. However, given that LONGBITS is not a macro but is defined according to sizeof(long), we can't check the value range at build time and instead we need to perform the check at early boot time. However, the compiler is able to optimize away the constant comparisons and doesn't even emit the check code when values are correct. The output message regarding threading support was improved to report the number of threads.	2019-01-26 13:37:48 +01:00
Willy Tarreau	c9a82e48bf	MINOR: cfgparse: make the process/thread parser support a maximum value It was hard-wired to LONGBITS, let's make it configurable depending on the context (threads, processes).	2019-01-26 13:25:14 +01:00
Willy Tarreau	4790f7c907	MEDIUM: h2: always parse and deduplicate the content-length header The header used to be parsed only in HTX but not in legacy. And even in HTX mode, the value was dropped. Let's always parse it and report the parsed value back so that we'll be able to store it in the streams.	2019-01-24 19:07:26 +01:00
Willy Tarreau	bf66bd1b8b	MEDIUM: stream-int: always mark pending outgoing SI_ST_CON Before the first send() attempt, we should be in SI_ST_CON, not SI_ST_EST, since we have not yet attempted to send and we are allowed to retry. This is particularly important with complex outgoing muxes which can fail during the first send attempt (e.g. failed stream ID allocation). It only requires that sess_update_st_con_tcp() knows about this possibility, as we must not forcefully close a reused connection when facing an error in this case, this will be handled later. This may be backported to 1.9 with care after some observation period.	2019-01-24 19:06:43 +01:00
Willy Tarreau	9c538e01c2	MINOR: server: add a max-reuse parameter Some servers may wish to limit the total number of requests they execute over a connection because some of their components might leak resources. In HTTP/1 it was easy, they just had to emit a "connection: close" header field with the last response. In HTTP/2, it's less easy because the info is not always shared with the component dealing with the H2 protocol and it could be harder to advertise a GOAWAY with a stream limit. This patch provides a solution to this by adding a new "max-reuse" parameter to the server keyword. This parameter indicates how many times an idle connection may be reused for new requests. The information is made available and the underlying muxes will be able to use it at will. This patch should be backported to 1.9.	2019-01-24 19:06:43 +01:00
Willy Tarreau	1e7d444eec	BUG/MINOR: hpack: return a compression error on invalid table size updates RFC7541#6.3 mandates that an error is reported when a dynamic table size update announces a size larger than the one configured with settings. This is tested by h2spec using test "hpack/6.3/1". This must be backported to 1.9 and possibly 1.8 as well.	2019-01-24 15:27:06 +01:00
Willy Tarreau	71c3811589	MINOR: h2: declare new sets of frame types This patch adds H2_FT_HDR_MASK to group all frame types carrying headers information, and H2_FT_LATE_MASK to group frame types allowed to arrive after a stream was closed.	2019-01-24 15:27:06 +01:00
Fr�d�ric L�caille	355b2033ec	MINOR: cfgparse: SSL/TLS binding in "peers" sections. Make "bind" keywork be supported in "peers" sections. All "bind" settings are supported on this line. Add "default-bind" option to parse the binding options excepted the bind address. Do not parse anymore the bind address for local peers on "server" lines. Do not use anymore list_for_each_entry() to set the "peers" section listener parameters because there is only one listener by "peers" section. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Fr�d�ric L�caille	1055e687a2	MINOR: peers: Make outgoing connection to SSL/TLS peers work. This patch adds pointer to a struct server to peer structure which is initialized after having parsed a remote "peer" line. After having parsed all peers section we run ->prepare_srv to initialize all SSL/TLS stuff of remote perr (or server). Remaining thing to do to completely support peer protocol over SSL/TLS: make "bind" keyword be supported in "peers" sections to make SSL/TLS incoming connections to local peers work. May be backported to 1.5 and newer.	2019-01-18 14:26:21 +01:00
Tim Duesterhus	8b87c01c4d	BUG/MINOR: stick_table: Prevent conn_cur from underflowing When using the peers feature a race condition could prevent a connection from being properly counted. When this connection exits it is being "uncounted" nonetheless, leading to a possible underflow (-1) of the conn_curr stick table entry in the following scenario : - Connect to peer A (A=1, B=0) - Peer A sends 1 to B (A=1, B=1) - Kill connection to A (A=0, B=1) - Connect to peer B (A=0, B=2) - Peer A sends 0 to B (A=0, B=0) - Peer B sends 0/2 to A (A=?, B=0) - Kill connection to B (A=?, B=-1) - Peer B sends -1 to A (A=-1, B=-1) This fix may be backported to all supported branches.	2019-01-15 15:34:49 +01:00
Willy Tarreau	0cac26cd88	MEDIUM: backend: move all LB algo parameters into an union Since all of them are exclusive, let's move them to an union instead of eating memory with the sum of all of them. We're using a transparent union to limit the code changes. Doing so reduces the struct lbprm from 392 bytes to 372, and thanks to these changes, the struct proxy is now down to 6480 bytes vs 6624 before the changes (144 bytes saved per proxy).	2019-01-14 19:33:17 +01:00
Willy Tarreau	76e84f5091	MINOR: backend: move hash_balance_factor out of chash This one is a proxy option which can be inherited from defaults even if the LB algo changes. Move it out of the lb_chash struct so that we don't need to keep anything separate between these structs. This will allow us to merge them into an union later. It even takes less room now as it fills a hole and removes another one.	2019-01-14 19:33:17 +01:00
Willy Tarreau	a9a7249966	MINOR: backend: remap the balance uri settings to lbprm.arg_opt{1,2,3} The algo-specific settings move from the proxy to the LB algo this way : - uri_whole => arg_opt1 - uri_len_limit => arg_opt2 - uri_dirs_depth1 => arg_opt3	2019-01-14 19:33:17 +01:00
Willy Tarreau	9fed8586b5	MINOR: backend: make the header hash use arg_opt1 for use_domain_only This is only a boolean extra arg. Let's map it to arg_opt1 and remove hh_match_domain from struct proxy.	2019-01-14 19:33:17 +01:00
Willy Tarreau	20e68378f1	MINOR: backend: add new fields in lbprm to store more LB options Some algorithms require a few extra options (up to 3). Let's provide some room in lbprm to store them, and make sure they're passed from defaults to backends.	2019-01-14 19:33:17 +01:00
Willy Tarreau	484ff07691	MINOR: backend: make headers and RDP cookie also use arg_str/len These ones used to rely on separate variables called hh_name/hh_len but they are exclusive with the former. Let's use the same variable which becomes a generic argument name and length for the LB algorithm.	2019-01-14 19:33:17 +01:00
Willy Tarreau	4c03d1c9b6	MINOR: backend: move url_param_name/len to lbprm.arg_str/len This one is exclusively used by LB parameters, when using URL param hashing. Let's move it to the lbprm struct under a more generic name.	2019-01-14 19:33:17 +01:00
Emeric Brun	9e7547740c	MINOR: ssl: add support of aes256 bits ticket keys on file and cli. Openssl switched from aes128 to aes256 since may 2016 to compute tls ticket secrets used by default. But Haproxy still handled only 128 bits keys for both tls key file and CLI. This patch permit the user to set aes256 keys throught CLI or the key file (80 bytes encoded in base64) in the same way that aes128 keys were handled (48 bytes encoded in base64): - first 16 bytes for the key name - next 16/32 bytes for aes 128/256 key bits key - last 16/32 bytes for hmac 128/256 bits Both sizes are now supported (but keys from same file must be of the same size and can but updated via CLI only using a key of the same size). Note: This feature need the fix "dec func ignores padding for output size checking."	2019-01-14 19:32:58 +01:00
Olivier Houchard	c98aa1f182	MINOR: checks: Store the proxy in checks. Instead of assuming we have a server, store the proxy directly in struct check, and use it instead of s->server. This should be a no-op for now, but will be useful later when we change mail checks to avoid having a server. This should be backported to 1.9.	2019-01-14 11:15:11 +01:00
Willy Tarreau	762475e1f9	BUG/MEDIUM: connection: properly unregister the mux on failed initialization When mux->init() fails, session_free() will call it again to unregister it while it was already done, resulting in null derefs or use-after-free. This typically happens on out-of-memory conditions during H1 or H2 connection or stream allocation. This fix must be backported to 1.9.	2019-01-10 19:47:43 +01:00
Christopher Faulet	f7ed195ac8	MINOR: channel/htx: Add the HTX version of channel_truncate/erase The function channel_htx_truncate() can now be used on HTX buffer to truncate all incoming data, keeping outgoing one intact. This function relies on the function channel_htx_erase() and htx_truncate(). This patch may be backported to 1.9. If so, the patch "MINOR: channel/htx: Add the HTX version of channel_truncate()" must also be backported.	2019-01-08 12:06:55 +01:00
Christopher Faulet	00cf697215	MINOR: htx: Add a function to truncate all blocks after a specific offset This function will be used to truncate all incoming data in a channel, keeping outgoing ones. This may be backported to 1.9.	2019-01-08 12:06:55 +01:00
Christopher Faulet	5811db0043	MINOR: channel/htx: Add HTX version for some helper functions HTX versions for functions to test the free space in input against the reserve have been added. Now, on HTX streams, following functions can be used: * channel_htx_may_recv * channel_htx_recv_limit * channel_htx_recv_max * channel_htx_full This patch must be backported in 1.9 because it will be used by a futher patch to fix a bug.	2019-01-07 16:32:05 +01:00
Christopher Faulet	8564c1f04b	MINOR: htx: Add an helper function to get the max space usable for a block This patch must be backported in 1.9 because it will be used by a futher patch to fix a bug.	2019-01-07 16:32:02 +01:00
Willy Tarreau	909b9d852b	BUILD: add a new file "version.c" to carry version updates While testing fixes, it's sometimes confusing to rebuild only one C file (e.g. a mux) and not to have the correct commit ID reported in "haproxy -v" nor on the stats page. This patch adds a new "version.c" file which is always rebuilt. It's very small and contains only 3 variables derived from the various version strings. These variables are used instead of the macros at the few places showing the version. This way the output version of the running code is always correct for the parts that were rebuilt.	2019-01-04 18:20:32 +01:00
Olivier Houchard	f1b11e2d16	MINOR: connections: Remove a stall comment. Remove the comment that pretends 0x40000000 is unused, it's not true anymore.	2019-01-04 17:26:47 +01:00
Willy Tarreau	0f8fb6b7f9	MINOR: h1: make the H1 headers block parser able to parse headers only Currently the H1 headers parser works for either a request or a response because it starts from the start line. It is also able to resume its processing when it was interrupted, but in this case it doesn't update the list. Make it support a new flag, H1_MF_HDRS_ONLY so that the caller can indicate it's only interested in the headers list and not the start line. This will be convenient to parse H1 trailers.	2019-01-04 10:48:03 +01:00
Willy Tarreau	1e1f27c5c1	MINOR: h2: add h2_make_htx_trailers to turn H2 headers to HTX trailers This function is usable to transform a list of H2 header fields to a HTX trailers block. It takes care of rejecting forbidden headers and pseudo-headers when performing the conversion. It also emits the trailing CRLF that is currently needed in the HTX trailers block.	2019-01-03 18:45:38 +01:00
Willy Tarreau	52610e905d	MINOR: htx: add a new function to add a block without filling it htx_add_blk_type_size() creates a block of a specified type and size and returns it. The caller can then fill it.	2019-01-03 18:45:38 +01:00
Willy Tarreau	9d953e7572	MINOR: h2: add h2_make_h1_trailers to turn H2 headers to H1 trailers This function is usable to transform a list of H2 header fields to a H1 trailers block. It takes care of rejecting forbidden headers and pseudo-headers when performing the conversion.	2019-01-03 18:45:38 +01:00
Willy Tarreau	59884a646c	MINOR: lb: allow redispatch when using consistent hash Redispatch traditionally only worked for cookie based persistence. Adding redispatch support for consistent hash based persistence - also update docs. Reported by Oskar Stenman on discourse: https://discourse.haproxy.org/t/balance-uri-consistent-hashing-redispatch-3-not-redispatching/3344 Should be backported to 1.8. Cc: Lukas Tribus <lukas@ltri.eu>	2019-01-02 20:22:17 +01:00
Christopher Faulet	e64582929f	MINOR: channel: Add the function channel_add_input This function must be called when new incoming data are pushed in the channel's buffer. It updates the channel state and take care of the fast forwarding by consuming right amount of data and decrementing "->to_forward" accordingly when necessary. In fact, this patch just moves a part of ci_putblk in a dedicated function. This patch must be backported to 1.9.	2019-01-02 20:12:44 +01:00
Olivier Houchard	a2dbeb22fc	MEDIUM: sessions: Keep track of which connections are idle. Instead of keeping track of the number of connections we're responsible for, keep track of the number of connections we're responsible for that we are currently considering idling (ie that we are not using, they may be in use by other sessions), that way we can actually reuse connections when we have more connections than the max configured.	2018-12-28 19:16:03 +01:00
Olivier Houchard	351411facd	BUG/MAJOR: sessions: Use an unlimited number of servers for the conn list. When a session adds a connection to its connection list, we used to remove connections for an another server if there were not enough room for our server. This can't work, because those lists are now the list of connections we're responsible for, not just the idle connections. To fix this, allow for an unlimited number of servers, instead of using an array, we're now using a linked list.	2018-12-28 16:33:13 +01:00
Olivier Houchard	09e498f1a1	BUG/MEDIUM: tasks: Decrement tasks_run_queue in tasklet_free(). If the tasklet is in the list, don't forget to decrement tasks_run_queue in tasklet_free(). This should be backported to 1.9.	2018-12-24 14:04:55 +01:00
Willy Tarreau	f48919aafb	MINOR: buffers: add a new b_move() function This function will be used to move parts of a buffer to another place in the same buffer, even if the parts overlap. In order to keep things under reasonable control, it only uses a length and absolute offsets for the source and destination, and doesn't consider head nor data.	2018-12-24 11:45:00 +01:00
Willy Tarreau	deab244dc1	MINOR: h2: add a bit-based frame type representation This will ease checks among sets of frames.	2018-12-24 11:45:00 +01:00
Willy Tarreau	fba74ea7b0	[RELEASE] Released version 2.0-dev0 Released version 2.0-dev0 with the following main changes : - BUG/MAJOR: connections: Close the connection before freeing it. - REGTEST: Require the option LUA to run lua tests - REGTEST: script: Process script arguments before everything else - REGTEST: script: Evaluate the varnishtest command to allow quoted parameters - REGTEST: script: Add the option --clean to remove previous log direcotries - REGTEST: script: Add the option --debug to show logs on standard ouput - REGTEST: script: Add the option --keep-logs to keep all log directories - REGTEST: script: Add the option --use-htx to enable the HTX in regtests - REGTEST: script: Print only errors in the results report - REGTEST: Add option to use HTX prefixed by the macro 'no-htx' - REGTEST: Make reg-tests target support argument. - REGTEST: Fix a typo about barrier type. - REGTEST: Be less Linux specific with a syslog regex. - REGTEST: Missing enclosing quotes for ${tmpdir} macro. - REGTEST: Exclude freebsd target for some reg tests. - BUG/MEDIUM: h2: Don't forget to quit the sending_list if SUB_CALL_UNSUBSCRIBE. - BUG/MEDIUM: mux-h2: Don't forget to quit the send list on error reports - BUG/MEDIUM: dns: Don't prevent reading the last byte of the payload in dns_validate_response() - BUG/MEDIUM: dns: overflowed dns name start position causing invalid dns error - BUG/MINOR: compression/htx: Don't compress responses with unknown body length - BUG/MINOR: compression/htx: Don't add the last block of data if it is empty - MEDIUM: mux_h1: Implement h1_show_fd. - REGTEST: script: Add support of alternatives in requited options list - REGTEST: Add a basic test for the compression - BUG/MEDIUM: mux-h2: don't needlessly wake up the demux on short frames - REGTEST: A basic test for "http-buffer-request" - BUG/MEDIUM: server: Also copy "check-sni" for server templates. - MINOR: ssl: Add ssl_sock_set_alpn(). - MEDIUM: checks: Add check-alpn.	2018-12-22 11:20:35 +01:00
Olivier Houchard	921501443b	MEDIUM: checks: Add check-alpn. Add a way to configure the ALPN used by check, with a new "check-alpn" keyword. By default, the checks will use the server ALPN, but it may not be convenient, for instance because the server may use HTTP/2, while checks are unable to do HTTP/2 yet.	2018-12-21 19:54:16 +01:00
Olivier Houchard	ab28a320aa	MINOR: ssl: Add ssl_sock_set_alpn(). Add a new function, ssl_sock_set_alpn(), to be able to change the ALPN for a connection, instead of relying of the one defined in the SSL_CTX.	2018-12-21 19:53:30 +01:00
Olivier Houchard	8ab8a6eee5	BUG/MAJOR: connections: Close the connection before freeing it. In si_release_endpoint(), if the end point is a connection, because we don't know which mux to use it, make sure we close the connection before freeing it, or else, we'd have a fd left for polling, which would point to a now free'd connection. This should be backported to 1.9.	2018-12-20 06:03:14 +01:00
Willy Tarreau	e9f4301f0f	MINOR: connection: add cs_set_error() to set the error bits Depending on the CS_FL_EOS status, we either set CS_FL_ERR_PENDING or CS_FL_ERROR at various places. Let's have a generic function to do this.	2018-12-19 18:13:52 +01:00
Willy Tarreau	14bfe9af12	CLEANUP: stream-int: consistently call the si/stream_int functions As long-time changes have accumulated over time, the exported functions of the stream-interface were almost all prefixed "si_<something>" while most private ones (mostly callbacks) were called "stream_int_<something>". There were still a few confusing exceptions, which were addressed to follow this shcme : - stream_sock_read0(), only used internally, was renamed stream_int_read0() and made static - stream_int_notify() is only private and was made static - stream_int_{check_timeouts,report_error,retnclose,register_handler,update} were renamed si_<something>. Now it is clearer when checking one of these if it risks to be used outside or not.	2018-12-19 15:25:43 +01:00
Willy Tarreau	94031d30d7	MINOR: connection: remove an unwelcome dependency on struct stream There was a reference to struct stream in conn_free() for the case where we're freeing a connection that doesn't have a mux attached. For now we know it's always a stream, and we only need to do it to put a NULL in s->si[1].end. Let's do it better by storing the pointer to si[1].end in the context and specifying that this pointer is always nulled if the mux is null. This way it allows a connection to detach itself from wherever it's being used. Maybe we could even get rid of the condition on the mux.	2018-12-19 14:36:29 +01:00
Willy Tarreau	3d2ee55ebd	CLEANUP: connection: rename conn->mux_ctx to conn->ctx We most often store the mux context there but it can also be something else while setting up the connection. Better call it "ctx" and know that it's the owner's context than misleadingly call it mux_ctx and get caught doing suspicious tricks.	2018-12-19 14:13:07 +01:00
Willy Tarreau	4f6516d677	CLEANUP: connection: rename subscription events values and event field The SUB_CAN_SEND/SUB_CAN_RECV enum values have been confusing a few times, especially when checking them on reading. After some discussion, it appears that calling them SUB_RETRY_SEND/SUB_RETRY_RECV more accurately reflects their purpose since these events may only appear after a first attempt to perform the I/O operation has failed or was not completed. In addition the wait_reason field in struct wait_event which carries them makes one think that a single reason may happen at once while it is in fact a set of events. Since the struct is called wait_event it makes sense that this field is called "events" to indicate it's the list of events we're subscribed to. Last, the values for SUB_RETRY_RECV/SEND were swapped so that value 1 corresponds to recv and 2 to send, as is done almost everywhere else in the code an in the shutdown() call.	2018-12-19 14:09:21 +01:00
Willy Tarreau	beefaee4f5	MEDIUM: h2: properly check and deduplicate the content-length header in HTX When producing an HTX message, we can't rely on the next-level H1 parser to check and deduplicate the content-length header, so we have to do it while parsing a message. The algorithm is the exact same as used for H1 messages.	2018-12-19 13:08:08 +01:00
Willy Tarreau	d5e3c71208	MINOR: objtype: report a few missing types in names and base pointers Types DNS_SRVRQ and CS were not referenced in the type to string conversions, causing possibly misleading outputs in session dumps. Now instead of showing "NONE" for unknown invalid types names, we display "!INVAL!" to clear the confusion that may exist in case of memory corruption for example.	2018-12-18 16:31:10 +01:00
Olivier Houchard	71748cb91b	BUG/MEDIUM: connection: Add a new CS_FL_ERR_PENDING flag to conn_streams. Add a new flag to conn_streams, CS_FL_ERR_PENDING. This is to be set instead of CS_FL_ERR in case there's still more data to be read, so that we read all the data before closing.	2018-12-17 21:54:14 +01:00
Willy Tarreau	bce4d8a37d	MINOR: debug: make the ABORT_NOW macro use a volatile int Similar to previous commit, let's make the macro use a volatile when dereferencing NULL so that clang doesn't optimize it away.	2018-12-16 08:17:23 +01:00
Olivier Houchard	51e474136b	MINOR: pools: Cast to volatile int * instead of int . When using DEBUG_MEMORY_POOLS, when we want to crash, instead of using (int )0 = 0, use (volatile int *)0 = 0, or clang will just translate it to a nop, instead of dereferencing 0.	2018-12-16 08:15:16 +01:00
Olivier Houchard	a4d4fdfaa3	MEDIUM: sessions: Don't keep an infinite number of idling connections. In session, don't keep an infinite number of connection that can idle. Add a new frontend parameter, "max-session-srv-conns" to set a max number, with a default value of 5.	2018-12-15 23:50:10 +01:00
Olivier Houchard	f502aca5c2	MEDIUM: mux: provide the session to the init() and attach() method. Instead of trying to get the session from the connection, which is not always there, and of course there could be multiple sessions per connection, provide it with the init() and attach() methods, so that we know the session for each outgoing stream.	2018-12-15 23:50:09 +01:00
Olivier Houchard	b7b3faa79c	MEDIUM: servers: Replace idle-timeout with pool-purge-delay. Instead of the old "idle-timeout" mechanism, add a new option, "pool-purge-delay", that sets the delay before purging idle connections. Each time the delay happens, we destroy half of the idle connections.	2018-12-15 23:50:09 +01:00
Olivier Houchard	006e3101f9	MEDIUM: servers: Add a command to limit the number of idling connections. Add a new command, "pool-max-conn" that sets the maximum number of connections waiting in the orphan idling connections list (as activated with idle-timeout). Using "-1" means unlimited. Using pools is now dependant on this.	2018-12-15 23:50:08 +01:00
William Lallemand	a57b7e33ef	MINOR: cli: implements 'reload' on master CLI The reload command reload the haproxy master like it is done with a kill -USR2 on the master process.	2018-12-15 13:33:49 +01:00
Christopher Faulet	f0216dae0c	MINOR: payload/htx: Adapt smp_fetch_len to be HTX aware	2018-12-14 16:03:34 +01:00
Willy Tarreau	a1214a501f	MINOR: cache: report the number of cache lookups and cache hits The cache lookups and hits is now accounted per frontend and per backend, and reported on the stats page.	2018-12-14 14:00:25 +01:00
Willy Tarreau	59caa3b872	MINOR: tools: increase the number of ITOA strings to 16 It's currently 10 and is too little to extend some tooltips on the stats page.	2018-12-14 13:59:42 +01:00
Willy Tarreau	f157384803	MINOR: backend: count the number of connect and reuse per server and per backend Sadly we didn't have the cumulated number of connections established to servers till now, so let's now update it per backend and per-server and report it in the stats. On the stats page it appears in the tooltip when hovering over the total sessions count field.	2018-12-14 11:35:36 +01:00
Olivier Houchard	9a86fcbd47	MEDIUM: mux: Add an optional "reset" method. Add a new method to mux, "reset", that is used to let the mux know the connection attempt failed, and we're about to retry, so it just have to reinit itself. Currently only the H1 mux needs it.	2018-12-13 17:32:15 +01:00
William Lallemand	b7ea141cbb	MEDIUM: cli: handle CLI level from the master CLI Handle the CLI level in the master CLI. In order to do this, the master CLI stores the level in the stream. Each command are prefixed by a "user" or "operator" command before they are forwarded to the target CLI. The level can be configured in the haproxy program arguments with the level keyword: -S /tmp/sock,level,admin -S /tmp/sock2,level,user.	2018-12-13 09:45:16 +01:00
William Lallemand	dc12c2e56c	CLEANUP: cli: use dedicated define instead of appctx ones Replace APPCTX_CLI_ST1_PAYLOAD and APPCTX_CLI_ST1_PROMPT by PCLI_F_PAYLOAD and PCLI_F_PROMPT in the master CLI code.	2018-12-13 09:45:16 +01:00
William Lallemand	f630d01c9f	MEDIUM: cli: store CLI level in the appctx Store and check the level in the appctx in order to allow dynamic permission changes over the CLI.	2018-12-13 09:45:16 +01:00
Remi Gacogne	00488ddef5	BUG: dns: Fix off-by-one write in dns_validate_dns_response() The maximum number of bytes in a DNS name is indeed 255, but we need to allocate one more byte for the NULL-terminating byte. Otherwise dns_read_name() might return 255 for a very long name, causing dns_validate_dns_response() to write a NULL value one byte after the end of the buffer: dns_answer_record->name[len] = 0; The next fields in the struct being filled from the content of the query, it might have been possible to fill them with non-0 values, causing for example a strlen() of the name to read past the end of the struct and access unintended parts of the memory, possibly leading to a crash. To be backported to 1.8, probably also 1.7.	2018-12-12 14:44:52 +01:00
Remi Gacogne	bc552102ad	BUG: dns: Fix out-of-bounds read via signedness error in dns_validate_dns_response() Since the data_len field of the dns_answer_item struct was an int16_t, record length values larger than 2^15-1 were causing an integer overflow and thus may have been interpreted as negative, making us read well before the beginning of the buffer. This might have led to information disclosure or a crash. To be backported to 1.8, probably also 1.7.	2018-12-12 14:44:38 +01:00
Willy Tarreau	0007d0afbc	CLEANUP: stream: remove SF_TUNNEL, SF_INITIALIZED, SF_CONN_TAR These flags haven't been used for a while. SF_TUNNEL was reintroduced by commit `d62b98c6e` ("MINOR: stream: don't set backend's nor response analysers on SF_TUNNEL") to handle the two-level streams needed to deal with the first model for H2, and was not removed after this model was abandonned. SF_INITIALIZED was only set. SF_CONN_TAR was never referenced at all.	2018-12-11 18:01:38 +01:00
Willy Tarreau	afba57ae80	REORG: h1: merge types+proto into common/h1.h These two files are self-contained and do not depend on other layers, so let's remerge them together for easier manipulation.	2018-12-11 17:15:13 +01:00
Willy Tarreau	30925659ef	CLEANUP: h1: remove some occurrences of unneeded h1.h inclusions Several places where h1.h was included didn't need it at all since they in fact relied on the legacy HTTP definitions.	2018-12-11 17:15:13 +01:00
Willy Tarreau	326e27ed08	REORG: h1: move the h1_state definition to proto_http This is the legacy HTTP/1 state, it's never used from within h1 users, let's move it to proto_http with the rest of the legacy code.	2018-12-11 17:15:13 +01:00
Willy Tarreau	538746ad38	REORG: h1: move legacy http functions to http_msg.c Now that h1 and legacy HTTP are two distinct things, there's no need to keep the legacy HTTP parsers in h1.c since they're only used by the legacy code in proto_http.c, and h1.h doesn't need to include hdr_idx anymore. This concerns the following functions : - http_parse_reqline(); - http_parse_stsline(); - http_msg_analyzer(); - http_forward_trailers(); All of these were moved to http_msg.c.	2018-12-11 17:15:13 +01:00
Willy Tarreau	c5a4fd5c30	REORG: http: create http_msg.c to place there some legacy HTTP parts Lots of HTTP code still uses struct http_msg. Not only this code is still huge, but it's part of the legacy interface. Let's move most of these functions to a separate file http_msg.c to make it more visible which file relies on what. It's mostly symmetrical with what is present in http_htx.c. The function http_transform_header_str() which used to rely on two function pointers to look up a header was simplified to rely on two variants http_legacy_replace_{,full_}header(), making both sides of the function much simpler. No code was changed beyond these moves.	2018-12-11 17:15:13 +01:00
Willy Tarreau	b96b77ed6e	REORG: htx: merge types+proto into common/htx.h All the HTX definition is self-contained and doesn't really depend on anything external since it's a mostly protocol. In addition, some external similar files (like h2) also placed in common used to rely on it, making it a bit awkward. This patch moves the two htx.h files into a single self-contained one. The historical dependency on sample.h could be also removed since it used to be there only for http_meth_t which is now in http.h.	2018-12-11 17:15:04 +01:00
Christopher Faulet	f4a4ef7d7c	MINOR: filters: Export the name of known filters It could be useful to know if some filter is declared on a proxy or if it is enabled on a stream.	2018-12-11 17:09:31 +01:00
Christopher Faulet	54a8d5a4a0	MEDIUM: cache/htx: Add the HTX support into the cache The cache is now able to store and resend HTX messages. When an HTX message is stored in the cache, the headers are prefixed with their block's info (an uint32_t), containing its type and its length. Data, on their side, are stored without any prefix. Only the value is copied in the cache. 2 fields have been added in the structure cache_entry, hdrs_len and data_len, to known the size, in the cache, of the headers part and the data part. If the message is chunked, the trailers are also copied, the same way as data. When the HTX message is recreated in the cache applet, the trailers size is known removing the headers length and the data lenght from the total object length.	2018-12-11 17:09:31 +01:00
Christopher Faulet	c9df7f728f	MINOR: compression: Rename the function check_legacy_http_comp_flt() To not mix it up with the legacy HTTP representation, this function has been rename check_implicit_http_comp_flt().	2018-12-11 17:09:31 +01:00
William Lallemand	459e18e9e7	MINOR: cli: use pcli_flags for prompt activation Instead of using a variable to activate the prompt, we just use a flag.	2018-12-11 17:05:40 +01:00
William Lallemand	ebf61804ef	MEDIUM: cli: handle payload in CLI proxy The CLI proxy was not handling payload. To do that, we needed to keep a connection active on a server and to transfer each new line over that connection until we receive a empty line. The CLI proxy handles the payload in the same way that the CLI do it. Examples: $ echo -e "@1;add map #-1 <<\n$(cat data)\n" \| socat /tmp/master-socket - $ socat /tmp/master-socket readline prompt master> @1 25130> add map #-1 << + test test + test2 test2 + test3 test3 + 25130>	2018-12-11 17:05:36 +01:00
William Lallemand	5b80fa2864	MINOR: cli: parse prompt command in the CLI proxy Handle the prompt command. Works the same way as the CLI.	2018-12-11 16:54:18 +01:00
Willy Tarreau	1a18b54142	REORG: connection: centralize the conn_set_{tos,mark,quickack} functions There were a number of ugly setsockopt() calls spread all over proto_http.c, proto_htx.c and hlua.c just to manipulate the front connection's TOS, mark or TCP quick-ack. These ones entirely relied on the connection, its existence, its control layer's presence, and its addresses. Worse, inet_set_tos() was placed in proto_http.c, exported and used from the two other ones, surrounded in #ifdefs. This patch moves this code to connection.h and makes the other ones rely on it without ifdefs.	2018-12-11 16:41:51 +01:00
Willy Tarreau	eaeeb68f23	MINOR: hpack: provide a function to encode an HTTP path The new function hpack_encode_path() supports encoding a path into the ":path" header. It knows about "/" and "/index.html" which use a single byte, and falls back to literal encoding for other ones, with a fast path for short paths < 127 bytes.	2018-12-11 09:07:02 +01:00
Willy Tarreau	820b391260	MINOR: hpack: provide a function to encode an HTTP scheme The new function hpack_encode_scheme() supports encoding a scheme into the ":scheme" header. It knows about "https" and "http" which use a single byte, and falls back to literal encoding for other ones.	2018-12-11 09:07:02 +01:00
Willy Tarreau	39c80ebff0	MINOR: hpack: provide a function to encode an HTTP method The new function hpack_encode_method() supports encoding a method. It knows about GET and POST which use a single byte, and falls back to literal encoding for other ones.	2018-12-11 09:07:02 +01:00
Willy Tarreau	8895367fb1	MINOR: hpack: provide new functions to encode the ":status" header This header exists with 7 different values, it's worth taking them into account for the encoding, hence these functions. One of them makes use of an integer only and computes the 3 output bytes in case of literal. The other one benefits from the knowledge of an existing string, which for example exists in the case of H1 to H2 encoding.	2018-12-11 09:07:02 +01:00
Willy Tarreau	bd5659bbe1	MINOR: hpack: provide a function to encode a long indexed header For long header values whose index is known, hpack_encodde_long_idx() may now be used. This function emits the short index and follows with the header's value.	2018-12-11 09:07:01 +01:00
Willy Tarreau	30eb809fdb	MINOR: hpack: provide a function to encode a short indexed header Most direct calls to HPACK functions are made to encode short header fields like methods, schemes or statuses, whose lengths and indexes are known. Let's have a small function to do this.	2018-12-11 09:06:46 +01:00
Willy Tarreau	bad0a381d3	MINOR: hpack: move the length computation and encoding functions to .h We'll need these functions from other inline functions, let's make them accessible. len_to_bytes() was renamed to hpack_len_to_bytes() since it's now exposed.	2018-12-11 09:06:46 +01:00
Willy Tarreau	2df026fbce	CLEANUP: hpack: no need to include chunk.h, only include buf.h Chunk.h used to be needed to declare the struct chunk which we don't use anymore, let's fall back to the lighter buf.h	2018-12-11 09:06:06 +01:00
Willy Tarreau	071d4b31ff	MINOR: compiler: add a new macro ALREADY_CHECKED() This macro may be used to block constant propagation that lets the compiler detect a possible NULL dereference on a variable resulting from an explicit assignment in an impossible check. Sometimes a function is called which does safety checks and returns NULL if safe conditions are not met. The place where it's called cannot hit this condition and dereferencing the pointer without first checking it will make the compiler emit a warning about a "potential null pointer dereference" which is hard to work around. This macro "washes" the pointer and prevents the compiler from emitting tests branching to undefined instructions. It may only be used when the developer is absolutely certain that the conditions are guaranteed and that the pointer passed in argument cannot be NULL by design. A typical use case is a top-level function doing this : if (frame->type == HEADERS) parse_frame(frame); Then parse_frame() does this : void parse_frame(struct frame frame) { const char frame_hdr; frame_hdr = frame_hdr_start(frame); if (frame_hdr == FRAME_HDR_BEGIN) process_frame(frame); } and : const char frame_hdr_start(const struct frame frame) { if (frame->type == HEADERS) return frame->data; else return NULL; } Above parse_frame() is only called for frame->type == HEADERS so it will never get a NULL in return from frame_hdr_start(). Thus it's always safe to dereference frame_hdr since the check was already performed above. It's then safe to address it this way instead of inventing dummy error code paths that may create real bugs : void parse_frame(struct frame frame) { const char frame_hdr; frame_hdr = frame_hdr_start(frame); ALREADY_CHECKED(frame_hdr); if (*frame_hdr == FRAME_HDR_BEGIN) process_frame(frame); }	2018-12-08 15:27:03 +01:00
Willy Tarreau	d6735d611e	MEDIUM: ist: use local conversion arrays to case conversion Calling tolower/toupper for each character is slow, a lookup into a 256-byte table is cheaper, especially for common characters used in header field names which all fit into a cache line. Let's create these two variables marked weak so that they're included only once.	2018-12-07 13:25:59 +01:00
Willy Tarreau	3f2d696d72	MINOR: ist: add functions to copy/uppercase/lowercase into a buffer or string The ist functions were missing functions to copy an IST into a target buffer, making some code have to resort to memcpy(), which tends to be overkill for small strings, that the compiler cannot guess. In addition sometimes there is a need to turn a string to lower or upper case so it had to be overwritten after the operation. This patch adds 6 functions to copy an ist to a buffer, as binary or as a string (i.e. a zero is or is not appended), and optionally to apply a lower case or upper case transformation on the fly. A number of tests were performed to optimize the processing for small strings. The loops are marked unlikely to dissuade the compilers from over-optimizing them and switching to SIMD instructions. The lower case or upper case transformations used to rely on external functions for each character and to crappify the code due to clobbered registers, which is not acceptable when we know that only a certain class of chars has to be transformed, so the test was open-coded.	2018-12-07 13:25:59 +01:00
Olivier Houchard	d247be0620	BUG/MEDIUM: connections: Split CS_FL_RCV_MORE into 2 flags. CS_FL_RCV_MORE is used in two cases, to let the conn_stream know there may be more data available, and to let it know that it needs more room. We can't easily differentiate between the two, and that may leads to hangs, so split it into two flags, CS_FL_RCV_MORE, that means there may be more data, and CS_FL_WANT_ROOM, that means we need more room. This should not be backported.	2018-12-06 16:36:05 +01:00
Willy Tarreau	adc7f3edd2	BUG/MEDIUM: stream-int: don't attempt to receive if the connection is not established If we try to receive before the connection is established, we lose the send event and are not woken up anymore once the connection is established. This was diagnosed by Olivier. No backport is needed.	2018-12-06 15:25:58 +01:00
Willy Tarreau	a3b62d374a	MINOR: stream-int: add a new blocking condition on the remote connection There are some situations where we need to wait for the other side to be connected. None of the current blocking flags support this. It used to work more or less by accident using the old flags. Let's add a new flag to mention we're blocking on this, it's removed by si_chk_rcv() when a connection is established. It should be enough for now.	2018-12-06 15:24:01 +01:00
William Lallemand	27f3fa56f5	BUG/MEDIUM: mworker: stop every tasks in the master The master is not supposed to run (at the moment) any task before the polling loop, the created tasks should be run only in the workers but in the master they should be disabled or removed. No backport needed.	2018-12-06 14:12:58 +01:00
Christopher Faulet	aa75b3d2d5	CLEANUP: htx: Fix indentation here and there in HTX files	2018-12-05 17:33:14 +01:00
Christopher Faulet	b2aedea142	MEDIUM: channel/htx: Add functions for forward HTX data To ease the fast forwarding and the infinte forwarding on HTX proxies, 2 functions have been added to let the channel be almost aware of the way data are stored in its buffer. By calling these functions instead of legacy ones, we are sure to forward the right amount of data.	2018-12-05 17:29:30 +01:00
Christopher Faulet	27ba2dc6d6	MEDIUM: htx: Rework conversion from a buffer to an htx structure Now, the function htx_from_buf() will set the buffer's length to its size automatically. In return, the caller should call htx_to_buf() at the end to be sure to leave the buffer hosting the HTX message in the right state. When the caller can use the function htxbuf() to get the HTX message without any update on the underlying buffer.	2018-12-05 17:10:16 +01:00
Willy Tarreau	3906e22f6f	MINOR: htx: add buf_room_for_htx_data() to help optimize buffer transfers The small HTX overhead is enough to make the system perform multiple reads and unaligned memory copies. Here we provide a function whose purpose is to reduce the apparent room in a buffer by the size of the overhead for DATA blocks, which is the struct htx plus 2 blocks (one for DATA, one for the end of message so that small blocks can fit at once). The muxes using HTX will be encouraged to use this one instead of b_room() to compute the available buffer room and avoid filling their demux buf with more data than can fit at once into the HTX buffer.	2018-12-05 10:57:42 +01:00
Willy Tarreau	8ae4235f94	MINOR: htx: make htx_from_buf() adjust the size only on new buffers This one is used a lot during transfers, let's avoid resetting its size when there are already data in the buffer since it implies the size is correct.	2018-12-05 10:57:42 +01:00
Christopher Faulet	c59ff23804	MINOR: htx: Rename functions htx_*_to_str() to be H1 specific "_to_h1" suffix is now used because these function produce H1 strings. It avoids any ambiguity on the output format.	2018-12-04 05:51:37 +01:00
Joseph Herlant	75a323f04e	CLEANUP: Fix a typo in the listener subsystem Fixes a typo in the code comment of the listener subsystem.	2018-12-02 18:43:28 +01:00
Joseph Herlant	f69b807fa4	CLEANUP: Fix typos in the file descriptor subsystem Fixes 2 typos in the code comment of the file descriptor subsystem.	2018-12-02 18:43:25 +01:00
Joseph Herlant	0b75e63dc5	CLEANUP: Fix a typo in the checks header file Fixes a typo in the code comments of the checks header file.	2018-12-02 18:43:21 +01:00
Joseph Herlant	eeac3c722f	CLEANUP: Fix a typo in the protocol header file Fixes a typo in the code comments of the header file holding the general protocol primitives.	2018-12-02 18:42:49 +01:00
Joseph Herlant	8a95a6e5ed	CLEANUP: Fix a typo in the connection subsystem Fixes a typo in the code comments of the connection subsystem.	2018-12-02 18:42:12 +01:00
Joseph Herlant	41abef77cb	CLEANUP: Fix a typo in the mini-clist header Fixes a typo in the code comments of the mini-clist header.	2018-12-02 18:38:15 +01:00
Joseph Herlant	30bc509c40	CLEANUP: Fix typos in the h1 subsystem Fixes typos in the code comments of the h1 subsystem.	2018-12-02 18:38:02 +01:00
Joseph Herlant	be7619aaca	CLEANUP: Fix typo in the chunk headers file Fix a typo detected in the chunk.h header file's code comments.	2018-12-02 18:37:56 +01:00
Joseph Herlant	c42c0e9969	CLEANUP: fix typos in the htx subsystem Fix typos detected in the code comments of the htx subsystem.	2018-12-02 18:37:50 +01:00
Olivier Houchard	0c18a6fe34	MEDIUM: servers: Add a way to keep idle connections alive. Add a new keyword for servers, "idle-timeout". If set, unused connections are kept alive until the timeout happens, and will be picked for reuse if no other connection is available.	2018-12-02 18:16:53 +01:00
Olivier Houchard	8defe4b51a	MINOR: mux: add a "max_streams" method. Add a new method to muxes, "max_streams", that returns the max number of streams the mux can handle. This will be used to know if a mux is in use or not.	2018-12-02 17:48:32 +01:00
Olivier Houchard	f3e65b086d	MINOR: connection: Fix a comment. Connections can now have an owner for outgoing connections, so update the comment tu reflect that.	2018-12-02 17:48:28 +01:00
Willy Tarreau	1329b5be71	MINOR: h2: add new functions to produce an HTX message from an H2 response The new function h2_prepare_htx_stsline() produces an HTX response message from an H2 response presented as a list of header fields.	2018-12-02 13:30:17 +01:00
Willy Tarreau	3fbea1d8d0	MINOR: server: the mux_proto entry in the server is const Same as previous commit. We'll have to update this one soon, let's avoid any cast and mark it const as it really is.	2018-12-02 13:12:16 +01:00
Willy Tarreau	5fc311c001	MINOR: connection: create conn_get_best_mux_entry() We currently have conn_get_best_mux() to return the best mux for a given protocol name, side and proxy mode. But we need the mux entry as well in order to fix the bind_conf and servers at the end of the config parsing. Let's split the function in two parts. It's worth noting that the <conn> argument is never used anymore so this part is eligible to some cleanup.	2018-12-02 13:12:16 +01:00
Willy Tarreau	a004ae3e66	MINOR: listener: the mux_proto entry in the bind_conf is const We'll have to update this one soon, let's avoid any cast and mark it const as it really is.	2018-12-02 13:12:15 +01:00
Willy Tarreau	6deb4129de	MINOR: h2: implement H2->HTX request header frame transcoding Till now we could only produce an HTTP/1 request from a list of H2 request headers. Now the new function h2_make_htx_request() does the same but using the HTX encoding instead, while respecting the H2 semantics. The code is not much different from the first version, only the encoding differs. For now it's not used.	2018-12-01 17:38:32 +01:00
Christopher Faulet	75bc913d23	MAJOR: filters: Adapt filters API to be compatible with the HTX represenation First, to be called on HTX streams, a filter must explicitly be declared as compatible by setting the flag STRM_FLT_FL_HAS_FILTERS on the filter's config at HAProxy startup. This flag is checked when a filter implementation is attached to a stream. Then, some changes have been made on HTTP callbacks. The callback http_payload has been added to filter HTX data. It will be called on HTX streams only. It replaces the callbacks http_data, http_chunk_trailers and http_forward_data, called on legacy HTTP streams only and marked as deprecated. The documention (once updated)) will give all information to implement this new callback. Other HTTP callbacks will be called for HTX and HTTP legacy streams. So it is the filter's responsibility to known which kind of data it handles. The macro IS_HTX_STRM should be used in such cases. There is at least a noticeable changes in the way data are forwarded. In HTX, after the call to the callback http_headers, all the headers are considered as forwarded. So, in http_payload, only the body and eventually the trailers will be filtered.	2018-12-01 17:37:27 +01:00
Christopher Faulet	e44769b4fa	MINOR: mux-h1: Capture bad H1 messages First of all, an dedicated error snapshot, h1_snapshot, has been added. It contains more or less the some info than http_snapshot but adapted for H1 messages. Then, the function h1_capture_bad_message() has been added to capture bad H1 messages. And finally, the function h1_show_error_snapshot() is used to dump these errors. Only Headers or data parsing are captured.	2018-12-01 17:37:27 +01:00
Christopher Faulet	a7b677cd0d	MEDIUM: proto_htx: Convert all HTTP error messages into HTX During startup, after the configuration parsing, all HTTP error messages (errorloc, errorfile or default messages) are converted into HTX messages and stored in dedicated buffers. We use it to return errors in the HTX analyzers instead of using ugly OOB blocks.	2018-12-01 17:37:27 +01:00
Christopher Faulet	b2db4fa016	MINOR: htx: Add BODYLESS flags on the HTX start-line and the HTTP message the flags HTX_SL_F_BODYLESS and HTTP_MSGF_BODYLESS have been added. These flags are set when the corresponding HTTP message has no body at all.	2018-12-01 17:37:27 +01:00
Christopher Faulet	f1ba18d7b3	MEDIUM: htx: Don't rely on h1_sl anymore except during H1 header parsing Instead, we now use the htx_sl coming from the HTX message. It avoids to have too H1 specific code in version-agnostic parts. Of course, the concept of the start-line is higly influenced by the H1, but the structure htx_sl can be adapted, if necessary. And many things depend on a start-line during HTTP analyzis. Using the structure htx_sl also avoid boring conversions between HTX version and H1 version.	2018-12-01 17:37:27 +01:00
Christopher Faulet	54483df5ba	MINOR: htx: Add the start-line offset for the HTX message in the HTX structure If there is no start-line, this offset is set to -1. Otherwise, it is the relative address where the start-line is stored in the data block. When the start-line is added, replaced or removed, this offset is updated accordingly. On remove, if the start-line is no set and if the next block is a start-line, the offset is updated. Finally, when an HTX structure is defragmented, the offset is also updated accordingly.	2018-12-01 17:37:27 +01:00
Christopher Faulet	570d1614fa	MEDIUM: htx: Change htx_sl to be a struct instead of an union The HTX start-line is now a struct. It will be easier to extend, if needed. Same info can be found, of course. In addition it is now possible to set flags on it. It will be used to set some infos about the message. Some macros and functions have been added in proto/htx.h to help accessing different parts of the start-line.	2018-12-01 17:37:27 +01:00
Christopher Faulet	14e88252f2	MINOR: htx: Add a function to find the HTX block corresponding to a data offset The function htx_find_blk() returns the HTX block containing data with a given offset, relatively to the beginning of the HTX message. It is a good way to skip outgoing data and find the first HTX block not already processed.	2018-12-01 17:37:27 +01:00
Christopher Faulet	d16b0a7b2d	MINOR: htx: Add function to iterate on an HTX message using HTX blocks the functions htx_get_next() and htx_get_prev() are used to iterate on an HTX message using blocks position. With htx_get_next_blk() and htx_get_prev_blk(), it is possible to do the same, but with HTX blocks. Of course, internally, we rely on position's versions to do so. But it is handy for callers to not take care of the blocks position.	2018-12-01 17:37:27 +01:00
Christopher Faulet	24ed835129	MINOR: htx: Add function to add an HTX block just before another one The function htx_add_data_before() can be used to add an HTX block before another one. For instance, it could be used to add some data before the end-of-message marker.	2018-12-01 17:37:27 +01:00
Christopher Faulet	3bc1b11dae	MEDIUM: conn_stream: Add a way to get mux's info on a CS from the upper layer Time to time, the need arises to get some info owned by the multiplexer about a connection stream from the upper layer. Today we really need to get some dates and durations specific to the conn_stream. It is only true for the mux H1 and H2. Otherwise it will be impossible to have correct times reported in the logs. To do so, the structure cs_info has been defined to provide all info we ever need on a conn_stream from the upper layer. Of course, it is the first step. So this structure will certainly envloved. But for now, only the bare minimum is referenced. On the mux side, the callback get_cs_info() has been added in the structure mux_ops. Multiplexers can now implement it, if necessary, to return a pointer on a structure cs_info. And finally, the function si_get_cs_info() should be used from the upper layer. If the stream interface is not attached to a connection stream, this function returns NULL, likewise if the callback get_cs_info() is not defined for the corresponding mux.	2018-12-01 17:37:27 +01:00
Willy Tarreau	c01ed9ff20	MINOR: htx: add a function to cut the beginning of a DATA block htx_cut_data_blk() is used to cut the beginning of a DATA block after a part of it was tranferred. It simply advances the address, reduces the advertised length and updates the htx's total data count.	2018-12-01 17:36:59 +01:00
Willy Tarreau	d3c49d17dc	BUG/MINOR: connection: report mux modes when HTX is supported It looks like we forgot to report HTX when listing the muxes and their respective protocols, leading to "NONE" being displayed. Let's report "HTX" and "HTTP\|HTX" since both will exist. Also fix a minor typo in the output message.	2018-12-01 17:33:35 +01:00
Olivier Houchard	00cf70f28b	MAJOR: sessions: Store multiple outgoing connections in the session. Instead of just storing the last connection in the session, store all of the connections, for at most MAX_SRV_LIST (currently 5) targets. That way we can do keepalive on more than 1 outgoing connection when the client uses HTTP/2.	2018-12-01 10:47:18 +01:00
William Lallemand	4b58c80ee2	REORG: mworker: declare master variable in global.h This variable is used at several places, better declare it in global.h.	2018-11-27 19:34:00 +01:00
Willy Tarreau	7f0165e399	MEDIUM: memory: make the pool cache an array and not a thread_local Having a thread_local for the pool cache is messy as we need to initialize all elements upon startup, but we can't until the threads are created, and once created it's too late. For this reason, the allocation code used to check for the pool's initialization, and it was the release code which used to detect the first call and to initialize the cache on the fly, which is not exactly optimal. Now that we have initcalls, let's turn this into a per-thread array. This array is initialized very early in the boot process (STG_PREPARE) so that pools are always safe to use. This allows to remove the tests from the alloc/free calls. Doing just this has removed 2.5 kB of code on all cumulated pool_alloc() and pool_free() paths.	2018-11-26 19:50:32 +01:00
Willy Tarreau	b6b3df3ed3	MEDIUM: initcall: use initcalls for a few initialization functions signal_init(), init_log(), init_stream(), and init_task() all used to only preset some values and lists. This needs to be done very early to provide a reliable interface to all other users. The calls used to be explicit in haproxy.c:init(). Now they're placed in initcalls at the STG_PREPARE stage. The functions are not exported anymore.	2018-11-26 19:50:32 +01:00
Willy Tarreau	2455cebe00	MEDIUM: memory: use pool_destroy_all() to destroy all pools on deinit() Instead of exporting a number of pools and having to manually delete them in deinit() or to have dedicated destructors to remove them, let's simply kill all pools on deinit(). For this a new function pool_destroy_all() was introduced. As its name implies, it destroys and frees all pools (provided they don't have any user anymore of course). This allowed to remove 4 implicit destructors, 2 explicit ones, and 11 individual calls to pool_destroy(). In addition it properly removes the mux_pt_ctx pool which was not cleared on exit (no backport needed here since it's 1.9 only). The sig_handler pool doesn't need to be exported anymore and became static now.	2018-11-26 19:50:32 +01:00
Willy Tarreau	8ceae72d44	MEDIUM: init: use initcall for all fixed size pool creations This commit replaces the explicit pool creation that are made in constructors with a pool registration. Not only this simplifies the pools declaration (it can be done on a single line after the head is declared), but it also removes references to pools from within constructors. The only remaining create_pool() calls are those performed in init functions after the config is parsed, so there is no more user of potentially uninitialized pool now. It has been the opportunity to remove no less than 12 constructors and 6 init functions.	2018-11-26 19:50:32 +01:00
Willy Tarreau	7107c8b494	MINOR: memory: add a callback function to create a pool The new function create_pool_callback() takes 3 args including the return pointer, and creates a pool with the specified name and size. In case of allocation error, it emits an error message and returns. The new macro REGISTER_POOL() registers a callback using this function and will be usable to request some pools creation and guarantee that the allocation will be checked. An even simpler approach is to use DECLARE_POOL() and DECLARE_STATIC_POOL() which declare and register the pool.	2018-11-26 19:50:32 +01:00
Willy Tarreau	e655251e80	MINOR: initcall: use initcalls for section parsers The two calls to cfg_register_section() and cfg_register_postparser() are now supported by initcalls. This allowed to remove two other constructors.	2018-11-26 19:50:32 +01:00
Willy Tarreau	172f5ce948	MINOR: initcall: use initcalls for most post_{check,deinit} and per_thread* Most calls to hap_register_post_check(), hap_register_post_deinit(), hap_register_per_thread_init(), hap_register_per_thread_deinit() can be done using initcalls and will not require a constructor anymore. Let's create a set of simplified macros for this, called respectively REGISTER_POST_CHECK, REGISTER_POST_DEINIT, REGISTER_PER_THREAD_INIT, and REGISTER_PER_THREAD_DEINIT. Some files were not modified because they wouldn't benefit from this or because they conditionally register (e.g. the pollers).	2018-11-26 19:50:32 +01:00
Willy Tarreau	8071338c78	MINOR: initcall: apply initcall to all register_build_opts() calls Most register_build_opts() calls use static strings. These ones were replaced with a trivial REGISTER_BUILD_OPTS() statement adding the string and its call to the STG_REGISTER section. A dedicated section could be made for this if needed, but there are very few such calls for this to be worth it. The calls made with computed strings however, like those which retrieve OpenSSL's version or zlib's version, were moved to a dedicated function to guarantee they are called late in the process. For example, the SSL call probably requires that SSL_library_init() has been called first.	2018-11-26 19:50:32 +01:00
Willy Tarreau	90fa97b65e	MINOR: threads: add new macros to declare self-initializing locks Using __decl_spinlock(), __decl_rwlock(), __decl_aligned_spinlock() and __decl_aligned_rwlock(), one can now simply declare a spinlock or an rwlock which will automatically be initialized at boot time by calling the ha_spin_init() or ha_rwlock_init() callback. The "aligned" variants enforce a 64-byte alignment on the lock.	2018-11-26 19:50:32 +01:00
Willy Tarreau	a8ae77da61	MINOR: thread: provide a set of lock initialisers This patch adds ha_spin_init() and ha_rwlock_init() which are used as a callback to initialise locks at boot time. They perform exactly the same as HA_SPIN_INIT() or HA_RWLOCK_INIT() but from within a real function.	2018-11-26 19:50:32 +01:00
Willy Tarreau	d13a9281bd	MINOR: initcall: introduce a way to register init functions to call at boot We currently have to deal with multiple initialization stages in a way that can be confusing, because certain parts rely on others having been properly initialized. Most calls consist in adding lists to existing lists, whose heads are initialized in the declaration so this is easy. But some calls create new pools and require pools to be properly initialized. Pools currently are thread-local and as such cannot be pre-initialized, requiring run-time checks. All this could be simplified by using multiple boot stages and allowing functions to be registered at various stages. One approach might be to use gcc's constructor priorities, but this requires gcc >= 4.3 which eliminates a wide spectrum of working compilers, and some versions of certain compilers (like clang 3.0) are known for silently ignore these priorities. Instead we can use our own init function registration mechanism. A first attempt was made using register_function() calls in all constructors but this made the code more painful. This patch's approach is different. It creates sections containing arrays of pointers to "initcall" descriptors. An initcall contains a pointer to a function and an argument. Each section corresponds to a specific initialization stage. Each module creates such descriptors for various calls it requires. The main() function starts by scanning each of these sections in turn to process these initcalls. This will make it possible to remove many constructors from various modules, by simply placing initcalls for the requested functions next to the keyword lists that need to be called. A first attempt was made by placing the initcalls directly into the sections instead of creating an array of pointers, but it becomes sensitive to the array's alignment which depends on the compiler and the linker, so it seems too fragile. For now we support 6 init stages : - STG_PREPARE : preset variables, tables and list heads - STG_LOCK : initialize spinlocks and rwlocks - STG_ALLOC : allocate the required structures - STG_POOL : create pools - STG_REGISTER : register static lists (keywords etc) - STG_INIT : subsystems normal initialization These ones are declared directly in the files where they are needed using one of the INITCALL* macros, passing 0 to 3 pointers as arguments. The API should possibly be extended to support a return value to give a status to the caller, and to support a unified API, possibly a bit more flexibility in the arguments. In this case it might make sense to support a set of macros to register functions having a different API and to pass the function type in the initcall itself. Special thanks to Olivier for showing how to scan sections as this is not something particularly well documented and exactly what I've been missing to achieve this.	2018-11-26 19:50:32 +01:00
Willy Tarreau	a7280a1ec2	BUILD: buffers: buf.h requires unistd to get ssize_t on libmusl Building with musl and gcc-5.3 for MIPS returns this : include/common/buf.h: In function 'b_dist': include/common/buf.h:252:2: error: unknown type name 'ssize_t' ssize_t dist = to - from; ^ Including stdint or stddef is not sufficient there to get ssize_t, unistd is needed as well. It's likely that other platforms will have the same issue. This patch also addresses it in ist.h and memory.h.	2018-11-26 19:49:21 +01:00
Willy Tarreau	6689609090	BUILD: htx: fix fprintf format inconsistency on 32-bit platforms Building on 32 bits gives this : include/proto/htx.h: In function 'htx_dump': include/proto/htx.h:443:25: warning: format '%lu' expects argument of type 'long unsigned int', but argument 8 has type 'uint64_t {aka long long unsigned int}' [-Wformat=] fprintf(stderr, "htx:%p [ size=%u - data=%u - used=%u - wrap=%s - extra=%lu]\n", ^ In htx_dump(), fprintf() uses %lu but the value is an uint64_t so it doesn't match on 32-bit. Let's cast this to unsigned long long and use %llu instead.	2018-11-26 19:37:32 +01:00
Olivier Houchard	ee23b2a1e3	MEDIUM: servers: Store the connection in the SI until we have a mux. When we create a connection, if we have to defer the conn_stream and the mux creation until we can decide it (ie until the SSL handshake is done, and the ALPN is decided), store the connection in the stream_interface, so that we're sure we can destroy it if needed.	2018-11-23 19:11:14 +01:00
Olivier Houchard	201b9f4eb5	MAJOR: connections: Defer mux creation for outgoing connection if alpn is set. If an ALPN (or a NPN) was chosen for a server, defer choosing the mux until after the SSL handshake is done, and the ALPN/NPN has been negociated, so that we know which mux to pick.	2018-11-22 19:52:23 +01:00
Olivier Houchard	c756600103	MINOR: server: Add "alpn" and "npn" keywords. Add new keywords to "server" lines, alpn and npn. If set, when connecting through SSL, those alpn/npn will be negociated during the SSL handshake.	2018-11-22 19:50:08 +01:00
Willy Tarreau	beb859abce	MINOR: polling: add an option to support busy polling In some situations, especially when dealing with low latency on processors supporting a variable frequency or when running inside virtual machines, each time the process waits for an I/O using the poller, the processor goes back to sleep or is offered to another VM for a long time, and it causes excessively high latencies. A solution to this provided by this patch is to enable busy polling using a global option. When busy polling is enabled, the pollers never sleep and loop over themselves waiting for an I/O event to happen or for a timeout to occur. On multi-processor machines it can significantly overheat the processor but it usually results in much lower latencies. A typical test consisting in injecting traffic over a single connection at a time over the loopback shows a bump from 4640 to 8540 connections per second on forwarded connections, indicating a latency reduction of 98 microseconds for each connection, and a bump from 12500 to 21250 for locally terminated connections (redirects), indicating a reduction of 33 microseconds. It is only usable with epoll and kqueue because select() and poll()'s API is not convenient for such usages, and the level of performance they are used in doesn't benefit from this anyway. The option, which obviously remains disabled by default, can be turned on using "busy-polling" in the global section, and turned off later using "no busy-polling". Its status is reported in "show info" to help troubleshooting suspicious CPU spikes.	2018-11-22 19:47:30 +01:00
Willy Tarreau	48f8bc1368	MINOR: poller: move the call of tv_update_date() back to the pollers The reason behind this will be to be able to compute a timeout when busy polling.	2018-11-22 18:57:37 +01:00
Willy Tarreau	9efd7456e0	MEDIUM: tasks: collect per-task CPU time and latency Right now we measure for each task the cumulated time spent waiting for the CPU and using it. The timestamp uses a 64-bit integer to report a nanosecond-level date. This is only enabled when "profiling.tasks" is enabled, and consumes less than 1% extra CPU on x86_64 when enabled. The cumulated processing time and wait time are reported in "show sess". The task's counters are also reset when an HTTP transaction is reset since the HTTP part pretends to restart on a fresh new stream. This will make sure we always report correct numbers for each request in the logs.	2018-11-22 15:44:21 +01:00
Willy Tarreau	75c62c2793	MINOR: activity: add configuration and CLI support for "profiling.tasks" This is a new global setting which enables or disables CPU profiling per task. For now it only sets/resets the variable based on the global option "profiling.tasks" and supports showing it as well as setting it from the CLI using "show profiling" and "set profiling". The option will be used by a future commit. It was done in a way which should ease future addition of profiling options.	2018-11-22 11:48:51 +01:00
Willy Tarreau	baba82fe70	MINOR: activity: report the average loop time in "show activity" Since we know the time it takes to process everything between two poll() calls, we can use this as the max latency measurement any task will experience and average it. This code does this, and reports in "show activity" the average of this loop time over the last 1024 poll() loops, for each thread. It will vary quickly at high loads and slowly under low to moderate loads, depending on the rate at which poll() is called. The latency a task experiences is expected to be half of this on average.	2018-11-22 11:48:41 +01:00
Willy Tarreau	609aad9e73	REORG: time/activity: move activity measurements to activity.{c,h} At the moment the situation with activity measurement is quite tricky because the struct activity is defined in global.h and declared in haproxy.c, with operations made in time.h and relying on freq_ctr which are defined in freq_ctr.h which itself includes time.h. It's barely possible to touch any of these files without breaking all the circular dependency. Let's move all this stuff to activity.{c,h} and be done with it. The measurement of active and stolen time is now done in a dedicated function called just after tv_before_poll() instead of mixing the two, which used to be a lazy (but convenient) decision. No code was changed, stuff was just moved around.	2018-11-22 11:48:41 +01:00
Willy Tarreau	17306b905e	MINOR: cli: add a few missing includes in proto/cli.h Just found that proto/cli.h doesn't build if types/cli.h is not also included by the caller, as it uses cli_kw_list is used in arguments. But it's also true for a few other ones like mworker_proc, stream, and channel, so let's fix this.	2018-11-22 11:47:53 +01:00
William Lallemand	31a1c1d5e7	MEDIUM: signal: signal_unregister() removes every handlers The new function signal_unregister() removes every handlers assigned to a signal. Once the handler list of the signal is empty, the signal is ignored with SIG_IGN.	2018-11-22 11:42:51 +01:00
William Lallemand	db6bdfbf68	MINOR: cli: add mworker_accept_wrapper to 'show fd' In the output of 'show fd', the worker CLI's socketpair was still handled by an "unknown" function. That can be really confusing during debug. Fixed it by showing "mworker_accept_wrapper" instead.	2018-11-22 11:42:51 +01:00
William Lallemand	9c56a22b20	MINOR: log: introduce ha_notice() It's like ha_warning() or ha_alert() but with a NOTICE prefix.	2018-11-21 19:02:23 +01:00
William Lallemand	944e619b64	MEDIUM: mworker: wait mode use standard init code path The mworker waitpid mode (which is used when a reload failed to apply the new configuration) was still using a specific initialisation path. That's a problem since we use a polling loop in the master now, the master proxy is not initialized and the master CLI is not activated. This patch removes the initialisation code of the wait mode and introduce the MODE_MWORKER_WAIT in order to use the same init path as the MODE_MWORKER with some exceptions. It allows to use the master proxy and the master CLI during the waitpid mode.	2018-11-21 17:05:30 +01:00
William Lallemand	16dd1b3ead	MINOR: cli: show master information in 'show proc' Displays the master information in show proc.	2018-11-20 04:43:54 +01:00
William Lallemand	e368330128	MINOR: cli: displays uptime in `show proc` Displays the uptime of the workers in `show proc`	2018-11-20 04:43:54 +01:00
Willy Tarreau	3a1f5fda10	REORG: config: extract the proxy parser into cfgparse-listen.c This was the largest function of the whole file, taking a rough second to build alone. Let's move it to a distinct file along with a few dependencies. Doing so saved about 2 seconds on the total build time.	2018-11-19 06:47:09 +01:00
Willy Tarreau	36b9e222bb	REORG: config: extract the global section parser into cfgparse-global The config parser is the largest file to build and its build dominates the total project's build time. Let's start to split it into multiple smaller pieces by extracting the "global" section parser into a new file called "cfgparse-global.c". This removes 1/4th of the file's build time.	2018-11-19 06:41:57 +01:00
Joseph Herlant	32b8327266	CLEANUP: Fix typos in the standard subsystem Fix typos in the code comments of the standard subsystem.	2018-11-18 22:26:42 +01:00
Joseph Herlant	f7f6031184	CLEANUP: Fix typos in the spoe subsystem Fix typos in the code comments of the spoe subsystem.	2018-11-18 22:26:42 +01:00
Joseph Herlant	757f5ad73a	CLEANUP: Fix typos in the sample subsystem Fix some typos in the code comment of the sample subsystem.	2018-11-18 22:26:42 +01:00
Joseph Herlant	85b4059b82	CLEANUP: Fix typos in the log subsystem Fix some misspells in the code comments of the log subsystem.	2018-11-18 22:26:42 +01:00
Joseph Herlant	b35ea68081	CLEANUP: Fix typos in the filters subsystem Fix typos in the code comments of the filters subsystems.	2018-11-18 22:26:42 +01:00
Joseph Herlant	59dd295926	CLEANUP: fix typos in the proxy subsystem Fix typos in the code comments of the proxy subsystem.	2018-11-18 22:23:15 +01:00
Joseph Herlant	5ba8025976	CLEANUP: fix typos in the proto_http subsystem Fixes typos in the code comments of the proto_http subsystem.	2018-11-18 22:23:15 +01:00
Joseph Herlant	44466826b1	CLEANUP: fix a few typos in the comments of the server subsystem A few misspells where detected in the server subsystem. This commit fixes them.	2018-11-18 22:23:15 +01:00
Joseph Herlant	42cf6395c4	CLEANUP: Fix typos in the dns subsystem Fix misspells in the code comments of the dns subsystem.	2018-11-18 22:23:15 +01:00
Christopher Faulet	ef453ed9b0	MINOR: http_fetch: Add smp_prefetch_htx It does the same than smp_prefetch_http but for HTX messages. It can be called from an HTTP proxy or a TCP proxy. For HTTP proxies, the parsing is handled by the mux, so it does nothing but wait. For TCP proxies, it tries to parse an HTTP message and to convert it in a temporary HTX message. Sample fetches will use this temporary variable to do their job.	2018-11-18 22:09:00 +01:00
Christopher Faulet	fefc73da34	MINOR: proto_htx: Add functions htx_perform_server_redirect It is more or less the same than legacy version but adapted to be called from HTX analyzers. In the legacy version of this function, we switch on the HTX code when applicable.	2018-11-18 22:08:58 +01:00
Christopher Faulet	64159df1fb	MINOR: proto_htx: Add functions htx_send_name_header It is more or less the same than legacy version but adapted to be called from HTX analyzers. In the legacy version of this function, we switch on the HTX code when applicable.	2018-11-18 22:08:58 +01:00
Christopher Faulet	25a02f65b1	MINOR: proto_htx: Add functions to check the cacheability of HTX messages It is more or less the same than legacy versions but adapted to be called from HTX analyzers. In the legacy versions of these functions, we switch on the HTX code when applicable.	2018-11-18 22:08:58 +01:00
Christopher Faulet	8d8ac191a7	MINOR: proto_htx: Add functions htx_req_replace_stline and htx_res_set_status It is more or less the same than legacy versions but adapted to be called from HTX analyzers. In the legacy versions of these functions, we switch on the HTX code when applicable.	2018-11-18 22:08:56 +01:00
Christopher Faulet	7233352fe4	MINOR: proto_htx: Add functions htx_transform_header and htx_transform_header_str It is more or less the same than legacy versions but adapted to be called from HTX analyzers.	2018-11-18 22:08:56 +01:00
Christopher Faulet	7ff1ceaa5e	MINOR: http_htx: Add functions to retrieve a specific occurrence of a header There are 2 functions. The first one considers any comma as a delimiter for distinct values. The second one considers full-line headers.	2018-11-18 22:08:55 +01:00
Christopher Faulet	e010c80753	MINOR: http_htx: Add functions to replace part of the start-line	2018-11-18 22:08:54 +01:00
Christopher Faulet	0f226958b7	MINOR: proto_htx: Add some functions to handle HTX messages More functions will come, but it is the minimum to switch HTX analyzers on the HTX internal representation.	2018-11-18 22:08:54 +01:00
Christopher Faulet	47596d3787	MINOR: http_htx: Add functions to manipulate HTX messages in http_htx.c This file will host all functions to manipulate HTTP messages using the HTX representation. Functions in this file will be able to be called from anywhere and are mainly related to the HTTP semantics.	2018-11-18 22:08:53 +01:00
Christopher Faulet	a3d2a16fad	MEDIUM: htx: Add API to deal with the internal representation of HTTP messages The internal representation of an HTTP message, called HTX, is a structured representation, unlike the old one which is a raw representation of messages. Idea is to have a version-agnostic representation of the HTTP messages, which can be easily used by to handle HTTP/1, HTTP/2 and hopefully QUIC messages, and communication from one of them to another. In this patch, we add types to define the internal representation itself and the main functions to manipulate them.	2018-11-18 22:08:53 +01:00
Christopher Faulet	f2824e6e10	MAJOR: mux-h1/proto_htx: Handle keep-alive connections in the mux Now, the connection mode is detected in the mux and not in HTX analyzers anymore. Keep-alive connections are now managed by the mux. A new stream is created for each transaction. This removes the most important part of the synchronization between channels and the HTTP transaction cleanup. These changes only affect the HTX part (proto_htx.c). Legacy HTTP analyzers remain untouched for now. On the client-side, the mux is responsible to create new streams when a new request starts. It is also responsible to parse and update the "Connection:" header of the response. On the server-side, the mux is responsible to parse and update the "Connection:" header of the request. Muxes on each side are independent. For now, there is no connection pool on the server-side, so it always close the server connection.	2018-11-18 22:02:42 +01:00
Christopher Faulet	e0768ebabc	MEDIUM: proto_htx: Add HTX analyzers and use it when the mux H1 is used For now, these analyzers are just copies of the legacy HTTP analyzers. But, during the HTTP refactoring, it will be the main place where it will be visible. And in legacy analyzers, the macro IS_HTX_STRM is used to know if the HTX version should be called or not. Note: the following commits were applied to proto_http.c after this patch was developed and need to be studied to see if an adaptation to htx is required : `fd9b68c` BUG/MINOR: only mark connections private if NTLM is detected	2018-11-18 21:45:50 +01:00
Christopher Faulet	1d5b85aba2	MINOR: http: Add macros to check if a stream uses the HTX representation To prepare the refactoring of the code handling HTTP messages, these macros will help to use HTX functions instead of legacy ones when the new HTX internal representation is in use. To do so, for a given stream, we will check if its frontend has the option PR_O2_USE_HTX. It is useless to test backend options because it is not possible to mix the HTX representation and the legacy one (i.e, having an HTX frontend and a legacy backend or vice versa).	2018-11-18 21:45:50 +01:00
Christopher Faulet	effc3750cc	MINOR: conn_stream: Add a flag to notify the SI some data were received The flag CS_FL_READ_PARTIAL can be set by the mux on the conn_stream to notify the stream interface that some data were received. Is is used in si_cs_recv to re-arm read timeout on the channel.	2018-11-18 21:45:49 +01:00
Christopher Faulet	27a3dc8fb2	MINOR: http: Call http_send_name_header with the stream instead of the txn This is just a minor change to ease integrartion of the HTX.	2018-11-18 21:45:49 +01:00
Christopher Faulet	8277ca72b1	MINOR: http: Add standalone functions to parse a start-line or a header These 2 functions are pretty naive. They only split a start-line into its 3 substrings or a header line into its name and value. Spaces before and after each part are skipped. No CRLF at the end are expected.	2018-11-18 21:45:49 +01:00
Christopher Faulet	72d9125efb	MINOR: conn_stream: Add a flag to notify the mux it must respect the reserve By setting the flag CO_RFL_KEEP_RSV when calling mux->rcv_buf, the stream-interface notifies the mux it must keep some space to preserve the buffer's reserve. This flag is only useful for multiplexers handling structured data, because in such case, the stream-interface cannot know the real amount of free space in the channel's buffer.	2018-11-18 21:45:48 +01:00
Christopher Faulet	c6618d6835	MINOR: conn_stream: Add a flag to notify the mux it should flush its buffers By setting the flag CO_RFL_BUF_FLUSH when calling mux->rcv_buf, the stream-interface notifies the mux it should flush its buffers without reading more data. This flag is set when the SI want to use the kernel TCP splicing to forward data. Of course, the mux can respect it or not, depending on its state. It's just an information.	2018-11-18 21:45:48 +01:00
Olivier Houchard	7c6f8b146d	MAJOR: connections: Detach connections from streams. Do not destroy the connection when we're about to destroy a stream. This prevents us from doing keepalive on server connections when the client is using HTTP/2, as a new stream is created for each request. Instead, the session is now responsible for destroying connections. When reusing connections, the attach() mux method is now used to create a new conn_stream.	2018-11-18 21:45:45 +01:00
Olivier Houchard	131fd89d5a	MINOR: sessions: Start to store the outgoing connection in sessions. Introduce a new field in session, "srv_conn", and a linked list of sessions in the connection. It will be used later when we'll switch connections from being managed by the stream, to being managed by the session.	2018-11-18 21:44:56 +01:00
Olivier Houchard	060ed43361	MINOR: mux: Add a destroy() method. Add a new method to muxes, destroy(), that is responsible for destroying the mux and the associated connection, to be used for server connections.	2018-11-18 21:44:53 +01:00
Olivier Houchard	d540b36e8a	MINOR: mux: Add a new "avail_streams" method. Add a new method for mux, avail_streams, that returns the number of streams still available for a mux. For the mux_pt, it'll return 1 if the connection is in idle, or 0. For the H2 mux, it'll return the max number of streams allowed, minus the number of streams currently in use.	2018-11-18 21:44:06 +01:00
Willy Tarreau	db398435aa	MINOR: stream-int: replace si_cant_put() with si_rx_room_{blk,rdy}() Remaining calls to si_cant_put() were all for lack of room and were turned to si_rx_room_blk(). A few places where SI_FL_RXBLK_ROOM was cleared by hand were converted to si_rx_room_rdy(). The now unused si_cant_put() function was removed.	2018-11-18 21:41:50 +01:00
Willy Tarreau	b26a6f9708	MEDIUM: stream-int: make use of si_rx_chan_{rdy,blk} to control the stream-int from the channel The channel can disable reading from the stream-interface using various methods, such as : - CF_DONT_READ - !channel_may_recv() - and possibly others Till now this was done by mangling SI_FL_RX_WAIT_EP which is not appropriate at all since it's not the stream interface which decides whether it wants to deliver data or not. Some places were also wrongly relying on SI_FL_RXBLK_ROOM since it was the only other alternative, but it's not suitable for CF_DONT_READ. Let's use the SI_FL_RXBLK_CHAN flag for this instead. It will properly prevent the stream interface from being woken up and reads from subscribing to more receipt without being accidently removed. It is automatically reset if CF_DONT_READ is not set in stream_int_notify(). The code is not trivial because it splits the logic between everything related to buffer contents (channel_is_empty(), CF_WRITE_PARTIAL, etc) and buffer policy (CF_DONT_READ). Also it now needs to decide timeouts based on any blocking flag and not just SI_FL_RXBLK_ROOM anymore. It looks like this patch has caused a minor performance degradation on connection rate, which possibly deserves being investigated deeper as the test conditions are uncertain (e.g. slightly more subscribe calls?).	2018-11-18 21:41:49 +01:00
Willy Tarreau	abb5d4202f	MEDIUM: stream-int: use si_rx_shut_blk() to indicate the SI is closed Till now we were using si_done_put() upon shutr, but these flags could be reset upon next activity. Now let's switch to SI_FL_RXBLK_SHUT which doesn't go away. It's also set in stream_int_update() in case a shutr condition is detected. The now unused si_done_put() was removed.	2018-11-18 21:41:49 +01:00
Willy Tarreau	7f494d0c5e	MINOR: stream-int: make si_sync_recv() simply check ENDP before si_cs_recv() Instead of checking complex conditions to call si_cs_recv() upon first call, let's simply use si_rx_endp_ready() now that si_cs_recv() reports it accurately, and add si_rx_blocked() to cover any blocking situation.	2018-11-18 21:41:48 +01:00
Willy Tarreau	8bb2ffb831	MINOR: stream-int: replace si_{want,stop}_put() with si_rx_endp_{more,done}() Here it's only a 1-to-1 replacement.	2018-11-18 21:41:47 +01:00
Willy Tarreau	8be7cd7b92	MEDIUM: stream-int: use si_rx_buff_{rdy,blk} to report buffer readiness The stream interface used to conflate a missing buffer and lack of buffer space into SI_FL_WAIT_ROOM but this causes difficulties as these cannot be checked at the same moment and are not resolved at the same moment either. Now we instead mark the buffer as presumably available using si_rx_buff_rdy() and mark it as unavailable+requested using si_rx_buff_blk(). The call to si_alloc_buf() was moved after si_stop_put(). This makes sure that the SI_FL_RX_WAIT_EP flag is cleared on allocation failure so that the function is called again if the callee fails to do its work.	2018-11-18 21:41:47 +01:00
Willy Tarreau	32742fdf45	MINOR: stream-int: use si_rx_blocked()/si_tx_blocked() to check readiness This way we don't limit ourselves to random flags only and the code is more readable and safer for the long term.	2018-11-18 21:41:46 +01:00
Willy Tarreau	05b9b64afb	MINOR: stream-int: replace SI_FL_WANT_PUT with !SI_FL_RX_WAIT_EP The SI_FL_WANT_PUT flag is used in an awkward way, sometimes it's set by the stream-interface to mean "I have something to deliver", sometimes it's cleared by the channel to say "I don't want you to send what you have", and it has to be set back once CF_DONT_READ is cleared. This will have to be split between SI_FL_RX_WAIT_EP and SI_FL_RXBLK_CHAN. This patch only replaces all uses of the flag with its natural (but negated) replacement SI_FL_RX_WAIT_EP. The code is expected to be strictly equivalent. The now unused flag was completely removed.	2018-11-18 21:41:46 +01:00
Willy Tarreau	78dcacef5c	MINOR: stream-int: add new functions si_{rx,tx}_{blocked,endp_ready}() The first ones are used to figure if a direction is blocked on the stream interface for anything but the end point. The second ones are used to detect if the end point is ready to receive/transmit. They should be used instead of directly fiddling with the existing bits.	2018-11-18 21:41:46 +01:00
Willy Tarreau	94f7907d65	MINOR: stream-int: introduce new SI_FL_RXBLK flags The plan is to have the following flags to describe why a stream interface doesn't produce data : - SI_FL_RXBLK_CHAN : the channel doesn't want it to receive - SI_FL_RXBLK_BUFF : waiting for a buffer allocation to complete - SI_FL_RXBLK_ROOM : more room is required in the channel to receive - SI_FL_RXBLK_SHUT : input now closed, nothing new will come - SI_FL_RX_WAIT_EP : waiting for the endpoint to produce more data Applets like the CLI which consume complete commands at once and produce large chunks of responses will for example be able to stop being woken up by clearing SI_FL_WANT_GET and setting SI_FL_RXBLK_ROOM when the rx buffer is full. Once called they will unblock WANT_GET. The flags were moved together in readable form with the Rx bits using 2 hex digits and still have some room to do a similar operation on the Tx path later, with the WAIT_EP flag being represented alone on a digit.	2018-11-18 21:41:45 +01:00
Willy Tarreau	d0f5bbcd64	MINOR: stream-int: rename SI_FL_WAIT_ROOM to SI_FL_RXBLK_ROOM This flag is not enough to describe all blocking situations, as can be seen in each case we remove it. The muxes has taught us that using multiple blocking flags in parallel will be much easier, so let's start to do this now. This patch only renames this flags in order to make next changes more readable.	2018-11-18 21:41:45 +01:00
Willy Tarreau	a44e576f62	MINOR: stream-int: expand the flags to 32-bit We used to have enough of 16 bits, with 3 still available but it's not possible to add the rx/tx blocking bits there. Let's extend the format to 32 bits and slightly reorder the fields to maintain the struct size to 64 bytes. Nothing else was changed.	2018-11-18 21:41:45 +01:00
Willy Tarreau	fafd3984b9	MINOR: mux: implement a get_first_cs() method This method is used to retrieve the first known good conn_stream from the mux. It will be used to find the other end of a connection when dealing with the proxy protocol for example.	2018-11-18 21:29:20 +01:00
Willy Tarreau	ade6478a8c	MINOR: stream: move the conn_stream specific calls to the stream-int There are still some unwelcome synchronous calls to si_cs_recv() in process_stream(). Let's have a new function si_sync_recv() to perform a synchronous receive call on a stream interface regardless of the type of its endpoint, and move these calls there. For now it only implements conn_streams since it doesn't seem useful to support applets there. The function implements an extra check for the stream interface to be in an established state before attempting anything.	2018-11-17 19:53:45 +01:00
William Lallemand	c59f9884d7	MEDIUM: listeners: support unstoppable listener An unstoppable listener is a listener which won't be stop during a soft stop. The unstoppable_jobs variable is incremented and the listener won't prevent the process to leave properly. It is not a good idea to use this feature (the LI_O_NOSTOP flag) with a listener that need to be bind again on another process during a soft reload.	2018-11-16 17:05:40 +01:00
William Lallemand	a719926cf8	MEDIUM: jobs: support unstoppable jobs for soft stop This patch allows a process to properly quit when some jobs are still active, this feature is handled by the unstoppable_jobs variable, which must be atomically incremented. During each new iteration of run_poll_loop() the break condition of the loop is now (jobs - unstoppable_jobs) == 0. The unique usage of this at the moment is to handle the socketpair CLI of a the worker during the stopping of the process. During the soft stop, we could mark the CLI listener as an unstoppable job and still handle new connections till every other jobs are stopped.	2018-11-16 17:05:40 +01:00
Fr�d�ric L�caille	9ca51aa288	MINOR: http: Implement "early-hint" http request rules. This patch implements http_apply_early_hint_rule() function is responsible of building HTTP 103 Early Hint responses each time a "early-hint" rule is matched.	2018-11-12 21:08:55 +01:00
Fr�d�ric L�caille	0ebbcb663c	MINOR: http: Make new "early-hint" http-request action really be parsed. This patch adds a "early_hint" struct to "arg" union of "act_rule" struct and parse "early-hint" http-request keyword with it using the same code as for "(add\|set)-header" parser.	2018-11-12 21:08:55 +01:00
Fr�d�ric L�caille	a985e3875b	MINOR: http: Add new "early-hint" http-request action. This patch adds the new "early-hint" action to "http-request" rules parser. This action should be parsed the same way as "(add\|set)-header" actions.	2018-11-12 21:08:55 +01:00
Willy Tarreau	7520e4ff57	MINOR: namespaces: don't build namespace.c if disabled When namespaces are disabled, support is still reported because the file is built with almost nothing in it but built anyway. Instead of extending the scope of the numerous ifdefs in this file, better avoid building it when namespaces are diabled. In this case we define my_socketat() as an inline function mapping directly to socket(). The struct netns_entry still needs to be defined because it's used by various other functions in the code.	2018-11-12 19:15:15 +01:00
Willy Tarreau	c1b0645dac	MEDIUM: log: add a new "raw" format This format is pretty similar to the previous "short" format except that it also removes the severity level. Thus only the raw message is sent. This is suitable for use in containers, where only the raw information is expected and where the severity is supposed to come from the file descriptor used.	2018-11-12 18:37:55 +01:00
Willy Tarreau	e8746a08b2	MEDIUM: log: support a new "short" format This format is meant to be used with local file descriptors. It emits messages only prefixed with a level, removing all the process name, system name, date and so on. It is similar to the printk() format used on Linux. It's suitable to be sent to a local logger compatible with systemd's output format. Note that the facility is still required but not used, hence it is suggested to use "daemon" to remind that it's a local logger. Example : log stdout format short daemon # send everything to stdout log stderr format short daemon notice # send important events to stderr	2018-11-12 18:37:55 +01:00
Willy Tarreau	13ef773722	MINOR: log: report the number of dropped logs in the stats It's easy to detect when logs on some paths are lost as sendmsg() will return EAGAIN. This is particularly true when sending to /dev/log, which often doesn't support a big logging capacity. Let's keep track of these and report the total number of dropped messages in "show info".	2018-11-12 18:37:55 +01:00
Willy Tarreau	d0d40ebf5e	CLEANUP: stream-int: remove the now unused si->update() function We exclusively use stream_int_update() now, the lower layers are not called anymore so let's remove them, as well as si_update() which used to be their wrapper.	2018-11-11 10:18:37 +01:00
Willy Tarreau	d14844a734	MINOR: stream-int: replace si_update() with si_update_both() The function used to be called in turn for each side of the stream, but since it's called exclusively from process_stream(), it prevents us from making use of the knowledge we have of the operations in progress for each side, resulting in having to go all the way through functions like stream_int_notify() which are not appropriate there. That patch creates a new function, si_update_both() which takes two stream interfaces expected to belong to the same stream, and processes their flags in a more suitable order, but for now doesn't change the logic at all. The next step will consist in trying to reinsert the rest of the socket layer-specific update code to ultimately update the flags correctly at the end of the operation.	2018-11-11 10:18:37 +01:00
Willy Tarreau	8fe516f08a	MEDIUM: stream-int: make si_chk_rcv() check that SI_FL_WAIT_ROOM is cleared After careful inspection, it now seems OK to call si_chk_rcv() only when SI_FL_WAIT_ROOM is cleared and SI_FL_WANT_PUT is set, since all identified call places have already taken care of this.	2018-11-11 10:18:37 +01:00
Willy Tarreau	abf531caa0	MEDIUM: stream-int: always call si_chk_rcv() when we make room in the buffer Instead of clearing the SI_FL_WAIT_ROOM flag and losing the information about the need from the producer to be woken up, we now call si_chk_rcv() immediately. This is cheap to do and it could possibly be further improved by only doing it when SI_FL_WAIT_ROOM was still set, though this will require some extra auditing of the code paths. The only remaining place where the flag was cleared without a call to si_chk_rcv() is si_alloc_ibuf(), but since this one is called from a receive path woken up from si_chk_rcv() or not having failed, the clearing was not necessary anymore either. And there was one place in stream_int_notify() where si_chk_rcv() was called with SI_FL_WAIT_ROOM still explicitly set so this place was adjusted in order to clear the flag prior to calling si_chk_rcv(). Now we don't have any situation where we randomly clear SI_FL_WAIT_ROOM without trying to wake the other side up, nor where we call si_chk_rcv() with the flag set, so this flag should accurately represent a failed attempt at putting data into the buffer.	2018-11-11 10:18:37 +01:00
Willy Tarreau	1f9de21c38	MEDIUM: stream-int: make SI_FL_WANT_PUT reflect CF_DONT_READ When CF_DONT_READ is set, till now we used to set SI_FL_WAIT_ROOM, which is not appropriate since it would lose the subscribe status. Instead let's clear SI_FL_WANT_PUT (just like applets do), and set the flag only when CF_DONT_READ is cleared. We have to do this in stream_int_update(), and in si_cs_io_cb() after returning from si_cs_recv() since it would be a bit invasive to hack this one for now. It must not be done in stream_int_notify() otherwise it would re-enable blocked applets. Last, when si_chk_rcv() is called, it immediately clears the flag before calling ->chk_rcv() so that we are not tempted to uselessly loop on the same call until the receive function is called. This is the same principle as what is done with the applet scheduler.	2018-11-11 10:18:37 +01:00
Willy Tarreau	1bdb598a55	MINOR: stream-int: factor the SI_ST_EST state test into si_chk_rcv() This test is made in each implementation of the function, better to merge it.	2018-11-11 10:18:37 +01:00
Willy Tarreau	96aadd5c55	MEDIUM: stream-int: temporarily make si_chk_rcv() take care of SI_FL_WAIT_ROOM This flag should already be cleared before calling the *chk_rcv() functions. Before adapting all call places, let's first make sure si_chk_rcv() clears it before calling them so that these functions do not have to check it again and so that they do not adjust it. This function will only call the lower layers if the SI_FL_WANT_PUT flag is present so that the endpoint can decide not to be called (as done with applets).	2018-11-11 10:18:37 +01:00
Willy Tarreau	57f08bb63b	MINOR: stream-int: make it clear that si_ops cannot be null There was an ambiguity in which functions of the si_ops struct could be null or not. only ->update doesn't exist in one of the si_ops (the embedded one), all others are always defined. ->shutr and ->shutw were never tested. However ->chk_rcv() and ->chk_snd() were tested, causing confusion about the proper way to wake the other side up if undefined (which never happens). Let's update the comments to state these functions are mandatory and remove the offending checks.	2018-11-11 10:18:37 +01:00
Willy Tarreau	af4f6f6d2f	MINOR: stream-int: use si_cant_put() instead of setting SI_FL_WAIT_ROOM We now do this on the si_cs_recv() path so that we always have SI_FL_WANT_PUT properly set when there's a need to receive and SI_FL_WAIT_ROOM upon failure.	2018-11-11 10:18:37 +01:00
Willy Tarreau	394970c297	MINOR: stream-int: add si_done_{get,put} to indicate that we won't do it anymore This is useful on close or stream aborts as it saves us from having to manipulate the (sometimes confusing) flags.	2018-11-11 10:18:37 +01:00
Willy Tarreau	0cd3bd628a	MINOR: stream-int: rename si_applet_{want\|stop\|cant}_{get\|put} It doesn't make sense to limit this code to applets, as any stream interface can use it. Let's rename it by simply dropping the "applet_" part of the name. No other change was made except updating the comments.	2018-11-11 10:18:37 +01:00
Willy Tarreau	21028b5e7f	MEDIUM: appctx: check for allocation attempts in buffer allocation callbacks The buffer allocation callback appctx_res_wakeup() used to rely on old tricks to detect if a buffer was already granted to an appctx, namely by checking the task's state. Not only this test is not valid anymore, but it's inaccurate. Let's solely on SI_FL_WAIT_ROOM that is now set on allocation failure by the functions trying to allocate a buffer. The buffer is now allocated on the fly and the flag removed so that the consistency between the two remains granted. The patch also fixes minor issues such as the function being improperly declared inline(!) and the fact that using appctx_wakeup() sets the wakeup reason to TASK_WOKEN_OTHER while we try to use TASK_WOKEN_RES when waking up consecutive to a ressource allocation such as a buffer.	2018-11-11 10:18:37 +01:00
Willy Tarreau	b882dd88cc	MEDIUM: stream: implement stream_buf_available() This function replaces stream_res_available(), which is used as a callback for the buffer allocator. It now carefully checks which stream interface was blocked on a buffer allocation, tries to allocate the input buffer to this stream interface, and wakes the task up once such a buffer was found. It will automatically remove the SI_FL_WAIT_ROOM flag upon success since the info this flag indicates becomes wrong as soon as the buffer is allocated. The code is still far from being perfect because if a call to si_cs_recv() fails to allocate a buffer, we'll still end up passing via process_stream() again, but this could be improved in the future by using finer-grained wake-up notifications.	2018-11-11 10:18:37 +01:00
Willy Tarreau	2d372c2aa1	MINOR: stats: report the number of currently connected peers The active peers output indicates both the number of established peers connections and the number of peers connection attempts. The new counter "ConnectedPeers" also indicates the number of currently connected peers. This helps detect that some peers cannot be reached for example. It's worth mentioning that this value changes over time because unused peers are often disconnected and reconnected. Most of the time it should be equal to ActivePeers.	2018-11-05 17:15:21 +01:00
Willy Tarreau	199ad24661	MINOR: stats: report the number of active peers in "show info" Peers are the last type of activity which can maintain a job present, so it's important to report that such an entity is still active to explain why the job count may be higher than zero. Here by "ActivePeers" we report peers sessions, which include both established connections and outgoing connection attempts.	2018-11-05 17:15:21 +01:00
Willy Tarreau	00098ea034	MINOR: stats: report the number of active jobs and listeners in "show info" When an haproxy process doesn't stop after a reload, it's because it still has some active "jobs", which mainly are active sessions, listeners, peers or other specific activities. Sometimes it's difficult to troubleshoot the cause of these issues (which generally are the result of a bug) only because some indicators are missing. This patch add the number of listeners, the number of jobs, and the stopping status to the output of "show info". This way it becomes a bit easier to try to narrow down the cause of such an issue should it happen. A typical use case is to connect to the CLI before reloading, then issuing the "show info" command to see what happens. In the normal situation, stopping should equal 1, jobs should equal 1 (meaning only the CLI is still active) and listeners should equal zero. The patch is so trivial that it could make sense to backport it to 1.8 in order to help with troubleshooting.	2018-11-05 17:15:21 +01:00
Willy Tarreau	4698adf68f	MINOR: compat: automatically detect support for crypt_r() glibc >= 2.2 and FreeBSD >= 12.0 support crypt_r(), let's detect this and set a macro HA_HAVE_CRYPT_R for this.	2018-10-29 19:14:14 +01:00
Willy Tarreau	34d4b525a1	BUG/MEDIUM: auth/threads: use of crypt() is not thread-safe It was reported here that authentication may fail when threads are enabled : https://bugzilla.redhat.com/show_bug.cgi?id=1643941 While I couldn't reproduce the issue, it's obvious that there is a problem with the use of the non-reentrant crypt() function there. On Linux systems there's crypt_r() but not on the vast majority of other ones. Thus a first approach consists in placing a lock around this crypt() call. Another patch may relax it when crypt_r() is available. This fix must be backported to 1.8. Thanks to Ryan O'Hara for the quick notification.	2018-10-29 18:06:02 +01:00
Willy Tarreau	ce487aab46	BUG/MEDIUM: tools: fix direction of my_ffsl() Commit `27346b01a` ("OPTIM: tools: optimize my_ffsl() for x86_64") optimized my_ffsl() for intensive use cases in the scheduler, but as half of the times I got it wrong so it counted bits the reverse way. It doesn't matter for the scheduler nor fd cache but it broke cpu-map with threads which heavily relies on proper ordering. We should probably consider dropping support for gcc < 3.4 and switching to builtins for these ones, though often they are as ambiguous. No backport is needed.	2018-10-29 16:09:57 +01:00
Willy Tarreau	8e9f4531cb	BUG/MINOR: memory: make the thread-local cache allocator set the debugging link When building with DEBUG_MEMORY_POOLS, an element returned from the cache would not have its pool link initialized unless it's allocated using pool_alloc(). This is problematic for buffer allocators which use pool_alloc_dirty(), as freeing this object will make the code think it was allocated from another pool. This patch does two things : - make __pool_get_from_cache() set the link - remove the extra initialization from pool_alloc() since it's always done in either __pool_get_first() or __pool_refill_alloc() This patch is marked MINOR since it only affects code explicitly built for debugging. No backport is needed.	2018-10-28 20:12:31 +01:00
William Lallemand	90b1ca1ff5	MEDIUM: channel: reorder the channel analyzers for the cli Reorder the channel analyzers so the CLI analyzers are defined before the XFER_DATA ones.	2018-10-28 14:13:31 +01:00
William Lallemand	309dc9adec	MEDIUM: mworker: stop the master proxy in the workers The master proxy which handles the CLI should not be used or shown in the stats of the workers. This proxy is now disabled after the fork.	2018-10-28 14:03:31 +01:00
William Lallemand	cf62f7e3cb	MEDIUM: cli: implement 'mode cli' proxy analyzers This patch implements analysers for parsing the CLI and extra features for the master's CLI. For each command (sent alone, or separated by ; or \n) the request analyser will determine to which server it should send the request. The 'mode cli' proxy is able to parse a prefix for each command which is used to select the apropriate server. The prefix start by @ and is followed by "master", the PID preceded by ! or the relative PID. (e.g. @master, @1, @!1234). The servers are not round-robined anymore. The command is sent with a SHUTW which force the server to close the connection after sending its response. However the proxy allows a keepalive connection on the client side and does not close. The response analyser does not do much stuff, it only reinits the connection when it received a close from the server, and forward the response. It does not analyze the response data. The only guarantee of the end of the response is the close of the server, we can't rely on the double \n since it's not send by every command. This could be reimplemented later as a filter.	2018-10-28 14:03:06 +01:00
William Lallemand	291810d8f8	MEDIUM: mworker: find the server ptr using a CLI prefix Add a struct server pointer in the mworker_proc struct so we can easily use it as a target for the mworker proxy. pcli_prefix_to_pid() is used to find the right PID of the worker when using a prefix in the CLI. (@master, @#<relative pid> , @<pid>) pcli_pid_to_server() is used to find the right target server for the CLI proxy.	2018-10-28 13:51:39 +01:00
William Lallemand	14721be11f	MEDIUM: cli: disable some keywords in the master The master process does not need all the keywords of the cli, add 2 flags to chose which keyword to use. It might be useful to activate some of them in a debug mode later...	2018-10-28 13:51:39 +01:00
William Lallemand	e736115d3a	MEDIUM: mworker: create CLI listeners from argv[] This patch introduces mworker_cli_proxy_new_listener() which allows the creation of new listeners for the CLI proxy. Using this function it is possible to create new listeners from the program arguments with -Sa <unix_socket>. It is allowed to create multiple listeners with several -Sa.	2018-10-28 13:51:39 +01:00
William Lallemand	8a02257d88	MEDIUM: mworker: proxy for the master CLI This patch implements a listen proxy within the master. It uses the sockpair of all the workers as servers. In the current state of the code, the proxy is only doing round robin on the CLI of the workers. A CLI mode will be needed to know to which CLI send the requests.	2018-10-28 13:51:39 +01:00
William Lallemand	6e0db2fa99	MEDIUM: mworker: add proc_list in global.h Add the process list in types/global.h so it could be accessed from anywhere.	2018-10-28 13:51:39 +01:00
William Lallemand	313bfd18c1	MINOR: server: export new_server() function The new_server() function will be useful to create a proxy for the master-worker.	2018-10-28 13:51:38 +01:00
William Lallemand	7e1299bb3a	REORG: mworker: move struct mworker_proc to global.h Move the definition of the mworker_proc structure in types/global.h.	2018-10-28 13:51:38 +01:00
William Lallemand	ce83b4a5dd	MEDIUM: mworker: each worker socketpair is a CLI listener The init code of the mworker_proc structs has been moved before the init of the listeners. Each socketpair is now connected to a CLI within the workers, which allows the master to access their CLI. The inherited flag of the worker side socketpair is removed so the socket can be closed in the master.	2018-10-28 13:51:38 +01:00
Willy Tarreau	85f890174a	MEDIUM: stream-int: make si_update() synchronize flag changes before the I/O With the new synchronous si_cs_send() at the end of process_stream(), we're seeing re-appear the I/O layer specific part of the stream interface which is supposed to deal with I/O event subscription. The only difference is that now we subscribe to I/Os only after having attempted (and failed) them. This patch brings a cleanup in this by reintroducing stream_int_update_conn() with the send code from process_stream(). However this alone would not be enough because the flags which are cleared afterwards would result in the loss of the possible events (write events only at the moment). So the flags clearing and stream-int state updates are also performed inside si_update() between the generic code and the I/O specific code. This definitely makes sense as after this call we can simply check again for channel and SI flag changes and decide to loop once again or not.	2018-10-28 13:47:00 +01:00
Willy Tarreau	0979916d3b	MINOR: stream-int: add si_alloc_ibuf() to ease input buffer allocation This will supersed channel_alloc_buffer() while relying on it. It will automatically adjust SI_FL_WAIT_ROOM on the stream-int depending on success or failure to allocate this buffer. It's worth noting that it could make sense to also set SI_FL_WANT_PUT each time we do this to further simplify the code at user places such as applets, but it would possibly not be easy to clean this flag everywhere an rx operation stops.	2018-10-28 13:47:00 +01:00
Willy Tarreau	ede3d884fc	MEDIUM: channel: merge back flags CF_WRITE_PARTIAL and CF_WRITE_EVENT The behaviour of the flag CF_WRITE_PARTIAL was modified by commit `95fad5ba4` ("BUG/MAJOR: stream-int: don't re-arm recv if send fails") due to a situation where it could trigger an immediate wake up of the other side, both acting in loops via the FD cache. This loss has caused the need to introduce CF_WRITE_EVENT as commit `c5a9d5bf`, to replace it, but both flags express more or less the same thing and this distinction creates a lot of confusion and complexity in the code. Since the FD cache now acts via tasklets, the issue worked around in the first patch no longer exists, so it's more than time to kill this hack and to restore CF_WRITE_PARTIAL's semantics (i.e.: there has been some write activity since we last left process_stream). This patch mostly reverts the two commits above. Only the part making use of CF_WROTE_DATA instead of CF_WRITE_PARTIAL to detect the loss of data upon connection setup was kept because it's more accurate and better suited.	2018-10-26 08:32:57 +02:00
Ioannis Cherouvim	1ff7633dd7	CLEANUP: tools: fix misleading comment above function LIM2A The function produces ASCII, but its comment was copied from U2H which produces HTML.	2018-10-26 05:00:48 +02:00
Fr�d�ric L�caille	b80bc273a3	MINOR: shctx: Change max. object size type to unsigned int. This change is there to prevent implicit conversions when comparing shctx maximum object sizes with other unsigned values.	2018-10-26 04:54:40 +02:00
Fr�d�ric L�caille	b7838afe6f	MINOR: shctx: Add a maximum object size parameter. This patch adds a new parameter to shctx_init() function to be used to limit the size of each shared object, -1 value meaning "no limit".	2018-10-24 04:39:44 +02:00
Fr�d�ric L�caille	8df65ae5e2	MINOR: cache: Larger HTTP objects caching. This patch makes the capable of storing HTTP objects larger than a buffer. It makes usage of the "block by block shared object allocation" new shctx API. A new pointer to struct shared_block has been added to the cache applet context to memorize the next block to be used by the HTTP cache I/O handler http_cache_io_handler() to emit the data. Another member, named "sent" memorize the number of bytes already sent by this handler. So, to send an object from cache, http_cache_io_handler() must be called until "sent" counter reaches the size of this object.	2018-10-24 04:37:12 +02:00
Fr�d�ric L�caille	0bec807e08	MINOR: shctx: Shared objects block by block allocation. This patch makes shctx capable of storing objects in several parts, each parts being made of several blocks. There is no more need to walk through until reaching the end of a row to append new blocks. A new pointer to a struct shared_block member, named last_reserved, has been added to struct shared_block so that to memorize the last block which was reserved by shctx_row_reserve_hot(). Same thing about "last_append" pointer which is used to memorize the last block used by shctx_row_data_append() to store the data.	2018-10-24 04:35:53 +02:00
Willy Tarreau	68ad3a42f7	MINOR: proxy: add a new option "http-use-htx" This option makes a proxy use only HTX-compatible muxes instead of the HTTP-compatible ones for HTTP modes. It must be set on both ends, this is checked at parsing time.	2018-10-23 10:22:36 +02:00
Christopher Faulet	55d6be7d83	MINOR: h1: Export some functions parsing the value of some HTTP headers Functions parsing the value of "Connection:", "Transfer-encoding:" and "Content-length:" headers are now exported to be used by the mux-h1.	2018-10-23 10:22:36 +02:00
Willy Tarreau	627505d36a	MINOR: freq_ctr: add swrate_add_scaled() to work with large samples Some samples representing time will cover more than one sample at once if they are units of time per time. For this we'd need to have the ability to loop over swrate_add() multiple times but that would be inefficient. By developing the function elevated to power N, it's visible that some coefficients quickly disappear and that those which remain at the first order more or less compensate each other. Thus a simplified version of this function was added to provide a single value for a given number of samples. Tests with multiple values, window sizes and sample sizes have shown that it is possible to make it remain surprisingly accurate (typical error < 0.2% over various large window and sample sizes, even samples representing up to 1/4 of the window).	2018-10-22 08:13:57 +02:00
Olivier Houchard	3f03ab5b15	MINOR: connection: Add a SUB_CALL_UNSUBSCRIBE event. Add a SUB_CALL_UNSUBSCRIBE event, to let the caller know that the unsubscribe method should be called before destroyin the object.	2018-10-21 06:00:04 +02:00
Olivier Houchard	53216e7db9	MEDIUM: connections: Don't directly mess with the polling from the upper layers. Avoid using conn_xprt_want_send/recv, and totally nuke cs_want_send/recv, from the upper layers. The polling is now directly handled by the connection layer, it is activated on subscribe(), and unactivated once we got the event and we woke the related task.	2018-10-21 05:58:40 +02:00
Olivier Houchard	1fddc9b7bb	BUG/MEDIUM: connections: Remove subscription if going in idle mode. Make sure we don't have any subscription when the connection is going in idle mode, otherwise there's a race condition when the connection is reused, if there are still old subscriptions, new ones won't be done. No backport is needed.	2018-10-21 05:55:20 +02:00
Olivier Houchard	62975a7740	BUG/MEDIUM: pools: Fix the usage of mmap()) with DEBUG_UAF. When mapping memory with mmap(), we should use a fd of -1, not 0. 0 may work on linux, but it doesn't work on FreeBSD, and probably other OSes. It would be nice to backport this to 1.8 to help debugging there.	2018-10-21 05:43:33 +02:00
Willy Tarreau	4e7cc3381b	BUILD: compiler: rename __unreachable() to my_unreachable() Olivier reported that on FreeBSD __unreachable is already defined and causes build warnings. Let's rename it then.	2018-10-20 17:45:48 +02:00
Willy Tarreau	7a6ad88b02	BUILD: memory: fix free_list pointer declaration again for atomic CAS Commit `ac6c880` ("BUILD: memory: fix pointer declaration for atomic CAS") attemtped to fix a build warning affecting the lock-free version of the pool allocator. But the fix tried to hide the cause instead of addressing it, thus clang still complains about (void ) not matching (void ). The real solution is to declare free_list (void *) and not to use a cast. Now this builds fine with gcc/clang with and without threads. No backport is needed.	2018-10-20 17:37:38 +02:00
Willy Tarreau	ed72d82827	MEDIUM: time: measure the time stolen by other threads The purpose is to detect if threads or processes are competing for the same CPU. This can happen when threads are incorrectly bound, or after a reload if the previous process still has an important activity. With threads this situation is problematic because a preempted thread holding a lock will block other ones waiting for this lock to be released. A first attempt consisted in measuring the cumulated lost time more precisely but the system's scheduler is smart enough to try to limit the thread preemption rate by mostly context switching during poll()'s blank periods, so most of the time lost is not seen. In essence this is good because it means a thread is not preempted with a lock held, and even regarding the rendez-vous point it cannot prevent the other ones from making progress. But still it happens tens to hundreds of times per second that a thread might be preempted, so it's still possible to detect that the situation is happening, thus it's interesting to measure and report its frequency. Each time we enter the poller, we check the CPU time spent working and see if we've lost time doing something else. To limit false positives, we're only interested in losses of 500 microseconds or more (i.e. half a clock tick on a 1 kHz system). If so, it indicates that some time was stolen by another thread or process. Note that we purposely store some sub-millisecond counters so that under heavy traffic with a 1 kHz clock, it's still possible to measure something without being subject to the risk of rounding errors (i.e. if exactly 1 ms is stolen it's possible that the time difference could often be slightly lower). This counter of lost CPU time slots time is reported in "show activity" in numbers of milliseconds of CPU lost per second, per 15s, and total over the process' life. By definition, the per-second counter cannot report values larger than 1000 per thread per second and the 15s one will be limited to 15000/s in the worst case, but it's possible that peak values exceed such thresholds after long pauses.	2018-10-19 08:51:59 +02:00
Willy Tarreau	5ceeb15002	MINOR: time: add now_mono_time() and now_cpu_time() These two functions retrieve respectively the monotonic clock time and the per-thread CPU time when available on the platform, or return zero. These syscalls may require to link with -lrt on certain libc, which is enabled in the Makefile with USE_RT=1 (default on Linux systems).	2018-10-18 16:39:48 +02:00
Willy Tarreau	ac6c8805be	BUILD: memory: fix pointer declaration for atomic CAS The calls to HA_ATOMIC_CAS() on the lockfree version of the pool allocator were mistakenly done on (void) for the old value instead of (void *). While this has no impact on "recent" gcc, it does have one for gcc < 4.7 since the CAS was open coded and it's not possible to assign a temporary variable of type "void". No backport is needed, this only affects 1.9.	2018-10-18 16:12:28 +02:00
Willy Tarreau	7e9c4ae4de	MINOR: poller: move time and date computation out of the pollers By placing this code into time.h (tv_entering_poll() and tv_leaving_poll()) we can remove the logic from the pollers and prepare for extending this to offer more accurate time measurements.	2018-10-17 19:59:43 +02:00
Willy Tarreau	f37ba94768	MINOR: fd: centralize poll timeout computation in compute_poll_timeout() The 4 pollers all contain the same code used to compute the poll timeout. This is pointless, let's centralize this into fd.h. This also gets rid of the useless SCHEDULER_RESOLUTION macro which used to work arond a very old linux 2.2 bug causing select() to wake up slightly before the timeout.	2018-10-17 19:59:43 +02:00
Willy Tarreau	e18db9e984	MEDIUM: pools: implement a thread-local cache for pool entries Each thread now keeps the last ~512 kB of freed objects into a local cache. There are some heuristics involved so that a specific pool cannot use more than 1/8 of the total cache in number of objects. Tests have shown that 512 kB is an optimal size on a 24-thread test running on a dual-socket machine, resulting in an overall 7.5% performance increase and a cache miss ratio reducing from 19.2 to 17.7%. Anyway it seems pointless to keep more than an L2 cache, which probably explains why sizes between 256 and 512 kB are optimal. Cached objects appear in two lists, one per pool and one LRU to help with fair eviction. Currently there is no way to check each thread's cache state nor to flush it. This cache cannot be disabled and is enabled as soon as the lockless pools are enabled (i.e.: threads are enabled, no pool debugging is in use and the CPU supports a double word CAS).	2018-10-16 13:46:08 +02:00
Willy Tarreau	146794dc4f	MINOR: pools: split pool_free() in the lockfree variant This separates the validity tests from the code committing the object to the pool, in order to ease insertion of the thread-local cache.	2018-10-16 10:29:28 +02:00
Willy Tarreau	0a93b6413f	MINOR: pools: allocate most memory pools from an array For caching it will be convenient to have indexes associated with pools, without having to dereference the pool itself. One solution could consist in replacing all pool pointers with integers but this would limit the number of allocatable pools. Instead here we allocate the 32 first pools from a pre-allocated array whose base address is known so that it's trivial to convert a pool to an index in this array. Pools that cannot fit there will be allocated normally.	2018-10-16 10:29:26 +02:00
Bertrand Jacquin	d5e4de8e5f	DOC: Fix a few typos these are mostly spelling mistakes, some of them might be candidate for backporting as well.	2018-10-15 19:38:15 +02:00
Willy Tarreau	8d8747abe0	OPTIM: tasks: group all tree roots per cache line Currently we have per-thread arrays of trees and counts, but these ones unfortunately share cache lines and are accessed very often. This patch moves the task-specific stuff into a structure taking a multiple of a cache line, and has one such per thread. Just doing this has reduced the cache miss ratio from 19.2% to 18.7% and increased the 12-thread test performance by 3%. It starts to become visible that we really need a process-wide per-thread storage area that would cover more than just these parts of the tasks. The code was arranged so that it's easy to move the pieces elsewhere if needed.	2018-10-15 19:06:13 +02:00
Willy Tarreau	b20aa9eef3	MAJOR: tasks: create per-thread wait queues Now we still have a main contention point with the timers in the main wait queue, but the vast majority of the tasks are pinned to a single thread. This patch creates a per-thread wait queue and queues a task to the local wait queue without any locking if the task is bound to a single thread (the current one) otherwise to the shared queue using locking. This significantly reduces contention on the wait queue. A test with 12 threads showed 11 ms spent in the WQ lock compared to 4.7 seconds in the same test without this change. The cache miss ratio decreased from 19.7% to 19.2% on the 12-thread test, and its performance increased by 1.5%. Another indirect benefit is that the average queue size is divided by the number of threads, which roughly removes log(nbthreads) levels in the tree and further speeds up lookups.	2018-10-15 19:04:40 +02:00
Willy Tarreau	87d54a9a6d	MEDIUM: fd/threads: only grab the fd's lock if the FD has more than one thread The vast majority of FDs are only seen by one thread. Currently the lock on FDs costs a lot because it's touched often, though there should be very little contention. This patch ensures that the lock is only grabbed if the FD is shared by more than one thread, since otherwise the situation is safe. Doing so resulted in a 15% performance boost on a 12-threads test.	2018-10-15 13:25:06 +02:00
Willy Tarreau	98d334bd94	MINOR: tools: add a new function atleast2() to test masks for more than 1 bit For threads it's common to have to check if a mask contains more than one bit set. Let's have this "atleast2()" function report this.	2018-10-15 13:25:06 +02:00
Willy Tarreau	d944344f01	BUILD: peers: check allocation error during peers_init_sync() peers_init_sync() doesn't check task_new()'s return value and doesn't return any result to indicate success or failure. Let's make it return an int and check it from the caller. This can be backported as far as 1.6.	2018-10-15 13:24:43 +02:00
Willy Tarreau	8d26f02e69	BUILD: compiler: add a new statement "__unreachable()" This statement is used as a hint for the compiler so that it knows that the location where it's placed cannot be reached. It will mostly be used after longjmp() or equivalent statements that deal with error processing and that the compiler doesn't know will not return on certain conditions, so that it doesn't complain about null dereferences on error paths.	2018-10-15 13:24:43 +02:00
Willy Tarreau	c1f40b38a6	MINOR: chunk: add chunk_cpy() and chunk_cat() Sometimes we need to concatenate constant chunks to existing ones, but no function currently exists to do this easily, hence these two new ones.	2018-10-12 16:58:01 +02:00
Christopher Faulet	25da9e34f1	MINOR: h1: Add the flag H1_MF_NO_PHDR to not add pseudo-headers during parsing Some pseudo-headers are added during the headers parsing, mainly for the mux H2. With this flag, it is possible to not add them. This avoid some boring filtering in the mux H1.	2018-10-12 16:15:18 +02:00
Christopher Faulet	1dc2b49556	MINOR: h1: Change the union h1_sl to use indirect strings to store infos Instead of using offsets relating to the parsed buffer to store start line infos, we now use indirect strings. So now, these infos remain valid only if the origin buffer remains untouched. But it's not a real problem because this union is used during the parsing and never stored to a later use.	2018-10-12 16:14:57 +02:00
Christopher Faulet	08088e77c6	MINOR: conn-stream: Add CL_FL_NOT_FIRST flag This flags will be used by multiplexers to warn a conn-stream (and, by transitivity, a stream) it is not the first one created by the mux. It will help mux H1 to handle keep-alive connections.	2018-10-12 16:09:26 +02:00
Christopher Faulet	315b39c391	MINOR: http: Use same flag for httpclose and forceclose options Since keep-alive mode is the default mode, the passive close has disappeared, and in the code, httpclose and forceclose options are handled the same way: connections with the client and the server are closed as soon as the request and the response are received and missing "Connection: close" header is added in each direction. So to make things clearer, forceclose is now an alias for httpclose. And httpclose is explicitly an active close. So the old passive close does not exist anymore. Internally, the flag PR_O_HTTP_PCL has been removed and PR_O_HTTP_FCL has been replaced by PR_O_HTTP_CLO. In HTTP analyzers, the checks done to find the right mode to use, depending on proxies options and "Connection: " header value, have been simplified. This should only be a cleanup and no changes are expected.	2018-10-12 16:07:56 +02:00
Christopher Faulet	10079f59b7	MINOR: http: Export some functions and do cleanup to prepare HTTP refactoring To ease the refactoring, the function "http_header_add_tail" have been remove. Now, "http_header_add_tail2" is always used. And the function "capture_headers" have been renamed into "http_capture_headers". Finally, some functions have been exported.	2018-10-12 16:00:45 +02:00
Christopher Faulet	702226c827	MINOR: stats: Add missing include "proto/stats.h" must include "types/stats.h".	2018-10-12 16:00:32 +02:00
Christopher Faulet	7e266c7936	MINOR: http: Move comment about some HTTP macros in the right header file HTTP_FLG_* and HTTP_IS_* were moved from "proto/proto_http.h" to "common/http.h" but the associated comment was forgotten during the move. This is 1.9-specific and should not be backported.	2018-10-12 16:00:24 +02:00
Olivier Houchard	4fdec7aafa	BUG/MEDIUM: stream: Make sure to unsubscribe before si_release_endpoint. Make sure we unsubscribe from events before si_release_endpoint destroys the conn_stream, or it will be never called. To do so, move the call to unsubscribe to si_release_endpoint() directly. This is 1.9-specific and shouldn't be backported.	2018-10-11 17:16:43 +02:00
Olivier Houchard	fa8aa867b9	MEDIUM: connections: Change struct wait_list to wait_event. When subscribing, we don't need to provide a list element, only the h2 mux needs it. So instead, Add a list element to struct h2s, and use it when a list is needed. This forces us to use the unsubscribe method, since we can't just unsubscribe by using LIST_DEL anymore. This patch is larger than it should be because it includes some renaming.	2018-10-11 15:34:39 +02:00
Olivier Houchard	83a0cd8a36	MINOR: connections: Introduce an unsubscribe method. As we don't know how subscriptions are handled, we can't just assume we can use LIST_DEL() to unsubscribe, so introduce a new method to mux and connections to do so.	2018-10-11 15:34:21 +02:00
Willy Tarreau	27346b01aa	OPTIM: tools: optimize my_ffsl() for x86_64 This call is now used quite a bit in the fd cache, to decide which cache to add/remove the fd to/from, when waking up a task for a single thread in __task_wakeup(), in fd_cant_recv() and in fd_process_cached_events(), and we can replace it with a single instruction, removing ~30 instructions and ~80 bytes from the inner loop of some of these functions. In addition the test for zero value was replaced with a comment saying that it is illegal and leads to an undefined behaviour. The code does not make use of this useless case today.	2018-10-10 19:24:23 +02:00
Willy Tarreau	2325d8af93	BUG/MINOR: threads: move declaration of capabilities to config.h In commit `f161d0f51` ("BUG/MINOR: pools/threads: don't ignore DEBUG_UAF on double-word CAS capable archs") I moved some defines and accidently messed up with lockfree pools. The problem is that the HA_HAVE_CAS_DW macro is not defined anymore where the CONFIG_HAP_LOCKLESS_POOLS macro is set, so this fix implicitly disabled lockfree pools. This patch fixes this by moving the capabilities definition to config.h (probably that we'd benefit from having an "arch.h" file to declare the capabilities offered by the architecture). In a test on a 12-core machine, we used to measure 19s spent in the pool lock for 1M requests without this patch, and 0 with it so that's definitely a net saving. No backport is required, this is only for 1.9.	2018-10-10 18:29:23 +02:00
Dirkjan Bussink	c26c72d89b	CLEANUP: h1: Fix debug warnings for h1 headers The wrong method was used to debug the h1m state here. This fixes both the signature of the h1m method and also fixes the invocation to be correct.	2018-10-09 15:09:29 +02:00
Dirkjan Bussink	415150f764	MEDIUM: ssl: add support for ciphersuites option for TLSv1.3 OpenSSL released support for TLSv1.3. It also added a separate function SSL_CTX_set_ciphersuites that is used to set the ciphers used in the TLS 1.3 handshake. This change adds support for that new configuration option by adding a ciphersuites configuration variable that works essentially the same as the existing ciphers setting. Note that it should likely be backported to 1.8 in order to ease usage of the now released openssl-1.1.1.	2018-10-08 19:20:13 +02:00
Olivier Houchard	363c745569	BUG/MEDIUM: buffers: Make sure we don't wrap in ci_insert_line2/b_rep_blk. In ci_insert_line2() and b_rep_blk(), we can't afford to wrap, so don't use b_tail() to check if we do, use __b_tail() instead. This should be backported to previous versions.	2018-10-08 16:11:54 +02:00
Emmanuel Hocdet	747ca61693	MINOR: ssl: generate-certificates for BoringSSL	2018-10-08 09:42:34 +02:00
Willy Tarreau	491cec20be	CLEANUP: http: remove some leftovers from recent cleanups The prototypes of functions find_hdr_value_end(), extract_cookie_value() and http_header_match2() were still in proto_http.h while some of them don't exist anymore and the others were just moved. Let's remove them. In addition, da.c was updated to use http_extract_cookie_value() which is the correct one.	2018-10-02 18:37:27 +02:00
Willy Tarreau	61c112aa5b	REORG: http: move HTTP rules parsing to http_rules.c These ones are mostly called from cfgparse.c for the parsing and do not depend on the HTTP representation. The functions's prototypes were moved to proto/http_rules.h, making this file work exactly like tcp_rules. Ideally we should stop calling these functions directly from cfgparse and register keywords, but there are a few cases where that wouldn't work (stats http-request) so it's probably not worth trying to go this far.	2018-10-02 18:28:05 +02:00
Willy Tarreau	79e57336b5	REORG: http: move the code to different files The current proto_http.c file is huge and contains different processing domains making it very difficult to work on an alternative representation. This commit moves some parts to other files : - ACL registration code => http_acl.c This code only creates some ACL mappings and doesn't know anything about HTTP nor about the representation. This code could even have moved to acl.c but it was not worth polluting it again. - HTTP sample conversion => http_conv.c This code doesn't depend on the internal representation but definitely manipulates some HTTP elements, such as dates. It also has access to captures. - HTTP sample fetching => http_fetch.c This code does depend entirely on the internal representation but is totally independent on the analysers. Placing it into a different file will ease the transition to the new representation and the creation of a wrapper if required. An include file was created due to CHECK_HTTP_MESSAGE_FIRST() being used at various places. - HTTP action registration => http_act.c This code doesn't directly interact with the messages nor the transaction but it does so via some exported http functions like http_replace_req_line() or http_set_status() so it will be easier to change only this after the conversion. - a few very generic parts were found and moved to http.{c,h} as relevant. It is worth noting that the functions moved to these new files are not referenced anywhere outside of the files and are only called as registered callbacks, so these files do not even require associated include files.	2018-10-02 18:26:59 +02:00
Adis Nezirovic	8878f8eb3d	MEDIUM: lua: Add stick table support for Lua. This ads support for accessing stick tables from Lua. The supported operations are reading general table info, lookup by string/IP key, and dumping the table. Similar to "show table", a data filter is available during dump, and as an improvement over "show table" it's possible to use up to 4 filter expressions instead of just one (with implicit AND clause binding the expressions). Dumping with/without filters can take a long time for large tables, and should be used sparingly.	2018-09-29 20:15:01 +02:00
Olivier Houchard	0e367bbb01	BUG/MEDIUM: process_stream: Don't use si_cs_io_cb() in process_stream(). Instead of using si_cs_io_cb() in process_stream() use si_cs_send/si_cs_recv instead, as si_cs_io_cb() may lead to process_stream being woken up when it shouldn't be, and thus timeout would never get triggered.	2018-09-26 14:21:54 +02:00
Willy Tarreau	7f2a44d319	BUG/CRITICAL: hpack: fix improper sign check on the header index value Tim D�sterhus found using afl-fuzz that some parts of the HPACK decoder use incorrect bounds checking which do not catch negative values after a type cast. The first culprit is hpack_valid_idx() which takes a signed int and is fed with an unsigned one, but a few others are affected as well due to being designed to work with an uint16_t as in the table header, thus not being able to detect the high offset bits, though they are not exposed if hpack_valid_idx() is fixed. The impact is that the HPACK decoder can be crashed by an out-of-bounds read. The only work-around without this patch is to disable H2 in the configuration. CVE-2018-14645 was assigned to this bug. This patch addresses all of these issues at once. It must be backported to 1.8.	2018-09-20 11:45:56 +02:00
Willy Tarreau	55e0da664e	BUILD: connection: silence a couple of null-deref build warnings at -Wextra These ones don't need to be checked either.	2018-09-20 11:42:15 +02:00
Willy Tarreau	4ae4923c3e	MINOR: stream-int: make si_appctx() never fail Callers of si_appctx() always use the result without checking it because they know by construction that it's valid. This results in unchecked null pointer warnings at -Wextra, so let's remove this test and make it clear that it's up to the caller to check validity first.	2018-09-20 11:42:15 +02:00
Willy Tarreau	babc15e8cf	MINOR: stktable: provide an unchecked version of stktable_data_ptr() stktable_data_ptr() currently performs null pointer checks but most callers don't check the result since they know by construction that it cannot be null. This causes valid warnings when building with -Wextra which are worth addressing since it will result in better code. Let's provide an unguarded version of this function for use where the check is known to be useless and untested.	2018-09-20 11:42:15 +02:00
Willy Tarreau	4c0fcc2314	BUG/MINOR: tools: fix set_net_port() / set_host_port() on IPv4 These two functions were apparently written on the same model as their parents when added by commit `11bcb6c4f` ("[MEDIUM] IPv6 support for syslog") except that they perform an assignment instead of a return, and as a result fall through the next case where the assigned value may possibly be partially overwritten. At least under Linux the port offset is the same in both sockaddr_in and sockaddr_in6 so the value is written twice without side effects. This needs to be backported as far as 1.5.	2018-09-20 10:52:48 +02:00
Willy Tarreau	2557f6a3e2	MEDIUM: h1: better handle transfer-encoding vs content-length The transfer-encoding header processing was a bit lenient in this part because it was made to read messages already validated by haproxy. We absolutely need to reinstate the strict processing defined in RFC7230 as is currently being done in proto_http.c. That is, transfer-encoding presence alone is enough to cancel content-length, and must be terminated by the "chunked" token, except in the response where we can fall back to the close mode if it's not last. For this we now use a specific parsing function which updates the flags and we introduce a new flag H1_MF_XFER_ENC indicating that the transfer-encoding header is present. Last, if such a header is found, we delete all content-length header fields found in the message.	2018-09-14 17:40:35 +02:00
Willy Tarreau	e2c418e94b	MINOR: http: add http_hdr_del() to remove a header from a list This one removes all occurrences of the specified header field name from a complete list and returns the new count.	2018-09-14 17:40:35 +02:00
Christopher Faulet	c4e53f4ad7	MINOR: h1: Add H1_MF_XFER_LEN flag This flag is usefull to handle cases where there is no body, regardless of CL or TE headers (for instance, responses to HEAD requests). It will not be set by the parser itself.	2018-09-14 16:02:40 +02:00
Willy Tarreau	98f5cf7a59	MINOR: h1: parse the Connection header field The new function h1_parse_connection_header() is called when facing a connection header in the generic parser, and it will set up to 3 bits in h1m->flags indicating if at least one "close", "keep-alive" or "upgrade" tokens was seen.	2018-09-13 14:52:31 +02:00
Willy Tarreau	ba5fbca33f	MINOR: h1: report in the h1m struct if the HTTP version is 1.1 or above This will be needed for the mux to know how to process the Connection header, and will save it from having to re-parse the request line since it's captured on the fly.	2018-09-13 14:34:09 +02:00
Willy Tarreau	175a2bb507	MINOR: connection: pass the proxy when creating a connection Till now it was very difficult for a mux to know what proxy it was working for. Let's pass the proxy when the mux is instanciated at init() time. It's not yet used but the H1 mux will definitely need it, just like the H2 mux when dealing with backend connections.	2018-09-12 17:39:22 +02:00
Willy Tarreau	eb528db60b	MINOR: h1: add H1_MF_TOLOWER to decide when to turn header names to lower case The h1 parser used to systematically turn header field names to lower case because it was designed for H2. Let's add a flag which is off by default to condition this behaviour so that when using it from an H1 parser it will not affect the message.	2018-09-12 17:38:26 +02:00
Willy Tarreau	11da5674c3	MINOR: h1: remove the HTTP status from the H1M struct It has nothing to do there and is not used from there anymore, let's get rid of it.	2018-09-12 17:38:25 +02:00
Willy Tarreau	001823c304	MEDIUM: h1: remove the useless H1_MSG_BODY state This state was only a delimiter between headers and body but it now causes more harm than good because it requires someone to change it. Since the H1 parser knows if we're in DATA or CHUNK_SIZE, simply let it set the right next state so that h1m->state constantly matches what is expected afterwards.	2018-09-12 17:38:25 +02:00
Willy Tarreau	a41393fc61	MEDIUM: h1: make the parser support a pointer to a start line This will allow the parser to fill some extra fields like the method or status without having to store them permanently in the HTTP message. At this point however the parser cannot restart from an interrupted read.	2018-09-12 17:38:25 +02:00
Willy Tarreau	bbf3823f82	MINOR: h1: properly pre-initialize err_pos to -2 This way we maintain the old mechanism stating that -2 means we block on errors, -1 means we only capture them, and a positive value indicates the position of the first error.	2018-09-12 17:38:25 +02:00
Willy Tarreau	ccaf233741	MINOR: h1: add a message flag to indicate that a message carries a response This flag is H1_MF_RESP. It will be used by the parser during restarts when it supports requests.	2018-09-12 17:38:25 +02:00
Willy Tarreau	7f437ff81c	MINOR: h1: provide a distinct init() function for request and response h1m_init() used to handle response only since it was used by the H1 client code. Let's have one init per direction.	2018-09-12 17:38:25 +02:00
Willy Tarreau	acc295cab3	MINOR: h1: remove the unused states from h1m_state States ERROR, 100_SENT, ENDING, CLOSE, CLOSING are not used at all for the parsers. It's possible that a few others may disappear as well.	2018-09-12 17:38:25 +02:00
Willy Tarreau	b3b0152b6f	MINOR: h1: add the restart offsets into struct h1m Currently the only user of struct h1m is the h2 mux when it has to parse an H1 message coming from the channel. Unfortunately this is not enough to efficiently parse HTTP/1 messages like those coming from the network as we don't want to restart from scratch at every byte received. This patch reintroduces the "next" offset into the H1 message so that any H1 parser can use it to restart when called with a state that is not the initial state.	2018-09-12 17:38:25 +02:00
Willy Tarreau	801250e07d	REORG: h1: create a new h1m_state This is the parsing state of an HTTP/1 message. Currently the h1_state is composite as it's made both of parsing and control (100SENT, BODY, DONE, TUNNEL, ENDING etc). The purpose here is to have a purely H1 state that can be used by H1 parsers. For now it's equivalent to h1_state.	2018-09-12 17:38:25 +02:00
Olivier Houchard	71384551fe	MINOR: conn_streams: Remove wait_list from conn_streams. The conn_streams won't be used for subscribing/waiting for I/O events, after all, so just remove its wait_list, and send/recv/_wait_list.	2018-09-12 17:37:55 +02:00
Olivier Houchard	26e1a8f2bf	MINOR: checks: Give checks their own wait_list. Instead of (ab)using the conn_stream's wait_list, which should disappear, give the checks their own wait_list.	2018-09-12 17:37:55 +02:00
Olivier Houchard	cb1f49ff93	MINOR: connections: Add a "handle" field to wait_list. Add a new field to struct wait_list, "handle", that can be used by the entity in charge of subscribing.	2018-09-12 17:37:55 +02:00
Olivier Houchard	af4021e680	MEDIUM: connections: Get rid of the recv() method. Remove the recv() method from mux and conn_stream. The goal is to always receive from the upper layers, instead of waiting for the connection later. For now, recv() is still called from the wake() method, but that should change soon.	2018-09-12 17:37:55 +02:00
Olivier Houchard	4cf7fb148f	MEDIUM: connections/mux: Add a recv and a send+recv wait list. For struct connection, struct conn_stream, and for the h2 mux, add 2 new lists, one that handles waiters for recv, and one that handles waiters for recv and send. That way we can ask to subscribe for either recv or send.	2018-09-12 17:37:55 +02:00
Olivier Houchard	931624a00b	BUG/MEDIUM: tasks: Don't forget to decrement task_list_size in tasklet_free(). In tasklet_free(), if we're currently in the runnable task list, don't forget to decrement taks_list_size, or it'll end up being to big, and we may not process tasks in the global runqueue.	2018-09-12 17:37:55 +02:00
William Lallemand	2fe7dd0b2e	MEDIUM: protocol: sockpair protocol This protocol is based on the uxst one, but it uses socketpair and FD passing insteads of a connect()/accept(). The "sockpair@" prefix has been implemented for both bind and server keywords. When HAProxy wants to connect through a sockpair@, it creates 2 new sockets using the socketpair() syscall and pass one of the socket through the FD specified on the server line. On the bind side, haproxy will receive the FD, and will use it like it was the FD of an accept() syscall. This protocol was designed for internal communication within HAProxy between the master and the workers, but it's possible to use it externaly with a wrapper and pass the FD through environment variabls.	2018-09-12 07:20:17 +02:00
William Lallemand	2d3f8a411f	MEDIUM: protocol: use a custom AF_MAX to help protocol parser It's possible to have several protocols per family which is a problem with the current way the protocols are stored. This allows to register a new protocol in HAProxy which is not a protocol in the strict socket definition. It will be used to register a SOCK_STREAM protocol using socketpair().	2018-09-12 07:12:27 +02:00
Willy Tarreau	ab813a4b05	REORG: http: move some header value processing functions to http.c The following functions only deal with header field values and are agnostic to the HTTP version so they were moved to http.c : http_header_match2(), find_hdr_value_end(), find_cookie_value_end(), extract_cookie_value(), parse_qvalue(), http_find_url_param_pos(), http_find_next_url_param(). Those lacking the "http_" prefix were modified to have it.	2018-09-11 10:30:25 +02:00
Willy Tarreau	04f1e2d202	REORG: http: move error codes production and processing to http.c These error codes and messages are agnostic to the version, even if they are represented as HTTP/1.0 messages. Ultimately they will have to be transformed into internal HTTP messages to be used everywhere. The HTTP/1.1 100 Continue message was turned to an IST and the local copy in the Lua code was removed.	2018-09-11 10:30:25 +02:00
Willy Tarreau	6b952c8101	REORG: http: move http_get_path() to http.c This function is purely HTTP once http_txn is put aside. So the original one was renamed to http_txn_get_path() and it extracts the relevant offsets from the txn to pass them to http_get_path(). One benefit of the new version is that it returns the length at the same time so that allowed to slightly simplify http_get_path_from_string() which had to look up the end pointer previously and which is not needed anymore.	2018-09-11 10:30:25 +02:00
Willy Tarreau	35b51c6e5b	REORG: http: move the HTTP semantics definitions to http.h/http.c It's a bit painful to have to deal with HTTP semantics for each protocol version (H1 and H2), and working on the version-agnostic code further emphasizes the problem. This patch creates http.h and http.c which are agnostic to the version in use, and which borrow a few parts from proto_http and from h1. For example the once thought h1-specific h1_char_classes array is in fact dictated by RFC7231 and is used to parse HTTP headers. A few changes were made to a few files which were including proto_http.h while they only needed http.h. Certain string definitions pre-dated the introduction of indirect strings (ist) so some were used to simplify the definition of the known HTTP methods. The current lookup code saves 2 kB of a heavily used table and is faster than the previous table based lookup (typ. 14 ns vs 16 before).	2018-09-11 10:30:25 +02:00
William Lallemand	e22f11ff47	MINOR: mworker: keep and clean the listeners Keep the listeners that should be used in the master process and clean them in the workers.	2018-09-11 10:23:24 +02:00
William Lallemand	d3801c1c21	MEDIUM: startup: unify signal init between daemon and mworker mode The signals are now unblocked only once the configuration have been parsed.	2018-09-11 10:21:58 +02:00
Willy Tarreau	4bc7d90d3b	MEDIUM: snapshot: merge the captured data after the descriptor Instead of having a separate area for the captured data, we now have a contigous block made of the descriptor and the data. At the moment, since the area is dynamically allocated, we can adjust its size to what is needed, but the idea is to quickly switch to a pool and an LRU list.	2018-09-07 20:07:17 +02:00
Willy Tarreau	c55015ee5b	MEDIUM: snapshots: dynamically allocate the snapshots Now upon error we dynamically allocate the snapshot instead of overwriting it. This way there is no more memory wasted in the proxy to hold the two error snapshot descriptors. Also an appreciable side effect of this is that the proxy's lock is only taken during the pointer swap, no more while copying the buffer's contents. This saves 480 bytes of memory per proxy.	2018-09-07 19:59:58 +02:00
Willy Tarreau	fd9419d560	MINOR: http: remove the pointer to the error snapshot in http_capture_bad_message() It's not needed anymore as we know the side thanks to the channel. This will allow the proxy generic code to better manage the error snapshots.	2018-09-07 18:36:04 +02:00
Willy Tarreau	75fb65a51f	MINOR: proxy: add a new generic proxy_capture_error() This function now captures an error regardless of its side and protocol. The caller must pass a number of elements and may pass a protocol-specific structure and a callback to display it. Later this function may deal with more advanced allocation techniques to avoid allocating as many buffers as proxies.	2018-09-07 18:36:04 +02:00
Willy Tarreau	7ccdd8dad9	MEDIUM: snapshot: implement a show() callback and use it for HTTP The HTTP dumps are now configurable in the code : "show errors" now calls a protocol-specific function to emit the decoded output. For now only HTTP is implemented.	2018-09-07 18:36:01 +02:00
Willy Tarreau	7480f323ff	MINOR: snapshot: split the error snapshots into common and proto-specific parts The idea will be to make the error snapshot feature accessible to other protocols than just HTTP. This patch only introduces an "http_snapshot" structure and renames a few fields to make things more explicit. The HTTP part was installed inside a union so that we can easily add more protocols in the future.	2018-09-07 16:13:45 +02:00
Willy Tarreau	5865a8fe69	MINOR: snapshot: restart on the event ID and not the stream ID The snapshots have the ability to restart a partial dump and they use the stream ID as the restart point. Since it's purely HTTP, let's use the event ID instead.	2018-09-07 15:00:43 +02:00
Olivier Houchard	54620523e2	MINOR: log: One const should be enough. "const const" doesn't bring much more constness, so only use one.	2018-09-06 18:52:15 +02:00
Willy Tarreau	57f8185625	MINOR: connection: add new function conn_is_back() This function returns true if the connection is a backend connection and false if it's a frontend connection.	2018-09-06 14:52:21 +02:00
Willy Tarreau	6ac98ac1be	MINOR: connection: add new function conn_get_proxy() This function returns the proxy associated to a connection. For front connections it returns the frontend, and for back connections it returns the backend. This will be used to retrieve some configuration parameters from within a mux.	2018-09-06 11:48:44 +02:00
Willy Tarreau	be373150c7	MINOR: connection: make the initialization more consistent Sometimes a connection is prepared before the target is set, sometimes after. There's no real rule since the few functions involved operate on different and independent fields. Soon we'll benefit from knowing the target at the connection layer, in order to figure the associated proxy and retrieve the various parameters (timeouts etc). This patch slightly reorders a few calls to conn_prepare() so that we can make sure that the target is always known to the mux.	2018-09-06 11:45:30 +02:00
Willy Tarreau	5383935856	MINOR: log: provide a function to emit a log for a session The new function sess_log() only needs a session to emit a log. It will ignore the parts that depend on the stream. It is usable to emit a log to report early errors in muxes. These ones will typically mention "<BADREQ>" for the request and 0 for the HTTP status code.	2018-09-06 09:43:41 +02:00
Willy Tarreau	26ffa8544d	CLEANUP: log: make the low_level lf_{ip,port,text,text_len} functions take consts These ones were abusively relying on variables making it hard to integrate with const arguments.	2018-09-05 20:01:23 +02:00
Willy Tarreau	43c538eab6	MINOR: log: move the log code to sess_build_logline() to add extra arguments The current build_logline() can only be used with valid streams, which means it is not suitable for use from muxes. We start by moving it into another more generic function which takes the session as an argument, to avoid complexifying all the internal API for jsut a few use cases. This new function is not supposed to be called directly from outside so we'll be able to instrument it to support several calling conventions. For now the behaviour and conditions remain unchanged.	2018-09-05 20:01:23 +02:00
Willy Tarreau	ec3750c590	BUG/MAJOR: buffer: fix incorrect check in __b_putblk() This function was split in two at commit `f7d0447` ("MINOR: buffers: split b_putblk() into __b_putblk()") but it's wrong, the first half's length is not adjusted to the requested size so it copies more than desired. This is purely 1.9-specific, no backport is needed.	2018-09-05 20:01:14 +02:00
Willy Tarreau	590a0514f2	BUG/MEDIUM: session: fix reporting of handshake processing time in the logs The handshake processing time used to be stored per stream, which was valid when there was exactly one stream per session. With H2 and multiplexing it's not the case anymore and the reported handshake times are wrong in the logs as it's computed between the TCP accept() and the stream creation. Let's first move the handshake where it belongs, which is the session. However, this is not enough because we don't want to report an excessive idle time either for H2 (since many requests use the connection). So the solution used here is to have the stream retrieve sess->tv_accept and the handshake duration when the stream is created, and let the mux immediately reset them. This way, the handshake time becomes zero for the second and subsequent requests in H2 (which was already the case in H1), and the idle time exactly counts how long the connection remained unused while it could be used, so in H1 it runs from the end of the previous response and in H2 it runs from the end of the previous request since the channel is already available. This patch will need to be backported to 1.8.	2018-09-05 16:30:23 +02:00
Willy Tarreau	9378df89f6	MINOR: thread: implement HA_ATOMIC_XADD() We've been missing it several times and now we'll need it to increment a request counter. Let's do it once for all. This patch will need to be backported to 1.8 with the associated fix.	2018-09-05 16:30:17 +02:00
Willy Tarreau	f16cb41d19	MINOR: tools: make date2str_log() take some consts The "tm" and "date" field are not modified, they can be const instead of forcing their callers to use vars.	2018-09-05 16:30:11 +02:00
Baptiste Assmann	6d0f38f00d	BUG/MEDIUM: dns/server: fix incomatibility between SRV resolution and server state file Server state file has no indication that a server is currently managed by a DNS SRV resolution. And thus, both feature (DNS SRV resolution and server state), when used together, does not provide the expected behavior: a smooth experience... This patch introduce the "SRV record name" in the server state file and loads and applies it if found and wherever required. This patch applies to haproxy-dev branch only. For backport, a specific patch is provided for 1.8.	2018-09-04 17:40:22 +02:00
Willy Tarreau	e215bba956	MINOR: connection: make conn_sock_drain() work for all socket families This patch improves the previous fix by implementing the socket draining code directly in conn_sock_drain() so that it always applies regardless of the protocol's family. Thus it gets rid of tcp_drain().	2018-08-24 14:45:46 +02:00
Willy Tarreau	b509232eb8	MINOR: sample: remove impossible tests on negative smp->data.u.str.data Since commit `843b7cb` ("MEDIUM: chunks: make the chunk struct's fields match the buffer struct") a chunk length is unsigned so we can remove negative size checks.	2018-08-22 05:28:33 +02:00
Willy Tarreau	bba81563cf	MINOR: chunk: remove impossible tests on negative chunk->data Since commit `843b7cb` ("MEDIUM: chunks: make the chunk struct's fields match the buffer struct") a chunk length is unsigned so we can remove negative size checks.	2018-08-22 05:28:32 +02:00
Willy Tarreau	1b13bfd646	BUG/MEDIUM: connection: don't forget to always delete the list's head During a test it happened that a connection was deleted before the stream it's attached to, resulting in a crash related to the fix `18a85fe` ("BUG/MEDIUM: streams: Don't forget to remove the si from the wait list.") during the LIST_DEL(). Make sure to always delete the list's head in this case so that other elements can safely detach later. This is purely 1.9, no backport is needed.	2018-08-21 18:33:20 +02:00
Olivier Houchard	abedf5f6c3	BUG/MEDIUM: tasklets: Add the thread as active when waking a tasklet. Set the flag for the current thread in active_threads_mask when waking a tasklet, or we will never run it if no tasks are available. This is 1.9-specific, no backport is needed.	2018-08-21 18:06:33 +02:00
Olivier Houchard	6aab737835	MINOR: fd cache: And the thread_mask with all_threads_mask. When we choose to insert a fd in either the global or the local fd update list, and the thread_mask against all_threads_mask before checking if it's tid_bit, that way, if we run with nbthreads==1, we will always use the local list, which is cheaper than the global one.	2018-08-17 14:50:47 +02:00
Olivier Houchard	8f0b4c66f5	MINOR: stream_interface: Give stream_interface its own wait_list. Instead of just using the conn_stream wait_list, give the stream_interface its own. When the conn_stream will have its own buffers, the stream_interface may have to wait on it.	2018-08-16 17:29:54 +02:00
Olivier Houchard	91894cbf4c	MINOR: stream_interface: Don't use si_cs_send() as a task handler. Instead of using si_cs_send() as a task handler, define a new function, si_cs_io_cb(), and give si_cs_send() its original prototype. Right now si_cs_io_cb() just handles send, but later it'll handle recv() too.	2018-08-16 17:29:54 +02:00
Olivier Houchard	e1c6dbcd70	MINOR: connections/mux: Add the wait reason(s) to wait_list. Add a new element to the wait_list, that let us know which event(s) we are waiting on.	2018-08-16 17:29:53 +02:00
Olivier Houchard	5d18718c8f	MINOR: tasks: Allow tasklet_wakeup() to wakeup a task. Modify tasklet_wakeup() so that it handles a task as well, and inserts it directly into the tasklet list, making it effectively a tasklet. This should make future developments easier.	2018-08-16 17:29:53 +02:00
Olivier Houchard	ed0f207ef5	MINOR: connections: Get rid of txbuf. Remove txbuf from conn_stream. It is not used yet, and its only user will probably be the mux_h2, so it will be better suited in the struct h2s.	2018-08-16 17:29:51 +02:00
Olivier Houchard	638b799b09	MINOR: connections: Move rxbuf from the conn_stream to the h2s. As the mux_h2 is the only user of rxbuf, move it to the struct h2s, instead of conn_stream.	2018-08-16 17:28:11 +02:00
Olivier Houchard	511efeae7e	MINOR: connections: Make rcv_buf mandatory and nuke cs_recv(). Reintroduce h2_rcv_buf(), right now it just does what cs_recv() did, but should be modified later.	2018-08-16 17:23:44 +02:00
Patrick Hemmer	268a707a3d	MEDIUM: add set-priority-class and set-priority-offset This adds the set-priority-class and set-priority-offset actions to http-request and tcp-request content. At this point they are not used yet, which is the purpose of the next commit, but all the logic to set and clear the values is there.	2018-08-10 15:06:31 +02:00
Patrick Hemmer	0355dabd7c	MINOR: queue: replace the linked list with a tree We'll need trees to manage the queues by priorities. This change replaces the list with a tree based on a single key. It's effectively a list but allows us to get rid of the list management right now.	2018-08-10 15:06:27 +02:00
Patrick Hemmer	da282f4a8f	MINOR: queue: store the queue index in the stream when enqueuing We store the queue index in the stream and check it on dequeueing to figure how many entries were processed in between. This way we'll be able to count the elements that may later be added before ours.	2018-08-10 15:06:25 +02:00
Patrick Hemmer	ffe5e8c638	MINOR: stream: rename {srv,prx}_queue_size to *_queue_pos The current name is misleading as it implies a queue size, but the value instead indicates a position in the queue. The value is only the queue size at the exact moment the element is enqueued. Soon we will gain the ability to insert anywhere into the queue, upon which clarity of the name is more important.	2018-08-10 15:04:14 +02:00
Willy Tarreau	287527a176	BUG/MEDIUM: connection/mux: take care of serverless proxies Commit `7ce0c89` ("MEDIUM: mux: Use the mux protocol specified on bind/server lines") assumed a bit too strongly that we could only have servers on the connect side :-) It segfaults under this config : defaults contimeout 5s clitimeout 5s srvtimeout 5s mode http listen test1 bind :8001 dispatch 127.0.0.1:8002 frontend test2 mode http bind :8002 redirect location / No backport needed.	2018-08-08 18:44:43 +02:00
Christopher Faulet	7ce0c891ab	MEDIUM: mux: Use the mux protocol specified on bind/server lines To do so, mux choices are split to handle incoming and outgoing connections in a different way. The protocol specified on the bind/server line is used in priority. Then, for frontend connections, the ALPN is retrieved and used to choose the best mux. For backend connection, there is no ALPN. Finaly, if no protocol is specified and no protocol matches the ALPN, we fall back on a default mux, choosing in priority the first mux with exactly the same mode.	2018-08-08 10:42:08 +02:00
Christopher Faulet	8ed0a3e32a	MINOR: mux/server: Add 'proto' keyword to force the multiplexer's protocol For now, it is parsed but not used. Tests are done on it to check if the side and the mode are compatible with the server's definition.	2018-08-08 10:42:08 +02:00
Christopher Faulet	a717b99284	MINOR: mux/frontend: Add 'proto' keyword to force the mux protocol For now, it is parsed but not used. Tests are done on it to check if the side and the mode are compatible with the proxy's definition.	2018-08-08 10:41:11 +02:00
Christopher Faulet	9c9ef03bf4	MINOR: mux: Improve the message with the list of existing mux protocols Because there can be several default multiplexers (without name), they are now reported with the name "<default>". And a message warns they cannot be referenced with the "proto" keyword on a bind line or a server line.	2018-08-08 10:41:11 +02:00
Christopher Faulet	e15c6c48ef	MINOR: mux: Change get_mux_proto to get an ist as parameter It simplifies the API and ease comparisons with the multiplexers token (which is an ist too).	2018-08-08 10:41:11 +02:00
Christopher Faulet	259e473ecc	BUG/MINOR: threads: Remove the unexisting lock label "UPDATED_SERVERS_LOCK" The update lock was removed by the commit `91c2826e1` ("CLEANUP: server: remove the update list and the update lock"). But the lock label was not which makes the compilation fail in debug mode. pour vos modifications. Les lignes # commençant par '#' seront ignorées, et un message vide abandonne la validation. # # Sur la branche temp # Votre branche est en avance sur 'origin/master' de 87 commits. # (utilisez "git push" pour publier vos commits locaux) # # Modifications qui seront validées : # modifié : include/common/hathreads.h #	2018-08-08 10:41:11 +02:00
Willy Tarreau	91c2826e1d	CLEANUP: server: remove the update list and the update lock These ones are not more used, let's get rid of them.	2018-08-08 09:57:45 +02:00
Willy Tarreau	3ff577e165	MAJOR: server: make server state changes synchronous again Now we try to synchronously push updates as they come using the new rdv point, so that the call to the server update function from the main poll loop is not needed anymore. It further reduces the apparent latency in the health checks as the response time almost always appears as 0 ms, resulting in a slightly higher check rate of ~1960 conn/s. Despite this, the CPU consumption has slightly dropped again to ~32% for the same test. The only trick is that the checks code is built with a bit of recursivity because srv_update_status() calls server_recalc_eweight(), and the latter needs to signal srv_update_status() in case of updates. Thus we added an extra argument to this function to indicate whether or not it must propagate updates (no if it comes from srv_update_status).	2018-08-08 09:57:45 +02:00
Willy Tarreau	647c70b681	MINOR: threads: remove the previous synchronization point It's not needed anymore as it is fully covered by the new rendez-vous point. This also removes the pipe and its polling.	2018-08-08 09:57:45 +02:00
Christopher Faulet	98d9fe21e0	MINOR: mux: Print the list of existing mux protocols during HA startup This is done in verbose/debug mode and when build options are reported.	2018-08-08 09:54:22 +02:00
Christopher Faulet	32f61c0421	MINOR: mux: Unlink ALPN and multiplexers to rather speak of mux protocols Multiplexers are not necessarily associated to an ALPN. ALPN is a TLS extension, so it is not always defined or used. Instead, we now rather speak of multiplexer's protocols. So in this patch, there are no significative changes, some structures and functions are just renamed.	2018-08-08 09:54:22 +02:00
Christopher Faulet	2d5292a412	MINOR: mux: Add info about the supported side in alpn_mux_list structure Now, a multiplexer can specify if it can be install on incoming connections (ALPN_SIDE_FE), on outgoing connections (ALPN_SIDE_BE) or both (ALPN_SIDE_BOTH). These flags are compatible with proxies' ones.	2018-08-08 09:54:22 +02:00
Christopher Faulet	063f786553	MINOR: conn_stream: add cs_send() as a default snd_buf() function This function is generic and is able to automatically transfer data from a buffer to the conn_stream's tx buffer. It does this automatically if the mux doesn't define another snd_buf() function. It cannot yet be used as-is with the conn_stream's txbuf without risking to lose data on close since conn_streams need to be orphaned for this.	2018-08-08 09:53:58 +02:00
Christopher Faulet	3c51802fb9	MINOR: conn_stream: add an tx buffer to the conn_stream To be symmetrical with the recv() part, we no handle retryable and partial transmission using a intermediary buffer in the conn_stream. For now it's only set to BUF_NULL and never allocated nor used. It cannot yet be used as-is without risking to lose data on close since conn_streams need to be orphaned for this.	2018-08-08 09:53:01 +02:00
Christopher Faulet	d44a9b3627	MEDIUM: mux: Remove const on the buffer in mux->snd_buf() This is a partial revert of the commit `deccd1116` ("MEDIUM: mux: make mux->snd_buf() take the byte count in argument"). It is a requirement to do zero-copy transfers. This will be mandatory when the TX buffer of the conn_stream will be used. So, now, data are consumed by mux->snd_buf() and not only sent. So it needs to update the buffer state. On its side, the caller must be aware the buffer can be replaced y an empty or unallocated one. As a side effet of this change, the function co_set_data() is now only responsible to update the channel set, by update ->output field.	2018-08-07 14:36:52 +02:00
Christopher Faulet	ad4e1a4735	BUG/MINOR: buffers: Fix b_slow_realign when a buffer is realign without output When b_slow_realign is called with the <output> parameter equal to 0, the buffer's head, after the realign, must be set to 0. It was errornously set to the buffer's size, because there was no test on the value of <output>.	2018-08-06 15:56:40 +02:00
Willy Tarreau	60b639ccbe	MEDIUM: hathreads: implement a more flexible rendez-vous point The current synchronization point enforces certain restrictions which are hard to workaround in certain areas of the code. The fact that the critical code can only be called from the sync point itself is a problem for some callback-driven parts. The "show fd" command for example is fragile regarding this. Also it is expensive in terms of CPU usage because it wakes every other thread just to be sure all of them join to the rendez-vous point. It's a problem because the sleeping threads would not need to be woken up just to know they're doing nothing. Here we implement a different approach. We keep track of harmless threads, which are defined as those either doing nothing, or doing harmless things. The rendez-vous is used "for others" as a way for a thread to isolate itself. A thread then requests to be alone using thread_isolate() when approaching the dangerous area, and then waits until all other threads are either doing the same or are doing something harmless (typically polling). The function only returns once the thread is guaranteed to be alone, and the critical section is terminated using thread_release().	2018-08-02 17:51:45 +02:00
Willy Tarreau	0c026f49e7	MINOR: threads: add more consistency between certain variables in no-thread case When threads are disabled, some variables such as tid and tid_bit are still checked everywhere, the MAX_THREADS_MASK macro is ~0UL while MAX_THREADS is 1, and the all_threads_mask variable is replaced with a macro forced to zero. The compiler cannot optimize away all this code involving checks on tid and tid_bit, and we end up in special cases where all_threads_mask has to be specifically tested for being zero or not. It is not even certain the code paths are always equivalent when testing without threads and with nbthread 1. Let's change this to make sure we always present a single thread when threads are disabled, and have the relevant values declared as constants so that the compiler can optimize all the tests away. Now we have MAX_THREADS_MASK set to 1, all_threads_mask set to 1, tid set to zero and tid_bit set to 1. Doing just this has removed 4 kB of code in the no-thread case. A few checks for all_threads_mask==0 have been removed since it never happens anymore.	2018-08-02 17:48:09 +02:00
Willy Tarreau	c03ea40763	BUILD/MINOR: compiler: fix offsetof() on older compilers An offsetof() macro was introduced with commit `928fbfa` ("MINOR: compiler: introduce offsetoff().") with a fallback for older compilers. But this breaks gcc 3.4 because __size_t and __uintptr_t are not defined there. However size_t and uintptr_t are, so let's fix it this way. No backport needed.	2018-07-30 11:49:35 +02:00
Willy Tarreau	0ccd32285f	MINOR: threads: move "nbthread" parsing to hathreads.c The purpose is to make sure that all variables which directly depend on this nbthread argument are set at the right moment. For now only all_threads_mask needs to be set. It used to be set while calling thread_sync_init() which is called too late for certain checks. The same function handles threads and non-threads, which removes the need for some thread-specific knowledge from cfgparse.c.	2018-07-30 11:10:46 +02:00
Olivier Houchard	3e12304ae0	BUG/MINOR: threads: Handle nbthread == MAX_THREADS. If nbthread is MAX_THREADS, the shift operation needed to compute all_threads_mask fails in thread_sync_init(). Instead pass a number of threads to this function and let it compute the mask without overflowing. This should be backported to 1.8.	2018-07-27 17:18:22 +02:00
Emmanuel Hocdet	ebabd8768a	MINOR: ssl: BoringSSL matches OpenSSL 1.1.0 Since BoringSSL 3b2ff028, API now correctly match OpenSSL 1.1.0. The patch revert part of haproxy `019f9b10`: "Fix BoringSSL call and openssl-compat.h/#define occordingly.". This will not break openssl/libressl compat.	2018-07-27 09:43:40 +02:00
Olivier Houchard	79321b95a8	MINOR: pollers: Add a way to wake a thread sleeping in the poller. Add a new pipe, one per thread, so that we can write on it to wake a thread sleeping in a poller, and use it to wake threads supposed to take care of a task, if they are all sleeping.	2018-07-26 19:09:50 +02:00
Olivier Houchard	9b03c0c9a7	MINOR: tasks: Make active_tasks_mask volatile. To be sure we have the relevant informations, make active_tasks_mask volatile	2018-07-26 19:09:50 +02:00
Willy Tarreau	3201e4e428	MEDIUM: queue: get rid of the pendconn lock This lock was necessary to manipulate the pendconn element between concurrent places, but was causing great difficulties in the list walk by having to iterate over multiple entries instead of being able to safely pick the first one (in fact the first element was always the right one but the locking model was hard to prove). Here since we know we can always rely on the queue's locks, we take the queue's lock every time we need to modify the element. In practice it was already the case everywhere except in pendconn_dequeue() which only works on an element that was already detached. This function had to be protected against the risk of meeting an incompletely detached element (which could be unlinked but not yet assigned). By taking the queue lock around the LIST_ISEMPTY test, it's enough to ensure that a concurrent thread either didn't begin or had completed the operation. The true benefit really is in pendconn_process_next_strm() where we can again safely work with the first element of each queue. This will significantly simplify next updates to this code.	2018-07-26 17:32:51 +02:00
Willy Tarreau	88930dd364	MINOR: queue: use a distinct variable for the assigned server and the queue The pendconn struct uses ->px and ->srv to designate where the element is queued. There is something confusing regarding threads though, because we have to lock the appropriate queue before inserting/removing elements, and this queue may only be determined by looking at ->srv (if it's not NULL it's the server, otherwise use the proxy). But pendconn_grab_from_px() and pendconn_process_next_strm() both assign this ->srv field, making it complicated to know what queue to lock before manipulating the element, which is exactly why we have the pendconn_lock in the first place. This commit introduces pendconn->target which is the target server that the two aforementioned functions will set when assigning the server. Thanks to this, the server pointer may always be relied on to determine what queue to use.	2018-07-26 17:32:51 +02:00
Willy Tarreau	d0ad4a87f0	MEDIUM: queue: make pendconn_free() work on the stream instead Now pendconn_free() takes a stream, checks that pend_pos is set, clears it, and uses pendconn_unlink() to complete the job. It's cleaner and centralizes all the bookkeeping work in pendconn_unlink() only and ensures that there's a single place where the stream's position in the queue is manipulated.	2018-07-26 17:32:51 +02:00
Willy Tarreau	9624faec86	MINOR: queue: centralize dequeuing code a bit better For now the pendconns may be dequeued at two places : - pendconn_unlink(), which operates on a locked queue - pendconn_free(), which operates on an unlocked queue and frees everything. Some changes are coming to the queue and we'll need to be able to be a bit stricter regarding the places where we dequeue to keep the accounting accurate. This first step renames the locked function __pendconn_unlink() as it's for use by those aware of it, and introduces a new general purpose pendconn_unlink() function which automatically grabs the necessary locks before calling the former, and pendconn_cond_unlink() which additionally checks the pointer and the presence in the queue.	2018-07-26 17:32:48 +02:00
Olivier Houchard	77551ee8a7	BUG/MEDIUM: tasks: make __task_unlink_rq responsible for the rqueue size. As __task_wakeup() is responsible for increasing rqueue_local[tid]/global_rqueue_size, make __task_unlink_rq responsible for decreasing it, as process_runnable_tasks() isn't the only one that removes tasks from runqueues.	2018-07-26 16:33:29 +02:00
Olivier Houchard	76e45181b2	MINOR: tasks: Add a flag that tells if we're in the global runqueue. How that we have bits available in task->state, add a flag that tells if we're in the global runqueue or not.	2018-07-26 16:33:10 +02:00
Willy Tarreau	f0cea1ee3f	MINOR: tasks: extend the state bits from 8 to 16 and remove the reason By removing the reason code for the wakeup we can gain 8 extra bits to encode the task's state. The reason code was never used at all and is wrong by design since subsequent calls will OR this value anyway. Let's say it goodbye and leave the room for more precious bits. The woken bits were moved to the higher byte so that the most important bits can stay grouped together.	2018-07-26 16:13:00 +02:00
Willy Tarreau	7999bfbfd3	MEDIUM: buffers: make b_xfer() automatically swap buffers when possible Whenever it's possible to avoid a copy, b_xfer() will simply swap the buffer's heads without touching the data. This has brought the performance back from 140 kH/s to 202 kH/s on the test case.	2018-07-20 19:21:43 +02:00
Willy Tarreau	11c9aa424e	MEDIUM: conn_stream: add cs_recv() as a default rcv_buf() function This function is generic and is able to automatically transfer data from a conn_stream's rx buffer to the destination buffer. It does this automatically if the mux doesn't define another rcv_buf() function.	2018-07-20 19:21:43 +02:00
Willy Tarreau	5e1cc5ea83	MINOR: conn_stream: add an rx buffer to the conn_stream In order to reorganize the connection layers, recv() operations will need to be retryable and to support partial transfers. This requires an intermediary buffer to hold the data coming from the mux. After a few attempts, it turns out that this buffer is best placed inside the conn_stream itself. For now it's only set to buf_empty and it will be up to the caller to allocate it if required.	2018-07-20 19:21:43 +02:00
Willy Tarreau	a3f7efe009	MINOR: conn_stream: add a new CS_FL_REOS flag This flag indicates that the mux layer has already detected an end of stream which will become CS_FL_EOS during a recv() once the rx buffer is empty.	2018-07-20 19:21:43 +02:00
Willy Tarreau	f148888d19	MINOR: buffers: add b_xfer() to transfer data between buffers Instead of open-coding buffer-to-buffer transfers using blocks, let's have a dedicated function for this. It also adjusts the buffer counts.	2018-07-20 19:21:43 +02:00
Willy Tarreau	f7d0447376	MINOR: buffers: split b_putblk() into __b_putblk() The latter function is more suited to operations that don't require any check because the check has already been performed. It will be used by other b_* functions.	2018-07-20 19:21:43 +02:00
Willy Tarreau	ab322d4fd4	MINOR: buffers: simplify b_contig_space() This function is used a lot in block copies and is needlessly complicated since it still uses pointer arithmetic. Let's fall back to regular offsets and simplify it. This removed around 23 bytes from b_putblk() and it removed any conditional jump.	2018-07-20 19:21:43 +02:00
Christopher Faulet	ddb6c16576	BUG/MEDIUM: threads: Fix the exit condition of the thread barrier In thread_sync_barrier, we exit when all threads have set their own bit in the barrier mask. It is done by comparing it to all_threads_mask. But we must not use a simple equality to do so, becaue all_threads_mask may change. Since commit `ba86c6c25` ("MINOR: threads: Be sure to remove threads from all_threads_mask on exit"), when a thread exit, its bit is removed from all_threads_mask. Instead, we must use a bitwise AND to test is all bits of all_threads_mask are set. This also requires that all_threads_mask is set to volatile if we want to catch changes. This patch must be backported in 1.8.	2018-07-20 14:24:41 +02:00
Christopher Faulet	20761453fb	MINOR: ist: Add the function isteqi This new function does the same as isteq, but ignoring the case.	2018-07-20 13:39:30 +02:00
Willy Tarreau	8318885487	MINOR: connection: simplify subscription by adding a registration function This new function wl_set_waitcb() prepopulates a wait_list with a tasklet and a context and returns it so that it can be passed to ->subscribe() to be added to a connection or conn_stream's wait_list. The caller doesn't need to know all the insiders details anymore this way.	2018-07-19 18:31:07 +02:00
Olivier Houchard	910b2bc829	MEDIUM: connections/mux: Revamp the send direction. Totally nuke the "send" method, instead, the upper layer decides when it's time to send data, and if it's not possible, uses the new subscribe() method to be called when it can send data again.	2018-07-19 18:31:07 +02:00
Olivier Houchard	6ff2039d13	MINOR: connections/mux: Add a new "subscribe" method. Add a new "subscribe" method for connection, conn_stream and mux, so that upper layer can subscribe to them, to be called when the event happens. Right now, the only event implemented is "SUB_CAN_SEND", where the upper layer can register to be called back when it is possible to send data. The connection and conn_stream got a new "send_wait_list" entry, which required to move a few struct members around to maintain an efficient cache alignment (and actually this slightly improved performance).	2018-07-19 16:23:43 +02:00
Olivier Houchard	e17c2d3e57	MINOR: tasklets: Don't attempt to add a tasklet in the list twice. Don't try to add a tasklet to the run queue if it's already in there, or we might get an infinite loop.	2018-07-19 16:23:43 +02:00
Willy Tarreau	83061a820e	MAJOR: chunks: replace struct chunk with struct buffer Now all the code used to manipulate chunks uses a struct buffer instead. The functions are still called "chunk*", and some of them will progressively move to the generic buffer handling code as they are cleaned up.	2018-07-19 16:23:43 +02:00
Willy Tarreau	843b7cbe9d	MEDIUM: chunks: make the chunk struct's fields match the buffer struct Chunks are only a subset of a buffer (a non-wrapping version with no head offset). Despite this we still carry a lot of duplicated code between buffers and chunks. Replacing chunks with buffers would significantly reduce the maintenance efforts. This first patch renames the chunk's fields to match the name and types used by struct buffers, with the goal of isolating the code changes from the declaration changes. Most of the changes were made with spatch using this coccinelle script : @rule_d1@ typedef chunk; struct chunk chunk; @@ - chunk.str + chunk.area @rule_d2@ typedef chunk; struct chunk chunk; @@ - chunk.len + chunk.data @rule_i1@ typedef chunk; struct chunk chunk; @@ - chunk->str + chunk->area @rule_i2@ typedef chunk; struct chunk chunk; @@ - chunk->len + chunk->data Some minor updates to 3 http functions had to be performed to take size_t ints instead of ints in order to match the unsigned length here.	2018-07-19 16:23:43 +02:00
Willy Tarreau	c9fa0480af	MAJOR: buffer: finalize buffer detachment Now the buffers only contain the header and a pointer to the storage area which can be anywhere. This will significantly simplify buffer swapping and will make it possible to map chunks on buffers as well. The buf_empty variable was removed, as now it's enough to have size==0 and area==NULL to designate the empty buffer (thus a non-allocated head is the empty buffer by default). buf_wanted for now is indicated by size==0 and area==(void *)1. The channels and the checks now embed the buffer's head, and the only pointer is to the storage area. This slightly increases the unallocated buffer size (3 extra ints for the empty buffer) but considerably simplifies dynamic buffer management. It will also later permit to detach unused checks. The way the struct buffer is arranged has proven quite efficient on a number of tests, which makes sense given that size is always accessed and often first, followed by the othe ones.	2018-07-19 16:23:43 +02:00
Willy Tarreau	bd1dba8a89	MINOR: buffer: rename the data length member to '->data' It used to be called 'len' during the reorganisation but strictly speaking it's not a length since it wraps. Also we already use '_data' as the suffix to count available data, and data is also what we use to indicate the amount of data in a pipe so let's improve consistency here. It was important to do this in two operations because data used to be the name of the pointer to the storage area.	2018-07-19 16:23:43 +02:00
Willy Tarreau	e3128024bf	MINOR: buffer: replace buffer_replace2() with b_rep_blk() This one is more generic and designed to work on a random block. It may later get a b_rep_ist() variant since many strings are already available as (ptr,len).	2018-07-19 16:23:43 +02:00
Willy Tarreau	4d893d440c	MINOR: buffers/channel: replace buffer_insert_line2() with ci_insert_line2() There was no point keeping that function in the buffer part since it's exclusively used by HTTP at the channel level, since it also automatically appends the CRLF. This further cleans up the buffer code.	2018-07-19 16:23:43 +02:00
Willy Tarreau	7b04cc4467	CLEANUP: buffer: minor cleanups to buffer.h Remove a few unused functions and add some comments to split the file parts in sections.	2018-07-19 16:23:43 +02:00
Willy Tarreau	911f7dd893	MINOR: buffers: remove b_putstr() It's not needed anymore.	2018-07-19 16:23:43 +02:00
Willy Tarreau	ea1b06d5bb	MINOR: buffer: add a new file for ist + buffer manipulation functions The new file istbuf.h links the indirect strings (ist) with the buffers. The purpose is to encourage addition of more standard buffer manipulation functions that rely on this in order to improve the overall ease of use along all the code. Just like ist.h and buf.h, this new file is not expected to depend on anything beyond these two files. A few functions were added and/or converted from buffer.h : - b_isteq() : indicates if a buffer and a string match - b_isteat() : consumes a string from the buffer if it matches - b_istput() : appends a small string to a buffer (all or none) - b_putist() : appends part of a large string to a buffer The equivalent functions were removed from buffer.h and changed at the various call places.	2018-07-19 16:23:43 +02:00
Willy Tarreau	55372f646f	MINOR: buffer: replace b{i,o}_put* with b_put* The two variants now do exactly the same (appending at the tail of the buffer) so let's not keep the distinction between these classes of functions and have generic ones for this. It's also worth noting that b{i,o}_putchk() wasn't used at all and was removed.	2018-07-19 16:23:43 +02:00
Willy Tarreau	72a100b386	MINOR: buffer: replace bi_fast_delete() with b_del() There's no distinction between in and out data now. The latter covers the needs of the former and supports wrapping. The extra cost is negligible given the locations where it's used.	2018-07-19 16:23:43 +02:00
Olivier Houchard	08afac0fd7	MEDIUM: buffers: move "output" from struct buffer to struct channel Since we never access this field directly anymore, but only through the channel's wrappers, it can now move to the channel. The buffers are now completely free from the distinction between input and output data.	2018-07-19 16:23:43 +02:00
Willy Tarreau	892f1dbe4f	MINOR: buffer: rename the "data" field to "area" Since we use "_data" for the amount of data at many places, as opposed to "_space" for the amount of space, let's rename the "data" field to "area" so that we can reuse "data" later for the amount of data in the buffer (currently called "len" despite not being contigous).	2018-07-19 16:23:43 +02:00
Willy Tarreau	f6dfd88a92	MINOR: buffer: b_set_data() doesn't truncate output data anymore b_set_data() is used : - in proto_http and hlua to trim input data (b_set_data(co_data())) - in SPOE to append data to a buffer while building a message In no case will this truncate a buffer so we can safely remove the test for len < b->output.	2018-07-19 16:23:43 +02:00
Willy Tarreau	abed1e7f34	MINOR: buffer: remove the check for output on b_del() b_del() is used in : - mux_h2 with the demux buffer : always processes input data - checks with output data though output is not considered at all there - b_eat() which is not used anywhere - co_skip() where the len is always <= output Thus the distinction for output data is not needed anymore and the decrement can be made inconditionally in co_skip().	2018-07-19 16:23:43 +02:00
Willy Tarreau	d54a8ceb97	MAJOR: start to change buffer API This is intentionally the minimal and safest set of changes, some cleanups area still required. These changes are quite tricky and cannot be independantly tested, so it's important to keep this patch as bisectable as possible. buf_empty and buf_wanted were changed and are now exactly similar since there's no <p> member in the structure anymore. Given that no test is ever made in the code to check that buf == &buf_wanted, it may be possible that we don't need to have two anymore, unless some buf_empty tests have precedence. This will have to be investigated. A significant part of this commit affects the HTTP compression code, which used to deeply manipulate the input and output buffers without any reasonable solution for a better abstraction. For this reason, if any regression is met and designates this patch as the culprit, it is important to run tests which specifically involve compression or which definitely don't use it in order to spot the issue. Cc: Olivier Houchard <ohouchard@haproxy.com>	2018-07-19 16:23:42 +02:00
Willy Tarreau	523cc5d506	MINOR: buffer: convert part bo_putblk() and bi_putblk() to the new API These functions are pretty similar and will be merged at the end of the migration. For now they still need to remain distinct.	2018-07-19 16:23:42 +02:00
Willy Tarreau	fdabbe243d	MINOR: buffer: remove unused bo_add() We don't need this function anymore.	2018-07-19 16:23:42 +02:00
Willy Tarreau	cd9e60db00	MEDIUM: channel: adapt to the new buffer API Also, ci_swpbuf() was removed (unused).	2018-07-19 16:23:42 +02:00
Olivier Houchard	d4251a7e98	MINOR: channel: Add co_set_data(). Add a new function that lets one set the channel's output amount.	2018-07-19 16:23:42 +02:00
Willy Tarreau	3ee8344b7b	MINOR: channel: remove almost all references to buf->i and buf->o We use ci_data() and co_data() instead now everywhere we read these values.	2018-07-19 16:23:42 +02:00
Willy Tarreau	591d445049	MINOR: buffer: use b_orig() to replace most references to b->data This patch updates most users of b->data to use b_orig().	2018-07-19 16:23:42 +02:00
Willy Tarreau	50227f9b88	MINOR: buffer: use c_head() instead of buffer_wrap_sub(c->buf, p-o) This way we don't need o anymore.	2018-07-19 16:23:42 +02:00
Willy Tarreau	144c5c4d21	MINOR: buffer: replace buffer_flush() with c_adv(chn, ci_data(chn)) It used to forward some input into output.	2018-07-19 16:23:41 +02:00
Willy Tarreau	5ba65521a3	MINOR: buffer: replace buffer_pending() with ci_data() It used to return b->i for channels, which is what ci_data() does.	2018-07-19 16:23:41 +02:00
Willy Tarreau	3f6799975f	MINOR: buffer: replace bi_space_for_replace() with ci_space_for_replace() This one computes the size that can be overwritten over the input part of the buffer, so it's channel-specific.	2018-07-19 16:23:41 +02:00
Willy Tarreau	2375233ef0	MINOR: buffer: replace buffer_full() with channel_full() It's only used by channels since we need to know the amount of output data.	2018-07-19 16:23:41 +02:00
Willy Tarreau	271e2a503d	MINOR: buffer: make bo_putchar() use b_tail() It's possible because we can't call bo_putchar() with i != 0.	2018-07-19 16:23:41 +02:00
Willy Tarreau	0c7ed5d264	MINOR: buffer: replace buffer_empty() with b_empty() or c_empty() For the same consistency reasons, let's use b_empty() at the few places where an empty buffer is expected, or c_empty() if it's done on a channel. Some of these places were there to realign the buffer so {b,c}_realign_if_empty() was used instead.	2018-07-19 16:23:41 +02:00
Willy Tarreau	d760eecf61	MINOR: buffer: replace buffer_not_empty() with b_data() or c_data() It's mostly for consistency as many places already use one of these instead.	2018-07-19 16:23:41 +02:00
Willy Tarreau	eac5259888	MINOR: buffer: use b_room() to determine available space in a buffer We used to have variations around buffer_total_space() and size-buffer_len() or size-b_data(). Let's simplify all this. buffer_len() was also removed as not used anymore.	2018-07-19 16:23:41 +02:00
Willy Tarreau	bc59f359dc	MINOR: buffer: get rid of b_ptr() and convert its last users Now the new API functions are being used everywhere, we can get rid of b_ptr(). A few last users like bi_istput() and bo_istput() appear to only differ by what part of the buffer they're increasing, but that should quickly be merged.	2018-07-19 16:23:41 +02:00
Willy Tarreau	337ea57cfc	MINOR: connection: add a new receive flag : CO_RFL_BUF_WET With this flag we introduce the notion of "dry" vs "wet" buffers : some demultiplexers like the H2 mux require as much room as possible for some operations that are not retryable like decoding a headers frame. For this they need to know if the buffer is congested with data scheduled for leaving soon or not. Since the new API will not provide this information in the buffer itself, the caller must indicate it. We never need to know the amount of such data, just the fact that the buffer is not in its optimal condition to be used for receipt. This "CO_RFL_BUF_WET" flag is used to mention that such outgoing data are still pending in the buffer and that a sensitive receiver should better let it "dry" before using it.	2018-07-19 16:23:41 +02:00
Willy Tarreau	7f3225f251	MINOR: connection: add a flags argument to rcv_buf() The mux and transport rcv_buf() now takes a "flags" argument, just like the snd_buf() one or like the equivalent syscall lower part. The upper layers will use this to pass some information such as indicating whether the buffer is free from outgoing data or if the lower layer may allocate the buffer itself.	2018-07-19 16:23:41 +02:00
Willy Tarreau	d9cf540457	MEDIUM: mux: make mux->rcv_buf() take a size_t for the count It also returns a size_t. This is in order to clean the API. Note that the H2 mux still uses some ints in the functions called from h2_rcv_buf(), though it's not really a problem given that H2 frames are smaller. It may deserve a general cleanup later though.	2018-07-19 16:23:41 +02:00
Willy Tarreau	bfc4d77ad3	MEDIUM: connection: make xprt->rcv_buf() use size_t for the count Just like we have a size_t for xprt->snd_buf(), we adjust to use size_t for rcv_buf()'s count argument and return value. It also removes the ambiguity related to the possibility to see a negative value there.	2018-07-19 16:23:41 +02:00
Willy Tarreau	deccd1116d	MEDIUM: mux: make mux->snd_buf() take the byte count in argument This way the mux doesn't need to modify the buffer's metadata anymore nor to know the output's size. The mux->snd_buf() function now takes a const buffer and it's up to the caller to update the buffer's state. The return type was updated to return a size_t to comply with the count argument.	2018-07-19 16:23:41 +02:00
Willy Tarreau	787db9a6a4	MEDIUM: connection: make xprt->snd_buf() take the byte count in argument This way the senders don't need to modify the buffer's metadata anymore nor to know about the output's split point. This way the functions can take a const buffer and it's clearer who's in charge of updating the buffer after a send. That's why the buffer realignment is now performed by the caller of the transport's snd_buf() functions. The return type was updated to return a size_t to comply with the count argument.	2018-07-19 16:23:41 +02:00
Willy Tarreau	55f3ce1c91	MINOR: buffer: make b_getblk_nc() take size_t for the block sizes Till now we used to reimplement it using ints to limit external changes but we must adjust it and the various users to switch to size_t.	2018-07-19 16:23:41 +02:00
Willy Tarreau	206ba834ef	MINOR: buffer: make b_getblk_nc() take const pointers Now that there are no more users requiring to modify the buffer anymore, switch these ones to const char and const buffer. This will make it more obvious next time send functions are tempted to modify the buffer's output count. Minor adaptations were necessary at a few call places which were using char due to the function's previous prototype.	2018-07-19 16:23:41 +02:00
Willy Tarreau	5d7d1bbd0e	MINOR: buffer: get rid of b_end() and b_to_end() These ones are not used anymore.	2018-07-19 16:23:41 +02:00
Willy Tarreau	f40e68227b	MINOR: h1: make h1_measure_trailers() use an offset and a count This will be needed by the H2 encoder to restart after wrapping.	2018-07-19 16:23:41 +02:00
Willy Tarreau	84d6b7af87	MINOR: h1: make h1_parse_chunk_size() not depend on b_ptr() anymore It's similar to the previous commit so that the function doesn't rely on buf->p anymore.	2018-07-19 16:23:41 +02:00
Willy Tarreau	c0973c6742	MINOR: h1: make h1_skip_chunk_crlf() not depend on b_ptr() anymore It now takes offsets relative to the buffer's head. It's up to the callers to add this offset which corresponds to the buffer's output size.	2018-07-19 16:23:41 +02:00
Willy Tarreau	7314be8e2c	MINOR: h1: make h1_measure_trailers() take the byte count in argument The principle is that it should not have to take this value from the buffer itself anymore.	2018-07-19 16:23:40 +02:00
Willy Tarreau	e5f12ce7f2	MINOR: buffer: replace bi_del() and bo_del() with b_del() Till now the callers had to know which one to call for specific use cases. Let's fuse them now since a single one will remain after the API migration. Given that bi_del() may only be used where o==0, just combine the two tests by first removing output data then only input.	2018-07-19 16:23:40 +02:00
Willy Tarreau	a1f78fb652	MINOR: buffer: replace bo_getblk_nc() with b_getblk_nc() which takes an offset This will be important so that we can parse a buffer without touching it. Now we indicate where from the buffer's head we plan to start to copy, and for how many bytes. This will be used by send functions to loop at the end of the buffer without having to update the buffer's output byte count.	2018-07-19 16:23:40 +02:00
Willy Tarreau	90ed3836db	MINOR: buffer: replace bo_getblk() with direction agnostic b_getblk() This new functoin limits itself to the amount of data available in the buffer and doesn't care about the direction anymore. It's only called from co_getblk() which already checks that no more than the available output bytes is requested.	2018-07-19 16:23:40 +02:00
Willy Tarreau	e4d5a036ed	MINOR: buffer: merge b{i,o}_contig_space() These ones were merged into a single b_contig_space() that covers both (the bo_ case was a simplified version of the other one). The function doesn't use ->i nor ->o anymore.	2018-07-19 16:23:40 +02:00
Willy Tarreau	0e11d59af6	MINOR: buffer: remove bo_contig_data() The two call places now make use of b_contig_data(0) and check by themselves that the returned size is no larger than the scheduled output data.	2018-07-19 16:23:40 +02:00
Willy Tarreau	8f9c72d301	MINOR: buffer: remove bi_end() It was replaced by ci_tail() when the channel is known, or b_tail() in other cases.	2018-07-19 16:23:40 +02:00
Willy Tarreau	41e38ac0ee	MINOR: buffer: remove bo_end() It was replaced by either b_tail() when the buffer has no input data, or b_peek(b, b->o).	2018-07-19 16:23:40 +02:00
Willy Tarreau	89faf5d7c3	MINOR: buffer: remove bo_ptr() It was replaced by co_head() when a channel was known, otherwise b_head().	2018-07-19 16:23:40 +02:00
Willy Tarreau	dda2e41881	MINOR: buffer: remove bi_ptr() It's now been replaced by b_head() when b->o is null, ci_head() when the channel is known, or b_peek(b, b->o) in other situations.	2018-07-19 16:23:40 +02:00
Willy Tarreau	7194d3cc3b	MINOR: buffer: split bi_contig_data() into ci_contig_data and b_config_data() This function was sometimes used from a channel and sometimes from a buffer. In both cases it requires knowledge of the size of the output data (to skip them). Here the split ensures the channel can deal with this point, and that other places not having output data can continue to work.	2018-07-19 16:23:40 +02:00
Willy Tarreau	d55fe397a0	MINOR: buffer: remove bi_getblk() and bi_getblk_nc() These ones were relying on bi_ptr() and are not used. They may be reimplemented later in the channel if needed.	2018-07-19 16:23:40 +02:00
Willy Tarreau	aa7af7213d	MINOR: buffer: replace calls to buffer_space_wraps() with b_space_wraps() And remove the unused function.	2018-07-19 16:23:40 +02:00
Willy Tarreau	bcbd39370f	MINOR: channel/buffer: replace b_{adv,rew} with c_{adv,rew} These ones manipulate the output data count which will be specific to the channel soon, so prepare the call points to use the channel only. The b_* functions are now unused and were removed.	2018-07-19 16:23:40 +02:00
Willy Tarreau	c0a51c51b1	MINOR: buffer: remove buffer_slow_realign() and the swap_buffer allocation code Since all call places can use the trash now, this is not needed anymore.	2018-07-19 16:23:40 +02:00
Willy Tarreau	fd8d42f496	MEDIUM: channel: make channel_slow_realign() take a swap buffer The few call places where it's used can use the trash as a swap buffer, which is made for this exact purpose. This way we can rely on the generic b_slow_realign() call.	2018-07-19 16:23:40 +02:00
Willy Tarreau	4cf1300e6a	MINOR: channel/buffer: replace buffer_slow_realign() with channel_slow_realign() and b_slow_realign() Where relevant, the channel version is used instead. The buffer version was ported to be more generic and now takes a swap buffer and the output byte count to know where to set the alignment point. The H2 mux still uses buffer_slow_realign() with buf->o but it will change later.	2018-07-19 16:23:40 +02:00
Willy Tarreau	d5b343bf9e	MINOR: channel/buffer: use c_realign_if_empty() instead of buffer_realign() This patch removes buffer_realign() and replaces it with c_realign_if_empty() instead.	2018-07-19 16:23:40 +02:00
Willy Tarreau	08d5ac8f27	MINOR: channel: add a few basic functions for the new buffer API This adds : - c_orig() : channel buffer's origin - c_size() : channel buffer's size - c_wrap() : channel buffer's wrapping location - c_data() : channel buffer's total data count - c_room() : room left in channel buffer's - c_empty() : true if channel buffer is empty - c_full() : true if channel buffer is full - c_ptr() : pointer to an offset relative to input data in the buffer - c_adv() : advances the channel's buffer (bytes become part of output) - c_rew() : rewinds the channel's buffer (output bytes not output anymore) - c_realign_if_empty() : realigns the buffer if it's empty - co_data() : # of output data - co_head() : beginning of output data - co_tail() : end of output data - ci_data() : # of input data - ci_head() : beginning of input data - ci_tail() : end of input data - ci_stop() : location after ci_tail() - ci_next() : pointer to next input byte And for the ci_* / co_* functions above, the "__*" variants which disable wrapping checks, and the "_ofs" variants which return an offset relative to the buffer's origin instead.	2018-07-19 16:23:39 +02:00
Willy Tarreau	f17f19f1a7	MINOR: buffer: introduce b_realign_if_empty() Many places deal with buffer realignment after data removal. The method is always the same : if the buffer is empty, set its pointer to the origin. Let's have a function for this so that we have less code to change with the new API.	2018-07-19 16:23:39 +02:00
Olivier Houchard	a04e40d578	MINOR: buffer: Add b_set_data(). Add a new function that lets you set the amount of input in a buffer. For now it extends/truncates b->i except if the total length is below b->o in which case it clears i and adjusts o.	2018-07-19 16:23:39 +02:00
Olivier Houchard	09138ecc49	MINOR: buffer: Introduce b_sub(), b_add(), and bo_add() Instead of doing b->i -= directly, introduce b_sub(), that does the job, to make it easier to switch to the future API. Also add b_add(), that increases b->i, instead of using it directly, and bo_add(), that does increase b->o.	2018-07-19 16:23:39 +02:00
Willy Tarreau	bbc68df330	MINOR: buffer: add a few basic functions for the new API Here's the list of newly introduced functions : - b_data(), returning the total amount of data in the buffer (currently i+o) - b_orig(), returning the origin of the storage area, that is, the place of position 0. - b_wrap(), pointer to wrapping point (currently data+size) - b_size(), returning the size of the buffer - b_room(), returning the amount of bytes left available - b_full(), returning true if the buffer is full, otherwise false - b_stop(), pointer to end of data mark (currently p+i), used to compute distances or a stop pointer for a loop. - b_peek(), this one will help make the transition to the new buffer model. It returns a pointer to a position in the buffer known from an offest relative to the beginning of the data in the buffer. Thus, we can replace the following occurrences : bo_ptr(b) => b_peek(b, 0); bo_end(b) => b_peek(b, b->o); bi_ptr(b) => b_peek(b, b->o); bi_end(b) => b_peek(b, b->i + b->o); b_ptr(b, ofs) => b_peek(b, b->o + ofs); - b_head(), pointer to the beginning of data (currently bo_ptr()) - b_tail(), pointer to first free place (currently bi_ptr()) - b_next() / b_next_ofs(), pointer to the next byte, taking wrapping into account. - b_dist(), returning the distance between two pointers belonging to a buffer - b_reset(), which resets the buffer - b_space_wraps(), indicating if the free space wraps around the buffer - b_almost_full(), indicating if 3/4 or more of the buffer are used Some of these are provided with the unchecked variants using the "__" prefix, or with the "_ofs" suffix indicating they return a relative position to the buffer's origin instead of a pointer. Cc: Olivier Houchard <ohouchard@haproxy.com>	2018-07-19 16:23:39 +02:00
Willy Tarreau	506a29ac6e	MINOR: buffer: switch buffer sizes and offsets to size_t Passing unsigned ints everywhere is painful, and will cause some headache later when we'll want to integrate better with struct ist which already uses size_t. Let's switch buffers to use size_t instead.	2018-07-19 16:23:39 +02:00
Willy Tarreau	41806d1c52	MINOR: buffer: implement a new file for low-level buffer manipulation functions The buffer code currently depends on pools and other stuff and is not really autonomous anymore. The rewrite of the new API is an opportunity to clean this up. This patch creates a new file (buf.h) which does not depend on other elements and which will only contain what is needed to perform the most basic buffer operations. The new API will be introduced in this file and the conversion will be finished once buffer.h is empty. The definition of struct buffer was moved to this new file, using more explicity stdint types for the sizes and offsets. Most new functions will be implemented in two variants : __b_something() : unchecked variant, no wrapping is expected b_something() : wrapping-checked variant This way callers will be able to select which one to use depending on the use cases.	2018-07-19 16:23:39 +02:00
Olivier Houchard	9ddaf794a8	MINOR: tasklet: Set process to NULL. Some consumers expect the process to be NULL when a tasklet it created, so do so.	2018-07-19 16:23:08 +02:00
Willy Tarreau	17b4aa1adc	BUG/MINOR: ssl: properly ref-count the tls_keys entries Commit `200b0fa` ("MEDIUM: Add support for updating TLS ticket keys via socket") introduced support for updating TLS ticket keys from the CLI, but missed a small corner case : if multiple bind lines reference the same tls_keys file, the same reference is used (as expected), but during the clean shutdown, it will lead to a double free when destroying the bind_conf contexts since none of the lines knows if others still use it. The impact is very low however, mostly a core and/or a message in the system's log upon old process termination. Let's introduce some basic refcounting to prevent this from happening, so that only the last bind_conf frees it. Thanks to Janusz Dziemidowicz and Thierry Fournier for both reporting the same issue with an easy reproducer. This fix needs to be backported from 1.6 to 1.8.	2018-07-18 08:59:50 +02:00
Baptiste Assmann	8e2d9430c0	MINOR: dns: new DNS options to allow/prevent IP address duplication By default, HAProxy's DNS resolution at runtime ensure that there is no IP address duplication in a backend (for servers being resolved by the same hostname). There are a few cases where people want, on purpose, to disable this feature. This patch introduces a couple of new server side options for this purpose: "resolve-opts allow-dup-ip" or "resolve-opts prevent-dup-ip".	2018-07-12 17:56:44 +02:00
Dave Chiluk	8618a6a5e2	MINOR: Some spelling cleanup in the comments. Signed-off-by: Dave Chiluk <chiluk+haproxy@indeed.com>	2018-06-21 20:43:52 +02:00
Olivier Houchard	dcd6f3a597	MINOR: tasks: Make sure we correctly init and deinit a tasklet. Up until now, a tasklet couldn't be free'd while it was in the list, it is no longer the case, so make sure we remove it from the list before freeing it. To do so, we have to make sure we correctly initialize it, so use LIST_INIT, instead of setting the pointers to NULL.	2018-06-14 18:57:13 +02:00
William Lallemand	6e1796e85d	BUG/MINOR: signals: ha_sigmask macro for multithreading The behavior of sigprocmask in an multithreaded environment is undefined. The new macro ha_sigmask() calls either pthreads_sigmask() or sigprocmask() if haproxy was built with thread support or not. This should be backported to 1.8.	2018-06-08 18:24:53 +02:00
Olivier Houchard	b1ca58b245	MINOR: tasks: Don't define rqueue if we're building without threads. To make sure we don't inadvertently insert task in the global runqueue, while only the local runqueue is used without threads, make its definition and usage conditional on USE_THREAD.	2018-06-06 16:35:12 +02:00
Olivier Houchard	e13ab8b3c6	BUG/MEDIUM: tasks: Use the local runqueue when building without threads. When building without threads enabled, instead of just using the global runqueue, just use the local runqueue associated with the only thread, as that's what is now expected for a single thread in prcoess_runnable_tasks(). This should fix haproxy when built without threads.	2018-06-06 16:34:52 +02:00
Willy Tarreau	10d81b8757	MINOR: applet: assign the same nice value to a new appctx as its owner task When an applet is created, let's assign it the same nice value as the task of the stream which owns it. It ensures that fairness is properly propagated to applets, and that the CLI can regain a low latency behaviour again. Huge differences have been seen under extreme loads, with the CLI being called every 200 microseconds instead of 11 milliseconds.	2018-06-05 11:18:21 +02:00
David Carlier	caa8a37ffe	MINOR: task: Fix a compiler warning by adding a cast. When calling HA_ATOMIC_CAS with a pointer as the target, the compiler expects a pointer as the new value, so give it one by casting 0x1 to (void *).	2018-06-04 17:43:12 +02:00
Thierry FOURNIER	9d5422a4b7	MINOR: task/notification: Is notifications registered ? This function returns true is some notifications are registered. This function is usefull for the following patch BUG/MEDIUM: lua/socket: Sheduling error on write: may dead-lock It should be backported in 1.6, 1.7 and 1.8	2018-05-31 10:58:41 +02:00
Olivier Houchard	09eeb7684d	BUG/MEDIUM: tasks: Don't forget to increase/decrease tasks_run_queue. Don't forget to increase tasks_run_queue when we're adding a task to the tasklet list, and to decrease it when we remove a task from a runqueue, or its value won't be accurate, and could lead to tasks not being executed when put in the global run queue. 1.9-dev only, no backport is needed.	2018-05-28 15:20:55 +02:00
Tim Duesterhus	3fd1973d37	MINOR: http: Log warning if (add\|set)-header fails This patch adds a warning if an http-(request\|reponse) (add\|set)-header rewrite fails to change the respective header in a request or response. This usually happens when tune.maxrewrite is not sufficient to hold all the headers that should be added.	2018-05-28 14:53:59 +02:00
Olivier Houchard	673867c357	MAJOR: applets: Use tasks, instead of rolling our own scheduler. There's no real reason to have a specific scheduler for applets anymore, so nuke it and just use tasks. This comes with some benefits, the first one being that applets cannot induce high latencies anymore since they share nice values with other tasks. Later it will be possible to configure the applets' nice value. The second benefit is that the applet scheduler was not very thread-friendly, having a big lock around it in prevision of this change. Thus applet-intensive workloads should now scale much better with threads. Some more improvement is possible now : some applets also use a task to handle timers and timeouts. These ones could now be simplified to use only one task.	2018-05-26 20:03:30 +02:00
Olivier Houchard	1599b80360	MINOR: tasks: Make the number of tasks to run at once configurable. Instead of hardcoding 200, make the number of tasks to be run configurable using tune.runqueue-depth. 200 is still the default.	2018-05-26 20:03:24 +02:00
Olivier Houchard	b0bdae7b88	MAJOR: tasks: Introduce tasklets. Introduce tasklets, lightweight tasks. They have no notion of priority, they are just run as soon as possible, and will probably be used for I/O later. For the moment they're used to replace the temporary thread-local list that was used in the scheduler. The first part of the struct is common with tasks so that tasks can be cast to tasklets and queued in this list. Once a task is in the tasklet list, it has its leaf_p set to 0x1 so that it cannot accidently be confused as not in the queue. Pure tasklets are identifiable by their nice value of -32768 (which is normally not possible).	2018-05-26 20:03:19 +02:00
Olivier Houchard	f6e6dc12cd	MAJOR: tasks: Create a per-thread runqueue. A lot of tasks are run on one thread only, so instead of having them all in the global runqueue, create a per-thread runqueue which doesn't require any locking, and add all tasks belonging to only one thread to the corresponding runqueue. The global runqueue is still used for non-local tasks, and is visited by each thread when checking its own runqueue. The nice parameter is thus used both in the global runqueue and in the local ones. The rare tasks that are bound to multiple threads will have their nice value used twice (once for the global queue, once for the thread-local one).	2018-05-26 19:27:29 +02:00
Olivier Houchard	9f6af33222	MINOR: tasks: Change the task API so that the callback takes 3 arguments. In preparation for thread-specific runqueues, change the task API so that the callback takes 3 arguments, the task itself, the context, and the state, those were retrieved from the task before. This will allow these elements to change atomically in the scheduler while the application uses the copied value, and even to have NULL tasks later.	2018-05-26 19:23:57 +02:00
Willy Tarreau	0cd82e883e	BUG/BUILD: threads: unbreak build without threads A few users reported that building without threads was accidently broken after commit `6b96f72` ("BUG/MEDIUM: pollers: Use a global list for fd shared between threads.") due to all_threads_mask not being defined. It's OK to set it to zero as other code parts do when threads are enabled but only one thread is used. This needs to be backported to 1.8.	2018-05-23 19:54:43 +02:00
Thierry Fournier	d5b073cf1f	MINOR: lua: Improve error message The function hlua_ctx_resume return less text message and more error code. These error code allow the caller to return appropriate message to the user.	2018-05-22 18:57:46 +02:00
Christopher Faulet	68db0235fd	CLEANUP: spoe: Remove unused variables the agent structure applets_act and applets_idle were used for debugging purpose. Now, these values are part of the agent's counters.	2018-05-18 15:04:46 +02:00
Olivier Houchard	cb92f5cae4	MINOR: pollers: move polled_mask outside of struct fdtab. The polled_mask is only used in the pollers, and removing it from the struct fdtab makes it fit in one 64B cacheline again, on a 64bits machine, so make it a separate array.	2018-05-06 06:27:34 +02:00
Olivier Houchard	6b96f7289c	BUG/MEDIUM: pollers: Use a global list for fd shared between threads. With the old model, any fd shared by multiple threads, such as listeners or dns sockets, would only be updated on one threads, so that could lead to missed event, or spurious wakeups. To avoid this, add a global list for fd that are shared, using the same implementation as the fd cache, and only remove entries from this list when every thread as updated its poller. [wt: this will need to be backported to 1.8 but differently so this patch must not be backported as-is]	2018-05-06 06:27:09 +02:00
Olivier Houchard	6a2cf8752c	MINOR: fd: Make the lockless fd list work with multiple lists. Modify fd_add_to_fd_list() and fd_rm_from_fd_list() so that they take an offset in the fdtab to the list entry, instead of hardcoding the fd cache, so we can use them with other lists.	2018-05-06 06:25:49 +02:00
Olivier Houchard	9b36cb4a41	BUG/MEDIUM: task: Don't free a task that is about to be run. While running a task, we may try to delete and free a task that is about to be run, because it's part of the local tasks list, or because rq_next points to it. So flag any task that is in the local tasks list to be deleted, instead of run, by setting t->process to NULL, and re-make rq_next a global, thread-local variable, that is modified if we attempt to delete that task. Many thanks to PiBa-NL for reporting this and analysing the problem. This should be backported to 1.8.	2018-05-04 20:11:04 +02:00
Willy Tarreau	760e81d356	MINOR: backend: implement random-based load balancing For large farms where servers are regularly added or removed, picking a random server from the pool can ensure faster load transitions than when using round-robin and less traffic surges on the newly added servers than when using leastconn. This commit introduces "balance random". It internally uses a random as the key to the consistent hashing mechanism, thus all features available in consistent hashing such as weights and bounded load via hash-balance- factor are usable. It is extremely convenient because one common concern when using random is what happens when a server is hammered a bit too much. Here that can trivially be avoided, like in the configuration below : backend bk0 balance random hash-balance-factor 110 server-template s 1-100 127.0.0.1:8000 check inter 1s Note that while "balance random" internally relies on a hash algorithm, it holds the same properties as round-robin and as such is compatible with reusing an existing server connection with "option prefer-last-server".	2018-05-03 07:20:40 +02:00
Tim Duesterhus	e2b10bf491	MINOR: http: Add support for 421 Misdirected Request This makes haproxy aware of HTTP 421 Misdirected Request, which is defined in RFC 7540, section 9.1.2.	2018-04-28 07:03:39 +02:00
Aur�lien Nephtali	abbf607105	MEDIUM: cli: Add payload support In order to use arbitrary data in the CLI (multiple lines or group of words that must be considered as a whole, for example), it is now possible to add a payload to the commands. To do so, the first line needs to end with a special pattern: <<\n. Everything that follows will be left untouched by the CLI parser and will be passed to the commands parsers. Per-command support will need to be added to take advantage of this feature. Signed-off-by: Aur�lien Nephtali <aurelien.nephtali@corp.ovh.com>	2018-04-26 14:19:33 +02:00
Willy Tarreau	174b06a572	MINOR: h2: detect presence of CONNECT and/or content-length We'll need this in order to support uploading chunks. The h2 to h1 converter checks for the presence of the content-length header field as well as the CONNECT method and returns these information to the caller. The caller indicates whether or not a body is detected for the message (presence of END_STREAM or not). No transfer-encoding header is emitted yet.	2018-04-26 10:15:14 +02:00
Olivier Houchard	302f9ef055	BUG/MEDIUM: connection: Make sure we have a mux before calling detach(). In some cases, we call cs_destroy() very early, so early the connection doesn't yet have a mux, so we can't call mux->detach(). In this case, just destroy the associated connection. This should be backported to 1.8.	2018-04-13 16:02:21 +02:00
Christopher Faulet	48aa13f286	BUG/MEDIUM: threads: Fix the max/min calculation because of name clashes With gcc < 4.7, when HAProxy is built with threads, the macros HA_ATOMIC_CAS/XCHG/STORE relies on the legacy __sync builtins. These macros are slightly complicated than the versions relying on the '_atomic' builtins. Internally, some local variables are defined, prefixed with '__' to avoid name clashes with the caller. On the other hand, the macros HA_ATOMIC_UPDATE_MIN/MAX call HA_ATOMIC_CAS. Some local variables are also definied in these macros, following the same naming rule as below. The problem is that '__new' variable is used in HA_ATOMIC_MIN/_MAX and in HA_ATOMIC_CAS. Obviously, the behaviour is undefined because '__new' in HA_ATOMIC_CAS is left uninitialized. Unfortunatly gcc fails to detect this error. To fix the problem, all internal variables to macros are now suffixed with name of the macros to avoid clashes (for instance, '__new_cas' in HA_ATOMIC_CAS). This patch must be backported in 1.8.	2018-04-10 11:07:56 +02:00
Christopher Faulet	caf2feca62	MINOR: spoe: Add counters to log info about SPOE agents In addition to metrics about time spent in the SPOE, following counters have been added: * applets : number of SPOE applets. * idles : number of idle applets. * nb_sending : number of streams waiting to send data. * nb_waiting : number of streams waiting for a ack. * nb_processed : number of events/groups processed by the SPOE (from the stream point of view). * nb_errors : number of errors during the processing (from the stream point of view). Log messages has been updated to report these counters. Following pattern has been added at the end of the log message: ... <idles>/<applets> <nb_sending>/<nb_waiting> <nb_error>/<nb_processed>	2018-04-05 15:13:54 +02:00
Christopher Faulet	7250b8fb5c	MINOR: spoe: Add loggers dedicated to the SPOE agent Now it is possible to configure a logger in a spoe-agent section using a "log" line, as for a proxy. "no log", "log global" and "log <address> ..." syntaxes are supported.	2018-04-05 15:13:54 +02:00
Christopher Faulet	28ac099907	MINOR: log: Keep the ref when a log server is copied to avoid duplicate entries With "log global" line, the global list of loggers are copied into the proxy's struct. The list coming from the default section is also copied when a frontend or a backend section is parsed. So it is possible to have duplicate entries in the proxy's list. For instance, with this following config, all messages will be logged twice: global log 127.0.0.1 local0 debug daemon defaults mode http log global option httplog frontend front-http log global bind *:8888 default_backend back-http backend back-http server www 127.0.0.1:8000	2018-04-05 15:13:54 +02:00
Christopher Faulet	4b0b79dd56	MINOR: log: move 'log' keyword parsing in dedicated function Now, the function parse_logsrv should be used to parse a "log" line. This function will update the list of loggers passed in argument. It can release all log servers when "no log" line was parsed (by the caller) or it can parse "log global" or "log <address> ... " lines. It takes care of checking the caller context (global or not) to prohibit "log global" usage in the global section.	2018-04-05 15:13:54 +02:00
Christopher Faulet	36bda1cd4a	MINOR: spoe: Add options to store processing times in variables "set-process-time" and "set-total-time" options have been added to store processing times in the transaction scope, at each event and group processing, the current one and the total one. So it is possible to get them. TODO: documentation	2018-04-05 15:13:54 +02:00
Christopher Faulet	b2dd1e034c	MINOR: spoe: Add metrics in to know time spent in the SPOE Following metrics are added for each event or group of messages processed in the SPOE: * processing time: the delay to process the event or the group. From the stream point of view, it is the latency added by the SPOE processing. * request time : It is the encoding time. It includes ACLs processing, if any. For fragmented frames, it is the sum of all fragments. * queue time : the delay before the request gets out the sending queue. For fragmented frames, it is the sum of all fragments. * waiting time: the delay before the reponse is received. No fragmentation supported here. * response time: the delay to process the response. No fragmentation supported here. * total time: (unused for now). It is the sum of all events or groups processed by the SPOE for a specific threads. Log messages has been updated. Before, only errors was logged (status_code != 0). Now every processing is logged, following this format: SPOE: [AGENT] <TYPE:NAME> sid=STREAM-ID st=STATUC-CODE reqT/qT/wT/resT/pT where: AGENT is the agent name TYPE is EVENT of GROUP NAME is the event or the group name STREAM-ID is an integer, the unique id of the stream STATUS_CODE is the processing's status code reqT/qT/wT/resT/pT are delays descrive above For all these delays, -1 means the processing was interrupted before the end. So -1 for the queue time means the request was never dequeued. For fragmented frames it is harder to know when the interruption happened. For now, messages are logged using the same logger than the backend of the stream which initiated the request.	2018-04-05 15:13:53 +02:00
Olivier Houchard	8ef1a6b0d8	BUG/MINOR: fd: Don't clear the update_mask in fd_insert. Clearing the update_mask bit in fd_insert may lead to duplicate insertion of fd in fd_updt, that could lead to a write past the end of the array. Instead, make sure the update_mask bit is cleared by the pollers no matter what. This should be backported to 1.8. [wt: warning: 1.8 doesn't have the lockless fdcache changes and will require some careful changes in the pollers]	2018-04-03 19:38:15 +02:00
Willy Tarreau	b011d8f4c4	MINOR: mux: add a "show_fd" function to dump debugging information for "show fd" This function will be called from the CLI's "show fd" command to append some extra mux-specific information that only the mux handler can decode. This is supposed to help collect various hints about what is happening when facing certain anomalies.	2018-03-30 14:41:19 +02:00
Willy Tarreau	4037a3f904	MINOR: cli/threads: make "show fd" report thread_sync_io_handler instead of "unknown" The output was confusing when the sync point's dummy handler was shown. This patch should be backported to 1.8 to help with troubleshooting.	2018-03-28 18:06:47 +02:00
Emmanuel Hocdet	4952985b71	REORG: compact "struct server" Move use_ssl (bool value) in "struct server" hole.	2018-03-21 05:04:01 +01:00
Emmanuel Hocdet	4399c75f6c	MINOR: proxy-v2-options: add crc32c This patch add option crc32c (PP2_TYPE_CRC32C) to proxy protocol v2. It compute the checksum of proxy protocol v2 header as describe in "doc/proxy-protocol.txt".	2018-03-21 05:04:01 +01:00
Emmanuel Hocdet	6afd898988	MINOR: hash: add new function hash_crc32c This function will be used to perform CRC32c computations. This is required to compute proxy protocol v2 CRC32C tlv (PP2_TYPE_CRC32C).	2018-03-21 05:04:01 +01:00
Willy Tarreau	26fb5d8449	BUG/MEDIUM: fd/threads: ensure the fdcache_mask always reflects the cache contents Commit `4815c8c` ("MAJOR: fd/threads: Make the fdcache mostly lockless.") made the fd cache lockless, but after a few iterations, a subtle part was lost, consisting in setting the bit on the fd_cache_mask immediately when adding an event. Now it was done only when the cache started to process events, but the problem it causes is that fd_cache_mask isn't reliable anymore as an indicator of presence of events to be processed with no delay outside of fd_process_cached_events(). This results in some spurious delays when processing inter-thread wakeups between tasks. Just restoring the flag when the event is added is enough to fix the problem. Kudos to Christopher for spotting this one! No backport is needed as this is only in the development version.	2018-03-20 19:14:24 +01:00
Christopher Faulet	5cd4bbd7ab	BUG/MAJOR: threads/queue: Fix thread-safety issues on the queues management The management of the servers and the proxies queues was not thread-safe at all. First, the accesses to <strm>->pend_pos were not protected. So it was possible to release it on a thread (for instance because the stream is released) and to use it in same time on another one (because we redispatch pending connections for a server). Then, the accesses to stream's information (flags and target) from anywhere is forbidden. To be safe, The stream's state must always be updated in the context of process_stream. So to fix these issues, the queue module has been refactored. A lock has been added in the pendconn structure. And now, when we try to dequeue a pending connection, we start by unlinking it from the server/proxy queue and we wake up the stream. Then, it is the stream reponsibility to really dequeue it (or release it). This way, we are sure that only the stream can create and release its <pend_pos> field. However, be careful. This new implementation should be thread-safe (hopefully...). But it is not optimal and in some situations, it could be really slower in multi-threaded mode than in single-threaded one. The problem is that, when we try to dequeue pending connections, we process it from the older one to the newer one independently to the thread's affinity. So we need to wait the other threads' wakeup to really process them. If threads are blocked in the poller, this will add a significant latency. This problem happens when maxconn values are very low. This patch must be backported in 1.8.	2018-03-19 10:03:06 +01:00
Christopher Faulet	510c0d67ef	BUG/MEDIUM: threads/unix: Fix a deadlock when a listener is temporarily disabled When a listener is temporarily disabled, we start by locking it and then we call .pause callback of the underlying protocol (tcp/unix). For TCP listeners, this is not a problem. But listeners bound on an unix socket are in fact closed instead. So .pause callback relies on unbind_listener function to do its job. Unfortunatly, unbind_listener hold the listener's lock and then call an internal function to unbind it. So, there is a deadlock here. This happens during a reload. To fix the problemn, the function do_unbind_listener, which is lockless, is now exported and is called when a listener bound on an unix socket is temporarily disabled. This patch must be backported in 1.8.	2018-03-16 11:19:07 +01:00
Willy Tarreau	c41b3e8dff	DOC: buffers: clarify the purpose of the <from> pointer in offer_buffers() This one is only used to compare pointers and NULL is permitted though this is far from being clear.	2018-03-08 18:33:48 +01:00
Emmanuel Hocdet	253c3b7516	MINOR: connection: add proxy-v2-options authority This patch add option PP2_TYPE_AUTHORITY to proxy protocol v2 when a TLS connection was negotiated. In this case, authority corresponds to the sni.	2018-03-01 11:38:32 +01:00
Emmanuel Hocdet	fa8d0f1875	MINOR: connection: add proxy-v2-options ssl-cipher,cert-sig,cert-key This patch implement proxy protocol v2 options related to crypto information: ssl-cipher (PP2_SUBTYPE_SSL_CIPHER), cert-sig (PP2_SUBTYPE_SSL_SIG_ALG) and cert-key (PP2_SUBTYPE_SSL_KEY_ALG).	2018-03-01 11:38:28 +01:00
Emmanuel Hocdet	283e004a85	MINOR: ssl: add ssl_sock_get_cert_sig function ssl_sock_get_cert_sig can be used to report cert signature short name to log and ppv2 (RSA-SHA256).	2018-03-01 11:34:08 +01:00
Emmanuel Hocdet	96b7834e98	MINOR: ssl: add ssl_sock_get_pkey_algo function ssl_sock_get_pkey_algo can be used to report pkey algorithm to log and ppv2 (RSA2048, EC256,...). Extract pkey information is not free in ssl api (lock/alloc/free): haproxy can use the pkey information computed in load_certificate. Store and use this information in a SSL ex_data when available, compute it if not (SSL multicert bundled and generated cert).	2018-03-01 11:34:05 +01:00
Emmanuel Hocdet	ddc090bc55	MINOR: ssl: extract full pkey info in load_certificate Private key information is used in switchctx to implement native multicert selection (ecdsa/rsa/anonymous). This patch extract and store full pkey information: dsa type and pkey size in bits. This can be used for switchctx or to report pkey informations in ppv2 and log.	2018-03-01 11:33:18 +01:00
Christopher Faulet	ca6ef50661	BUG/MEDIUM: buffer: Fix the wrapping case in bi_putblk When the block of data need to be split to support the wrapping, the start of the second block of data was wrong. We must be sure to skup data copied during the first memcpy. This patch must be backported to 1.8.	2018-02-27 15:45:03 +01:00
Christopher Faulet	b2b279464c	BUG/MEDIUM: buffer: Fix the wrapping case in bo_putblk When the block of data need to be split to support the wrapping, the start of the second block of data was wrong. We must be sure to skip data copied during the first memcpy. This patch must be backported to 1.8, 1.7, 1.6 and 1.5.	2018-02-27 15:45:03 +01:00
Yves Lafon	95317289e9	MINOR: stats: display the number of threads in the statistics. Add the nbthread global variable to the output, matching nbproc. This may be backported to 1.8	2018-02-26 11:53:46 +01:00
Willy Tarreau	364d745106	MINOR: debug/pools: make DEBUG_UAF also detect underflows Since we use padding before the allocated page, it's trivial to place the allocated address there and see if it gets mangled once we release it. This may be backported to stable releases already using DEBUG_UAF.	2018-02-22 14:18:45 +01:00
Willy Tarreau	5a9cce4653	BUG/MINOR: debug/pools: properly handle out-of-memory when building with DEBUG_UAF Commit `158fa75` ("MINOR: pools: implement DEBUG_UAF to detect use after free") implemented pool use-after-free detection, but the mmap() return value isn't properly checked, preventing the call to pool_alloc_area() from returning NULL. So on out-of-memory a mangled pointer is returned, causing a crash on the pool_alloc() site instead of forcing a GC. It doesn't affect regular operations however, just complicates complex bug investigations. This fix should be backported to 1.8 and to 1.7.	2018-02-22 14:18:45 +01:00
Willy Tarreau	f161d0f51e	BUG/MINOR: pools/threads: don't ignore DEBUG_UAF on double-word CAS capable archs Since commit `cf975d4` ("MINOR: pools/threads: Implement lockless memory pools."), we support lockless pools. However the parts dedicated to detecting use-after-free are not present in this part, making DEBUG_UAF useless in this situation. The present patch sets a new define CONFIG_HAP_LOCKLESS_POOLS when such a compatible architecture is detected, and when pool debugging is not requested, then makes use of this everywhere in pools and buffers functions. This way enabling DEBUG_UAF will automatically disable the lockless version. No backport is needed as this is purely 1.9-dev.	2018-02-22 14:18:45 +01:00
Tim Duesterhus	5e64286bab	CLEANUP: standard: Fix typo in IPv6 mask example IPv6 addresses with two double colons are invalid. This typo was introduced in commit `471851713a`.	2018-02-21 05:07:35 +01:00
Tim Duesterhus	05f6a43bd4	CLEANUP: pools: Remove unused end label in memory.h This removes the end label from memory.h. The labels are unused as of `cf975d46bc` which is unreleased (and incidentally the first commit containing those labels, thus they never have been used).	2018-02-20 08:30:13 +01:00
Christopher Faulet	16f45c87d5	BUG/MINOR: ssl/threads: Make management of the TLS ticket keys files thread-safe A TLS ticket keys file can be updated on the CLI and used in same time. So we need to protect it to be sure all accesses are thread-safe. Because updates are infrequent, a R/W lock has been used. This patch must be backported in 1.8	2018-02-19 14:15:38 +01:00
David Carlier	4ee76d0281	BUILD/MINOR: memory: stdint is needed for uintptr_t stdint.h is needed on OpenBSD for uintptr_t type.	2018-02-19 07:58:50 +01:00
Willy Tarreau	41ccb194d1	BUG/MEDIUM: threads: fix the double CAS implementation for ARMv7 Commit `f61f0cb` ("MINOR: threads: Introduce double-width CAS on x86_64 and arm.") introduced the double CAS. But the ARMv7 version is bogus, it uses the value of the pointers instead of dereferencing them. When lucky, it simply doesn't build due to impossible registers combinations. Otherwise it will immediately crash at run time when facing traffic. No backport is needed, this bug was introduced in 1.9-dev.	2018-02-14 14:16:28 +01:00
Willy Tarreau	4cc67a2782	MINOR: fd: move the fd_{add_to,rm_from}_fdlist functions to fd.c There's not point inlining these huge functions, better move them to real functions in fd.c.	2018-02-05 17:19:40 +01:00
Willy Tarreau	4d84186337	MEDIUM: fd: make updt_fd_polling() use atomics It only needed a test-and-set and an atomic increment so we can take it out of the fd lock now.	2018-02-05 16:02:22 +01:00
Willy Tarreau	1b76a6d1a6	CLEANUP: fd: remove the now unused fd_compute_new_polled_status() function It's not used anymore since the new state is calculated on the fly during every update. Let's remove this function.	2018-02-05 16:02:22 +01:00
Willy Tarreau	7ac0e35f23	MAJOR: fd: compute the new fd polling state out of the fd lock Each fd_{may\|cant\|stop\|want}_{recv\|send} function sets or resets a single bit at once, then recomputes the need for updates, and then the new cache state. Later, pollers will compute the new polling state based on the resulting operations here. In fact the conditions are so simple that they can be performed by a single "if", or sometimes even optimized away. This means that in practice a simple compare-and-swap operation if often enough to set the new value inluding the new polling state, and that only the cache and fdupdt have to be performed under the lock. Better, for the most common operations (fd_may_{recv,send}, used by the pollers), a simple atomic OR is needed. This patch does this for the fd_* functions above and it doesn't yet remove the now useless fd_compute_new_polling_status() because it's still used by other pollers. A pure connection rate test shows a 1% performance increase.	2018-02-05 16:02:22 +01:00
Olivier Houchard	1256836ebf	MEDIUM: fd/threads: Make sure we don't miss a fd cache entry. An fd cache entry might be removed and added at the end of the list, while another thread is parsing it, if that happens, we may miss fd cache entries, to avoid that, add a new field in the struct fdtab, "added_mask", which contains a mask for potentially affected threads, if it is set, the corresponding thread will set its bit in fd_cache_mask, to avoid waiting in poll while it may have more work to do.	2018-02-05 16:02:22 +01:00
Olivier Houchard	4815c8cbfe	MAJOR: fd/threads: Make the fdcache mostly lockless. Create a local, per-thread, fdcache, for file descriptors that only belongs to one thread, and make the global fd cache mostly lockless, as we can get a lot of contention on the fd cache lock.	2018-02-05 16:02:22 +01:00
Olivier Houchard	cf975d46bc	MINOR: pools/threads: Implement lockless memory pools. On CPUs that support a double-width compare-and-swap, implement lockless pools.	2018-02-05 16:02:22 +01:00
Willy Tarreau	5266b3e12d	MINOR: threads: add test and set/reset operations This just adds a set of naive bts/btr operations based on OR/AND. Later it could rely on pl_bts/btr to use arch-specific versions if needed.	2018-02-05 14:24:50 +01:00
Olivier Houchard	f61f0cb95f	MINOR: threads: Introduce double-width CAS on x86_64 and arm. Introduce double-width compare-and-swap on arches that support it, right now x86_64, arm, and aarch64. Also introduce functions to do memory barriers.	2018-02-05 14:24:50 +01:00
Olivier Houchard	928fbfa8b7	MINOR: compiler: introduce offsetoff(). Add a offsetof() macro, if it is no there already.	2018-02-05 14:24:50 +01:00
Olivier Houchard	6fa63d9852	MINOR: early data: Don't rely on CO_FL_EARLY_DATA to wake up streams. Instead of looking for CO_FL_EARLY_DATA to know if we have to try to wake up a stream, because it is waiting for a SSL handshake, instead add a new conn_stream flag, CS_FL_WAIT_FOR_HS. This way we don't have to rely on CO_FL_EARLY_DATA, and we will only wake streams that are actually waiting.	2018-02-05 14:24:50 +01:00
Christopher Faulet	b077cdc012	MEDIUM: spoe: Use an ebtree to manage idle applets Instead of using a list of applets with idle ones in front, we now use an ebtree. Aapplets in the tree are idle by definition. And the key is the applet's weight. When a new frame is queued, the first idle applet (with the lowest weight) is woken up and its weight is increased by one. And when an applet sends a frame to a SPOA, its weight is decremented by one. This is empirical, but it should avoid to overuse a very few number of applets and increase the balancing between idle applets.	2018-02-02 16:00:32 +01:00
Christopher Faulet	8f82b203d5	MINOR: spoe: Count the number of frames waiting for an ack for each applet So it is easier to respect the max_fpa value. This is no more the maximum frames processed by an applet at each loop but the maximum frames waiting for an ack for a specific applet. The function spoe_handle_processing_appctx has been rewritten accordingly.	2018-02-02 16:00:32 +01:00
Christopher Faulet	6f9ea4f87b	MINOR: spoe: Replace sending_rate by a frequency counter sending_rate was a counter used to evaluate the SPOE capacity to process frames. Because it was not really accurrate, it has been replaced by a frequency counter representing the number of frames handled by the SPOE per second. We just check this counter is higher than the number of streams waiting for a reply. If not, a new applet is created.	2018-02-02 16:00:32 +01:00
Christopher Faulet	fce747bbaa	MINOR: spoe: Always link a SPOE context with the applet processing it This was already done for fragmented frames. Now, this is true for all frames.	2018-02-02 16:00:32 +01:00
Christopher Faulet	420977903b	MINOR: spoe: Remove check on min_applets number when a SPOE context is queued The calculation of a minimal number of active applets was really empirical and finally useless. On heavy load, there are always many active applets (most of time, more than the minimal required) and when the load is low, there is no reason to keep unused applets opened. Because of this change, the flag SPOE_APPCTX_FL_PERSIST is now unused. So it has been removed.	2018-02-02 16:00:32 +01:00
Fr�d�ric L�caille	6778b27542	MINOR: stick-tables: Adds support for new "gpc1" and "gpc1_rate" counters. Implement exactly the same code as this has been done for "gpc0" and "gpc0_rate" counters.	2018-01-31 09:40:05 +01:00
Christopher Faulet	f51bac2ba8	BUG/MINOR: threads: Update labels array because of changes in lock_label enum Recent changes to the enum were not synchronized with the lock debugging code. Now we use a switch/case instead of an array so that the compiler throws a warning if there is any inconsistency. To be backported to 1.8 (at least to add the START entry).	2018-01-30 14:35:24 +01:00
Willy Tarreau	a9786b6f04	MINOR: fd: pass the iocb and owner to fd_insert() fd_insert() is currently called just after setting the owner and iocb, but proceeding like this prevents the operation from being atomic and requires a lock to protect the maxfd computation in another thread from meeting an incompletely initialized FD and computing a wrong maxfd. Fortunately for now all fdtab[].owner are set before calling fd_insert(), and the first lock in fd_insert() enforces a memory barrier so the code is safe. This patch moves the initialization of the owner and iocb to fd_insert() so that the function will be able to properly arrange its operations and remain safe even when modified to become lockless. There's no other change beyond the internal API.	2018-01-29 16:07:25 +01:00
Willy Tarreau	82b37d74d2	MEDIUM: fd: use atomic ops for hap_fd_{clr,set} and remove poll_lock Now that we can use atomic ops to set/clear an fd occurrence in an fd_set, we don't need the poll_lock anymore. Let's remove it.	2018-01-29 16:03:15 +01:00
Willy Tarreau	322e6c7e73	MINOR: fd: move the hap_fd_{clr,set,isset} functions to fd.h These functions were created for poll() in 1.5-dev18 (commit `80da05a4`) to replace the previous FD_{CLR,SET,ISSET} that were shared with select() because some libcs enforce a limit on FD_SET. But FD_SET doesn't seem to be universally MT-safe, requiring locks in the select() code that are not needed in the poll code. So let's move back to the initial situation where we used to only use bit fields, since that has been in use since day one without a problem, and let's use these hap_fd_* functions instead of FD_*. This patch only moves the functions to fd.h and revives hap_fd_isset() that was recently removed to kill an "unused" warning.	2018-01-29 16:03:15 +01:00
Willy Tarreau	745c60eac6	CLEANUP: fd: remove the unused "new" field This field has been unused since 1.6, it's only updated and never tested. Let's remove it.	2018-01-29 16:02:59 +01:00
Willy Tarreau	f2b5c99b4c	CLEANUP: fd/threads: remove the now unused fdtab_lock It was only used to protect maxfd computation and is not needed anymore.	2018-01-29 15:25:35 +01:00
Willy Tarreau	173d9951e2	MEDIUM: polling: start to move maxfd computation to the pollers Since only select() and poll() still make use of maxfd, let's move its computation right there in the pollers themselves, and only during each fd update pass. The computation doesn't need a lock anymore, only a few atomic ops. It will be accurate, be done much less often and will not be required anymore in the FD's fast patch. This provides a small performance increase of about 1% in connection rate when using epoll since we get rid of this computation which was performed under a lock.	2018-01-29 15:22:57 +01:00
Fr�d�ric L�caille	a41d531e4e	MINOR: config: Enable tracking of up to MAX_SESS_STKCTR stick counters. This patch really adds support for up to MAX_SESS_STKCTR stick counters.	2018-01-29 13:53:56 +01:00
Tim Duesterhus	471851713a	MINOR: standard: Add str2mask6 function This new function mirrors the str2mask() function for IPv4 addresses. This commit is in preparation to support ARGT_MSK6.	2018-01-25 22:25:40 +01:00
Tim Duesterhus	92bb034209	CLEANUP: Fix typo in ARGT_MSK6 comment The incorrect comment was introduced in commit: `2ac5718dbd` v1.5-dev9 is the first tag containing this comment, the fix should be backported to haproxy 1.5 and newer.	2018-01-25 22:25:40 +01:00
Willy Tarreau	1605c7ae61	BUG/MEDIUM: threads/mworker: fix a race on startup Marc Fournier reported an interesting case when using threads with the master-worker mode : sometimes, a listener would have its FD closed during startup. Sometimes it could even be health checks seeing this. What happens is that after the threads are created, and the pollers enabled on each threads, the master-worker pipe is registered, and at the same time a close() is performed on the write side of this pipe since the children must not use it. But since this is replicated in every thread, what happens is that the first thread closes the pipe, thus releases the FD, and the next thread starting a listener in parallel gets this FD reassigned. Then another thread closes the FD again, which this time corresponds to the listener. It can also happen with the health check sockets if they're started early enough. This patch splits the mworker_pipe_register() function in two, so that the close() of the write side of the FD is performed very early after the fork() and long before threads are created (we don't need to delay it anyway). Only the pipe registration is done in the threaded code since it is important that the pollers are properly allocated for this. The mworker_pipe_register() function now takes care of registering the pipe only once, and this is guaranteed by a new surrounding lock. The call to protocol_enable_all() looks fragile in theory since it scans the list of proxies and their listeners, though in practice all threads scan the same list and take the same locks for each listener so it's not possible that any of them escapes the process and finishes before all listeners are started. And the operation is idempotent. This fix must be backported to 1.8. Thanks to Marc for providing very detailed traces clearly showing the problem.	2018-01-23 19:18:57 +01:00
Willy Tarreau	c9c8378c2b	MINOR: fd: add a bitmask to indicate that an FD is known by the poller Some pollers like epoll() need to know if the fd is already known or not in order to compute the operation to perform (add, mod, del). For now this is performed based on the difference between the previous FD state and the new state but this will not be usable anymore once threads become responsible for their own polling. Here we come with a different approach : a bitmask is stored with the fd to indicate which pollers already know it, and the pollers will be able to simply perform the add/mod/del operations based on this bit combined with the new state. This patch only adds the bitmask declaration and initialization, it is it not yet used. It will be needed by the next two fixes and will need to be backported to 1.8.	2018-01-23 15:42:57 +01:00
Willy Tarreau	ebc78d78a2	BUG/MEDIUM: fd: maintain a per-thread update mask Since the fd update tables are per-thread, we need to have a bit per thread to indicate whether an update exists, otherwise this can lead to lost update events every time multiple threads want to update the same FD. In practice for now, it only happens at start time when listeners are enabled and ask for polling after facing their first EAGAIN. But since the pollers are still shared, a lost event is still recovered by a neighbor thread. This will not reliably work anymore with per-thread pollers, where it has been observed a few times on startup that a single-threaded listener would not always accept incoming connections upon startup. It's worth noting that during this code review it appeared that the "new" flag in the fdtab isn't used anymore. This fix should be backported to 1.8.	2018-01-23 15:41:19 +01:00
Christopher Faulet	69553fe62c	MINOR: threads/fd: Use a bitfield to know if there are FDs for a thread in the FD cache A bitfield has been added to know if there are some FDs processable by a specific thread in the FD cache. When a FD is inserted in the FD cache, the bits corresponding to its thread_mask are set. On each thread, the bitfield is updated when the FD cache is processed. If there is no FD processed, the thread is removed from the bitfield by unsetting its tid_bit. Note that this bitfield is updated but not checked in fd_process_cached_events. So, when this function is called, the FDs cache is always processed. [wt: should be backported to 1.8 as it will help fix a design limitation]	2018-01-23 15:39:10 +01:00
Willy Tarreau	d80cb4ee13	MINOR: global: add some global activity counters to help debugging A number of counters have been added at special places helping better understanding certain bug reports. These counters are maintained per thread and are shown using "show activity" on the CLI. The "clear counters" commands also reset these counters. The output is sent as a single write(), which currently produces up to about 7 kB of data for 64 threads. If more counters are added, it may be necessary to write into multiple buffers, or to reset the counters. To backport to 1.8 to help collect more detailed bug reports.	2018-01-23 15:38:33 +01:00
Willy Tarreau	421f02e738	MINOR: threads: add a MAX_THREADS define instead of LONGBITS This one allows not to inflate some structures when threads are disabled. Now struct global is 1.4 kB instead of 33 kB. Should be backported to 1.8 for ease of backporting of upcoming patches.	2018-01-23 15:28:20 +01:00
Willy Tarreau	f4571a027f	MINOR: global/threads: move cpu_map at the end of the global struct The "thread" part is 32kB long, better move it at the end of the structure since it's only used during initialization, to keep the rest grouped together. Should be backported to 1.8 to ease backporting of upcoming patches, no functional impact.	2018-01-23 15:27:52 +01:00
Christopher Faulet	336d3ef0e7	MINOR: spoe: add register-var-names directive in spoe-agent configuration In addition to "option force-set-var", recently added, this directive can be used to selectivelly register unknown variable names, without totally relaxing their registration during the runtime, like "option force-set-var" does. So there is no way for a malicious agent to exhaust memory by defining a too high number of variable names. In other hand, you need to enumerate all variable names. This could be painfull in some circumstances. Remember, this directive is only usefull when the variable names are not referenced anywhere in the HAProxy configuration or the SPOE one. Thanks to Etienne Carri�re for his help on this part.	2018-01-15 13:47:27 +01:00
David Carlier	ec5e84552a	BUILD/MINOR: ancient gcc versions atomic fix Commit `1a69af6d38` introduced code for atomic prior to 4.7. Unfortunately clang uses as well those constants which is misleading.	2018-01-11 15:31:07 +01:00
Willy Tarreau	1a69af6d38	MINOR: hathreads: add support for gcc < 4.7 Till now the use of __atomic_* gcc builtins required gcc >= 4.7. Since some supported and quite common operating systems like CentOS 6 still come with older versions (4.4) and the mapping to the older builtins is reasonably simple, let's implement it. This code is only used for gcc < 4.7. It has been quickly tested on a machine using gcc 4.4.4 and provided expected results. This patch should be backported to 1.8.	2018-01-10 07:51:56 +01:00
Olivier Houchard	2ec2db9725	MINOR: dns: Handle SRV record weight correctly. A SRV record weight can range from 0 to 65535, while haproxy weight goes from 0 to 256, so we have to divide it by 256 before handing it to haproxy. Also, a SRV record with a weight of 0 doesn't mean the server shouldn't be used, so use a minimum weight of 1. This should probably be backported to 1.8.	2018-01-09 15:43:11 +01:00
Olivier Houchard	e2a34967a9	CLEANUP: rbtree: remove Remove the rbtree implementation. It's not used, it's not even connected to the build, and we probably have no use for it .	2018-01-05 10:56:32 +01:00
Willy Tarreau	3083276187	MINOR: h2: add a function to report pseudo-header names For debugging we need to be able to dump pseudo headers when we know their name, let's put this there as we already have the other way around.	2017-12-30 17:17:07 +01:00
Willy Tarreau	a48c141f44	BUG/MAJOR: connection: refine the situations where we don't send shutw() Since commit `f9ce57e` ("MEDIUM: connection: make conn_sock_shutw() aware of lingering"), we refrain from performing the shutw() on the socket if there is no lingering risk. But there is a problem with this in tunnel and in TCP modes where a client is explicitly allowed to send a shutw to the server, eventhough it it risky. Not doing it creates this situation reported by Ricardo Fraile and diagnosed by Christopher : a typical HTTP client (eg: curl) connecting via the config below to an HTTP server would receive its response, immediately close while the server remains in keep-alive mode. The shutr() received by haproxy from the client is "propagated" to the server side but not acted upon because fdtab[fd].linger_risk is set, so we expect that the next close will immediately complete this operation. listen proxy-tcp bind 127.0.0.1:8888 mode tcp timeout connect 5s timeout server 10s timeout client 10s server server1 127.0.0.1:8000 But since the whole stream will not end until the server closes in turn, the server doesn't close and haproxy expires on server timeout. This problem has already struck by waking up an older bug and was partially fixed with commit `8059351` ("BUG/MEDIUM: http: don't disable lingering on requests with tunnelled responses") though it was not enough. The problem is that linger_risk is not suited here. In fact we need to know whether or not it is desired to close normally or silently, and whether or not a shutr() has already been received on this connection. This is the approach this patch takes, and it solves the problem for the various difficult modes (tcp, http-server-close, pretend-keepalive). This fix needs to be backported to 1.8. Many thanks to Ricardo for providing very detailed traces and configurations.	2017-12-22 18:54:05 +01:00
Willy Tarreau	0ad8e0dfea	MINOR: http: add a function to check request's cache-control header field The new function check_request_for_cacheability() is used to check if a request may be served from the cache, and/or allows the response to be stored into the cache. For this it checks the cache-control and pragma header fields, and adjusts the existing TX_CACHEABLE and a new TX_CACHE_IGNORE flags. For now, just like its response side counterpart, it only checks the first value of the header field. These functions should be reworked to improve their parsers and validate all elements.	2017-12-22 17:56:17 +01:00
Willy Tarreau	984fca9363	MINOR: stream-int: set flag SI_FL_CLEAN_ABRT when mux supports clean aborts By copying the info in the stream interface that the mux cleanly reports aborts, we'll have the ability to check this flag wherever needed regardless of the presence of a mux or not.	2017-12-20 16:56:32 +01:00
Willy Tarreau	28f1cb9da2	MINOR: mux: add flags to describe a mux's capabilities This new field will be used to describe certain properties of some muxes. For now we only add MX_FL_CLEAN_ABRT to indicate that a mux is able to unambiguously report aborts using CS_FL_ERROR contrary to others who may only report it via a read0. This will be used to improve handling of the abortonclose option with H2. Other flags may come later to report multiplexing capabilities or not, support of client/server sides etc.	2017-12-20 16:31:30 +01:00
Etienne Carriere	aec8989e53	MINOR: spoe: add force-set-var option in spoe-agent configuration For security reasons, the spoe filter was only able to change values of existing variables. In specific cases (ex : with LUA code), the name of variables are unknown at the configuration parsing phase. The force-set-var option can be enabled to register all variables.	2017-12-20 08:55:18 +01:00
Willy Tarreau	3c8294b607	MINOR: conn_stream: add new flag CS_FL_RCV_MORE to indicate pending data Due to the nature of multiplexed protocols, it will often happen that some operations are only performed on full frames, preventing any partial operation from being performed. HTTP/2 is one such example. The current MUX API causes a problem here because the rcv_buf() function has no way to let the stream layer know that some data could not be read due to a lack of room in the buffer, but that data are definitely present. The problem with this is that the stream layer might not know it needs to call the function again after it has made some room. And if the frame in the buffer is not followed by any other, nothing will move anymore. This patch introduces a new conn_stream flag CS_FL_RCV_MORE whose purpose is to indicate on the stream that more data than what was received are already available for reading as soon as more room will be available in the buffer. This patch doesn't make use of this flag yet, it only declares it. It is expected that other similar flags may come in the future, such as reports of pending end of stream, errors or any such event that might save the caller from having to poll, or simply let it know that it can take some actions after having processed data.	2017-12-10 21:13:25 +01:00
Thierry FOURNIER	cb14688496	BUG/MEDIUM: lua/notification: memory leak The thread patches adds refcount for notifications. The notifications are used with the Lua cosocket. These refcount free the notifications when the session is cleared. In the Lua task case, it not have sessions, so the nofications are never cleraed. This patch adds a garbage collector for signals. The garbage collector just clean the notifications for which the end point is disconnected. This patch should be backported in 1.8	2017-12-10 19:38:58 +01:00
Thierry FOURNIER	d5b79835f8	DOC: notifications: add precisions about thread usage Precise the terms of use the notification functions.	2017-12-10 19:38:55 +01:00
Emeric Brun	ece0c334bd	BUG/MEDIUM: ssl engines: Fix async engines fds were not considered to fix fd limit automatically. The number of async fd is computed considering the maxconn, the number of sides using ssl and the number of engines using async mode. This patch should be backported on haproxy 1.8	2017-12-06 14:17:41 +01:00
Willy Tarreau	6c71e4696b	BUG/MAJOR: hpack: don't pretend large headers fit in empty table In hpack_dht_make_room(), we try to fulfill this rule form RFC7541#4.4 : "It is not an error to attempt to add an entry that is larger than the maximum size; an attempt to add an entry larger than the maximum size causes the table to be emptied of all existing entries and results in an empty table." Unfortunately it is not consistent with the way it's used in hpack_dht_insert() as this last one will consider a success as a confirmation it can copy the header into the table, and a failure as an indexing error. This results in the two following issues : - if a client sends too large a header into an empty table, this header may overflow the table. Fortunately, most clients send small headers like :authority first, and never mark headers that don't fit into the table as indexable since it is counter-productive ; - if a client sends too large a header into a populated table, the operation fails after the table is totally flushed and the request is not processed. This patch fixes the two issues at once : - a header not fitting into an empty table is always a sign that it will never fit ; - not fitting into the table is not an error Thanks to Yves Lafon for reporting detailed traces demonstrating this issue. This fix must be backported to 1.8.	2017-12-04 18:06:51 +01:00
Willy Tarreau	d85ba4e092	BUG/MINOR: hpack: reject invalid header index If the hpack decoder sees an invalid header index, it emits value "### ERR ###" that was used during debugging instead of rejecting the block. This is harmless, and was detected by h2spec. To backport to 1.8.	2017-12-03 21:08:39 +01:00
Emeric Brun	0fed0b0a38	BUG/MEDIUM: peers: fix some track counter rules dont register entries for sync. This BUG was introduced with: 'MEDIUM: threads/stick-tables: handle multithreads on stick tables' The API was reviewed to handle stick table entry updates asynchronously and the caller must now call a 'stkable_touch_' function each time the content of an entry is modified to register the entry to be synced. There was missing call to stktable_touch_ resulting in not propagated entries to remote peers (or local one during reload)	2017-11-29 19:16:22 +01:00
Willy Tarreau	ec7464726f	BUILD: checks: don't include server.h server.h needs checks.h since it references the struct check, but depending on the include order it will fail if check.h is included first due to this one including server.h in turn while it doesn't need it.	2017-11-29 10:54:05 +01:00
Willy Tarreau	b306650c2a	[RELEASE] Released version 1.9-dev0 Released version 1.9-dev0 with the following main changes : - BUG/MEDIUM: stream: don't automatically forward connect nor close - BUG/MAJOR: stream: ensure analysers are always called upon close - BUG/MINOR: stream-int: don't try to read again when CF_READ_DONTWAIT is set - MEDIUM: mworker: Add systemd `Type=notify` support - BUG/MEDIUM: cache: free callback to remove from tree - CLEANUP: cache: remove unused struct - MEDIUM: cache: enable the HTTP analysers - CLEANUP: cache: remove wrong comment - MINOR: threads/atomic: rename local variables in macros to avoid conflicts - MINOR: threads/plock: rename local variables in macros to avoid conflicts - MINOR: threads/atomic: implement pl_mb() in asm on x86 - MINOR: threads/atomic: implement pl_bts() on non-x86 - MINOR: threads/build: atomic: replace the few inlines with macros - BUILD: threads/plock: fix a build issue on Clang without optimization - BUILD: ebtree: don't redefine types u32/s32 in scope-aware trees - BUILD: compiler: add a new type modifier __maybe_unused - BUILD: h2: mark some inlined functions "unused" - BUILD: server: check->desc always exists - BUG/MEDIUM: h2: properly report connection errors in headers and data handlers - MEDIUM: h2: add a function to emit an HTTP/1 request from a headers list - MEDIUM: h2: change hpack_decode_headers() to only provide a list of headers - BUG/MEDIUM: h2: always reassemble the Cookie request header field - BUG/MINOR: systemd: ignore daemon mode - CONTRIB: spoa_example: allow to compile outside HAProxy. - CONTRIB: spoa_example: remove bref, wordlist, cond_wordlist - CONTRIB: spoa_example: remove last dependencies on type "sample" - CONTRIB: spoa_example: remove SPOE enums that are useless for clients - CLEANUP: cache: reorder includes - MEDIUM: shctx: use unsigned int for len and block_count - MEDIUM: cache: "show cache" on the cli - BUG/MEDIUM: cache: use key=0 as a condition for freeing - BUG/MEDIUM: cache: refcount forbids to free the objects - BUG/MEDIUM: cache fix cli_kws structure - BUG/MEDIUM: deinit: correctly deinitialize the proxy and global listener tasks - BUG/MINOR: ssl: Always start the handshake if we can't send early data. - MINOR: ssl: Don't disable early data handling if we could not write. - MINOR: pools: prepare functions to override malloc/free in pools - MINOR: pools: implement DEBUG_UAF to detect use after free - BUG/MEDIUM: threads/time: fix time drift correction - BUG/MEDIUM: threads/time: maintain a common time reference between all threads - MINOR: sample: Add "thread" sample fetch - BUG/MINOR: Use crt_base instead of ca_base when crt is parsed on a server line - BUG/MINOR: stream: fix tv_request calculation for applets - BUG/MAJOR: h2: always remove a stream from the send list before freeing it - BUG/MAJOR: threads/task: dequeue expired tasks under the WQ lock - MINOR: ssl: Handle reading early data after writing better. - MINOR: mux: Make sure every string is woken up after the handshake. - MEDIUM: cache: store sha1 for hashing the cache key - MINOR: http: implement the "http-request reject" rule - MINOR: h2: send RST_STREAM before GOAWAY on reject - MEDIUM: h2: don't gracefully close the connection anymore on Connection: close - MINOR: h2: make use of client-fin timeout after GOAWAY - MEDIUM: config: ensure that tune.bufsize is at least 16384 when using HTTP/2 - MINOR: ssl: Handle early data with BoringSSL - BUG/MEDIUM: stream: always release the stream-interface on abort - BUG/MEDIUM: cache: free ressources in chn_end_analyze - MINOR: cache: move the refcount decrease in the applet release - BUG/MINOR: listener: Allow multiple "process" options on "bind" lines - MINOR: config: Support a range to specify processes in "cpu-map" parameter - MINOR: config: Slightly change how parse_process_number works - MINOR: config: Export parse_process_number and use it wherever it's applicable - MINOR: standard: Add my_ffsl function to get the position of the bit set to one - MINOR: config: Add auto-increment feature for cpu-map - MINOR: config: Support partial ranges in cpu-map directive - MINOR:: config: Remove thread-map directive - MINOR: config: Add the threads support in cpu-map directive - MINOR: config: Add threads support for "process" option on "bind" lines - MEDIUM: listener: Bind listeners on a thread subset if specified - CLEANUP: debug: Use DPRINTF instead of fprintf into #ifdef DEBUG_FULL/#endif - CLEANUP: log: Rename Alert/Warning in ha_alert/ha_warning - MINOR/CLEANUP: proxy: rename "proxy" to "proxies_list" - CLEANUP: pools: rename all pool functions and pointers to remove this "2" - DOC: update the roadmap file with the latest changes merged in 1.8 - DOC: fix mangled version in peers protocol documentation - DOC: add initial peers protovol v2.0 documentation. - DOC: mention William as maintainer of the cache and master-worker - DOC: add Christopher and Emeric as maintainers of the threads - MINOR: cache: replace a fprint() by an abort() - MEDIUM: cache: max-age configuration keyword - DOC: explain HTTP2 timeout behavior - DOC: cache: configuration and management - MAJOR: mworker: exits the master on failure - BUG/MINOR: threads: don't drop "extern" on the lock in include files - MINOR: task: keep a pointer to the currently running task - MINOR: task: align the rq and wq locks - MINOR: fd: cache-align fdtab and fdcache locks - MINOR: buffers: cache-align buffer_wq_lock - CLEANUP: server: reorder some fields in struct server to save 40 bytes - CLEANUP: proxy: slightly reorder the struct proxy to reduce holes - CLEANUP: checks: remove 16 bytes of holes in struct check - CLEANUP: cache: more efficiently pack the struct cache - CLEANUP: fd: place the lock at the beginning of struct fdtab - CLEANUP: pools: align pools on a cache line - DOC: config: add a few bits about how to configure HTTP/2 - BUG/MAJOR: threads/queue: avoid recursive locking in pendconn_get_next_strm() - BUILD: Makefile: reorder object files by size	2017-11-26 19:50:17 +01:00
Willy Tarreau	103e5663c8	BUG/MAJOR: threads/queue: avoid recursive locking in pendconn_get_next_strm() pendconn_get_next_strm() is called from process_srv_queue() under the server lock, and calls stream_add_srv_conn() with this lock held, while the latter tries to take it again. This results in a deadlock when a server's maxconn is reached and haproxy is built with thread support.	2017-11-26 18:50:30 +01:00
Willy Tarreau	1ca1b70cf9	CLEANUP: pools: align pools on a cache line There are just a few pools, and they're stressed a lot, so it makes sense to dedicate them a cache line to avoid contention and to place the lock at the beginning.	2017-11-26 11:10:53 +01:00
Willy Tarreau	5809052ae1	CLEANUP: fd: place the lock at the beginning of struct fdtab The struct is not cache line aligned but at least, every time the lock will appear in the same cache line as the fd it will benefit from being accessed first. This improves the performance by about 2% on fd-intensive workloads with 4 threads.	2017-11-26 11:10:53 +01:00
Willy Tarreau	08eaa78739	CLEANUP: checks: remove 16 bytes of holes in struct check These ones were easily recovered by swapping two members.	2017-11-26 11:10:52 +01:00
Willy Tarreau	a51108443e	CLEANUP: proxy: slightly reorder the struct proxy to reduce holes 16 bytes were recovered from the struct doing minimal reordering.	2017-11-26 11:10:52 +01:00
Willy Tarreau	d7e33bbe2f	CLEANUP: server: reorder some fields in struct server to save 40 bytes In 1.8 many holes were introduced in struct server, so let's slightly reorder a few fields to plug most of them. This saves 40 bytes in the struct.	2017-11-26 11:10:52 +01:00
Willy Tarreau	8b94969054	MINOR: fd: cache-align fdtab and fdcache locks These locks are highly contended, let's not make them share cache lines.	2017-11-26 11:10:51 +01:00
Willy Tarreau	53bae85b8e	BUG/MINOR: threads: don't drop "extern" on the lock in include files Commit `9dcf9b6` ("MINOR: threads: Use __decl_hathreads to declare locks") accidently lost a few "extern" in certain lock declarations, possibly causing certain entries to be declared at multiple places. Apparently it hasn't caused any harm though. The offending ones were : - fdtab_lock - fdcache_lock - poll_lock - buffer_wq_lock	2017-11-26 11:10:50 +01:00
William Lallemand	4cfede87a3	MAJOR: mworker: exits the master on failure This patch changes the behavior of the master during the exit of a worker. When a worker exits with an error code, for example in the case of a segfault, all workers are now killed and the master leaves. If you don't want this behavior you can use the option "master-worker no-exit-on-failure".	2017-11-24 22:48:27 +01:00
Willy Tarreau	bafbe01028	CLEANUP: pools: rename all pool functions and pointers to remove this "2" During the migration to the second version of the pools, the new functions and pool pointers were all called "pool_something2()" and "pool2_something". Now there's no more pool v1 code and it's a real pain to still have to deal with this. Let's clean this up now by removing the "2" everywhere, and by renaming the pool heads "pool_head_something".	2017-11-24 17:49:53 +01:00
Olivier Houchard	fbc74e8556	MINOR/CLEANUP: proxy: rename "proxy" to "proxies_list" Rename the global variable "proxy" to "proxies_list". There's been multiple proxies in haproxy for quite some time, and "proxy" is a potential source of bugs, a number of functions have a "proxy" argument, and some code used "proxy" when it really meant "px" or "curproxy". It worked by pure luck, because it usually happened while parsing the config, and thus "proxy" pointed to the currently parsed proxy, but we should probably not rely on this. [wt: some of these are definitely fixes that are worth backporting]	2017-11-24 17:21:27 +01:00
Christopher Faulet	767a84bcc0	CLEANUP: log: Rename Alert/Warning in ha_alert/ha_warning	2017-11-24 17:19:12 +01:00
Christopher Faulet	c644fa9bf5	MINOR: config: Add threads support for "process" option on "bind" lines It is now possible on a "bind" line (or a "stats socket" line) to specify the thread set allowed to process listener's connections. For instance: # HTTPS connections will be processed by all threads but the first and HTTP # connection will be processed on the first thread. bind :80 process 1/1 bind :443 ssl crt mycert.pem process 1/2-	2017-11-24 15:38:50 +01:00
Christopher Faulet	cb6a94510d	MINOR: config: Add the threads support in cpu-map directive Now, it is possible to bind CPU at the thread level instead of the process level by defining a thread set in "cpu-map" directives. Thus, its format is now: cpu-map [auto:]<process-set>[/<thread-set>] <cpu-set>... where <process-set> and <thread-set> must follow the format: all \| odd \| even \| number[-[number]] Having a process range and a thread range in same time with the "auto:" prefix is not supported. Only one range is supported, the other one must be a fixed number. But it is allowed when there is no "auto:" prefix. Because it is possible to define a mapping for a process and another for a thread on this process, threads will be bound on the intersection of their mapping and the one of the process on which they are attached. If the intersection is null, no specific binding will be set for the threads.	2017-11-24 15:38:50 +01:00
Christopher Faulet	26028f6209	MINOR: config: Add auto-increment feature for cpu-map The prefix "auto:" can be added before the process set to let HAProxy automatically bind a process to a CPU by incrementing process and CPU sets. To be valid, both sets must have the same size. No matter the declaration order of the CPU sets, it will be bound from the lower to the higher bound. Examples: # all these lines bind the process 1 to the cpu 0, the process 2 to cpu 1 # and so on. cpu-map auto:1-4 0-3 cpu-map auto:1-4 0-1 2-3 cpu-map auto:1-4 3 2 1 0 # bind each process to exaclty one CPU using all/odd/even keyword cpu-map auto:all 0-63 cpu-map auto:even 0-31 cpu-map auto:odd 32-63 # invalid cpu-map because process and CPU sets have different sizes. cpu-map auto:1-4 0 # invalid cpu-map auto:1 0-3 # invalid	2017-11-24 15:38:49 +01:00
Christopher Faulet	ff8131861f	MINOR: standard: Add my_ffsl function to get the position of the bit set to one	2017-11-24 15:38:49 +01:00
Christopher Faulet	f1f0c5f591	MINOR: config: Export parse_process_number and use it wherever it's applicable This function is used when "bind-process" directive is parsed and when "process" parameter on a "bind" or a "stats socket" line is parsed.	2017-11-24 15:38:49 +01:00
William Lallemand	f528fff46b	MEDIUM: cache: store sha1 for hashing the cache key The cache was relying on the txn->uri for creating its key, which was a big problem when there was no log activated. This patch does a sha1 of the host + uri, and stores it in the txn. When a object is stored, the eb32node uses the first 32 bits of the hash as a key, and the whole hash is stored in the cache entry. During a lookup, the truncated hash is used, and when it matches an entry we check the real sha1.	2017-11-23 20:20:04 +01:00
Olivier Houchard	90084a133d	MINOR: ssl: Handle reading early data after writing better. It can happen that we want to read early data, write some, and then continue reading them. To do so, we can't reuse tmp_early_data to store the amount of data sent, so introduce a new member. If we read early data, then ssl_sock_to_buf() is now the only responsible for getting back to the handshake, to make sure we don't miss any early data.	2017-11-23 19:35:28 +01:00
Willy Tarreau	158fa75811	MINOR: pools: implement DEBUG_UAF to detect use after free This code has been used successfully a few times in the past to detect that a pool was used after being freed. Its main goal is to allocate a full page for each object so that they are always released individually and unmapped from memory. This way if any part of the code reference the object after is was freed and before it is reallocated, a segv occurs at the exact offending location. It does a few extra things such as writing to the memory area before freeing to detect double-frees and free of read-only areas, and placing the data at the end of the page instead of the beginning so that out of bounds accesses are easier to spot. The amount of memory used with this is huge (about 10 times the regular usage) but it can be useful sometimes.	2017-11-22 19:43:57 +01:00
Willy Tarreau	f13322ede1	MINOR: pools: prepare functions to override malloc/free in pools This will be useful to add some debugging capabilities. For now it changes nothing.	2017-11-22 19:27:44 +01:00
William Lallemand	111bfef33c	MEDIUM: shctx: use unsigned int for len and block_count Allows bigger objects to be cached in the shctx, the first implementation was only storing small ssl session, but we want to store bigger HTTP response.	2017-11-21 21:35:04 +01:00
Willy Tarreau	59a10fb53d	MEDIUM: h2: change hpack_decode_headers() to only provide a list of headers The current H2 to H1 protocol conversion presents some issues which will require to perform some processing on certain headers before writing them so it's not possible to convert HPACK to H1 on the fly. This commit modifies the headers decoding so that it now works in two phases : hpack_decode_headers() only decodes the HPACK stream in the HEADERS frame and puts the result into a list. Headers which require storage (huffman-compressed or from the dynamic table) are stored in a chunk allocated by the H2 demuxer. Then once the headers are properly decoded into this list, h2_make_h1_request() is called with this list to produce the HTTP/1.1 request into the destination buffer. The list necessarily enforces a limit. Here we use 2*MAX_HTTP_HDR, which means that we can have as many individual cookies as we have regular headers if a client decides to break their cookies into multiple values. This seams reasonable and will allow the H1 parser to decide whether it's too much or not. Thus the output stream is not produced on the fly anymore and this will permit to deal with certain corner cases like reparing the Cookie header (which for now is not done). In order to limit header duplication and parsing, the known pseudo headers continue to be passed by their index : the name element in the list then has a NULL pointer and the value is the pseudo header's index. Given that these ones represent about half of the incoming requests and need to be found quickly, it maintains an acceptable level of performance. The code was significantly reduced by doing this because the orignal code had to deal with HPACK and H1 combinations (eg: index vs not indexed, etc) and now the HPACK decoding is totally focused on the decompression, and the H1 encoding doesn't have to deal with the issue of wrapping input for example. One bug was addressed here (though it couldn't happen at the moment). The H2 demuxer used to detect a failure to write the request into the H1 buffer and would then detect if the output buffer wraps, realign it and try again. The problem by doing so was that the HPACK context was already modified and not rewindable. Thus the size check is now performed first and a failure is reported if it doesn't fit.	2017-11-21 21:13:36 +01:00
Willy Tarreau	f24ea8e45e	MEDIUM: h2: add a function to emit an HTTP/1 request from a headers list The current H2 to H1 protocol conversion presents some issues which will require to perform some processing on certain headers before writing them so it's not possible to convert HPACK to H1 on the fly. Here we introduce a function which performs half of what hpack_decode_header() used to do, which is to take a list of headers on input and emit the corresponding request in HTTP/1.1 format. The code is the same and functions were renamed to be prefixed with "h2" instead of "hpack", though it ends up being simpler as the various HPACK-specific cases could be fused into a single one (ie: add header). Moving this part here makes a lot of sense as now this code is specific to what is documented in HTTP/2 RFC 7540 and will be able to deal with special cases related to H2 to H1 conversion enumerated in section 8.1. Various error codes which were previously assigned to HPACK were never used (aside being negative) and were all replaced by -1 with a comment indicating what error was detected. The code could be further factored thanks to this but this commit focuses on compatibility first. This code is not yet used but builds fine.	2017-11-21 21:13:33 +01:00
Willy Tarreau	dbd25fc75a	BUILD: compiler: add a new type modifier __maybe_unused While gcc only emits warnings about unused static functions, Clang also emits such a warning when the functions are inlined. This is a bit annoying at certain places where functions are provided to manipulate multiple data types and are not yet used. Let's have a type modifier "__maybe_unused" which sets the "unused" attribute like the Linux kernel does. It's elegant as it allows the code author to indicate that it knows that this element might be unused. It works on variables as well, which is convenient to remove ifdefs around local variables in certain functions, but doesn't work on labels.	2017-11-20 21:27:27 +01:00
Willy Tarreau	2532bd2f81	BUILD: threads/plock: fix a build issue on Clang without optimization [ plock commit 4c53fd3a0b2b1892817cebd0db012a52f4087850 ] Pieter Baauw reported a build issue affecting haproxy after plock was included. It happens that expressions of the form : if ((const) ? (expr1) : (expr2)) do_something() always produce code for both expr1 and expr2 on Clang when building without optimization. The resulting asm code is even funny, basically doing : mov reg, 1 cmp reg, 1 ... This causes our sizeof() tests to fail to build because we purposely dereference a fake function that reports the location and nature of the inconsistency, but this fake function appears in the object code despite all conditions being there to avoid it. However the compiler is still smart enough to optimize away code doing if (const) do_something() So we simply repeat the condition before do_something(), and the dummy function is not referenced anymore unless really required.	2017-11-20 21:06:35 +01:00
Willy Tarreau	b5f271555e	MINOR: threads/build: atomic: replace the few inlines with macros [ plock commit 61e255286ae32e83e1a3174dd7c49eda99880a8b] There are a few inlines such as pl_barrier() and pl_cpu_relax() which are used a lot. Unfortunately, while building test code at -O0, inlining is disabled and these ones are called a lot and show up a lot in any profile, are traced into when single-stepping with a debugger, etc, thus they are polluting the landscape. Since they're single-asm statements, there is no reason for not turning them into macros. The result becomes fairly visible here at -O0 : $ size latency.inline latency.macro text data bss dec hex filename 11431 692 656 12779 31eb treelock.inline 10967 692 656 12315 301b treelock.macro And it was verified that regularly optimized code remains strictly identical.	2017-11-20 21:06:35 +01:00
Willy Tarreau	d0d8ba59d3	MINOR: threads/atomic: implement pl_bts() on non-x86 [ plock commit da17ba320aad3a8faf08e36fca604de9cad21fdd ] This one was missing, it can be done using sync_fetch_and_or().	2017-11-20 21:06:03 +01:00
Willy Tarreau	01b8398b9e	MINOR: threads/atomic: implement pl_mb() in asm on x86 [ plock commit 44081ea493dd78dab48076980e881748e9b33db5 ] Older compilers (eg: gcc 3.4) don't provide __sync_synchronize() so let's do it by hand on this platform.	2017-11-20 20:45:47 +01:00
Willy Tarreau	f7ba77eb80	MINOR: threads/plock: rename local variables in macros to avoid conflicts [ plock commit b155d5c762fb9a9793911881f80e61faa6b0e889 ] Local variables "l", "i" and "ret" were renamed "__pl_l", "__pl_i" and "__pl_r" respectively, to limit the risk of conflicts with existing variables in application code.	2017-11-20 20:45:43 +01:00
Willy Tarreau	98409e34ca	MINOR: threads/atomic: rename local variables in macros to avoid conflicts [ plock commit bfac5887ebabb8ef753b0351f162265767eb219b ] Local variable "t" was renamed "__pl_t" to limit the risk of conflicts with existing variables in application code.	2017-11-20 20:45:38 +01:00
William Lallemand	71bd11a1f3	MEDIUM: cache: enable the HTTP analysers Enable the same analysers as the stats applet. Allows keepalive and termination flags to work.	2017-11-20 19:22:27 +01:00
William Lallemand	44e259c0b7	CLEANUP: cache: remove unused struct Remove unused structure which remain from old dev.	2017-11-20 19:22:27 +01:00
Tim Duesterhus	d6942c8297	MEDIUM: mworker: Add systemd `Type=notify` support This patch adds support for `Type=notify` to the systemd unit. Supporting `Type=notify` improves both starting as well as reloading of the unit, because systemd will be let known when the action completed. See this quote from `systemd.service(5)`: > Note however that reloading a daemon by sending a signal (as with the > example line above) is usually not a good choice, because this is an > asynchronous operation and hence not suitable to order reloads of > multiple services against each other. It is strongly recommended to > set ExecReload= to a command that not only triggers a configuration > reload of the daemon, but also synchronously waits for it to complete. By making systemd aware of a reload in progress it is able to wait until the reload actually succeeded. This patch introduces both a new `USE_SYSTEMD` build option which controls including the sd-daemon library as well as a `-Ws` runtime option which runs haproxy in master-worker mode with systemd support. When haproxy is running in master-worker mode with systemd support it will send status messages to systemd using `sd_notify(3)` in the following cases: - The master process forked off the worker processes (READY=1) - The master process entered the `mworker_reload()` function (RELOADING=1) - The master process received the SIGUSR1 or SIGTERM signal (STOPPING=1) Change the unit file to specify `Type=notify` and replace master-worker mode (`-W`) with master-worker mode with systemd support (`-Ws`). Future evolutions of this feature could include making use of the `STATUS` feature of `sd_notify()` to send information about the number of active connections to systemd. This would require bidirectional communication between the master and the workers and thus is left for future work.	2017-11-20 18:39:41 +01:00
Olivier Houchard	e6060c5d87	MINOR: SSL: Store the ASN1 representation of client sessions. Instead of storing the SSL_SESSION pointer directly in the struct server, store the ASN1 representation, otherwise, session resumption is broken with TLS 1.3, when multiple outgoing connections want to use the same session.	2017-11-16 19:03:32 +01:00
Christopher Faulet	595d7b72a6	MINOR: applets: Use a bitfield to track applets activity per-thread a bitfield has been added to know if there are runnable applets for a thread. When an applet is woken up, the bits corresponding to its thread_mask are set. When all active applets for a thread is get to be processed, the thread is removed from active ones by unsetting its tid_bit from the bitfield.	2017-11-16 11:19:46 +01:00
Christopher Faulet	3911ee85df	MINOR: tasks: Use a bitfield to track tasks activity per-thread a bitfield has been added to know if there are runnable tasks for a thread. When a task is woken up, the bits corresponding to its thread_mask are set. When all tasks for a thread have been evaluated without any wakeup, the thread is removed from active ones by unsetting its tid_bit from the bitfield.	2017-11-16 11:19:46 +01:00
William Lallemand	75ea0a06b0	BUG/MEDIUM: mworker: does not close inherited FD At the end of the master initialisation, a call to protocol_unbind_all() was made, in order to close all the FDs. Unfortunately, this function closes the inherited FDs (fd@), upon reload the master wasn't able to reload a configuration with those FDs. The create_listeners() function now store a flag to specify if the fd was inherited or not. Replace the protocol_unbind_all() by mworker_cleanlisteners() + deinit_pollers()	2017-11-15 19:53:33 +01:00
Willy Tarreau	9c1e15d8cd	MINOR: tools: emphasize the node being worked on in the tree dump Now we can show in dotted red the node being removed or surrounded in red a node having been inserted, and add a description on the graph related to the operation in progress for example.	2017-11-15 19:43:05 +01:00
Willy Tarreau	ed3cda02ae	MINOR: tools: add a function to dump a scope-aware tree to a file It emits a dump in DOT format for graphing purposes during debugging sessions. It's convenient to dump the run queue.	2017-11-15 16:07:15 +01:00
Christopher Faulet	99bca65f53	BUG/MEDIUM: standard: itao_str/idx and quote_str/idx must be thread-local This bug has an impact on the stats applet and easily leads to a crash of HAProxy. This is specific to threads, no backport is needed.	2017-11-14 18:11:57 +01:00
Christopher Faulet	e9a896e09e	BUG/MINOR: threads: tid_bit must be a unsigned long This is specific to threads, no backport is needed.	2017-11-14 18:11:28 +01:00
Christopher Faulet	fa5c812a6b	BUG/MINOR: buffers: Fix b_alloc_margin to be "fonctionnaly" thread-safe b_alloc_margin is, strickly speeking, thread-safe. It will not crash HAproxy. But its contract is not respected anymore in a multithreaded environment. In this function, we need to be sure to have <margin> buffers available in the pool after the allocation. So to have this guarantee, we must lock the memory pool during all the operation. This also means, we must call internal and lockless memory functions (prefixed with '__'). For the record, this patch fixes a pernicious bug happens after a soft reload where some streams can be blocked infinitly, waiting for a buffer in the buffer_wq list. This happens because, during a soft reload, pool_gc2 is called, making some calls to b_alloc_fast fail. This is specific to threads, no backport is needed.	2017-11-13 11:42:48 +01:00
Christopher Faulet	9dcf9b6f03	MINOR: threads: Use __decl_hathreads to declare locks This macro should be used to declare variables or struct members depending on the USE_THREAD compile option. It avoids the encapsulation of such declarations between #ifdef/#endif. It is used to declare all lock variables.	2017-11-13 11:38:17 +01:00
Willy Tarreau	387bd4f69f	CLEANUP: global: introduce variable pid_bit to avoid shifts with relative_pid At a number of places, bitmasks are used for process affinity and to map listeners to processes. Every time 1UL<<(relative_pid-1) is used. Let's create a "pid_bit" variable corresponding to this value to clean this up.	2017-11-10 19:08:14 +01:00
Willy Tarreau	28b55c6fed	CLEANUP: mux: remove the unused "release()" function In commit `53a4766` ("MEDIUM: connection: start to introduce a mux layer between xprt and data") we introduced a release() function which ends up never being used. Let's get rid of it now.	2017-11-10 16:43:05 +01:00
Willy Tarreau	aa39860aef	MINOR: tools: don't use unlikely() in hex2i() This small inline function causes some pain to the compiler when used inside other functions due to its use of the unlikely() hint for non-digits. It causes the letters to be processed far away in the calling function and makes the code less efficient. Removing these unlikely() hints has increased the chunk size parsing by around 5%.	2017-11-10 11:19:54 +01:00
Willy Tarreau	b15e3fefc9	BUG/MEDIUM: h1: ensure the chunk size parser can deal with full buffers The HTTP/1 code always has the reserve left available so the buffer is never full there. But with HTTP/2 we have to deal with full buffers, and it happens that the chunk size parser cannot tell the difference between a full buffer and an empty one since it compares the start and the stop pointer. Let's change this to instead deal with the number of bytes left to process. As a side effect, this code ends up being about 10% faster than the previous one, even on HTTP/1.	2017-11-10 11:17:08 +01:00
Christopher Faulet	c5a9d5bf23	BUG/MEDIUM: stream-int: Don't loss write's notifs when a stream is woken up When a write activity is reported on a channel, it is important to keep this information for the stream because it take part on the analyzers' triggering. When some data are written, the flag CF_WRITE_PARTIAL is set. It participates to the task's timeout updates and to the stream's waking. It is also used in CF_MASK_ANALYSER mask to trigger channels anaylzers. In the past, it was cleared by process_stream. Because of a bug (fixed in commit `95fad5ba4` ["BUG/MAJOR: stream-int: don't re-arm recv if send fails"]), It is now cleared before each send and in stream_int_notify. So it is possible to loss this information when process_stream is called, preventing analyzers to be called, and possibly leading to a stalled stream. Today, this happens in HTTP2 when you call the stat page or when you use the cache filter. In fact, this happens when the response is sent by an applet. In HTTP1, everything seems to work as expected. To fix the problem, we need to make the difference between the write activity reported to lower layers and the one reported to the stream. So the flag CF_WRITE_EVENT has been added to notify the stream of the write activity on a channel. It is set when a send succedded and reset by process_stream. It is also used in CF_MASK_ANALYSER. finally, it is checked in stream_int_notify to wake up a stream and in channel_check_timeouts. This bug is probably present in 1.7 but it seems to have no effect. So for now, no needs to backport it.	2017-11-09 15:16:05 +01:00
Willy Tarreau	1b4cf9b754	BUG/MINOR: h1: the HTTP/1 make status code parser check for digits The H1 parser used by the H2 gateway was a bit lax and could validate non-numbers in the status code. Since it computes the code on the fly it's problematic, as "30:" is read as status code 310. Let's properly check that it's a number now. No backport needed.	2017-11-09 11:15:45 +01:00
Olivier Houchard	522eea7110	MINOR: ssl: Handle sending early data to server. This adds a new keyword on the "server" line, "allow-0rtt", if set, we'll try to send early data to the server, as long as the client sent early data, as in case the server rejects the early data, we no longer have them, and can't resend them, so the only option we have is to send back a 425, and we need to be sure the client knows how to interpret it correctly.	2017-11-08 14:11:10 +01:00
Emeric Brun	d8b3b65faa	BUG/MEDIUM: splice/threads: pipe reuse list was not protected. The list is now protected using a global spinlock.	2017-11-07 14:47:28 +01:00
Christopher Faulet	2a944ee16b	BUILD: threads: Rename SPIN/RWLOCK macros using HA_ prefix This remove any name conflicts, especially on Solaris.	2017-11-07 11:10:24 +01:00
Olivier Houchard	55dcdf4c39	BUG/MINOR: dns: Don't try to get the server lock if it's already held. dns_link_resolution() can be called with the server lock already held, so don't attempt to lock it again in that case.	2017-11-06 18:34:24 +01:00
Willy Tarreau	88ac59be4d	MINOR: threads: use faster locks for the spin locks The spin locks used to rely on W locks, which involve a loop waiting for readers to leave, and this doesn't happen here. It's more efficient to use S locks instead, which are also mutually exclusive and do not have this loop. This saves one test per spinlock and a few tens of bytes allowing certain functions to be inlined.	2017-11-06 11:20:11 +01:00
Willy Tarreau	8d38805d3d	MAJOR: task: make use of the scope-aware ebtree functions Currently the task scheduler suffers from an O(n) lookup when skipping tasks that are not for the current thread. The reason is that eb32_lookup_ge() has no information about the current thread so it always revisits many tasks for other threads before finding its own tasks. This is particularly visible with HTTP/2 since the number of concurrent streams created at once causes long series of tasks for the same stream in the scheduler. With only 10 connections and 100 streams each, by running on two threads, the performance drops from 640kreq/s to 11.2kreq/s! Lookup metrics show that for only 200000 task lookups, 430 million skips had to be performed, which means that on average, each lookup leads to 2150 nodes to be visited. This commit backports the principle of scope lookups for ebtrees from the ebtree_v7 development tree. The idea is that each node contains a mask indicating the union of the scopes for the nodes below it, which is fed during insertion, and used during lookups. Then during lookups, branches that do not contain any leaf matching the requested scope are simply ignored. This perfectly matches a thread mask, allowing a thread to only extract the tasks it cares about from the run queue, and to always find them in O(log(n)) instead of O(n). Thus the scheduler uses tid_bit and task->thread_mask as the ebtree scope here. Doing this has recovered most of the performance, as can be seen on the test below with two threads, 10 connections, 100 streams each, and 1 million requests total : Before After Gain test duration : 89.6s 4.73s x19 HTTP requests/s (DEBUG) : 11200 211300 x19 HTTP requests/s (PROD) : 15900 447000 x28 spin_lock time : 85.2s 0.46s /185 time per lookup : 13us 40ns /325 Even when going to 6 threads (on 3 hyperthreaded CPU cores), the performance stays around 284000 req/s, showing that the contention is much lower. A test showed that there's no benefit in using this for the wait queue though.	2017-11-06 11:20:11 +01:00
Willy Tarreau	62a124977b	MINOR: applets: no need to check for runqueue's emptiness in appctx_res_wakeup() The __appctx_wakeup() function already does it. It matters with threads enabled because it simplifies the code in appctx_res_wakeup() to get rid of this test.	2017-11-05 12:01:11 +01:00
Willy Tarreau	bbd09b9306	BUG/MAJOR: thread/listeners: enable_listener must not call unbind_listener() unbind_listener() takes the listener lock, which is already held by enable_listener(). This situation happens when starting with nbproc > 1 with some bind lines limited to a certain process, because in this case enable_listener() tries to stop unneeded listeners. This commit introduces __do_unbind_listeners() which must be called with the lock held, and makes enable_listener() use this one. Given that the only return code has never been used and that it starts to make the code more complicated to propagate it before throwing it to the trash, the function's return type was changed to void.	2017-11-05 11:38:44 +01:00
David Carlier	5222d8eb25	BUG/MINOR: stdarg.h inclusion Needed for the memvprintf part, the va_list type. Spotted during OpenBSD build.	2017-11-03 15:04:09 +01:00
Willy Tarreau	4b75fffa2b	BUG/MAJOR: buffers: fix get_buffer_nc() for data at end of buffer This function incorrectly dealt with the case where data doesn't wrap but lies at the end of the buffer, resulting in Lukas' reported data corruption with HTTP/2. No backport is needed, it was introduced for HTTP/2 in 1.8-dev.	2017-11-02 17:16:07 +01:00
Willy Tarreau	7c2a2ad65c	BUG/MINOR: thread: fix a typo in the debug code __spin_unlock() used to call RWLOCK_WRUNLOCK() to unlock in the debug code. It's harmless as they happen to be identical.	2017-11-02 16:26:02 +01:00
William Lallemand	77c1197bfb	MEDIUM: cache: deliver objects from cache Lookup objects in the cache and deliver them using the http-request action "cache-use".	2017-10-31 21:17:19 +01:00
William Lallemand	41db46035e	MEDIUM: cache: configuration parsing and initialization Parse a configuration section "cache" and a http-{response,request} actions. Example: listen frt mode http http-response cache-store foobar http-request cache-use foobar cache foobar total-max-size 4 # size in megabytes	2017-10-31 21:17:19 +01:00
Willy Tarreau	ffca736401	MINOR: h2: centralize all HTTP/2 protocol elements and constants These constants from RFC7540 will be centralized into common/h2.h for use by the future h2 mux and other places.	2017-10-31 18:03:24 +01:00
Willy Tarreau	1be4f3d8af	MEDIUM: hpack: implement basic hpack encoding For now it only supports literals and a bit of static header table references for the 9 most common header field names (date, server, content-type, content-length, last-modified, accept-ranges, etag, cache-control, location). A previous incarnation of this commit used to strip the forbidden H2 header names (connection, proxy-connection, upgrade, transfer-encoding, keep-alive) but this is no longer the case as this filtering is irrelevant to HPACK encoding and is specific to H2, so this will have to be done by the caller. It's quite not optimal but works fine enough to prepare some valid and partially compressed responses during development.	2017-10-31 18:03:24 +01:00
Willy Tarreau	679790baae	MINOR: hpack: implement the decoder The decoder is now fully functional. It makes use of the dynamic header table. Dynamic header table size updates are currently ignored, as our initially advertised value is the highest we support. Strictly speaking, the impact is that a client referencing a header field after such an update wouldn't observe an error instead of the connection being dropped if it was implemented. Decoded header fields are copied into a target buffer in HTTP/1 format using HTTP/1.1 as the version. The Host header field is automatically appended if a ":authority" header field is present. All decoded header fields can be displayed if the file is compiled with DEBUG_HPACK.	2017-10-31 18:03:24 +01:00
Willy Tarreau	ce04094c4a	MINOR: hpack: implement the header tables management This code deals with header insertion, retrieval and eviction, as well as with dynamic header table defragmentation. It is functional for use as a decoder and was heavily tested in this context. There's still some room for optimization (eg: the defragmentation code currently does it in place using a memcpy). Also for now the dynamic header table is allocated using malloc() while a pool needs to be created instead. This code was mostly imported from https://github.com/wtarreau/http2-exp with "hpack_" prepended in front of most names to avoid risks of conflicts. Some small cleanups and renamings were applied during the import. This version must be considered more recent. Some HPACK error codes were placed here (HPACK_ERR_*), not exactly because they're needed by the decoder but they'll be needed by all callers. Maybe a different location should be found.	2017-10-31 18:03:24 +01:00
Willy Tarreau	a004ade512	MINOR: hpack: implement the HPACK Huffman table decoder The code was borrowed from the HPACK experimental implementations available here : https://github.com/wtarreau/http2-exp It contains the Huffman table as specified in RFC7541 Appendix B, and a set of reverse tables used to decode a Huffman byte stream, and produced by contrib/h2/gen-rht. The encoder is not finalized, it doesn't emit the byte stream but this is not needed for now.	2017-10-31 18:03:24 +01:00
Willy Tarreau	436d333124	MEDIUM: connection: add a destroy callback This callback will be used to release upper layers when a mux is in use. Given that the mux can be asynchronously deleted, we need a way to release the extra information such as the session. This callback will be called directly by the mux upon releasing everything and before the connection itself is released, so that the callee can find its information inside the connection if needed. The way it currently works is not perfect, and most likely this should instead become a mux release callback, but for now we have no easy way to add mux-specific stuff, and since there's one mux per connection, it works fine this way.	2017-10-31 18:03:24 +01:00
Willy Tarreau	2c52a2b9ee	MEDIUM: connection: make mux->detach() release the connection For H2, only the mux's timeout or other conditions might cause a release of the mux and the connection, no stream should be allowed to kill such a shared connection. So a stream will only detach using cs_destroy() which will call mux->detach() then free the cs. For now it's only handled by mux_pt. The goal is that the data layer never has to care about the connection, which will have to be released depending on the mux's mood.	2017-10-31 18:03:24 +01:00
Willy Tarreau	6978db35e9	MINOR: connection: add cs_close() to close a conn_stream This basically calls cs_shutw() followed by cs_shutr(). Both of them are called in the most conservative mode so that any previous call is still respected. The CS flags are cleared so that it can be reused (this is important for connection retries when conn and CS are reused without being reallocated).	2017-10-31 18:03:24 +01:00
Willy Tarreau	ecdb3fe9f4	MINOR: conn_stream: modify cs_shut{r,w} API to pass the desired mode Now we can specify how we want to shutdown (drain vs reset, and normal vs silent), and this propagates to the mux then the transport layer.	2017-10-31 18:03:23 +01:00
Willy Tarreau	79dadb5335	MINOR: conn_stream: new shutr/w status flags In order to support all shutdown modes on the CS, we introduce the following flags : CS_FL_SHRD : shut read, drain extra data CS_FL_SHRR : shut read, reset extra data CS_FL_SHWN : shut write, normal notification CS_FL_SHWS : shut write, silent mode (no notification) And the following modes for shutr/shutw : CS_SHR_DRAIN, CS_SHR_RESET, CS_SHW_NORMAL, CS_SHW_SILENT. Note: it's possible that we won't need to distinguish the two shutw above as they're only an action. For now they are not used.	2017-10-31 18:03:23 +01:00
Olivier Houchard	9aaf778129	MAJOR: connection : Split struct connection into struct connection and struct conn_stream. All the references to connections in the data path from streams and stream_interfaces were changed to use conn_streams. Most functions named "something_conn" were renamed to "something_cs" for this. Sometimes the connection still is what matters (eg during a connection establishment) and were not always renamed. The change is significant and minimal at the same time, and was quite thoroughly tested now. As of this patch, all accesses to the connection from upper layers go through the pass-through mux.	2017-10-31 18:03:23 +01:00
Willy Tarreau	63dd75d934	MINOR: connection: introduce the conn_stream manipulation functions Most of the functions dealing with conn_streams are here. They act at the data layer and interact with the mux. For now they are not used yet but everything builds.	2017-10-31 18:03:23 +01:00
Olivier Houchard	8e6147292e	MINOR: mux: add more methods to mux_ops We'll need to support reading/writing from both sides, with buffers and pipes, as well as retrieving/updating flags.	2017-10-31 18:03:23 +01:00
Olivier Houchard	e2b40b9eab	MINOR: connection: introduce conn_stream This patch introduces a new struct conn_stream. It's the stream-side of a multiplexed connection. A pool is created and destroyed on exit. For now the conn_streams are not used at all.	2017-10-31 18:03:23 +01:00
Willy Tarreau	2e0b2b5f83	MEDIUM: session: use the ALPN token and proxy mode to select the mux When an incoming connection is made on an HTTP mode frontend, the session now looks up the mux to use based on the ALPN token and the proxy mode. This will allow easier mux registration, and we don't need to hard-code the mux_pt_ops anymore.	2017-10-31 18:03:23 +01:00
Willy Tarreau	2386be64ba	MINOR: connection: implement alpn registration of muxes Selecting a mux based on ALPN and the proxy mode will quickly become a pain. This commit provides new functions to register/lookup a mux based on the ALPN string and the proxy mode to make this easier. Given that we're not supposed to support a wide range of muxes, the lookup should not have any measurable performance impact.	2017-10-31 18:03:23 +01:00
Willy Tarreau	53a4766e40	MEDIUM: connection: start to introduce a mux layer between xprt and data For HTTP/2 and QUIC, we'll need to deal with multiplexed streams inside a connection. After quite a long brainstorming, it appears that the connection interface to the existing streams is appropriate just like the connection interface to the lower layers. In fact we need to have the mux layer in the middle of the connection, between the transport and the data layer. A mux can exist on two directions/sides. On the inbound direction, it instanciates new streams from incoming connections, while on the outbound direction it muxes streams into outgoing connections. The difference is visible on the mux->init() call : in one case, an upper context is already known (outgoing connection), and in the other case, the upper context is not yet known (incoming connection) and will have to be allocated by the mux. The session doesn't have to create the new streams anymore, as this is performed by the mux itself. This patch introduces this and creates a pass-through mux called "mux_pt" which is used for all new connections and which only calls the data layer's recv,send,wake() calls. One incoming stream is immediately created when init() is called on the inbound direction. There should not be any visible impact. Note that the connection's mux is purposely not set until the session is completed so that we don't accidently run with the wrong mux. This must not cause any issue as the xprt_done_cb function is always called prior to using mux's recv/send functions.	2017-10-31 18:03:23 +01:00
Willy Tarreau	b29dc95a97	MINOR: threads: add a portable barrier for threads and non-threads HA_BARRIER() is just a simple memory barrier to prevent the compiler from reordering our code.	2017-10-31 18:01:18 +01:00
Willy Tarreau	2510f702f9	MINOR: h1: add a function to measure the trailers length This is needed in the H2->H1 gateway so that we know how long the trailers block is in chunked encoding. It returns the number of bytes, or 0 if some are missing, or -1 in case of parse error.	2017-10-31 17:18:10 +01:00
Willy Tarreau	f65610a83d	CLEANUP: threads: rename process_mask to thread_mask It was a leftover from the last cleaning session; this mask applies to threads and calling it process_mask is a bit confusing. It's the same in fd, task and applets.	2017-10-31 16:06:06 +01:00
Olivier Houchard	d16bfe6c01	BUG/MINOR: dns: Fix SRV records with the new thread code. srv_set_fqdn() may be called with the DNS lock already held, but tries to lock it anyway. So, add a new parameter to let it know if it was already locked or not;	2017-10-31 15:47:55 +01:00
Willy Tarreau	a5e0590b80	BUILD: stick-tables: silence an uninitialized variable warning Commit `819fc6f` ("MEDIUM: threads/stick-tables: handle multithreads on stick tables") introduced a valid warning about an uninitialized return value in stksess_kill_if_expired(). It just happens that this result is never used, so let's turn the function back to void as previously.	2017-10-31 15:45:42 +01:00
Emeric Brun	6e0128630b	BUG/MAJOR: threads/freq_ctr: fix lock on freq counters. The wrong bit was set to keep the lock on freq counter update. And the read functions were re-worked to use volatile. Moreover, when a freq counter is updated, it is now rotated only if the current counter is in the past (now.tv_sec > ctr->curr_sec). It is important with threads because the current time (now) is thread-local. So, rounded to the second, the time may vary by more or less 1 second. So a freq counter rotated by one thread may be see 1 second in the future. In this case, it is updated but not rotated.	2017-10-31 13:58:33 +01:00
Christopher Faulet	cd7879adc2	BUG/MEDIUM: threads: Run the poll loop on the main thread too There was a flaw in the way the threads was created. the main one was just used to create all the others and just wait to exit. Now, it is used to run a poll loop. So we only create nbthread-1 threads. This also fixes a bug about the compression filter when there is only 1 thread (nbthread == 1 or no threads support). The bug was in the way thread-local resources was initialized. per-thread init/deinit callbacks were never called for the main process. So, with nthread set to 1, some buffers remained uninitialized.	2017-10-31 13:58:33 +01:00
Emeric Brun	9f0b458525	MEDIUM: threads/server: Use the server lock to protect health check and cli concurrency	2017-10-31 13:58:33 +01:00
Christopher Faulet	c2a89a6aed	MINOR: threads/mailers: Add a lock to protect queues of email alerts	2017-10-31 13:58:33 +01:00
Christopher Faulet	cfda847643	MINOR: threads/checks: Add a lock to protect the pid list used by external checks	2017-10-31 13:58:33 +01:00
Christopher Faulet	6251902e67	MINOR: threads: Add thread-map config parameter in the global section By default, no affinity is set for threads. To bind threads on CPU, you must define a "thread-map" in the global section. The format is the same than the "cpu-map" parameter, with a small difference. The process number must be defined, with the same format than cpu-map ("all", "even", "odd" or a number between 1 and 31/63). A thread will be bound on the intersection of its mapping and the one of the process on which it is attached. If the intersection is null, no specific bind will be set for the thread.	2017-10-31 13:58:33 +01:00
Christopher Faulet	b2812a6240	MEDIUM: thread/dns: Make DNS thread-safe	2017-10-31 13:58:33 +01:00
Christopher Faulet	24289f2e07	MEDIUM: thread/spoe: Make the SPOE thread-safe Because there is not migration mechanism yet, all runtime information about an SPOE agent are thread-local and async exchanges with agents are disabled when we have serveral threads. Howerver, pipelining is still available. So for now, the thread part of the SPOE is pretty simple.	2017-10-31 13:58:33 +01:00
Thierry FOURNIER	738a6d76f6	MEDIUM: threads/tasks: Add lock around notifications This patch add lock around some notification calls	2017-10-31 13:58:32 +01:00
Thierry FOURNIER	952939d294	MEDIUM: threads/xref: Convert xref function to a thread safe model Ensure that the unlink is done safely between thread and that the peer struct will not destroy between the usage of the peer.	2017-10-31 13:58:32 +01:00
Thierry FOURNIER	94a6bfce9b	MEDIUM: threads/lua: Cannot acces to the socket if we try to access from another thread. We have two y for nsuring that the data is not concurently manipulated: - locks - running task on the same thread. locks are expensives, it is better to avoid it. This patch cecks that the Lua task run on the same thread that the stream associated to the coprocess. TODO: in a next version, the error should be replaced by a yield and thread migration request.	2017-10-31 13:58:32 +01:00
Thierry FOURNIER	61ba0e2b6d	MEDIUM: threads/lua: Add locks around the Lua execution parts. Note that the Lua processing is not really thread safe. It provides heavy system which consists to add our own lock function in the Lua code and recompile the library. This system will probably not accepted by maintainers of various distribs. Our main excution point of the Lua is the function lua_resume(). A quick looking on the Lua sources displays a lua_lock() a the start of function and a lua_unlock() at the end of the function. So I conclude that the Lua thread safe mode just perform a mutex around all execution. So I prefer to do this in the HAProxy code, it will be easier for distro maintainers. Note that the HAProxy lua functions rounded by the macro SET_SAFE_LJMP and RESET_SAFE_LJMP manipulates the Lua stack, so it will be careful to set mutex around these functions.	2017-10-31 13:58:32 +01:00
Christopher Faulet	8ca3b4bc46	MEDIUM: threads/compression: Make HTTP compression thread-safe	2017-10-31 13:58:32 +01:00
Christopher Faulet	71a6a8efaa	MEDIUM: threads/filters: Add init/deinit callback per thread Now, it is possible to define init_per_thread and deinit_per_thread callbacks to deal with ressources allocation for each thread. This is the filter responsibility to deal with concurrency. This is also the filter responsibility to know if HAProxy is started with some threads. A good way to do so is to check "global.nbthread" value. If it is greater than 1, then _per_thread callbacks will be called.	2017-10-31 13:58:32 +01:00
Christopher Faulet	e95f2c3ef5	MEDIUM: thread/vars: Make vars thread-safe A RW lock has been added to the vars structure to protect each list of variables. And a global RW lock is used to protect registered names. When a varibable is fetched, we duplicate sample data because the variable could be modified by another thread.	2017-10-31 13:58:32 +01:00
Christopher Faulet	94b712337d	MEDIUM: threads/freq_ctr: Make the frequency counters thread-safe When a frequency counter must be updated, we use the curr_sec/curr_tick fields as a lock, by setting the MSB to 1 in a compare-and-swap to lock and by reseting it to unlock. And when we need to read it, we loop until the counter is unlocked. This way, the frequency counters are thread-safe without any external lock. It is important to avoid increasing the size of many structures (global, proxy, server, stick_table).	2017-10-31 13:58:32 +01:00
Emeric Brun	b5997f740b	MAJOR: threads/map: Make acls/maps thread safe locks have been added in pat_ref and pattern_expr structures to protect all accesses to an instance of on of them. Moreover, a global lock has been added to protect the LRU cache used for pattern matching. Patterns are now duplicated after a successfull matching, to avoid modification by other threads when the result is used. Finally, the function reloading a pattern list has been modified to be thread-safe.	2017-10-31 13:58:32 +01:00
Emeric Brun	821bb9beaa	MAJOR: threads/ssl: Make SSL part thread-safe First, OpenSSL is now initialized to be thread-safe. This is done by setting 2 callbacks. The first one is ssl_locking_function. It handles the locks and unlocks. The second one is ssl_id_function. It returns the current thread id. During the init step, we create as much as R/W locks as needed, ie the number returned by CRYPTO_num_locks function. Next, The reusable SSL session in the server context is now thread-local. Shctx is now also initialized if HAProxy is started with several threads. And finally, a global lock has been added to protect the LRU cache used to store generated certificates. The function ssl_sock_get_generated_cert is now deprecated because the retrieved certificate can be removed by another threads in same time. Instead, a new function has been added, ssl_sock_assign_generated_cert. It must be used to search a certificate in the cache and set it immediatly if found.	2017-10-31 13:58:32 +01:00
Emeric Brun	6b35e9bfbf	MEDIUM: threads/stream: Make streams list thread safe Adds a global lock to protect the full streams list used to dump sessions on stats socket.	2017-10-31 13:58:32 +01:00
Emeric Brun	a1dd243adb	MAJOR: threads/buffer: Make buffer wait queue thread safe Adds a global lock to protect the buffer wait queue.	2017-10-31 13:58:31 +01:00
Emeric Brun	80527f5bb6	MAJOR: threads/peers: Make peers thread safe A lock is used to protect accesses to a peer structure. A the lock is taken in the applet handler when the peer is identified and released living the applet handler. In the scheduling task for peers section, the lock is taken for every listed peer and released at the end of the process task function. The peer 'force shutdown' function was also re-worked.	2017-10-31 13:58:31 +01:00
Emeric Brun	1138fd0c57	MAJOR: threads/applet: Handle multithreading for applets A global lock has been added to protect accesses to the list of active applets. A process mask has also been added on each applet. Like for FDs and tasks, it is used to know which threads are allowed to process an applet. Because applets are, most of time, linked to a session, it should be sticky on the same thread. But in all cases, it is the responsibility of the applet handler to lock what have to be protected in the applet context.	2017-10-31 13:58:31 +01:00
Emeric Brun	272e252e61	MINOR: threads/regex: Change Regex trash buffer into a thread local variable	2017-10-31 13:58:31 +01:00
Emeric Brun	8c1aaa201a	MEDIUM: threads/http: Make http_capture_bad_message thread-safe This is done by passing the right stream's proxy (the frontend or the backend, depending on the context) to lock the error snapshot used to store the error info.	2017-10-31 13:58:31 +01:00
Emeric Brun	819fc6f563	MEDIUM: threads/stick-tables: handle multithreads on stick tables The stick table API was slightly reworked: A global spin lock on stick table was added to perform lookup and insert in a thread safe way. The handling of refcount on entries is now handled directly by stick tables functions under protection of this lock and was removed from the code of callers. The "stktable_store" function is no more externalized and users should now use "stktable_set_entry" in any case of insertion. This last one performs a lookup followed by a store if not found. So the code using "stktable_store" was re-worked. Lookup, and set_entry functions automatically increase the refcount of the returned/stored entry. The function "sticktable_touch" was renamed "sticktable_touch_local" and is now able to decrease the refcount if last arg is set to true. It is allowing to release the entry without taking the lock twice. A new function "sticktable_touch_remote" is now used to insert entries coming from remote peers at the right place in the update tree. The code of peer update was re-worked to use this new function. This function is also able to decrease the refcount if wanted. The function "stksess_kill" also handle a parameter to decrease the refcount on the entry. A read/write lock is added on each entry to protect the data content updates of the entry.	2017-10-31 13:58:31 +01:00
Christopher Faulet	5b51755aef	MEDIUM: threads/lb: Make LB algorithms (lb_*.c) thread-safe A lock for LB parameters has been added inside the proxy structure and atomic operations have been used to update server variables releated to lb. The only significant change is about lb_map. Because the servers status are updated in the sync-point, we can call recalc_server_map function synchronously in map_set_server_status_up/down function.	2017-10-31 13:58:31 +01:00
Christopher Faulet	5d42e099c5	MINOR: threads/server: Add a lock to deal with insert in updates_servers list This list is used to save changes on the servers state. So when serveral threads are used, it must be locked. The changes are then applied in the sync-point. To do so, servers_update_status has be moved in the sync-point. So this is useless to lock it at this step because the sync-point is a protected area by iteself.	2017-10-31 13:58:31 +01:00
Christopher Faulet	29f77e846b	MEDIUM: threads/server: Add a lock per server and atomically update server vars The server's lock is use, among other things, to lock acces to the active connection list of a server.	2017-10-31 13:58:31 +01:00
Christopher Faulet	40a007cf2a	MEDIUM: threads/server: Make connection list (priv/idle/safe) thread-safe For now, we have a list of each type per thread. So there is no need to lock them. This is the easiest solution for now, but not the best one because there is no sharing between threads. An idle connection on a thread will not be able be used by a stream on another thread. So it could be a good idea to rework this patch later.	2017-10-31 13:58:30 +01:00
Christopher Faulet	ff8abcd31d	MEDIUM: threads/proxy: Add a lock per proxy and atomically update proxy vars Now, each proxy contains a lock that must be used when necessary to protect it. Moreover, all proxy's counters are now updated using atomic operations.	2017-10-31 13:58:30 +01:00
Christopher Faulet	8d8aa0d681	MEDIUM: threads/listeners: Make listeners thread-safe First, we use atomic operations to update jobs/totalconn/actconn variables, listener's nbconn variable and listener's counters. Then we add a lock on listeners to protect access to their information. And finally, listener queues (global and per proxy) are also protected by a lock. Here, because access to these queues are unusal, we use the same lock for all queues instead of a global one for the global queue and a lock per proxy for others.	2017-10-31 13:58:30 +01:00
Christopher Faulet	b79a94c9f3	MEDIUM: threads/signal: Add a lock to make signals thread-safe A global lock has been added to protect the signal processing. So when a signal it triggered, only one thread will catch it.	2017-10-31 13:58:30 +01:00
Emeric Brun	c60def8368	MAJOR: threads/task: handle multithread on task scheduler 2 global locks have been added to protect, respectively, the run queue and the wait queue. And a process mask has been added on each task. Like for FDs, this mask is used to know which threads are allowed to process a task. For many tasks, all threads are granted. And this must be your first intension when you create a new task, else you have a good reason to make a task sticky on some threads. This is then the responsibility to the process callback to lock what have to be locked in the task context. Nevertheless, all tasks linked to a session must be sticky on the thread creating the session. It is important that I/O handlers processing session FDs and these tasks run on the same thread to avoid conflicts.	2017-10-31 13:58:30 +01:00
Christopher Faulet	36716a7fec	MEDIUM: threads/fd: Initialize the process mask during the call to fd_insert Listeners will allow any threads to process the corresponding fd. But for other FDs, we limit the processing to the current thread.	2017-10-31 13:58:30 +01:00
Christopher Faulet	a7c5d43085	MINOR: threads/fd: Add a mask of threads allowed to process on each fd in fdtab array	2017-10-31 13:58:30 +01:00
Christopher Faulet	d4604adeaa	MAJOR: threads/fd: Make fd stuffs thread-safe Many changes have been made to do so. First, the fd_updt array, where all pending FDs for polling are stored, is now a thread-local array. Then 3 locks have been added to protect, respectively, the fdtab array, the fd_cache array and poll information. In addition, a lock for each entry in the fdtab array has been added to protect all accesses to a specific FD or its information. For pollers, according to the poller, the way to manage the concurrency is different. There is a poller loop on each thread. So the set of monitored FDs may need to be protected. epoll and kqueue are thread-safe per-se, so there few things to do to protect these pollers. This is not possible with select and poll, so there is no sharing between the threads. The poller on each thread is independant from others. Finally, per-thread init/deinit functions are used for each pollers and for FD part for manage thread-local ressources. Now, you must be carefull when a FD is created during the HAProxy startup. All update on the FD state must be made in the threads context and never before their creation. This is mandatory because fd_updt array is thread-local and initialized only for threads. Because there is no pollers for the main one, this array remains uninitialized in this context. For this reason, listeners are now enabled in run_thread_poll_loop function, just like the worker pipe.	2017-10-31 13:58:30 +01:00
Christopher Faulet	b349e48ede	MEDIUM: threads/pool: Make pool thread-safe by locking all access to a pool A lock has been added for each memory pool. It is used to protect the pool during allocations and releases. It is also used when pool info are dumped.	2017-10-31 13:58:30 +01:00
Christopher Faulet	f8188c69fa	MEDIUM: threads/logs: Make logs thread-safe log buffers and static variables used in log functions are now thread-local. So there is no need to lock anything to log messages. Moreover, per-thread init/deinit functions are now used to initialize these buffers.	2017-10-31 13:58:30 +01:00
Christopher Faulet	9a65571781	MEDIUM: threads/time: Many global variables from time.h are now thread-local	2017-10-31 13:58:30 +01:00
Christopher Faulet	6adad11283	MEDIUM: threads/chunks: Transform trash chunks in thread-local variables So, per-thread init/deinit functions are registered to allocate/release them.	2017-10-31 13:58:30 +01:00
Christopher Faulet	339fff8a18	MEDIUM: threads: Adds a set of functions to handle sync-point A sync-point is a protected area where you have the warranty that no concurrency access is possible. It is implementated as a thread barrier to enter in the sync-point and another one to exit from it. Inside the sync-point, all threads that must do some syncrhonous processing will be called one after the other while all other threads will wait. All threads will then exit from the sync-point at the same time. A sync-point will be evaluated only when necessary because it is a costly operation. To limit the waiting time of each threads, we must have a mechanism to wakeup all threads. This is done with a pipe shared by all threads. By writting in this pipe, we will interrupt all threads blocked on a poller. The pipe is then flushed before exiting from the sync-point.	2017-10-31 13:58:29 +01:00
Christopher Faulet	be0faa2e47	MINOR: threads: Add nbthread parameter It is only parsed and initialized for now. It will be used later. This parameter is only available when support for threads was built in.	2017-10-31 13:58:29 +01:00
Christopher Faulet	415f611ff4	MINOR: threads: Add mechanism to register per-thread init/deinit functions hap_register_per_thread_init and hap_register_per_thread_deinit functions has been added to register functions to do, for each thread, respectively, some initialization and deinitialization. These functions are added in the global lists per_thread_init_list and per_thread_deinit_list. These functions are called only when HAProxy is started with more than 1 thread (global.nbthread > 1).	2017-10-31 13:58:29 +01:00
Christopher Faulet	1a2b56ea8e	MEDIUM: threads: Add hathreads header file This file contains all functions and macros used to deal with concurrency in HAProxy. It contains all high-level function to do atomic operation (HA_ATOMIC_*). Note, for now, we rely on "__atomic" GCC builtins to do atomic operation. So HAProxy can be compiled with the thread support iff these builtins are available. It also contains wrappers around plocks to use spin or read/write locks. These wrappers are used to abstract the internal representation of the locking system and to add information to help debugging, when compiled with suitable options. To add extra info on locks, you need to add DEBUG=-DDEBUG_THREAD or DEBUG=-DDEBUG_FULL compilation option. In addition to timing info on locks, we keep info on where a lock was acquired the last time (function name, file and line). There are also the thread id and a flag to know if it is still locked or not. This will be useful to debug deadlocks.	2017-10-31 13:58:23 +01:00
Emeric Brun	7122ab31b1	MINOR: threads: Add atomic-ops and plock includes in import dir atomic-ops header contains some low-level functions to do atomic operations. These operations are used by the progressive locks (plock).	2017-10-31 11:36:13 +01:00
Christopher Faulet	e9bd686b68	MINOR: threads: Add THREAD_LOCAL macro When compiled with threads support, this marco is set to __thread. Else it is empty.	2017-10-31 11:36:13 +01:00
Christopher Faulet	93a518f02a	MINOR: standard: Add memvprintf function Now memprintf relies on memvprintf. This new function does exactly what memprintf did before, but it must be called with a va_list instead of a variable number of arguments. So there is no change for every functions using memprintf. But it is now also possible to have same functionnality from any function with variadic arguments.	2017-10-31 11:36:12 +01:00
Christopher Faulet	0108bb3e40	MEDIUM: mailers: Init alerts during conf parsing and refactor their processing Email alerts relies on checks to send emails. The link between a mailers section and a proxy was resolved during the configuration parsing, But initialization was done when the first alert is triggered. This implied memory allocations and tasks creations. With this patch, everything is now initialized during the configuration parsing. So when an alert is triggered, only the memory required by this alert is dynamically allocated. Moreover, alerts processing had a flaw. The task handler used to process alerts to be sent to the same mailer, process_email_alert, was designed to give back the control to the scheduler when an alert was sent. So there was a delay between the sending of 2 consecutives alerts (the min of "proxy->timeout.connect" and "mailer->timeout.mail"). To fix this problem, now, we try to process as much queued alerts as possible when the task is woken up.	2017-10-31 11:36:12 +01:00
Christopher Faulet	67957bd59e	MAJOR: dns: Refactor the DNS code This is a huge patch with many changes, all about the DNS. Initially, the idea was to update the DNS part to ease the threads support integration. But quickly, I started to refactor some parts. And after several iterations, it was impossible for me to commit the different parts atomically. So, instead of adding tens of patches, often reworking the same parts, it was easier to merge all my changes in a uniq patch. Here are all changes made on the DNS. First, the DNS initialization has been refactored. The DNS configuration parsing remains untouched, in cfgparse.c. But all checks have been moved in a post-check callback. In the function dns_finalize_config, for each resolvers, the nameservers configuration is tested and the task used to manage DNS resolutions is created. The links between the backend's servers and the resolvers are also created at this step. Here no connection are kept alive. So there is no needs anymore to reopen them after HAProxy fork. Connections used to send DNS queries will be opened on demand. Then, the way DNS requesters are linked to a DNS resolution has been reworked. The resolution used by a requester is now referenced into the dns_requester structure and the resolution pointers in server and dns_srvrq structures have been removed. wait and curr list of requesters, for a DNS resolution, have been replaced by a uniq list. And Finally, the way a requester is removed from a DNS resolution has been simplified. Now everything is done in dns_unlink_resolution. srv_set_fqdn function has been simplified. Now, there is only 1 way to set the server's FQDN, independently it is done by the CLI or when a SRV record is resolved. The static DNS resolutions pool has been replaced by a dynamoc pool. The part has been modified by Baptiste Assmann. The way the DNS resolutions are triggered by the task or by a health-check has been totally refactored. Now, all timeouts are respected. Especially hold.valid. The default frequency to wake up a resolvers is now configurable using "timeout resolve" parameter. Now, as documented, as long as invalid repsonses are received, we really wait all name servers responses before retrying. As far as possible, resources allocated during DNS configuration parsing are releases when HAProxy is shutdown. Beside all these changes, the code has been cleaned to ease code review and the doc has been updated.	2017-10-31 11:36:12 +01:00
Christopher Faulet	344c4ab6a9	MEDIUM: spoe/rules: Process "send-spoe-group" action The messages processing is done using existing functions. So here, the main task is to find the SPOE engine to use. To do so, we loop on all filter instances attached to the stream. For each, we check if it is a SPOE filter and, if yes, if its name is the one used to declare the "send-spoe-group" action. We also take care to return an error if the action processing is interrupted by HAProxy (because of a timeout or an error at the HAProxy level). This is done by checking if the flag ACT_FLAG_FINAL is set. The function spoe_send_group is the action_ptr callback ot	2017-10-31 11:36:12 +01:00
Christopher Faulet	c718b82dfe	MINOR: spoe: Add a type to qualify the message list during encoding Because we can have messages chained by event or by group, we need to have a way to know which kind of list we manipulate during the encoding. So 2 types of list has been added, SPOE_MSGS_BY_EVENT and SPOE_MSGS_BY_GROUP. And the right type is passed when spoe_encode_messages is called.	2017-10-31 11:36:12 +01:00
Christopher Faulet	76c09ef8de	MEDIUM: spoe/rules: Add "send-spoe-group" action for tcp/http rules This action is used to trigger sending of a group of SPOE messages. To do so, the SPOE engine used to send messages must be defined, as well as the SPOE group to send. Of course, the SPOE engine must refer to an existing SPOE filter. If not engine name is provided on the SPOE filter line, the SPOE agent name must be used. For example: http-request send-spoe-group my-engine some-group This action is available for "tcp-request content", "tcp-response content", "http-request" and "http-response" rulesets. It cannot be used for tcp connection/session rulesets because actions for these rulesets cannot yield. For now, the action keyword is parsed and checked. But it does nothing. Its processing will be added in another patch.	2017-10-31 11:36:12 +01:00
Christopher Faulet	11610f3b5a	MEDIUM: spoe: Parse new "spoe-group" section in SPOE config file For now, this section is only parsed. It should have the following format: spoe-group <grp-name> messages <msg-name> ... And then SPOE groups must be referenced in spoe-agent section: spoe-agnt <name> ... groups <grp-name> ... The purpose of these groups is to trigger messages sending from TCP or HTTP rules, directly from HAProxy configuration, and not on specific event. This part will be added in another patch. It is important to note that a message belongs at most to a group.	2017-10-31 11:36:12 +01:00
Christopher Faulet	7ee8667c99	MINOR: spoe: Check uniqness of SPOE engine names during config parsing The engine name is now kept in "spoe_config" struture. Because a SPOE filter can be declared without engine name, we use the SPOE agent name by default. Then, its uniqness is checked against all others SPOE engines configured for the same proxy. * TODO: Add documentation	2017-10-31 11:36:12 +01:00
Christopher Faulet	57583e474e	MEDIUM: spoe: Add support of ACLS to enable or disable sending of SPOE messages Now, it is possible to conditionnaly send a SPOE message by adding an ACL-based condition on the "event" line, in a "spoe-message" section. Here is the example coming for the SPOE documentation: spoe-message get-ip-reputation args ip=src event on-client-session if ! { src -f /etc/haproxy/whitelist.lst } To avoid mixin with proxy's ACLs, each SPOE message has its private ACL list. It possible to declare named ACLs in "spoe-message" section, using the same syntax than for proxies. So we can rewrite the previous example to use a named ACL: spoe-message get-ip-reputation args ip=src acl ip-whitelisted src -f /etc/haproxy/whitelist.lst event on-client-session if ! ip-whitelisted ACL-based conditions are executed in the context of the stream that handle the client and the server connections.	2017-10-31 11:36:12 +01:00
Christopher Faulet	1b421eab87	MINOR: acl: Pass the ACLs as an explicit parameter of build_acl_cond So it is possible to use anothers ACLs to build ACL conditions than those of proxies.	2017-10-31 11:36:12 +01:00
Christopher Faulet	78880fb196	MINOR: action: Add function to check rules using an action ACT_ACTION_TRK_* The function "check_trk_action" has been added to find and check the target table for rules using an action ACT_ACTION_TRK_*.	2017-10-31 11:36:12 +01:00
Christopher Faulet	6d950b92cd	MINOR: action: Add a function pointer in act_rule struct to check its validity It is possible to define the field "act_rule.check_ptr" if you want to check the validity of a tcp/http rule.	2017-10-31 11:36:12 +01:00
Christopher Faulet	4fce0d8447	MINOR: action: Use trk_idx instead of tcp/http_trk_idx So tcp_trk_idx and http_trk_idx have been removed.	2017-10-31 11:36:12 +01:00
Christopher Faulet	7421b14c22	MINOR: action: Add trk_idx inline function It returns tracking index corresponding to an action ACT_ACTION_TRK_SC*. It will replace http_trk_idx and tcp_trk_idx.	2017-10-31 11:36:12 +01:00
Willy Tarreau	d22e83abd9	MINOR: h1: store the status code in the H1 message It was painful not to have the status code available, especially when it was computed. Let's store it and ensure we don't claim content-length anymore on 1xx, only 0 body bytes.	2017-10-31 08:43:29 +01:00
William Lallemand	a3c77cfdd7	MINOR: shctx: rename lock functions Rename lock functions to shctx_lock() and shctx_unlock() to be coherent with the new API.	2017-10-31 03:49:44 +01:00
William Lallemand	4f45bb9c46	MEDIUM: shctx: separate ssl and shctx This patch reorganize the shctx API in a generic storage API, separating the shared SSL session handling from its core. The shctx API only handles the generic data part, it does not know what kind of data you use with it. A shared_context is a storage structure allocated in a shared memory, allowing its usage in a multithread or a multiprocess context. The structure use 2 linked list, one containing the available blocks, and another for the hot locked blocks. At initialization the available list is filled with <maxblocks> blocks of size <blocksize>. An <extra> space is initialized outside the list in case you need some specific storage. +-----------------------+--------+--------+--------+--------+---- \| struct shared_context \| extra \| block1 \| block2 \| block3 \| ... +-----------------------+--------+--------+--------+--------+---- <-------- maxblocks ---------> * blocksize The API allows to store content on several linked blocks. For example, if you allocated blocks of 16 bytes, and you want to store an object of 60 bytes, the object will be allocated in a row of 4 blocks. The API was made for LRU usage, each time you get an object, it pushes the object at the end of the list. When it needs more space, it discards The functions name have been renamed in a more logical way, the part regarding shctx have been prefixed by shctx_ and the functions for the shared ssl session cache have been prefixed by sh_ssl_sess_.	2017-10-31 03:49:40 +01:00
William Lallemand	ed0b5ad1aa	REORG: shctx: move ssl functions to ssl_sock.c Move the ssl callback functions of the ssl shared session cache to ssl_sock.c. The shctx functions still needs to be separated of the ssl tree and data.	2017-10-31 03:48:39 +01:00
William Lallemand	3f85c9aec8	MEDIUM: shctx: allow the use of multiple shctx Add an shctx argument which permits to create new independent shctx area.	2017-10-31 03:44:11 +01:00
William Lallemand	24a7a75be6	REORG: shctx: move lock functions and struct Move locks functions to proto/shctx.h, and structures to types/shctx.h in order to simplify the split ssl/shctx.	2017-10-31 03:44:11 +01:00
William Lallemand	83215a44b8	MEDIUM: lists: list_for_each_entry{_safe}_from functions Add list_for_each_entry_from and list_for_each_entry_safe_from which allows to iterate in a list starting from a specific item.	2017-10-31 03:44:11 +01:00
Emmanuel Hocdet	01da571e21	MINOR: merge ssl_sock_get calls for log and ppv2 Merge ssl_sock_get_version and ssl_sock_get_proto_version. Change ssl_sock_get_cipher to be used in ppv2.	2017-10-27 19:32:36 +02:00
Emmanuel Hocdet	58118b43b1	MINOR: update proxy-protocol-v2 #define Report #define from doc/proxy-protocol.txt.	2017-10-27 19:32:36 +02:00
Olivier Houchard	9679ac997a	MINOR: ssl: Don't abuse ssl_options. A bind_conf does contain a ssl_bind_conf, which already has a flag to know if early data are activated, so use that, instead of adding a new flag in the ssl_options field.	2017-10-27 19:26:52 +02:00
Olivier Houchard	c2aae74f01	MEDIUM: ssl: Handle early data with OpenSSL 1.1.1 When compiled with Openssl >= 1.1.1, before attempting to do the handshake, try to read any early data. If any early data is present, then we'll create the session, read the data, and handle the request before we're doing the handshake. For this, we add a new connection flag, CO_FL_EARLY_SSL_HS, which is not part of the CO_FL_HANDSHAKE set, allowing to proceed with a session even before an SSL handshake is completed. As early data do have security implication, we let the origin server know the request comes from early data by adding the "Early-Data" header, as specified in this draft from the HTTP working group : https://datatracker.ietf.org/doc/html/draft-ietf-httpbis-replay	2017-10-27 10:54:05 +02:00
Olivier Houchard	51a76d84e4	MINOR: http: Mark the 425 code as "Too Early". This adds a new status code for use with the "http-request deny" ruleset. The use case for this code is currently handled by this draft dedicated to 0-RTT processing : https://datatracker.ietf.org/doc/html/draft-ietf-httpbis-replay	2017-10-27 10:53:32 +02:00
Thierry FOURNIER	31904278dc	MINOR: hlua: Add regex class This patch simply brings HAProxy internal regex system to the Lua API. Lua doesn't embed regexes, now it inherits from the regexes compiled with haproxy.	2017-10-27 10:30:44 +02:00
William Lallemand	48b4bb4b09	MEDIUM: cfgparse: post parsing registration Allow to register a function which will be called after the configuration file parsing, at the end of the check_config_validity(). It's useful fo checking dependencies between sections or for resolving keywords, pointers or values.	2017-10-27 10:15:56 +02:00
William Lallemand	d2ff56d2a3	MEDIUM: cfgparse: post section callback This commit implements a post section callback. This callback will be used at the end of a section parsing. Every call to cfg_register_section must be modified to use the new prototype: int cfg_register_section(char section_name, int (section_parser)(const char , int, char , int), int (post_section_parser)());	2017-10-27 10:14:51 +02:00
Willy Tarreau	145746c2d5	MINOR: buffer: add the buffer input manipulation functions We used to have bo_{get,put}_{chr,blk,str} to retrieve/send data to the output area of a buffer, but not the equivalent ones for the input area. This will be needed to copy uploaded data frames in HTTP/2.	2017-10-27 10:00:17 +02:00
Willy Tarreau	7b271b214f	MEDIUM: connection: make use of CO_FL_WILL_UPDATE in conn_sock_shutw() This one may be called by upper layers (eg: si_shutw()) or lower layers (si_shutw() as well during stream_int_notify()) so we want it to take care of updating the connection's flags if it's not going to be done by the caller.	2017-10-25 15:52:41 +02:00
Willy Tarreau	916e12dcfb	MINOR: connection: add flag CO_FL_WILL_UPDATE to indicate when updates are granted In transport-layer functions (snd_buf/rcv_buf), it's very problematic never to know if polling changes made to the connection will be propagated or not. This has led to some conn_cond_update_polling() calls being placed at a few places to cover both the cases where the function is called from the upper layer and when it's called from the lower layer. With the arrival of the MUX, this becomes even more complicated, as the upper layer will not have to manipulate anything from the connection layer directly and will not have to push such updates directly either. But the snd_buf functions will need to see their updates committed when called from upper layers. The solution here is to introduce a connection flag set by the connection handler (and possibly any other similar place) indicating that the caller is committed to applying such changes on return. This way, the called functions will be able to apply such changes by themselves before leaving when the flag is not set, and the upper layer will not have to care about that anymore.	2017-10-25 15:52:41 +02:00
Willy Tarreau	bc97cc4fd1	MINOR: connection: move the cleanup of flag CO_FL_WAIT_ROOM This flag is only used when reading using splicing for now, and is only set when a pipe full condition is met, so we can simplify its reset condition in conn_refresh_polling_flags so that it's cleared at the same time as the other ones, only when the control layer is ready. This flag could be used more, to mark that a buffer full condition was met with any receive method in order to simplify polling management. This should probably be revisited after 1.8.	2017-10-25 15:52:41 +02:00
Dragan Dosen	7389dd086c	IMPORT: sha1: import SHA1 functions This is based on the git SHA1 implementation and optimized to do word accesses rather than byte accesses, and to avoid unnecessary copies into the context array.	2017-10-25 04:45:48 +02:00
Emmanuel Hocdet	019f9b10ef	MINOR: ssl: build with recent BoringSSL library BoringSSL switch OPENSSL_VERSION_NUMBER to 1.1.0 for compatibility. Fix BoringSSL call and openssl-compat.h/#define occordingly. This will not break openssl/libressl compat.	2017-10-24 19:57:16 +02:00
Willy Tarreau	1296382d0b	CONTRIB: trace: add the possibility to place trace calls in the code Now any call to trace() in the code will automatically appear interleaved with the call sequence and timestamped in the trace file. They appear with a '#' on the 3rd argument (caller's pointer) in order to make them easy to spot. If the trace functionality is not used, a dmumy weak function is used instead so that it doesn't require to recompile every time traces are enabled/disabled. The trace decoder knows how to deal with these messages, detects them and indents them similarly to the currently traced function. This can be used to print function arguments for example. Note that we systematically flush the log when calling trace() to ensure we never miss important events, so this may impact performance. The trace() function uses the same format as printf() so it should be easy to setup during debugging sessions.	2017-10-24 19:54:25 +02:00
Willy Tarreau	cbc6524a19	MINOR: connection: remove conn_force_close() Now only conn_full_close() will be used. It will become more obvious when the tracking is in place or not and will make it easier to convert remaining call places to conn_streams.	2017-10-22 09:54:19 +02:00
Willy Tarreau	3b737c9894	MINOR: stream-int: use conn_full_close() instead of conn_force_close() We simply disable tracking before calling it.	2017-10-22 09:54:18 +02:00
Willy Tarreau	dc42acddb6	MINOR: connection: add conn_stop_tracking() to disable tracking This will be used before conn_full_close() instead of using conn_force_close(), resulting in a clearer exit path in various situations.	2017-10-22 09:54:16 +02:00
Willy Tarreau	6a0a80adaf	MINOR: connection: ensure conn_ctrl_close() also resets the fd The connection's fd was reset to DEAD_FD_MAGIC on conn_force_close() but not on conn_full_close(), which is a bit strange. Let's do it on both.	2017-10-22 09:54:16 +02:00
Willy Tarreau	f9ce57e86c	MEDIUM: connection: make conn_sock_shutw() aware of lingering Instead of having to manually handle lingering outside, let's make conn_sock_shutw() check for it before calling shutdown(). We simply don't want to emit the FIN if we're going to reset the connection due to lingering. It's particularly important for silent-drop where it's absolutely mandatory that no packet leaves the machine.	2017-10-22 09:54:16 +02:00
Olivier Houchard	1a0545f3d7	REORG: connection: rename CO_FL_DATA_* -> CO_FL_XPRT_* These flags are not exactly for the data layer, they instead indicate what is expected from the transport layer. Since we're going to split the connection between the transport and the data layers to insert a mux layer, it's important to have a clear idea of what each layer does. All function conn_data_* used to manipulate these flags were renamed to conn_xprt_*.	2017-10-22 09:54:15 +02:00
Willy Tarreau	794f9af894	MEDIUM: h1: reimplement the http/1 response parser for the gateway The HTTP/2->HTTP/1 gateway will need to process HTTP/1 responses. We cannot sanely rely on the HTTP/1 txn to parse a response because : 1) responses generated by haproxy such as error messages, redirects, stats or Lua are neither parsed nor indexed ; this could be addressed over the long term but will take time. 2) the http txn is useless to parse the body : the states present there are only meaningful to received bytes (ie next bytes to parse) and not at all to sent bytes. Thus chunks cannot be followed at all. Even when implementing this later, it's unsure whether it will be possible when dealing with compression. So using the HTTP txn is now out of the equation and the only remaining solution is to call an HTTP/1 message parser. We already have one, it was slightly modified to avoid keeping states by benefitting from the fact that the response was produced by haproxy and this is entirely available. It assumes the following rules are true, or that incuring an extra cost to work around them is acceptable : - the response buffer is read-write and supports modifications in place - headers sent through / by haproxy are not folded. Folding is still implemented by replacing CR/LF/tabs/spaces with spaces if encountered - HTTP/0.9 responses are never sent by haproxy and have never been supported at all - haproxy will not send partial responses, the whole headers block will be sent at once ; this means that we don't need to keep expensive states and can afford to restart the parsing from the beginning when facing a partial response ; - response is contiguous (does not wrap). This was already the case with the original parser and ensures we can safely dereference all fields with (ptr,len) The parser replaces all of the http_msg fields that were necessary with local variables. The parser is not called on an http_msg but on a string with a start and an end. The HTTP/1 states were reused for ease of use, though the request-specific ones have not been implemented for now. The error position and error state are supported and optional ; these ones may be used later for bug hunting. The parser issues the list of all the headers into a caller-allocated array of struct ist. The content-length/transfer-encoding header are checked and the relevant info fed the h1 message state (flags + body_len).	2017-10-22 09:54:15 +02:00
Willy Tarreau	306924ecb8	MINOR: http: add very simple header management based on double strings This will be used initially by the hpack table and hopefully later by a new native http processor. These headers are made of name and value, both an immediate string (ie: pointer and length).	2017-10-22 09:54:14 +02:00
Willy Tarreau	4093a4dc01	MINOR: h1: add struct h1m for basic HTTP/1 messages This one is much simpler than http_msg and will be used in the HTTP parsers involved in the H2 to H1 gateway.	2017-10-22 09:54:14 +02:00
Willy Tarreau	b28925675d	MEDIUM: http: make the chunk crlf parser only depend on the buffer The chunk crlf parser used to depend on the channel and on the HTTP message, eventhough it's not really needed. Let's remove this dependency so that it can be used within the H2 to H1 gateway. As part of this small API change, it was renamed to h1_skip_chunk_crlf() to mention that it doesn't depend on http_msg anymore.	2017-10-22 09:54:14 +02:00
Willy Tarreau	e56cdd3629	MEDIUM: http: make the chunk size parser only depend on the buffer The chunk parser used to depend on the channel and on the HTTP message but it's not really needed as they're only used to retrieve the buffer as well as to return the number of bytes parsed and the chunk size. Here instead we pass the (few) relevant information in arguments so that the function may be reused without a channel nor an HTTP message (ie from the H2 to H1 gateway). As part of this API change, it was renamed to h1_parse_chunk_size() to mention that it doesn't depend on http_msg anymore.	2017-10-22 09:54:14 +02:00
Willy Tarreau	8740c8b1b2	REORG: http: move the HTTP/1 header block parser to h1.c Since it still depends on http_msg, it was not renamed yet.	2017-10-22 09:54:13 +02:00
Willy Tarreau	db4893d6a4	REORG: http: move the HTTP/1 chunk parser to h1.{c,h} Functions http_parse_chunk_size(), http_skip_chunk_crlf() and http_forward_trailers() were moved to h1.h and h1.c respectively so that they can be called from outside. The parts that were inline remained inline as it's critical for performance (+41% perf difference reported in an earlier test). For now the "http_" prefix remains in their name since they still depend on the http_msg type.	2017-10-22 09:54:13 +02:00
Willy Tarreau	0da5b3bddc	REORG: http: move some very http1-specific parts to h1.{c,h} Certain types and enums are very specific to the HTTP/1 parser, and we'll need to share them with the HTTP/2 to HTTP/1 translation code. Let's move them to h1.c/h1.h. Those with very few occurrences or only used locally were renamed to explicitly mention the relevant HTTP version : enum ht_state -> h1_state. http_msg_state_str -> h1_msg_state_str HTTP_FLG_* -> H1_FLG_* http_char_classes -> h1_char_classes Others like HTTP_IS_, HTTP_MSG_ are left to be done later.	2017-10-22 09:54:13 +02:00
Willy Tarreau	0621da5f5b	MINOR: buffer: make bo_getblk_nc() not return 2 for a full buffer Thus function returns the number of blocks. When a buffer is full and properly aligned, buf->p loops back the beginning, and the test in the code doesn't cover that specific case, so it returns two chunks, a full one and an empty one. It's harmless but can sometimes have a small impact on performance and definitely makes the code hard to debug.	2017-10-22 09:54:12 +02:00
Emeric Brun	5a1335110c	BUG/MEDIUM: log: check result details truncated. Fix regression introduced by commit: 'MAJOR: servers: propagate server status changes asynchronously.' The building of the log line was re-worked to be done at the postponed point without lack of data. [wt: this only affects 1.8-dev, no backport needed]	2017-10-19 18:51:32 +02:00
Willy Tarreau	e67c4e5744	MINOR: ist: add ist0() to add a trailing zero to a string. This function modifies the string to add a zero after the end, and returns the start pointer. The purpose is to use it on strings extracted by parsers from larger strings cut with delimiters that are not important and can be destroyed. It allows any such string to be used with regular string functions. It's also convenient to use with printf() to show data extracted from writable areas.	2017-10-19 15:01:08 +02:00
Willy Tarreau	41ab86898e	MINOR: channel: make the channel be a const in all {ci,co}_get* functions There's no point having the channel marked writable as these functions only extract data from the channel. The code was retrieved from their ci/co ancestors.	2017-10-19 15:01:08 +02:00
Willy Tarreau	e0e734ccc5	MINOR: buffer: add bo_getblk() and bo_getblk_nc() These functions respectively extract a block from an output buffer by copying it or by just passing pointers and lengths for zero copy operation.	2017-10-19 15:01:08 +02:00
Willy Tarreau	06d80a9a9c	REORG: channel: finally rename the last bi_* / bo_* functions For HTTP/2 we'll need some buffer-only equivalent functions to some of the ones applying to channels and still squatting the bi_* / bo_* namespace. Since these names have kept being misleading for quite some time now and are really getting annoying, it's time to rename them. This commit will use "ci/co" as the prefix (for "channel in", "channel out") instead of "bi/bo". The following ones were renamed : bi_getblk_nc, bi_getline_nc, bi_putblk, bi_putchr, bo_getblk, bo_getblk_nc, bo_getline, bo_getline_nc, bo_inject, bi_putchk, bi_putstr, bo_getchr, bo_skip, bi_swpbuf	2017-10-19 15:01:08 +02:00
Willy Tarreau	5b9834f12a	MINOR: buffer: add buffer_space_wraps() This function returns true if the available buffer space wraps. This will be used to detect if it's worth realigning a buffer when it lacks contigous space.	2017-10-19 15:01:08 +02:00
Willy Tarreau	e5676e7103	MINOR: buffer: add two functions to inject data into buffers bi_istput() injects the ist string into the input region of the buffer, it will be used to feed small data chunks into the conn_stream. bo_istput() does the same into the output region of the buffer, it will be used to send data via the transport layer and assumes there's no input data.	2017-10-19 15:01:08 +02:00
Willy Tarreau	6634b63c78	MINOR: buffer: add a function to match against string patterns In order to match known patterns in wrapping buffer, we'll introduce new string manipulation functions for buffers. The new function b_isteq() relies on an ist string for the pattern and compares it against any location in the buffer relative to <p>. The second function bi_eat() is specially designed to match input contents.	2017-10-19 15:01:07 +02:00
Willy Tarreau	7f564d2b60	MINOR: buffer: add bo_del() to delete a number of characters from output This simply reduces the amount of output data from the buffer after they have been transferred, in a way that is more natural than by fiddling with buf->o. b_del() was renamed to bi_del() to avoid any ambiguity (it's not yet used).	2017-10-19 15:01:07 +02:00
Willy Tarreau	dea7c5c03d	BUG/MINOR: tools: fix my_htonll() on x86_64 Commit `36eb3a3` ("MINOR: tools: make my_htonll() more efficient on x86_64") brought an incorrect asm statement missing the input constraints, causing the input value not necessarily to be placed into the same register as the output one, resulting in random output. It happens to work when building at -O0 but not above. This was only detected in the HTTP/2 parser, but in mainline it could only affect the integer to binary sample cast. No backport is needed since this bug was only introduced in the development branch.	2017-10-18 11:46:17 +02:00
Olivier Houchard	9130a9605d	MINOR: checks: Add a new keyword to specify a SNI when doing SSL checks. Add a new keyword, "check-sni", to be able to specify the SNI to be used when doing health checks over SSL.	2017-10-17 18:10:24 +02:00
Emeric Brun	64cc49cf7e	MAJOR: servers: propagate server status changes asynchronously. In order to prepare multi-thread development, code was re-worked to propagate changes asynchronoulsy. Servers with pending status changes are registered in a list and this one is processed and emptied only once 'run poll' loop. Operational status changes are performed before administrative status changes. In a case of multiple operational status change or admin status change in the same 'run poll' loop iteration, those changes are merged to reach only the targeted status.	2017-10-13 12:00:27 +02:00
Willy Tarreau	bf08beb2a3	MINOR: session: remove the list of streams from struct session Commit `bcb86ab` ("MINOR: session: add a streams field to the session struct") added this list of streams that is not needed anymore. Let's get rid of it now.	2017-10-08 22:32:05 +02:00
Willy Tarreau	c939835f77	MINOR: compiler: restore the likely() wrapper for gcc 5.x After some tests, gcc 5.x produces better code with likely() than without, contrary to gcc 4.x where it was better to disable it. Let's re-enable it for 5 and above.	2017-10-08 22:32:05 +02:00
Willy Tarreau	2ba672726c	MINOR: ist: add a macro to ease const array initialization It's not possible to use strlen() in const arrays even with const strings, but we can use sizeof-1 via a macro. Let's provide this in the IST() macro, as it saves the developer from having to count the characters.	2017-09-21 15:32:31 +02:00
Willy Tarreau	82967bf9b3	MINOR: connection: adjust CO_FL_NOTIFY_DATA after removal of flags After the removal of CO_FL_DATA_RD_SH and CO_FL_DATA_WR_SH, the aggregate mask CO_FL_NOTIFY_DATA was not updated. It happens that now CO_FL_NOTIFY_DATA and CO_FL_NOTIFY_DONE are similar, which may reveal some overlap between the ->wake and ->xprt_done callbacks. We'll see after the mux changes if both are still required.	2017-09-21 06:28:52 +02:00
Willy Tarreau	5531d5732d	MINOR: net_helper: add 64-bit read/write functions These ones are the same as the previous ones but for 64 bit values. We're using my_ntohll() and my_htonll() from standard.h for the byte order conversion.	2017-09-21 06:27:08 +02:00
Willy Tarreau	2888c08346	MINOR: net_helper: add write functions These ones are the equivalent of the read_* functions. They support writing unaligned words, possibly wrapping, in host and network order. The write_i*() functions were not implemented since the caller can already use the unsigned version.	2017-09-21 06:25:10 +02:00
Willy Tarreau	d5370e1d6c	MINOR: net_helper: add functions to read from vectors This patch adds the ability to read from a wrapping memory area (ie: buffers). The new functions are called "readv_<type>". The original ones were renamed to start with "read_" to make the difference more obvious between the read method and the returned type. It's worth noting that the memory barrier in readv_bytes() is critical, as otherwise gcc decides that it doesn't need the resulting data, but even worse, removes the length checks in readv_u64() and happily performs an out-of-bounds unaligned read using read_u64()! Such "optimizations" are a bit borderline, especially when they impact security like this...	2017-09-20 11:27:31 +02:00
Willy Tarreau	26488ad358	MINOR: buffer: add b_end() and b_to_end() These ones return respectively the pointer to the end of the buffer and the distance between b->p and the end. These will simplify a bit some new code needed to parse directly from a wrapping buffer.	2017-09-20 11:27:31 +02:00
Willy Tarreau	4a6425d373	MINOR: buffer: add b_del() to delete a number of characters This will be used by code which directly parses buffers with no channel in the middle (eg: h2, might be used by checks as well).	2017-09-20 11:27:31 +02:00
Willy Tarreau	36eb3a3ac8	MINOR: tools: make my_htonll() more efficient on x86_64 The current construct was made when developing on a 32-bit machine. Having a simple bswap operation replaced with 2 bswap, 2 shift and 2 or is quite of a waste of precious cycles... Let's provide a trivial asm-based implementation for x86_64.	2017-09-20 11:27:31 +02:00
Willy Tarreau	05f5047d40	MINOR: listener: new function listener_release Instead of duplicating some sensitive listener-specific code in the session and in the stream code, let's call listener_release() when releasing a connection attached to a listener.	2017-09-15 11:49:52 +02:00
Willy Tarreau	2cc5bae0b8	MINOR: listeners: make listeners count consistent with reality Some places call delete_listener() then decrement the number of listeners and jobs. At least one other place calls delete_listener() without doing so, but since it's in deinit(), it's harmless and cannot risk to cause zombie processes to survive. Given that the number of listeners and jobs is incremented when creating the listeners, it's much more logical to symmetrically decrement them when deleting such listeners.	2017-09-15 11:49:52 +02:00
Willy Tarreau	0de59fd53a	MINOR: listeners: new function create_listeners This function is used to create a series of listeners for a specific address and a port range. It automatically calls the matching protocol handlers to add them to the relevant lists. This way cfgparse doesn't need to manipulate listeners anymore. As an added bonus, the memory allocation is checked.	2017-09-15 11:49:52 +02:00
Willy Tarreau	31794892af	MINOR: unix: remove the now unused proto_uxst.h file Since everything is self contained in proto_uxst.c there's no need to export anything. The same should be done for proto_tcp.c but the file contains other stuff that's not related to the TCP protocol itself and which should first be moved somewhere else.	2017-09-15 11:49:52 +02:00
Willy Tarreau	9d5be5c823	MINOR: protocols: register the ->add function and stop calling them directly cfgparse has no business directly calling each individual protocol's 'add' function to create a listener. Now that they're all registered, better perform a protocol lookup on the family and have a standard ->add method for all of them.	2017-09-15 11:49:52 +02:00
Willy Tarreau	3228238c73	MINOR: protocols: always pass a "port" argument to the listener creation It's a shame that cfgparse() has to make special cases of each protocol just to cast the port to the target address family. Let's pass the port in argument to the function. The unix listener simply ignores it.	2017-09-15 11:49:52 +02:00
Andjelko Iharos	c4df59e914	MINOR: cli: add socket commands and config to prepend informational messages with severity Adds cli commands to change at runtime whether informational messages are prepended with severity level or not, with support for numeric and worded severity in line with syslog severity level. Adds stats socket config keyword severity-output to set default behavior per socket on startup.	2017-09-13 13:37:59 +02:00
Olivier Houchard	ed0d96cac4	MINOR: net_helper: Inline functions meant to be inlined.	2017-09-13 13:35:35 +02:00
Thierry FOURNIER	d697596c6c	MINOR: tasks: Move Lua notification from Lua to tasks These notification management function and structs are generic and it will be better to move in common parts. The notification management functions and structs have names containing some "lua" references because it was written for the Lua. This patch removes also these references.	2017-09-11 18:59:40 +02:00
Thierry FOURNIER	2da788e755	MEDIUM: xref/lua: Use xref for referencing cosocket relation between stream and lua This relation will ensure that each was informed about death of another one.	2017-09-11 18:59:40 +02:00
Thierry FOURNIER	3c65b7a916	MINOR: xref: Add a new xref system xref is used to create a relation between two elements. Once an element is released, it breaks the relation. If the relation is already broken, it frees the xref struct. The pointer between two elements is a sort of refcount with max value 1. The relation is only between two elements. The pointer and the type of element a and b are conventional. Note that xref is initialised from Lua files because Lua is the only one user.	2017-09-11 18:59:40 +02:00
Emmanuel Hocdet	ddcde195eb	MINOR: ssl: rework smp_fetch_ssl_fc_cl_str without internal ssl use smp_fetch_ssl_fc_cl_str as very limited usage (only work with openssl == 1.0.2 compiled with the option enable-ssl-trace). It use internal cipher.algorithm_ssl attribut and SSL_CIPHER_standard_name (available with ssl-trace). This patch implement this (debug) function in a standard way. It used common SSL_CIPHER_get_name to display cipher name. It work with openssl >= 1.0.2 and boringssl.	2017-09-09 08:36:22 +02:00
Christopher Faulet	21e9267ac3	MINOR: fd: Add fd_update_events function This function should be called by the poller to set FD_POLL_* flags on an FD and update its state if needed. This function has been added to ease threads support integration.	2017-09-05 15:43:09 +02:00
Emeric Brun	52a91d3d48	MEDIUM: check: server states and weight propagation re-work The server state and weight was reworked to handle "pending" values updated by checks/CLI/LUA/agent. These values are commited to be propagated to the LB stack. In further dev related to multi-thread, the commit will be handled into a sync point. Pending values are named using the prefix 'next_' Current values used by the LB stack are named 'cur_'	2017-09-05 15:23:16 +02:00
Christopher Faulet	de2075fd21	MINOR: freq_ctr: Return the new value after an update This will ease threads support integration.	2017-09-05 11:55:07 +02:00
Christopher Faulet	d82b180d6b	MINOR: fd: Use inlined functions to check fd state in fd__send/recv functions It these functions, the test is inverted and we rely on fd_recv/send_ function to check the fd state. This will ease threads support integration.	2017-09-05 10:47:32 +02:00
Christopher Faulet	8db2fdfaba	MINOR: fd: Add fd_active function This inlined function is used to check if a fd is active for receive or send. It will ease threads support integration.	2017-09-05 10:39:46 +02:00
Christopher Faulet	6988f678cd	MINOR: http: Use a trash chunk to store decoded string of the HTTP auth header This string is used in sample fetches so it is safe to use a preallocated trash chunk instead of a buffer dynamically allocated during HAProxy startup.	2017-09-05 10:36:28 +02:00
Christopher Faulet	ca20d02ea8	MINOR: stick-tables: Make static_table_key a struct variable instead of a pointer First, this variable does not need to be publicly exposed because it is only used by stick_table functions. So we declare it as a global static in stick_table.c file. Then, it is useless to use a pointer. Using a plain struct variable avoids any dynamic allocation.	2017-09-05 10:35:07 +02:00
Christopher Faulet	ad405f1714	MINOR: buffers: Move swap_buffer into buffer.c and add deinit_buffer function swap_buffer is a global variable only used by buffer_slow_realign. So it has been moved from global.h to buffer.c and it is allocated by init_buffer function. deinit_buffer function has been added to release it. It is also used to destroy the buffers' pool.	2017-09-05 10:34:30 +02:00
Christopher Faulet	0132d06f68	MINOR: logs: Use dedicated function to init/deinit log buffers Now, we use init_log_buffers and deinit_log_buffers to, respectively, initialize and deinitialize log buffers used for syslog messages. These functions have been introduced to be used by threads, to deal with thread-local log buffers.	2017-09-05 10:29:31 +02:00
Christopher Faulet	748919a4c7	MINOR: chunks: Use dedicated function to init/deinit trash buffers Now, we use init_trash_buffers and deinit_trash_buffers to, respectively, initialize and deinitialize trash buffers (trash, trash_buf1 and trash_buf2). These functions have been introduced to be used by threads, to deal with thread-local trash buffers.	2017-09-05 10:22:20 +02:00
Christopher Faulet	576c5aa25c	MINOR: fd: Set owner and iocb field before inserting a new fd in the fdtab This will be needed for concurrent accesses.	2017-09-05 10:17:10 +02:00
Christopher Faulet	d531f88622	MINOR: fd: Don't forget to reset fdtab[fd].update when a fd is added/removed It used to be guaranteed by the polling functions on a later call but with concurrent accesses it cannot be granted anymore.	2017-09-05 10:16:42 +02:00
Christopher Faulet	f5b8adc5c0	MINOR: listeners: Change enable_listener and disable_listener into private functions These functions are only used in listener.c.	2017-09-05 10:14:16 +02:00
Christopher Faulet	5580ba2e11	MINOR: listeners: Change listener_full and limit_listener into private functions These functions are only used in listener_accept. So there is no need to export them.	2017-09-05 10:13:55 +02:00
Christopher Faulet	ae459fd206	CLEANUP: memory: Remove unused function pool_destroy This one was never used.	2017-09-05 10:13:20 +02:00
Emmanuel Hocdet	4366476852	MINOR: ssl: remove duplicate ssl_methods in struct bind_conf Patch "MINOR: ssl: support ssl-min-ver and ssl-max-ver with crt-list" introduce ssl_methods in struct ssl_bind_conf. struct bind_conf have now ssl_methods and ssl_conf.ssl_methods (unused). It's error-prone. This patch remove the duplicate structure to avoid any confusion.	2017-09-05 09:42:30 +02:00
Willy Tarreau	bbae3f0170	MEDIUM: connection: remove useless flag CO_FL_DATA_WR_SH After careful inspection, this flag is set at exactly two places : - once in the health-check receive callback after receipt of a response - once in the stream interface's shutw() code where CF_SHUTW is always set on chn->flags The flag was checked in the checks before deciding to send data, but when it is set, the wake() callback immediately closes the connection so the CO_FL_SOCK_WR_SH flag is also set. The flag was also checked in si_conn_send(), but checking the channel's flag instead is enough and even reveals that one check involving it could never match. So it's time to remove this flag and replace its check with a check of CF_SHUTW in the stream interface. This way each layer is responsible for its shutdown, this will ease insertion of the mux layer.	2017-08-30 10:05:49 +02:00
Willy Tarreau	cde5651c4d	CLEANUP: connection: remove the unused conn_sock_shutw_pending() This has never been used anywhere.	2017-08-30 08:18:53 +02:00
Willy Tarreau	54e917cfa1	MEDIUM: connection: remove useless flag CO_FL_DATA_RD_SH This flag is both confusing and wrong. It is supposed to report the fact that the data layer has received a shutdown, but in fact this is reported by CO_FL_SOCK_RD_SH which is set by the transport layer after this condition is detected. The only case where the flag above is set is in the stream interface where CF_SHUTR is also set on the receiving channel. In addition, it was checked in the health checks code (while never set) and was always test jointly with CO_FL_SOCK_RD_SH everywhere, except in conn_data_read0_pending() which incorrectly doesn't match the second time it's called and is fortunately protected by an extra check on (ic->flags & CF_SHUTR). This patch gets rid of the flag completely. Now conn_data_read0_pending() accurately reports the fact that the transport layer has detected the end of the stream, regardless of the fact that this state was already consumed, and the stream interface watches ic->flags&CF_SHUTR to know if the channel was already closed by the upper layer (which it already used to do). The now unused conn_data_read0() function was removed.	2017-08-30 08:18:50 +02:00
Willy Tarreau	5790eb0a76	MINOR: stream: provide a new stream creation function for connections The purpose will be to create new streams for a given connection so that we can later abstract this from a mux.	2017-08-30 07:06:39 +02:00
Willy Tarreau	0b74eae1f1	MEDIUM: session: add a pointer to a struct task in the session The session may need to enforce a timeout when waiting for a handshake. Till now we used a trick to avoid allocating a pointer, we used to set the connection's owner to the task and set the task's context to the session, so that it was possible to circle between all of them. The problem is that we'll really need to pass the pointer to the session to the upper layers during initialization and that the only place to store it is conn->owner, which is squatted for this trick. So this patch moves the struct task* into the session where it should always have been and ensures conn->owner points to the session until the data layer is properly initialized.	2017-08-30 07:05:49 +02:00
Willy Tarreau	ca3610251b	CLEANUP: listener: remove the unused handler field Historically listeners used to have a handler depending on the upper layer. But now it's exclusively process_stream() and nothing uses it anymore so it can safely be removed.	2017-08-30 07:05:08 +02:00
Willy Tarreau	87787acf72	MEDIUM: stream: make stream_new() allocate its own task Currently a task is allocated in session_new() and serves two purposes : - either the handshake is complete and it is offered to the stream via the second arg of stream_new() - or the handshake is not complete and it's diverted to be used as a timeout handler for the embryonic session and repurposed once we land into conn_complete_session() Furthermore, the task's process() function was taken from the listener's handler in conn_complete_session() prior to being replaced by a call to stream_new(). This will become a serious mess with the mux. Since it's impossible to have a stream without a task, this patch removes the second arg from stream_new() and make this function allocate its own task. In session_accept_fd(), we now only allocate the task if needed for the embryonic session and delete it later.	2017-08-30 07:05:04 +02:00
Willy Tarreau	8e3c6ce75a	MEDIUM: connection: get rid of data->init() which was not for data The ->init() callback of the connection's data layer was only used to complete the session's initialisation since sessions and streams were split apart in 1.6. The problem is that it creates a big confusion in the layers' roles as the session has to register a dummy data layer when waiting for a handshake to complete, then hand it off to the stream which will replace it. The real need is to notify that the transport has finished initializing. This should enable a better splitting between these layers. This patch thus introduces a connection-specific callback called xprt_done_cb() which informs about handshake successes or failures. With this, data->init() can disappear, CO_FL_INIT_DATA as well, and we don't need to register a dummy data->wake() callback to be notified of errors.	2017-08-30 07:04:04 +02:00
Willy Tarreau	585744bf2e	REORG/MEDIUM: connection: introduce the notion of connection handle Till now connections used to rely exclusively on file descriptors. It was planned in the past that alternative solutions would be implemented, leading to member "union t" presenting sock.fd only for now. With QUIC, the connection will need to continue to exist but will not rely on a file descriptor but a connection ID. So this patch introduces a "connection handle" which is either a file descriptor or a connection ID, to replace the existing "union t". We've now removed the intermediate "struct sock" which was never used. There is no functional change at all, though the struct connection was inflated by 32 bits on 64-bit platforms due to alignment.	2017-08-24 19:30:04 +02:00
Willy Tarreau	0c219be3df	BUG/MEDIUM: dns: fix accepted_payload_size parser to avoid integer overflow Since commit `9d8dbbc` ("MINOR: dns: Maximum DNS udp payload set to 8192") it's possible to specify a packet size, but passing too large a size or a negative size is not detected and results in memset() being performed over a 2GB+ area upon receipt of the first DNS response, causing runtime crashes. We now check that the size is not smaller than the smallest packet which is the DNS header size (12 bytes). No backport is needed.	2017-08-22 12:03:46 +02:00
Baptiste Assmann	9d8dbbc56b	MINOR: dns: Maximum DNS udp payload set to 8192 Following up DNS extension introduction, this patch aims at making the computation of the maximum number of records in DNS response dynamic. This computation is based on the announced payload size accepted by HAProxy.	2017-08-22 11:39:57 +02:00
Baptiste Assmann	747359eeca	BUG/MINOR: dns: server set by SRV records stay in "no resolution" status This patch fixes a bug where some servers managed by SRV record query types never ever recover from a "no resolution" status. The problem is due to a wrong function called when breaking the server/resolution (A/AAAA) relationship: this is performed when a server's SRV record disappear from the SRV response.	2017-08-22 11:34:49 +02:00
Fr�d�ric L�caille	6ca71a9297	BUG/MINOR: Wrong type used as argument for spoe_decode_buffer(). Contrary to 64-bits libCs where size_t type size is 8, on systems with 32-bits size of size_t is 4 (the size of a long) which does not equal to size of uint64_t type. This was revealed by such GCC warnings on 32bits systems: src/flt_spoe.c:2259:40: warning: passing argument 4 of spoe_decode_buffer from incompatible pointer type if (spoe_decode_buffer(&p, end, &str, &sz) == -1) ^ As the already existing code using spoe_decode_buffer() already use such pointers to uint64_t, in place of pointer to size_t ;), most of this code is in contrib directory, this simple patch modifies the prototype of spoe_decode_buffer() so that to use a pointer to uint64_t in place of a pointer to size_t, uint64_t type being the type finally required for decode_varint().	2017-08-22 11:27:20 +02:00
Willy Tarreau	a5480694bf	MINOR: http: export some of the HTTP parser macros The two macros EXPECT_LF_HERE and EAT_AND_JUMP_OR_RETURN were exported for use outside the HTTP parser. They now take extra arguments to avoid implicit pointers and jump labels. These will be used to reimplement a minimalist HTTP/1 parser in the H1->H2 gateway.	2017-08-18 13:38:47 +02:00
Willy Tarreau	e11f727c95	MINOR: ist: implement very simple indirect strings For HPACK we'll need to perform a lot of string manipulation between the dynamic headers table and the output stream, and we need an efficient way to deal with that, considering that the zero character is not an end of string marker here. It turns out that gcc supports returning structs from functions and is able to place up to two words directly in registers when -freg-struct is used, which is the case by default on x86 and armv8. On other architectures the caller reserves some stack space where the callee can write, which is equivalent to passing a pointer to the return value. So let's implement a few functions to deal with this as the resulting code will be optimized on certain architectures where retrieving the length of a string will simply consist in reading one of the two returned registers. Extreme care was taken to ensure that the compiler gets maximum opportunities to optimize out every bit of unused code. This is also the reason why no call to regular string functions (such as strlen(), memcmp(), memcpy() etc) were used. The code involving them is often larger than when they are open coded. Given that strings are usually very small, especially when manipulating headers, the time spent calling a function optimized for large vectors often ends up being higher than the few cycles needed to count a few bytes. An issue was met with __builtin_strlen() which can automatically convert a constant string to its constant length. It doesn't accept NULLs and there is no way to hide them using expressions as the check is made before the optimizer is called. On gcc 4 and above, using an intermediary variable is enough to hide it. On older versions, calls to ist() with an explicit NULL argument will issue a warning. There is normally no reason to do this but taking care of it the best possible still seems important.	2017-08-18 13:38:47 +02:00
Willy Tarreau	2bfd35885e	MINOR: stream: link the stream to its session Now each stream is added to the session's list of streams, so that it will be possible to know all the streams belonging to a session, and to know if any stream is still attached to a sessoin.	2017-08-18 13:26:35 +02:00
Willy Tarreau	bcb86abaca	MINOR: session: add a streams field to the session struct This will be used to hold the list of streams belonging to a given session.	2017-08-18 13:26:35 +02:00
Willy Tarreau	82032f1223	MINOR: chunks: add chunk_memcpy() and chunk_memcat() These two functions respectively copy a memory area onto the chunk, and append the contents of a memory area over a chunk. They are convenient to prepare binary output data to be sent and will be used for HTTP/2.	2017-08-18 13:26:20 +02:00
Baptiste Assmann	2af08fe3de	MINOR: dns: enabled edns0 extension and make accpeted payload size tunable Edns extensions may be used to negotiate some settings between a DNS client and a server. For now we only use it to announce the maximum response payload size accpeted by HAProxy. This size can be set through a configuration parameter in the resolvers section. If not set, it defaults to 512 bytes.	2017-08-18 11:25:56 +02:00
Baptiste Assmann	572ab8b269	MINOR: dns: new dns record type (RTYPE) for OPT DNS record type OPT is required to send additional records. OPT has been assigned ID 41.	2017-08-18 11:25:49 +02:00
Emmanuel Hocdet	15969297af	BUILD: ssl: replace SSL_CTX_get0_privatekey for openssl < 1.0.2 Commit `48a8332a` introduce SSL_CTX_get0_privatekey in openssl-compat.h but SSL_CTX_get0_privatekey access internal structure and can't be a candidate to openssl-compat.h. The workaround with openssl < 1.0.2 is to use SSL_new then SSL_get_privatekey.	2017-08-11 11:35:26 +02:00
Olivier Houchard	8da5f98fbe	MINOR: dns: Handle SRV records. Make it so for each server, instead of specifying a hostname, one can use a SRV label. When doing so, haproxy will first resolve the SRV label, then use the resulting hostnames, as well as port and weight (priority is ignored right now), to each server using the SRV label. It is resolved periodically, and any server disappearing from the SRV records will be removed, and any server appearing will be added, assuming there're free servers in haproxy.	2017-08-09 16:32:49 +02:00
Olivier Houchard	e962fd880d	Add a few functions to do unaligned access. Add a few functions to read 16bits and 32bits integers that may be unaligned, both in host and network order.	2017-08-09 16:32:49 +02:00
Olivier Houchard	e2c222b12f	MINOR: obj: Add a new type of object, OBJ_TYPE_SRVRQ. dns_srvrq will be objects used for dealing with SRV records.	2017-08-09 16:32:49 +02:00
Olivier Houchard	a8c6db8d2d	MINOR: dns: Cache previous DNS answers. As DNS servers may not return all IPs in one answer, we want to cache the previous entries. Those entries are removed when considered obsolete, which happens when the IP hasn't been returned by the DNS server for a time defined in the "hold obsolete" parameter of the resolver section. The default is 30s.	2017-08-09 16:32:49 +02:00
Fr�d�ric L�caille	3169471964	MINOR: Add server port field to server state file. This patch adds server ports to server state file at the end of each line for backward compatibility.	2017-08-03 14:31:46 +02:00
Christopher Faulet	48a8332a4a	BUG/MEDIUM: ssl: Fix regression about certificates generation Since the commit `f6b37c67` ["BUG/MEDIUM: ssl: in bind line, ssl-options after 'crt' are ignored."], the certificates generation is broken. To generate a certificate, we retrieved the private key of the default certificate using the SSL object. But since the commit `f6b37c67`, the SSL object is created with a dummy certificate (initial_ctx). So to fix the bug, we use directly the default certificate in the bind_conf structure. We use SSL_CTX_get0_privatekey function to do so. Because this function does not exist for OpenSSL < 1.0.2 and for LibreSSL, it has been added in openssl-compat.h with the right #ifdef.	2017-07-28 18:25:18 +02:00
Willy Tarreau	6d0d3f6546	MINOR: listener: add a function to return a listener's state as a string This will be used in debugging output, so it's a short 3-character string.	2017-07-28 17:03:12 +02:00
Emmanuel Hocdet	174dfe55a0	MINOR: ssl: add "no-ca-names" parameter for bind This option prevent to send CA names in server hello message when ca-file is used. This parameter is also available in "crt-list".	2017-07-28 15:20:48 +02:00
Willy Tarreau	71d058c288	MINOR: ssl: add a new error codes for wrong server certificates If a server presents an unexpected certificate to haproxy, that is, a certificate that doesn't match the expected name as configured in verifyhost or as requested using SNI, we want to store that precious information. Fortunately we have access to the connection in the verification callback so it's possible to store an error code there. For this purpose we use CO_ER_SSL_MISMATCH_SNI (for when the cert name didn't match the one requested using SNI) and CO_ER_SSL_MISMATCH for when it doesn't match verifyhost.	2017-07-28 11:50:16 +02:00
Christopher Faulet	96c7b8dbd2	BUG/MINOR: ssl: Fix check against SNI during server certificate verification This patch fixes the commit `2ab8867` ("MINOR: ssl: compare server certificate names to the SNI on outgoing connections") When we check the certificate sent by a server, in the verify callback, we get the SNI from the session (SSL_SESSION object). In OpenSSL, tlsext_hostname value for this session is copied from the ssl connection (SSL object). But the copy is done only if the "server_name" extension is found in the server hello message. This means the server has found a certificate matching the client's SNI. When the server returns a default certificate not matching the client's SNI, it doesn't set any "server_name" extension in the server hello message. So no SNI is set on the SSL session and SSL_SESSION_get0_hostname always returns NULL. To fix the problemn, we get the SNI directly from the SSL connection. It is always defined with the value set by the client. If the commit `2ab8867` is backported in 1.7 and/or 1.6, this one must be backported too. Note: it's worth mentionning that by making the SNI check work, we introduce another problem by which failed SNI checks can cause long connection retries on the server, and in certain cases the SNI value used comes from the client. So this patch series must not be backported until this issue is resolved.	2017-07-26 19:43:33 +02:00
Willy Tarreau	f42199975c	MINOR: task: always preinitialize the task's timeout in task_init() task_init() is called exclusively by task_new() which is the only way to create a task. Most callers set t->expire to TICK_ETERNITY, some set it to another value and a few like Lua don't set it at all as they don't need a timeout, causing random values to be used in case the task gets queued. Let's always set t->expire to TICK_ETERNITY in task_init() so that all tasks are now initialized in a clean state. This patch can be backported as it will definitely make the code more robust (at least the Lua code, possibly other places).	2017-07-24 17:52:58 +02:00
Christopher Faulet	5db105e8b2	MINOR: samples: Handle the type SMP_T_METH in smp_is_safe and smp_is_rw For all known methods, samples are considered as safe and rewritable. For unknowns, we handle them like strings (SMP_T_STR).	2017-07-24 17:16:00 +02:00
David Carlier	b781dbede3	MINOR: memory: remove macros We finally get rid of the macros and use usual memory management functions directly.	2017-07-21 09:54:03 +02:00
Willy Tarreau	cb1949b8b3	MINOR: tools: add a portable timegm() alternative timegm() is not provided everywhere and the documentation on how to replace it is bogus as it proposes an inefficient and non-thread safe alternative. Here we reimplement everything needed to compute the number of seconds since Epoch based on the broken down fields in struct tm. It is only guaranteed to return correct values for correct inputs. It was successfully tested with all possible 32-bit values of time_t converted to struct tm using gmtime() and back to time_t using the legacy timegm() and this function, and both functions always produced the same result. Thanks to Beno�t Garnier for an instructive discussion and detailed explanations of the various time functions, leading to this solution.	2017-07-19 19:15:06 +02:00
Emmanuel Hocdet	8c2ddc20de	BUILD: ssl: fix compatibility with openssl without TLSEXT_signature_* In openssl < 1.0.1, TLSEXT_signature_* is undefined. Add TLSEXT signatures (RFC 5246) when TLSEXT_signature_anonymous is undefined.	2017-07-19 17:19:33 +02:00
Thierry FOURNIER	b13b20a19a	BUG/MAJOR: lua/socket: resources not detroyed when the socket is aborted In some cases, the socket is misused. The user can open socket and never close it, or open the socket and close it without sending data. This causes resources leak on all resources associated to the stream (buffer, spoe, ...) This is caused by the stream_shutdown function which is called outside of the stream execution process. Sometimes, the shtudown is required while the stream is not started, so the cleanup is ignored. This patch change the shutdown mode of the session. Now if the session is no longer used and the Lua want to destroy it, it just set a destroy flag and the session kill itself. This patch should be backported in 1.6 and 1.7	2017-07-18 06:41:33 +02:00
Willy Tarreau	106f631280	CLEANUP: hdr_idx: make some function arguments const where possible Functions hdr_idx_first_idx() and hdr_idx_first_pos() were missing a "const" qualifier on their arguments which are not modified, causing a warning in some experimental H2 code.	2017-07-17 21:11:30 +02:00
Fr�d�ric L�caille	ed2b4a6b79	BUG/MINOR: peers: peer synchronization issue (with several peers sections). When several stick-tables were configured with several peers sections, only a part of them could be synchronized: the ones attached to the last parsed 'peers' section. This was due to the fact that, at least, the peer I/O handler refered to the wrong peer section list, in fact always the same: the last one parsed. The fact that the global peer section list was named "struct peers *peers" lead to this issue. This variable name is dangerous ;). So this patch renames global 'peers' variable to 'cfg_peers' to ensure that no such wrong references are still in use, then all the functions wich used old 'peers' variable have been modified to refer to the correct peer list. Must be backported to 1.6 and 1.7.	2017-07-13 09:39:29 +02:00
Willy Tarreau	2ab88675ec	MINOR: ssl: compare server certificate names to the SNI on outgoing connections When support for passing SNI to the server was added in 1.6-dev3, there was no way to validate that the certificate presented by the server would really match the name requested in the SNI, which is quite a problem as it allows other (valid) certificates to be presented instead (when hitting the wrong server or due to a man in the middle). This patch adds the missing check against the value passed in the SNI. The "verifyhost" value keeps precedence if set. If no SNI is used and no verifyhost directive is specified, then the certificate name is not checked (this is unchanged). In order to extract the SNI value, it was necessary to make use of SSL_SESSION_get0_hostname(), which appeared in openssl 1.1.0. This is a trivial function which returns the value of s->tlsext_hostname, so it was provided in the compat layer for older versions. After some refinements from Emmanuel, it now builds with openssl 1.0.2, openssl 1.1.0 and boringssl. A test file was provided to ease testing all cases. After some careful observation period it may make sense to backport this to 1.7 and 1.6 as some users rightfully consider this limitation as a bug. Cc: Emmanuel Hocdet <manu@gandi.net> Signed-off-by: Willy Tarreau <w@1wt.eu>	2017-07-06 15:15:28 +02:00
Emeric Brun	7d27f3c12d	BUG/MEDIUM: map/acl: fix unwanted flags inheritance. The bug: Maps/ACLs using the same file/id can mistakenly inherit their flags from the last declared one. i.e. $ cat haproxy.conf listen mylistener mode http bind 0.0.0.0:8080 acl myacl1 url -i -f mine.acl acl myacl2 url -f mine.acl acl myacl3 url -i -f mine.acl redirect location / if myacl2 $ cat mine.acl foobar Shows an unexpected redirect for request 'GET /FOObAR HTTP/1.0\n\n'. This fix should be backported on mainline branches v1.6 and v1.7.	2017-07-04 10:45:53 +02:00
Emeric Brun	8d85aa44da	BUG/MAJOR: map: fix segfault during 'show map/acl' on cli. The reference of the current map/acl element to dump could be destroyed if map is updated from an 'http-request del-map' configuration rule or throught a 'del map/acl' on CLI. We use a 'back_refs' chaining element to fix this. As it is done to dump sessions. This patch needs also fix: 'BUG/MAJOR: cli: fix custom io_release was crushed by NULL.' To clean the back_ref and avoid a crash on a further del/clear map operation. Those fixes should be backported on mainline branches 1.7 and 1.6. This patch wont directly apply on 1.6.	2017-06-30 06:49:42 +02:00
Emeric Brun	c730606879	MAJOR: applet: applet scheduler rework. In order to authorize call of appctx_wakeup on running task: - from within the task handler itself. - in futur, from another thread. The appctx is considered paused as default after running the handler. The handler should explicitly call appctx_wakeup to be re-called. When the appctx_free is called on a running handler. The real free is postponed at the end of the handler process.	2017-06-27 14:38:02 +02:00
Willy Tarreau	a9c1741820	MINOR: connection: add a .get_alpn() method to xprt_ops This will be used to retrieve the ALPN negociated over SSL (or possibly via the proxy protocol later). It's likely that this information should be stored in the connection itself, but it requires adding an extra pointer and an extra integer. Thus better rely on the transport layer to pass this info for now.	2017-06-27 14:38:02 +02:00
Christopher Faulet	f3a55dbd22	MINOR: queue: Change pendconn_from_srv/pendconn_from_px into private functions	2017-06-27 14:38:02 +02:00
Christopher Faulet	f0614e8111	MINOR: backends: Change get_server_sh/get_server_uh into private function	2017-06-27 14:38:02 +02:00
Christopher Faulet	87566c923b	MINOR: queue: Change pendconn_get_next_strm into private function	2017-06-27 14:38:02 +02:00
Emeric Brun	0194897e54	MAJOR: task: task scheduler rework. In order to authorize call of task_wakeup on running task: - from within the task handler itself. - in futur, from another thread. The lookups on runqueue and waitqueue are re-worked to prepare multithread stuff. If task_wakeup is called on a running task, the woken message flags are savec in the 'pending_state' attribute of the state. The real wakeup is postponed at the end of the handler process and the woken messages are copied from pending_state to the state attribute of the task. It's important to note that this change will cause a very minor (though measurable) performance loss but it is necessary to make forward progress on a multi-threaded scheduler. Most users won't ever notice.	2017-06-27 14:38:02 +02:00
Emeric Brun	ff4491726f	BUG/MINOR: stream: flag TASK_WOKEN_RES not set if task in runqueue Under certain circumstances, if a stream's task is first woken up (eg: I/O event) then notified of the availability of a buffer it was waiting for via stream_res_wakeup(), this second event is lost because the flags are only merged after seeing that the task is running. At the moment it seems that the TASK_WOKEN_RES event is not explicitly checked for, but better fix this before getting reports of lost events. This fix removes this "task running" test which is properly performed in task_wakeup(), while the flags are properly merged. It must be backported to 1.7 and 1.6.	2017-06-27 14:37:52 +02:00
Christopher Faulet	a36b311b9f	BUG/MINOR: buffers: Fix bi/bo_contig_space to handle full buffers These functions was added in commit `637f8f2c` ("BUG/MEDIUM: buffers: Fix how input/output data are injected into buffers"). This patch fixes hidden bugs. When a buffer is full (buf->i + buf->o == buf->size), instead of returning 0, these functions can return buf->size. Today, this never happens because callers already check if the buffer is full before calling bi/bo_contig_space. But to avoid possible bugs if calling conditions changed, we slightly refactored these functions.	2017-06-14 16:20:20 +02:00
Emmanuel Hocdet	df701a2adb	MINOR: ssl: support ssl-min-ver and ssl-max-ver with crt-list SSL/TLS version can be changed per certificat if and only if openssl lib support earlier callback on handshake and, of course, is implemented in haproxy. It's ok for BoringSSL. For Openssl, version 1.1.1 have such callback and could support it.	2017-06-02 16:42:09 +02:00
Willy Tarreau	2686dcad1e	CLEANUP: connection: remove unused CO_FL_WAIT_DATA Very early in the connection rework process leading to v1.5-dev12, commit `56a77e5` ("MEDIUM: connection: complete the polling cleanups") marked the end of use for this flag which since was never set anymore, but it continues to be tested. Let's kill it now.	2017-06-02 15:50:27 +02:00
Willy Tarreau	ed936c5d37	MINOR: tools: make debug_hexdump() take a string prefix When dumping data at various places in the code, it's hard to figure what is present where. To make this easier, this patch slightly modifies debug_hexdump() to take a prefix string which is prepended in front of each output line.	2017-06-02 15:49:31 +02:00
Willy Tarreau	9faef1e391	MINOR: tools: make debug_hexdump() use a const char for the string There's no reason the string to be dumped should be a char *, it's a const.	2017-06-02 15:49:31 +02:00
Jarno Huuskonen	577d5ac8ae	CLEANUP: str2mask return code comment: non-zero -> zero.	2017-06-02 15:43:46 +02:00
Baptiste Assmann	201c07f681	MAJOR/REORG: dns: DNS resolution task and requester queues This patch is a major upgrade of the internal run-time DNS resolver in HAProxy and it brings the following 2 main changes: 1. DNS resolution task Up to now, DNS resolution was triggered by the health check task. From now, DNS resolution task is autonomous. It is started by HAProxy right after the scheduler is available and it is woken either when a network IO occurs for one of its nameserver or when a timeout is matched. From now, this means we can enable DNS resolution for a server without enabling health checking. 2. Introduction of a dns_requester structure Up to now, DNS resolution was purposely made for resolving server hostnames. The idea, is to ensure that any HAProxy internal object should be able to trigger a DNS resolution. For this purpose, 2 things has to be done: - clean up the DNS code from the server structure (this was already quite clean actually) and clean up the server's callbacks from manipulating too much DNS resolution - create an agnostic structure which allows linking a DNS resolution and a requester of any type (using obj_type enum) 3. Manage requesters through queues Up to now, there was an uniq relationship between a resolution and it's owner (aka the requester now). It's a shame, because in some cases, multiple objects may share the same hostname and may benefit from a resolution being performed by a third party. This patch introduces the notion of queues, which are basically lists of either currently running resolution or waiting ones. The resolutions are now available as a pool, which belongs to the resolvers. The pool has has a default size of 64 resolutions per resolvers and is allocated at configuration parsing.	2017-06-02 11:58:54 +02:00
Baptiste Assmann	fa4a663095	MINOR: dns: implement a LRU cache for DNS resolutions Introduction of a DNS response LRU cache in HAProxy. When a positive response is received from a DNS server, HAProxy stores it in the struct resolution and then also populates a LRU cache with the response. For now, the key in the cache is a XXHASH64 of the hostname in the domain name format concatened to the query type in string format.	2017-06-02 11:40:01 +02:00
Baptiste Assmann	729c901c3f	MAJOR: dns: save a copy of the DNS response in struct resolution Prior this patch, the DNS responses were stored in a pre-allocated memory area (allocated at HAProxy's startup). The problem is that this memory is erased for each new DNS responses received and processed. This patch removes the global memory allocation (which was not thread safe by the way) and introduces a storage of the dns response in the struct resolution. The memory in the struct resolution is also reserved at start up and is thread safe, since each resolution structure will have its own memory area. For now, we simply store the response and use it atomically per response per server.	2017-06-02 11:30:21 +02:00
Baptiste Assmann	fb7091e213	MINOR: dns: new snr_check_ip_callback function In the process of breaking links between dns_* functions and other structures (mainly server and a bit of resolution), the function dns_get_ip_from_response needs to be reworked: it now can call "callback" functions based on resolution's owner type to allow modifying the way the response is processed. For now, main purpose of the callback function is to check that an IP address is not already affected to an element of the same type. For now, only server type has a callback.	2017-06-02 11:28:14 +02:00
Baptiste Assmann	42746373eb	REORG: dns: dns_option structure, storage of hostname_dn This patch introduces a some re-organisation around the DNS code in HAProxy. 1. make the dns_* functions less dependent on 'struct server' and 'struct resolution'. With this in mind, the following changes were performed: - 'struct dns_options' has been removed from 'struct resolution' (well, we might need it back at some point later, we'll see) ==> we'll use the 'struct dns_options' from the owner of the resolution - dns_get_ip_from_response(): takes a 'struct dns_options' instead of 'struct resolution' ==> so the caller can pass its own dns options to get the most appropriate IP from the response - dns_process_resolve(): struct dns_option is deduced from new resolution->requester_type parameter 2. add hostname_dn and hostname_dn_len into struct server In order to avoid recomputing a server's hostname into its domain name format (and use a trash buffer to store the result), it is safer to compute it once at configuration parsing and to store it into the struct server. In the mean time, the struct resolution linked to the server doesn't need anymore to store the hostname in domain name format. A simple pointer to the server one will make the trick. The function srv_alloc_dns_resolution() properly manages everything for us: memory allocation, pointer updates, etc... 3. move resolvers pointer into struct server This patch makes the pointer to struct dns_resolvers from struct dns_resolution obsolete. Purpose is to make the resolution as "neutral" as possible and since the requester is already linked to the resolvers, then we don't need this information anymore in the resolution itself.	2017-06-02 11:26:48 +02:00
Baptiste Assmann	81ed1a0516	MINOR: dns: functions to manage memory for a DNS resolution structure A couple of new functions to allocate and free memory for a DNS resolution structure. Main purpose is to to make the code related to DNS more consistent. They allocate or free memory for the structure itself. Later, if needed, they should also allocate / free the buffers, etc, used by this structure. They don't set/unset any parameters, this is the role of the caller. This patch also implement calls to these function eveywhere it is required.	2017-06-02 11:20:29 +02:00
Baptiste Assmann	d0aa6d2399	MINOR: dns: smallest DNS fqdn size global variable used to define the size of the smallest fqdn possible.	2017-06-02 11:20:07 +02:00
St�phane Cottin	23e9e93128	MINOR: log: Add logurilen tunable. The default len of request uri in log messages is 1024. In some use cases, you need to keep the long trail of GET parameters. The only way to increase this len is to recompile with DEFINE=-DREQURI_LEN=2048. This commit introduces a tune.http.logurilen configuration directive, allowing to tune this at runtime.	2017-06-02 11:06:36 +02:00
William Lallemand	69f9b3bfa4	MEDIUM: mworker: exit-on-failure option This option exits every workers when one of the current workers die. It allows you to monitor the master process in order to relaunch everything on a failure. For example it can be used with systemd and Restart=on-failure in a spec file.	2017-06-02 10:56:32 +02:00
William Lallemand	095ba4c242	MEDIUM: mworker: replace systemd mode by master worker mode This commit remove the -Ds systemd mode in HAProxy in order to replace it by a more generic master worker system. It aims to replace entirely the systemd wrapper in the near future. The master worker mode implements a new way of managing HAProxy processes. The master is in charge of parsing the configuration file and is responsible for spawning child processes. The master worker mode can be invoked by using the -W flag. It can be used either in background mode (-D) or foreground mode. When used in background mode, the master will fork to daemonize. In master worker background mode, chroot, setuid and setgid are done in each child rather than in the master process, because the master process will still need access to filesystem to reload the configuration.	2017-06-02 10:56:32 +02:00
Emeric Brun	3854e0102b	MEDIUM: ssl: handle multiple async engines This patch adds the support of a maximum of 32 engines in async mode. Some tests have been done using 2 engines simultaneously. This patch also removes specific 'async' attribute from the connection structure. All the code relies only on Openssl functions.	2017-05-27 07:12:27 +02:00
Grant Zhang	fa6c7ee702	MAJOR: ssl: add openssl async mode support ssl-mode-async is a global configuration parameter which enables asynchronous processing in OPENSSL for all SSL connections haproxy handles. With SSL_MODE_ASYNC set, TLS I/O operations may indicate a retry with SSL_ERROR_WANT_ASYNC with this mode set if an asynchronous capable engine is used to perform cryptographic operations. Currently async mode only supports one async-capable engine. This is the latest version of the patchset which includes Emeric's updates : - improved async fd cleaning when openssl reports an fd to delete - prevent conn_fd_handler from calling SSL_{read,write,handshake} until the async fd is ready, as these operations are very slow and waste CPU - postpone of SSL_free to ensure the async operation can complete and does not cause a dereference a released SSL. - proper removal of async fd from the fdtab and removal of the unused async flag.	2017-05-27 07:05:54 +02:00
Grant Zhang	872f9c2139	MEDIUM: ssl: add basic support for OpenSSL crypto engine This patch adds the global 'ssl-engine' keyword. First arg is an engine identifier followed by a list of default_algorithms the engine will operate. If the openssl version is too old, an error is reported when the option is used.	2017-05-27 07:05:00 +02:00
William Lallemand	f6975e9f76	MINOR: cli: add 'expose-fd listeners' to pass listeners FDs This patch changes the stats socket rights for allowing the sending of listening sockets. The previous behavior was to allow any unix stats socket with admin level to send sockets. It's not possible anymore, you have to set this option to activate the socket sending. Example: stats socket /var/run/haproxy4.sock mode 666 expose-fd listeners level user process 4	2017-05-27 07:02:17 +02:00
William Lallemand	07a62f7a7e	MINOR: cli: add ACCESS_LVL_MASK to store the access level The current level variable use only 2 bits for storing the 3 access level (user, oper and admin). This patch add a bitmask which allows to use the remaining bits for other usage.	2017-05-27 07:02:06 +02:00
Emmanuel Hocdet	5db33cbdc4	MEDIUM: ssl: ssl_methods implementation is reworked and factored for min/max tlsxx Plan is to add min-tlsxx max-tlsxx configuration, more consistent than no-tlsxx. This patch introduce internal min/max and replace force-tlsxx implementation. SSL method configuration is store in 'struct tls_version_filter'. SSL method configuration to openssl setting is abstract in 'methodVersions' table. With openssl < 1.1.0, SSL_CTX_set_ssl_version is used for force (min == max). With openssl >= 1.1.0, SSL_CTX_set_min/max_proto_version is used.	2017-05-12 15:49:04 +02:00
Lukas Tribus	53ae85c38e	MINOR: ssl: add prefer-client-ciphers Currently we unconditionally set SSL_OP_CIPHER_SERVER_PREFERENCE [1], which may not always be a good thing. The benefit of server side cipher prioritization may not apply to all cases out there, and it appears that the various SSL libs are going away from this recommendation ([2], [3]), as insecure ciphers suites are properly blacklisted/removed and honoring the client's preference is more likely to improve user experience (for example using SW-friendly ciphers on devices without HW AES support). This is especially true for TLSv1.3, which will restrict the cipher suites to just AES-GCM and Chacha20/Poly1305. Apache [4], nginx [5] and others give admins full flexibility, we should as well. The initial proposal to change the current default and add a "prefer-server-ciphers" option (as implemented in `e566ecb`) has been declined due to the possible security impact. This patch implements prefer-client-ciphers without changing the defaults. [1] https://www.openssl.org/docs/man1.0.2/ssl/SSL_CTX_set_options.html [2] https://github.com/openssl/openssl/issues/541 [3] https://github.com/libressl-portable/portable/issues/66 [4] https://httpd.apache.org/docs/2.0/en/mod/mod_ssl.html#sslhonorcipherorder [5] https://nginx.org/en/docs/http/ngx_http_ssl_module.html#ssl_prefer_server_ciphers	2017-05-12 15:49:04 +02:00
Fr�d�ric L�caille	b418c1228c	MINOR: server: cli: Add server FQDNs to server-state file and stats socket. This patch adds a new stats socket command to modify server FQDNs at run time. Its syntax: set server <backend>/<server> fqdn <FQDN> This patch also adds FQDNs to server state file at the end of each line for backward compatibility ("-" if not present).	2017-05-03 06:58:53 +02:00
Lukas Tribus	23953686da	DOC: update RFC references A few doc and code comment updates bumping RFC references to the new ones.	2017-04-28 18:58:11 +02:00
Thierry FOURNIER	6ab2bae084	REORG: spoe: move spoe_encode_varint / spoe_decode_varint from spoe to common These encoding functions does general stuff and can be used in other context than spoe. This patch moves the function spoe_encode_varint and spoe_decode_varint from spoe to common. It also remove the prefix spoe. These functions will be used for encoding values in new binary sample fetch.	2017-04-27 11:50:41 +02:00
Thierry FOURNIER	f4128a9981	BUG/MINOR: change header-declared function to static inline When we include the header proto/spoe.h in other files in the same project, the compilator claim that the symbol have multiple definitions: src/flt_spoe.o: In function `spoe_encode_varint': ~/git/haproxy/include/proto/spoe.h:45: multiple definition of `spoe_encode_varint' src/proto_http.o:~/git/haproxy/include/proto/spoe.h:45: first defined here	2017-04-27 11:50:07 +02:00
Fr�d�ric L�caille	b82f742b78	MINOR: server: Add 'server-template' new keyword supported in backend sections. This patch makes backend sections support 'server-template' new keyword. Such 'server-template' objects are parsed similarly to a 'server' object by parse_server() function, but its first arguments are as follows: server-template <ID prefix> <nb \| range> <ip \| fqdn>:<port> ... The remaining arguments are the same as for 'server' lines. With such server template declarations, servers may be allocated with IDs built from <ID prefix> and <nb \| range> arguments. For instance declaring: server-template foo 1-5 google.com:80 ... or server-template foo 5 google.com:80 ... would be equivalent to declare: server foo1 google.com:80 ... server foo2 google.com:80 ... server foo3 google.com:80 ... server foo4 google.com:80 ... server foo5 google.com:80 ...	2017-04-21 15:42:10 +02:00
Olivier Houchard	1fc0516516	MINOR: proxy: Don't close FDs if not our proxy. When running with multiple process, if some proxies are just assigned to some processes, the other processes will just close the file descriptors for the listening sockets. However, we may still have to provide those sockets when reloading, so instead we just try hard to pretend those proxies are dead, while keeping the sockets opened. A new global option, no-reused-socket", has been added, to restore the old behavior of closing the sockets not bound to this process.	2017-04-13 19:15:17 +02:00
Olivier Houchard	f73629d23a	MINOR: global: Add an option to get the old listening sockets. Add the "-x" flag, that takes a path to a unix socket as an argument. If used, haproxy will connect to the socket, and asks to get all the listening sockets from the old process. Any failure is fatal. This is needed to get seamless reloads on linux.	2017-04-13 19:15:17 +02:00
Olivier Houchard	f886e3478d	MINOR: cli: Add a command to send listening sockets. Add a new command that will send all the listening sockets, via the stats socket, and their properties. This is a first step to workaround the linux problem when reloading haproxy.	2017-04-13 19:15:17 +02:00
Willy Tarreau	7b677265fd	[RELEASE] Released version 1.8-dev1 Released version 1.8-dev1 with the following main changes : - BUG/MEDIUM: proxy: return "none" and "unknown" for unknown LB algos - BUG/MINOR: stats: make field_str() return an empty string on NULL - DOC: Spelling fixes - BUG/MEDIUM: http: Fix tunnel mode when the CONNECT method is used - BUG/MINOR: http: Keep the same behavior between 1.6 and 1.7 for tunneled txn - BUG/MINOR: filters: Protect args in macros HAS_DATA_FILTERS and IS_DATA_FILTER - BUG/MINOR: filters: Invert evaluation order of HTTP_XFER_BODY and XFER_DATA analyzers - BUG/MINOR: http: Call XFER_DATA analyzer when HTTP txn is switched in tunnel mode - BUG/MAJOR: stream: fix session abort on resource shortage - OPTIM: stream-int: don't disable polling anymore on DONT_READ - BUG/MINOR: cli: allow the backslash to be escaped on the CLI - BUG/MEDIUM: cli: fix "show stat resolvers" and "show tls-keys" - DOC: Fix map table's format - DOC: Added 51Degrees conv and fetch functions to documentation. - BUG/MINOR: http: don't send an extra CRLF after a Set-Cookie in a redirect - DOC: mention that req_tot is for both frontends and backends - BUG/MEDIUM: variables: some variable name can hide another ones - MINOR: lua: Allow argument for actions - BUILD: rearrange target files by build time - CLEANUP: hlua: just indent functions - MINOR: lua: give HAProxy variable access to the applets - BUG/MINOR: stats: fix be/sessions/max output in html stats - MINOR: proxy: Add fe_name/be_name fetchers next to existing fe_id/be_id - DOC: lua: Documentation about some entry missing - DOC: lua: Add documentation about variable manipulation from applet - MINOR: Do not forward the header "Expect: 100-continue" when the option http-buffer-request is set - DOC: Add undocumented argument of the trace filter - DOC: Fix some typo in SPOE documentation - MINOR: cli: Remove useless call to bi_putchk - BUG/MINOR: cli: be sure to always warn the cli applet when input buffer is full - MINOR: applet: Count number of (active) applets - MINOR: task: Rename run_queue and run_queue_cur counters - BUG/MEDIUM: stream: Save unprocessed events for a stream - BUG/MAJOR: Fix how the list of entities waiting for a buffer is handled - BUILD/MEDIUM: Fixing the build using LibreSSL - BUG/MEDIUM: lua: In some case, the return of sample-fetches is ignored (2) - SCRIPTS: git-show-backports: fix a harmless typo - SCRIPTS: git-show-backports: add -H to use the hash of the commit message - BUG/MINOR: stream-int: automatically release SI_FL_WAIT_DATA on SHUTW_NOW - CLEANUP: applet/lua: create a dedicated ->fcn entry in hlua_cli context - CLEANUP: applet/table: add an "action" entry in ->table context - CLEANUP: applet: remove the now unused appctx->private field - DOC: lua: documentation about time parser functions - DOC: lua: improve links - DOC: lua: section declared twice - MEDIUM: cli: 'show cli sockets' list the CLI sockets - BUG/MINOR: cli: "show cli sockets" wouldn't list all processes - BUG/MINOR: cli: "show cli sockets" would always report process 64 - CLEANUP: lua: rename one of the lua appctx union - BUG/MINOR: lua/cli: bad error message - MEDIUM: lua: use memory pool for hlua struct in applets - MINOR: lua/signals: Remove Lua part from signals. - DOC: cli: show cli sockets - MINOR: cli: automatically enable a CLI I/O handler when there's no parser - CLEANUP: memory: remove the now unused cli_parse_show_pools() function - CLEANUP: applet: group all CLI contexts together - CLEANUP: stats: move a misplaced stats context initialization - MINOR: cli: add two general purpose pointers and integers in the CLI struct - MINOR: appctx/cli: remove the cli_socket entry from the appctx union - MINOR: appctx/cli: remove the env entry from the appctx union - MINOR: appctx/cli: remove the "be" entry from the appctx union - MINOR: appctx/cli: remove the "dns" entry from the appctx union - MINOR: appctx/cli: remove the "server_state" entry from the appctx union - MINOR: appctx/cli: remove the "tlskeys" entry from the appctx union - CONTRIB: tcploop: add limits.h to fix build issue with some compilers - MINOR/DOC: lua: just precise one thing - DOC: fix small typo in fe_id (backend instead of frontend) - BUG/MINOR: Fix the sending function in Lua's cosocket - BUG/MINOR: lua: memory leak executing tasks - BUG/MINOR: lua: bad return code - BUG/MINOR: lua: memleak when Lua/cli fails - MEDIUM: lua: remove Lua struct from session, and allocate it with memory pools - CLEANUP: haproxy: statify unexported functions - MINOR: haproxy: add a registration for build options - CLEANUP: wurfl: use the build options list to report it - CLEANUP: 51d: use the build options list to report it - CLEANUP: da: use the build options list to report it - CLEANUP: namespaces: use the build options list to report it - CLEANUP: tcp: use the build options list to report transparent modes - CLEANUP: lua: use the build options list to report it - CLEANUP: regex: use the build options list to report the regex type - CLEANUP: ssl: use the build options list to report the SSL details - CLEANUP: compression: use the build options list to report the algos - CLEANUP: auth: use the build options list to report its support - MINOR: haproxy: add a registration for post-check functions - CLEANUP: checks: make use of the post-init registration to start checks - CLEANUP: filters: use the function registration to initialize all proxies - CLEANUP: wurfl: make use of the late init registration - CLEANUP: 51d: make use of the late init registration - CLEANUP: da: make use of the late init registration code - MINOR: haproxy: add a registration for post-deinit functions - CLEANUP: wurfl: register the deinit function via the dedicated list - CLEANUP: 51d: register the deinitialization function - CLEANUP: da: register the deinitialization function - CLEANUP: wurfl: move global settings out of the global section - CLEANUP: 51d: move global settings out of the global section - CLEANUP: da: move global settings out of the global section - MINOR: cfgparse: add two new functions to check arguments count - MINOR: cfgparse: move parsing of "ca-base" and "crt-base" to ssl_sock - MEDIUM: cfgparse: move all tune.ssl.* keywords to ssl_sock - MEDIUM: cfgparse: move maxsslconn parsing to ssl_sock - MINOR: cfgparse: move parsing of ssl-default-{bind,server}-ciphers to ssl_sock - MEDIUM: cfgparse: move ssl-dh-param-file parsing to ssl_sock - MEDIUM: compression: move the zlib-specific stuff from global.h to compression.c - BUG/MEDIUM: ssl: properly reset the reused_sess during a forced handshake - BUG/MEDIUM: ssl: avoid double free when releasing bind_confs - BUG/MINOR: stats: fix be/sessions/current out in typed stats - MINOR: tcp-rules: check that the listener exists before updating its counters - MEDIUM: spoe: don't create a dummy listener for outgoing connections - MINOR: listener: move the transport layer pointer to the bind_conf - MEDIUM: move listener->frontend to bind_conf->frontend - MEDIUM: ssl: remote the proxy argument from most functions - MINOR: connection: add a new prepare_bind_conf() entry to xprt_ops - MEDIUM: ssl_sock: implement ssl_sock_prepare_bind_conf() - MINOR: connection: add a new destroy_bind_conf() entry to xprt_ops - MINOR: ssl_sock: implement ssl_sock_destroy_bind_conf() - MINOR: server: move the use_ssl field out of the ifdef USE_OPENSSL - MINOR: connection: add a minimal transport layer registration system - CLEANUP: connection: remove all direct references to raw_sock and ssl_sock - CLEANUP: connection: unexport raw_sock and ssl_sock - MINOR: connection: add new prepare_srv()/destroy_srv() entries to xprt_ops - MINOR: ssl_sock: implement and use prepare_srv()/destroy_srv() - CLEANUP: ssl: move tlskeys_finalize_config() to a post_check callback - CLEANUP: ssl: move most ssl-specific global settings to ssl_sock.c - BUG/MINOR: backend: nbsrv() should return 0 if backend is disabled - BUG/MEDIUM: ssl: for a handshake when server-side SNI changes - BUG/MINOR: systemd: potential zombie processes - DOC: Add timings events schemas - BUILD: lua: build failed on FreeBSD. - MINOR: samples: add xx-hash functions - MEDIUM: regex: pcre2 support - BUG/MINOR: option prefer-last-server must be ignored in some case - MINOR: stats: Support "select all" for backend actions - BUG/MINOR: sample-fetches/stick-tables: bad type for the sample fetches sc_get_gpt0 - BUG/MAJOR: channel: Fix the definition order of channel analyzers - BUG/MINOR: http: report real parser state in error captures - BUILD: scripts: automatically update the branch in version.h when releasing - MINOR: tools: add a generic hexdump function for debugging - BUG/MAJOR: http: fix risk of getting invalid reports of bad requests - MINOR: http: custom status reason. - MINOR: connection: add sample fetch "fc_rcvd_proxy" - BUG/MINOR: config: emit a warning if http-reuse is enabled with incompatible options - BUG/MINOR: tools: fix off-by-one in port size check - BUG/MEDIUM: server: consider AF_UNSPEC as a valid address family - MEDIUM: server: split the address and the port into two different fields - MINOR: tools: make str2sa_range() return the port in a separate argument - MINOR: server: take the destination port from the port field, not the addr - MEDIUM: server: disable protocol validations when the server doesn't resolve - BUG/MEDIUM: tools: do not force an unresolved address to AF_INET:0.0.0.0 - BUG/MINOR: ssl: EVP_PKEY must be freed after X509_get_pubkey usage - BUG/MINOR: ssl: assert on SSL_set_shutdown with BoringSSL - MINOR: Use "500 Internal Server Error" for 500 error/status code message. - MINOR: proto_http.c 502 error txt typo. - DOC: add deprecation notice to "block" - MINOR: compression: fix -vv output without zlib/slz - BUG/MINOR: Reset errno variable before calling strtol(3) - MINOR: ssl: don't show prefer-server-ciphers output - OPTIM/MINOR: config: Optimize fullconn automatic computation loading configuration - BUG/MINOR: stream: Fix how backend-specific analyzers are set on a stream - MAJOR: ssl: bind configuration per certificat - MINOR: ssl: add curve suite for ECDHE negotiation - MINOR: checks: Add agent-addr config directive - MINOR: cli: Add possiblity to change agent config via CLI/socket - MINOR: doc: Add docs for agent-addr configuration variable - MINOR: doc: Add docs for agent-addr and agent-send CLI commands - BUILD: ssl: fix to build (again) with boringssl - BUILD: ssl: fix build on OpenSSL 1.0.0 - BUILD: ssl: silence a warning reported for ERR_remove_state() - BUILD: ssl: eliminate warning with OpenSSL 1.1.0 regarding RAND_pseudo_bytes() - BUILD: ssl: kill a build warning introduced by BoringSSL compatibility - BUG/MEDIUM: tcp: don't poll for write when connect() succeeds - BUG/MINOR: unix: fix connect's polling in case no data are scheduled - MINOR: server: extend the flags to 32 bits - BUG/MINOR: lua: Map.end are not reliable because "end" is a reserved keyword - MINOR: dns: give ability to dns_init_resolvers() to close a socket when requested - BUG/MAJOR: dns: restart sockets after fork() - MINOR: chunks: implement a simple dynamic allocator for trash buffers - BUG/MEDIUM: http: prevent redirect from overwriting a buffer - BUG/MEDIUM: filters: Do not truncate HTTP response when body length is undefined - BUG/MEDIUM: http: Prevent replace-header from overwriting a buffer - BUG/MINOR: http: Return an error when a replace-header rule failed on the response - BUG/MINOR: sendmail: The return of vsnprintf is not cleanly tested - BUG/MAJOR: ssl: fix a regression in ssl_sock_shutw() - BUG/MAJOR: lua segmentation fault when the request is like 'GET ?arg=val HTTP/1.1' - BUG/MEDIUM: config: reject anything but "if" or "unless" after a use-backend rule - MINOR: http: don't close when redirect location doesn't start with "/" - MEDIUM: boringssl: support native multi-cert selection without bundling - BUG/MEDIUM: ssl: fix verify/ca-file per certificate - BUG/MEDIUM: ssl: switchctx should not return SSL_TLSEXT_ERR_ALERT_WARNING - MINOR: ssl: removes SSL_CTX_set_ssl_version call and cleanup CTX creation. - BUILD: ssl: fix build with -DOPENSSL_NO_DH - MEDIUM: ssl: add new sample-fetch which captures the cipherlist - MEDIUM: ssl: remove ssl-options from crt-list - BUG/MEDIUM: ssl: in bind line, ssl-options after 'crt' are ignored. - BUG/MINOR: ssl: fix cipherlist captures with sustainable SSL calls - MINOR: ssl: improved cipherlist captures - BUG/MINOR: spoe: Fix soft stop handler using a specific id for spoe filters - BUG/MINOR: spoe: Fix parsing of arguments in spoe-message section - MAJOR: spoe: Add support of pipelined and asynchronous exchanges with agents - MINOR: spoe: Add support for pipelining/async capabilities in the SPOA example - MINOR: spoe: Remove SPOE details from the appctx structure - MINOR: spoe: Add status code in error variable instead of hardcoded value - MINOR: spoe: Send a log message when an error occurred during event processing - MINOR: spoe: Check the scope of sample fetches used in SPOE messages - MEDIUM: spoe: Be sure to wakeup the good entity waiting for a buffer - MINOR: spoe: Use the min of all known max_frame_size to encode messages - MAJOR: spoe: Add support of payload fragmentation in NOTIFY frames - MINOR: spoe: Add support for fragmentation capability in the SPOA example - MAJOR: spoe: refactor the filter to clean up the code - MINOR: spoe: Handle NOTIFY frames cancellation using ABORT bit in ACK frames - REORG: spoe: Move struct and enum definitions in dedicated header file - REORG: spoe: Move low-level encoding/decoding functions in dedicated header file - MINOR: spoe: Improve implementation of the payload fragmentation - MINOR: spoe: Add support of negation for options in SPOE configuration file - MINOR: spoe: Add "pipelining" and "async" options in spoe-agent section - MINOR: spoe: Rely on alertif_too_many_arg during configuration parsing - MINOR: spoe: Add "send-frag-payload" option in spoe-agent section - MINOR: spoe: Add "max-frame-size" statement in spoe-agent section - DOC: spoe: Update SPOE documentation to reflect recent changes - MINOR: config: warn when some HTTP rules are used in a TCP proxy - BUG/MEDIUM: ssl: Clear OpenSSL error stack after trying to parse OCSP file - BUG/MEDIUM: cli: Prevent double free in CLI ACL lookup - BUG/MINOR: Fix "get map <map> <value>" CLI command - MINOR: Add nbsrv sample converter - CLEANUP: Replace repeated code to count usable servers with be_usable_srv() - MINOR: Add hostname sample fetch - CLEANUP: Remove comment that's no longer valid - MEDIUM: http_error_message: txn->status / http_get_status_idx. - MINOR: http-request tarpit deny_status. - CLEANUP: http: make http_server_error() not set the status anymore - MEDIUM: stats: Add JSON output option to show (info\|stat) - MEDIUM: stats: Add show json schema - BUG/MAJOR: connection: update CO_FL_CONNECTED before calling the data layer - MINOR: server: Add dynamic session cookies. - MINOR: cli: Let configure the dynamic cookies from the cli. - BUG/MINOR: checks: attempt clean shutw for SSL check - CONTRIB: tcploop: make it build on FreeBSD - CONTRIB: tcploop: fix time format to silence build warnings - CONTRIB: tcploop: report action 'K' (kill) in usage message - CONTRIB: tcploop: fix connect's address length - CONTRIB: tcploop: use the trash instead of NULL for recv() - BUG/MEDIUM: listener: do not try to rebind another process' socket - BUG/MEDIUM server: Fix crash when dynamic is defined, but not key is provided. - CLEANUP: config: Typo in comment. - BUG/MEDIUM: filters: Fix channels synchronization in flt_end_analyze - TESTS: add a test configuration to stress handshake combinations - BUG/MAJOR: stream-int: do not depend on connection flags to detect connection - BUG/MEDIUM: connection: ensure to always report the end of handshakes - MEDIUM: connection: don't test for CO_FL_WAKE_DATA - CLEANUP: connection: completely remove CO_FL_WAKE_DATA - BUG: payload: fix payload not retrieving arbitrary lengths - BUILD: ssl: simplify SSL_CTX_set_ecdh_auto compatibility - BUILD: ssl: fix OPENSSL_NO_SSL_TRACE for boringssl and libressl - BUG/MAJOR: http: fix typo in http_apply_redirect_rule - MINOR: doc: 2.4. Examples should be 2.5. Examples - BUG/MEDIUM: stream: fix client-fin/server-fin handling - MINOR: fd: add a new flag HAP_POLL_F_RDHUP to struct poller - BUG/MINOR: raw_sock: always perfom the last recv if RDHUP is not available - OPTIM: poll: enable support for POLLRDHUP - MINOR: kqueue: exclusively rely on the kqueue returned status - MEDIUM: kqueue: take care of EV_EOF to improve polling status accuracy - MEDIUM: kqueue: only set FD_POLL_IN when there are pending data - DOC/MINOR: Fix typos in proxy protocol doc - DOC: Protocol doc: add checksum, TLV type ranges - DOC: Protocol doc: add SSL TLVs, rename CHECKSUM - DOC: Protocol doc: add noop TLV - MEDIUM: global: add a 'hard-stop-after' option to cap the soft-stop time - MINOR: dns: improve DNS response parsing to use as many available records as possible - BUG/MINOR: cfgparse: loop in tracked servers lists not detected by check_config_validity(). - MINOR: server: irrelevant error message with 'default-server' config file keyword. - MINOR: server: Make 'default-server' support 'backup' keyword. - MINOR: server: Make 'default-server' support 'check-send-proxy' keyword. - CLEANUP: server: code alignement. - MINOR: server: Make 'default-server' support 'non-stick' keyword. - MINOR: server: Make 'default-server' support 'send-proxy' and 'send-proxy-v2 keywords. - MINOR: server: Make 'default-server' support 'check-ssl' keyword. - MINOR: server: Make 'default-server' support 'force-sslv3' and 'force-tlsv1[0-2]' keywords. - CLEANUP: server: code alignement. - MINOR: server: Make 'default-server' support 'no-ssl' and 'no-tlsv' keywords. - MINOR: server: Make 'default-server' support 'ssl' keyword. - MINOR: server: Make 'default-server' support 'send-proxy-v2-ssl' keywords. - CLEANUP: server: code alignement. - MINOR: server: Make 'default-server' support 'verify' keyword. - MINOR: server: Make 'default-server' support 'verifyhost' setting. - MINOR: server: Make 'default-server' support 'check' keyword. - MINOR: server: Make 'default-server' support 'track' setting. - MINOR: server: Make 'default-server' support 'ca-file', 'crl-file' and 'crt' settings. - MINOR: server: Make 'default-server' support 'redir' keyword. - MINOR: server: Make 'default-server' support 'observe' keyword. - MINOR: server: Make 'default-server' support 'cookie' keyword. - MINOR: server: Make 'default-server' support 'ciphers' keyword. - MINOR: server: Make 'default-server' support 'tcp-ut' keyword. - MINOR: server: Make 'default-server' support 'namespace' keyword. - MINOR: server: Make 'default-server' support 'source' keyword. - MINOR: server: Make 'default-server' support 'sni' keyword. - MINOR: server: Make 'default-server' support 'addr' keyword. - MINOR: server: Make 'default-server' support 'disabled' keyword. - MINOR: server: Add 'no-agent-check' server keyword. - DOC: server: Add docs for "server" and "default-server" new "no-*" and other settings. - MINOR: doc: fix use-server example (imap vs mail) - BUG/MEDIUM: tcp: don't require privileges to bind to device - BUILD: make the release script use shortlog for the final changelog - BUILD: scripts: fix typo in announce-release error message - CLEANUP: time: curr_sec_ms doesn't need to be exported - BUG/MEDIUM: server: Wrong server default CRT filenames initialization. - BUG/MEDIUM: peers: fix buffer overflow control in intdecode. - BUG/MEDIUM: buffers: Fix how input/output data are injected into buffers - BUG/MINOR: http: Fix conditions to clean up a txn and to handle the next request - CLEANUP: http: Remove channel_congested function - CLEANUP: buffers: Remove buffer_bounce_realign function - CLEANUP: buffers: Remove buffer_contig_area and buffer_work_area functions - MINOR: http: remove useless check on HTTP_MSGF_XFER_LEN for the request - MINOR: http: Add debug messages when HTTP body analyzers are called - BUG/MEDIUM: http: Fix blocked HTTP/1.0 responses when compression is enabled - BUG/MINOR: filters: Don't force the stream's wakeup when we wait in flt_end_analyze - DOC: fix parenthesis and add missing "Example" tags - DOC: update the contributing file - DOC: log-format/tcplog/httplog update - MINOR: config parsing: add warning when log-format/tcplog/httplog is overriden in "defaults" sections	2017-04-03 09:27:49 +02:00
Christopher Faulet	a545569f1e	CLEANUP: buffers: Remove buffer_contig_area and buffer_work_area functions Not used anymore since last commit.	2017-03-31 14:38:30 +02:00
Christopher Faulet	aaf4a325ca	CLEANUP: buffers: Remove buffer_bounce_realign function Not used anymore since last commit.	2017-03-31 14:38:22 +02:00
Christopher Faulet	533182f1c8	CLEANUP: http: Remove channel_congested function Not used anymore since last commit.	2017-03-31 14:38:08 +02:00
Christopher Faulet	637f8f2ca7	BUG/MEDIUM: buffers: Fix how input/output data are injected into buffers The function buffer_contig_space is buggy and could lead to pernicious bugs (never hitted until now, AFAIK). This function should return the number of bytes that can be written into the buffer at once (without wrapping). First, this function is used to inject input data (bi_putblk) and to inject output data (bo_putblk and bo_inject). But there is no context. So it cannot decide where contiguous space should placed. For input data, it should be after bi_end(buf) (ie, buf->p + buf->i modulo wrapping calculation). For output data, it should be after bo_end(buf) (ie, buf->p) and input data are assumed to not exist (else there is no space at all). Then, considering we need to inject input data, this function does not always returns the right value. And when we need to inject output data, we must be sure to have no input data at all (buf->i == 0), else the result can also be wrong (but this is the caller responsibility, so everything should be fine here). The buffer can be in 3 different states: 1) no wrapping <---- o ----><----- i -----> +------------+------------+-------------+------------+ \| \|oooooooooooo\|iiiiiiiiiiiii\|xxxxxxxxxxxx\| +------------+------------+-------------+------------+ ^ <contig_space> p ^ ^ l r 2) input wrapping ...---> <---- o ----><-------- i -------... +-----+------------+------------+--------------------+ \|iiiii\|xxxxxxxxxxxx\|oooooooooooo\|iiiiiiiiiiiiiiiiiiii\| +-----+------------+------------+--------------------+ <contig_space> ^ ^ ^ p l r 3) output wrapping ...------ o ------><----- i -----> <----... +------------------+-------------+------------+------+ \|oooooooooooooooooo\|iiiiiiiiiiiii\|xxxxxxxxxxxx\|oooooo\| +------------------+-------------+------------+------+ ^ <contig_space> p ^ ^ l r buffer_contig_space returns (l - r). The cases 1 and 3 are correctly handled. But for the second case, r is wrong. It points on the buffer's end (buf->data + buf->size). It should be bo_end(buf) (ie, buf->p - buf->o). To fix the bug, the function has been splitted. Now, bi_contig_space and bo_contig_space should be used to know the contiguous space available to insert, respectively, input data and output data. For bo_contig_space, input data are assumed to not exist. And the right version is used, depending what we want to do. In addition, to clarify the buffer's API, buffer_realign does not return value anymore. So it has the same API than buffer_slow_realign. This patch can be backported in 1.7, 1.6 and 1.5.	2017-03-31 14:36:04 +02:00
Fr�d�ric L�caille	6e0843c0e0	MINOR: server: Add 'no-agent-check' server keyword. This patch adds 'no-agent-check' setting supported both by 'default-server' and 'server' directives to disable an agent check for a specific server which would have 'agent-check' set as default value (inherited from 'default-server' 'agent-check' setting), or, on 'default-server' lines, to disable 'agent-check' setting as default value for any further 'server' declarations. For instance, provided this configuration: default-server agent-check server srv1 server srv2 no-agent-check server srv3 default-server no-agent-check server srv4 srv1 and srv3 would have an agent check enabled contrary to srv2 and srv4. We do not allocate anymore anything when parsing 'default-server' 'agent-check' setting.	2017-03-27 14:37:01 +02:00
Fr�d�ric L�caille	9a146de934	MINOR: server: Make 'default-server' support 'sni' keyword. This patch makes 'default-server' directives support 'sni' settings. A field 'sni_expr' has been added to 'struct server' to temporary stores SNI expressions as strings during both 'default-server' and 'server' lines parsing. So, to duplicate SNI expressions from 'default-server' 'sni' setting for new 'server' instances we only have to "strdup" these strings as this is often done for most of the 'server' settings. Then, sample expressions are computed calling sample_parse_expr() (only for 'server' instances). A new function has been added to produce the same error output as before in case of any error during 'sni' settings parsing (display_parser_err()). Should not break anything.	2017-03-27 14:37:01 +02:00
Fr�d�ric L�caille	65aa356c0b	MINOR: server: Make 'default-server' support 'check' keyword. Before this patch 'check' setting was only supported by 'server' directives. This patch makes also 'default-server' directives support this setting. A new 'no-check' keyword parser has been implemented to disable this setting both in 'default-server' and 'server' directives. Should not break anything.	2017-03-27 14:37:01 +02:00
Cyril Bont�	203ec5a2b5	MEDIUM: global: add a 'hard-stop-after' option to cap the soft-stop time When SIGUSR1 is received, haproxy enters in soft-stop and quits when no connection remains. It can happen that the instance remains alive for a long time, depending on timeouts and traffic. This option ensures that soft-stop won't run for too long. Example: global hard-stop-after 30s # Once in soft-stop, the instance will remain # alive for at most 30 seconds.	2017-03-23 23:03:57 +01:00
Willy Tarreau	5a767693b5	MINOR: fd: add a new flag HAP_POLL_F_RDHUP to struct poller We'll need to differenciate between pollers which can report hangup at the same time as read (POLL_RDHUP) from the other ones, because only these ones may benefit from the fd_done_recv() optimization. Epoll has had support for EPOLLRDHUP since Linux 2.6.17 and has always been used this way in haproxy, so now we only set the flag once we've observed it once in a response. It means that some initial requests may try to perform a second recv() call, but after the first closed connection it will be enough to know that the second call is not needed anymore. Later we may extend these flags to designate event-triggered pollers.	2017-03-21 16:30:35 +01:00
Hongbo Long	e39683c4d4	BUG/MEDIUM: stream: fix client-fin/server-fin handling A tcp half connection can cause 100% CPU on expiration. First reproduced with this haproxy configuration : global tune.bufsize 10485760 defaults timeout server-fin 90s timeout client-fin 90s backend node2 mode tcp timeout server 900s timeout connect 10s server def 127.0.0.1:3333 frontend fe_api mode tcp timeout client 900s bind :1990 use_backend node2 Ie timeout server-fin shorter than timeout server, the backend server sends data, this package is left in the cache of haproxy, the backend server continue sending fin package, haproxy recv fin package. this time the session information is as follows: time the session information is as follows: 0x2373470: proto=tcpv4 src=127.0.0.1:39513 fe=fe_api be=node2 srv=def ts=08 age=1s calls=3 rq[f=848000h,i=0,an=00h,rx=14m58s,wx=,ax=] rp[f=8004c020h,i=0,an=00h,rx=,wx=14m58s,ax=] s0=[7,0h,fd=6,ex=] s1=[7,18h,fd=7,ex=] exp=14m58s rp has set the CF_SHUTR state, next, the client sends the fin package, session information is as follows: 0x2373470: proto=tcpv4 src=127.0.0.1:39513 fe=fe_api be=node2 srv=def ts=08 age=38s calls=4 rq[f=84a020h,i=0,an=00h,rx=,wx=,ax=] rp[f=8004c020h,i=0,an=00h,rx=1m11s,wx=14m21s,ax=] s0=[7,0h,fd=6,ex=] s1=[9,10h,fd=7,ex=] exp=1m11s After waiting 90s, session information is as follows: 0x2373470: proto=tcpv4 src=127.0.0.1:39513 fe=fe_api be=node2 srv=def ts=04 age=4m11s calls=718074391 rq[f=84a020h,i=0,an=00h,rx=,wx=,ax=] rp[f=8004c020h,i=0,an=00h,rx=?,wx=10m49s,ax=] s0=[7,0h,fd=6,ex=] s1=[9,10h,fd=7,ex=] exp=? run(nice=0) cpu information: 6899 root 20 0 112224 21408 4260 R 100.0 0.7 3:04.96 haproxy Buffering is set to ensure that there is data in the haproxy buffer, and haproxy can receive the fin package, set the CF_SHUTR flag, If the CF_SHUTR flag has been set, The following code does not clear the timeout message, causing cpu 100%: stream.c:process_stream: if (unlikely((res->flags & (CF_SHUTR\|CF_READ_TIMEOUT)) == CF_READ_TIMEOUT)) { if (si_b->flags & SI_FL_NOHALF) si_b->flags \|= SI_FL_NOLINGER; si_shutr(si_b); } If you have closed the read, set the read timeout does not make sense. With or without cf_shutr, read timeout is set: if (tick_isset(s->be->timeout.serverfin)) { res->rto = s->be->timeout.serverfin; res->rex = tick_add(now_ms, res->rto); } After discussion on the mailing list, setting half-closed timeouts the hard way here doesn't make sense. They should be set only at the moment the shutdown() is performed. It will also solve a special case which was already reported of some half-closed timeouts not working when the shutw() is performed directly at the stream-interface layer (no analyser involved). Since the stream interface layer cannot know the timeout values, we'll have to store them directly in the stream interface so that they are used upon shutw(). This patch does this, fixing the problem. An easier reproducer to validate the fix is to keep the huge buffer and shorten all timeouts, then call it under tcploop server and client, and wait 3 seconds to see haproxy run at 100% CPU : global tune.bufsize 10485760 listen px bind :1990 timeout client 90s timeout server 90s timeout connect 1s timeout server-fin 3s timeout client-fin 3s server def 127.0.0.1:3333 $ tcploop 3333 L W N20 A P100 F P10000 & $ tcploop 127.0.0.1:1990 C S10000000 F	2017-03-21 15:04:43 +01:00
Emmanuel Hocdet	9490cedb4e	BUILD: ssl: fix OPENSSL_NO_SSL_TRACE for boringssl and libressl "sample-fetch which captures the cipherlist" patch introduce #define do deal with trace functions only available in openssl > 1.0.2. Add this #define to libressl and boringssl environment. Thanks to Piotr Kubaj for postponing and testing with libressl.	2017-03-20 12:01:44 +01:00
Emmanuel Hocdet	a52bb15cc7	BUILD: ssl: simplify SSL_CTX_set_ecdh_auto compatibility SSL_CTX_set_ecdh_auto is declared (when present) with #define. A simple #ifdef avoid to list all cases of ssllibs. It's a placebo in new ssllibs. It's ok with openssl 1.0.1, 1.0.2, 1.1.0, libressl and boringssl. Thanks to Piotr Kubaj for postponing and testing with libressl.	2017-03-20 12:01:34 +01:00
Willy Tarreau	de40d798de	CLEANUP: connection: completely remove CO_FL_WAKE_DATA Since it's only set and never tested anymore, let's remove it.	2017-03-19 12:18:27 +01:00
Willy Tarreau	3c0cc49d30	BUG/MEDIUM: connection: ensure to always report the end of handshakes Despite the previous commit working fine on all tests, it's still not sufficient to completely address the problem. If the connection handler is called with an event validating an L4 connection but some handshakes remain (eg: accept-proxy), it will still wake the function up, which will not report the activity, and will not detect a change once the handshake it complete so it will not notify the ->wake() handler. In fact the only reason why the ->wake() handler is still called here is because after dropping the last handshake, we try to call ->recv() and ->send() in turn and change the flags in order to detect a data activity. But if for any reason the data layer is not interested in reading nor writing, it will not get these events. A cleaner way to address this is to call the ->wake() handler only on definitive status changes (shut, error), on real data activity, and on a complete connection setup, measured as CONNECTED with no more handshake pending. It could be argued that the handshake flags have to be made part of the condition to set CO_FL_CONNECTED but that would currently break a part of the health checks. Also a handshake could appear at any moment even after a connection is established so we'd lose the ability to detect a second end of handshake. For now the situation around CO_FL_CONNECTED is not clean : - session_accept() only sets CO_FL_CONNECTED if there's no pending handshake ; - conn_fd_handler() will set it once L4 and L6 are complete, which will do what session_accept() above refrained from doing even if an accept_proxy handshake is still pending ; - ssl_sock_infocbk() and ssl_sock_handshake() consider that a handshake performed with CO_FL_CONNECTED set is a renegociation ; => they should instead filter on CO_FL_WAIT_L6_CONN - all ssl_fc_* sample fetch functions wait for CO_FL_CONNECTED before accepting to fetch information => they should also get rid of any pending handshake - smp_fetch_fc_rcvd_proxy() uses !CO_FL_CONNECTED instead of CO_FL_ACCEPT_PROXY - health checks (standard and tcp-checks) don't check for HANDSHAKE and may report a successful check based on CO_FL_CONNECTED while not yet done (eg: send buffer full on send_proxy). This patch aims at solving some of these side effects in a backportable way before this is reworked in depth : - we need to call ->wake() to report connection success, measure connection time, notify that the data layer is ready and update the data layer after activity ; this has to be done either if we switch from pending {L4,L6}_CONN to nothing with no handshakes left, or if we notice some handshakes were pending and are now done. - we document that CO_FL_CONNECTED exactly means "L4 connection setup confirmed at least once, L6 connection setup confirmed at least once or not necessary, all this regardless of any possibly remaining handshakes or future L6 negociations". This patch also renames CO_FL_CONN_STATUS to the more explicit CO_FL_NOTIFY_DATA, and works around the previous flags trick consiting in setting an impossible combination of flags to notify the data layer, by simply clearing the current flags. This fix should be backported to 1.7, 1.6 and 1.5.	2017-03-19 12:06:18 +01:00
Christopher Faulet	e6006245de	BUG/MEDIUM: filters: Fix channels synchronization in flt_end_analyze When a filter is used, there are 2 channel's analyzers to surround all the others, flt_start_analyze and flt_end_analyze. This is the good place to acquire and release resources used by filters, when needed. In addition, the last one is used to synchronize the both channels, especially for HTTP streams. We must wait that the analyze is finished for the both channels for an HTTP transaction before restarting it for the next one. But this part was buggy, leading to unexpected behaviours. First, depending on which channel ends first, the request or the response can be switch in a "forward forever" mode. Then, the HTTP transaction can be cleaned up too early, while a processing is still in progress on a channel. To fix the bug, the flag CF_FLT_ANALYZE has been added. It is set on channels in flt_start_analyze and is kept if at least one filter is still analyzing the channel. So, we can trigger the channel syncrhonization if this flag was removed on the both channels. In addition, the flag TX_WAIT_CLEANUP has been added on the transaction to know if the transaction must be cleaned up or not during channels syncrhonization. This way, we are sure to reset everything once all the processings are finished. This patch should be backported in 1.7.	2017-03-15 19:09:06 +01:00
Olivier Houchard	614f8d7d56	MINOR: cli: Let configure the dynamic cookies from the cli. This adds 3 new commands to the cli : enable dynamic-cookie backend <backend> that enables dynamic cookies for a specified backend disable dynamic-cookie backend <backend> that disables dynamic cookies for a specified backend set dynamic-cookie-key backend <backend> that lets one change the dynamic cookie secret key, for a specified backend.	2017-03-15 11:38:29 +01:00
Olivier Houchard	4e694049fa	MINOR: server: Add dynamic session cookies. This adds a new "dynamic" keyword for the cookie option. If set, a cookie will be generated for each server (assuming one isn't already provided on the "server" line), from the IP of the server, the TCP port, and a secret key provided. To provide the secret key, a new keyword as been added, "dynamic-cookie-key", for backends. Example : backend bk_web balance roundrobin dynamic-cookie-key "bla" cookie WEBSRV insert dynamic server s1 127.0.0.1:80 check server s2 192.168.56.1:80 check This is a first step to be able to dynamically add and remove servers, without modifying the configuration file, and still have all the load balancers redirect the traffic to the right server. Provide a way to generate session cookies, based on the IP address of the server, the TCP port, and a secret key provided.	2017-03-15 11:37:30 +01:00
Simon Horman	6f6bb380ef	MEDIUM: stats: Add show json schema This may be used to output the JSON schema which describes the output of show info json and show stats json. The JSON output is without any extra whitespace in order to reduce the volume of output. For human consumption passing the output through a pretty printer may be helpful. e.g.: $ echo "show schema json" \| socat /var/run/haproxy.stat stdio \| \ python -m json.tool The implementation does not generate the schema. Some consideration could be given to integrating the output of the schema with the output of typed and json info and stats. In particular the types (u32, s64, etc...) and tags. A sample verification of show info json and show stats json using the schema is as follows. It uses the jsonschema python module: cat > jschema.py << __EOF__ import json from jsonschema import validate from jsonschema.validators import Draft3Validator with open('schema.txt', 'r') as f: schema = json.load(f) Draft3Validator.check_schema(schema) with open('instance.txt', 'r') as f: instance = json.load(f) validate(instance, schema, Draft3Validator) __EOF__ $ echo "show schema json" \| socat /var/run/haproxy.stat stdio > schema.txt $ echo "show info json" \| socat /var/run/haproxy.stat stdio > instance.txt python ./jschema.py $ echo "show stats json" \| socat /var/run/haproxy.stat stdio > instance.txt python ./jschema.py Signed-off-by: Simon Horman <horms@verge.net.au>	2017-03-14 11:14:03 +01:00
Simon Horman	05ee213f8b	MEDIUM: stats: Add JSON output option to show (info\|stat) Add a json parameter to show (info\|stat) which will output information in JSON format. A follow-up patch will add a JSON schema which describes the format of the JSON output of these commands. The JSON output is without any extra whitespace in order to reduce the volume of output. For human consumption passing the output through a pretty printer may be helpful. e.g.: $ echo "show info json" \| socat /var/run/haproxy.stat stdio \| \ python -m json.tool STAT_STARTED has bee added in order to track if show output has begun or not. This is used in order to allow the JSON output routines to only insert a "," between elements when needed. I would value any feedback on how this might be done better. Signed-off-by: Simon Horman <horms@verge.net.au>	2017-03-14 11:14:03 +01:00
Jarno Huuskonen	9e6906b9ec	MEDIUM: http_error_message: txn->status / http_get_status_idx. This commit removes second argument(msgnum) from http_error_message and changes http_error_message to use s->txn->status/http_get_status_idx for mapping status code from 200..504 to HTTP_ERR_200..HTTP_ERR_504(enum). This is needed for http-request tarpit deny_status commit.	2017-03-14 10:41:41 +01:00
Nenad Merdanovic	b7e7c4720a	MINOR: Add nbsrv sample converter This is like the nbsrv() sample fetch function except that it works as a converter so it can count the number of available servers of a backend name retrieved using a sample fetch or an environment variable. Signed-off-by: Nenad Merdanovic <nmerdan@haproxy.com>	2017-03-13 18:26:05 +01:00
Christopher Faulet	cecd8527b3	MINOR: spoe: Add "send-frag-payload" option in spoe-agent section This option can be used to enable or to disable (prefixing the option line with the "no" keyword) the sending of fragmented payload to agents. By default, this option is enabled.	2017-03-09 15:32:55 +01:00
Christopher Faulet	305c6079d4	MINOR: spoe: Add "pipelining" and "async" options in spoe-agent section These options can be used to enable or to disable (prefixing the option line with the "no" keyword), respectively, pipelined and asynchronous exchanged between HAproxy and agents. By default, pipelining and async options are enabled.	2017-03-09 15:32:55 +01:00
Christopher Faulet	f032c3ec09	MINOR: spoe: Improve implementation of the payload fragmentation Now, when a payload is fragmented, the first frame must define the frame type and the followings must use the special type SPOE_FRM_T_UNSET. This way, it is easy to know if a fragment is the first one or not. Of course, all frames must still share the same stream-id and frame-id. Update SPOA example accordingly.	2017-03-09 15:32:55 +01:00
Christopher Faulet	4ff3e574ac	REORG: spoe: Move low-level encoding/decoding functions in dedicated header file So, it will be easier to anyone to develop external services using these functions. SPOA example has been updated accordingly.	2017-03-09 15:32:55 +01:00
Christopher Faulet	1f40b91a83	REORG: spoe: Move struct and enum definitions in dedicated header file SPOA example has been Updated accordingly	2017-03-09 15:32:55 +01:00
Christopher Faulet	8ef75251e3	MAJOR: spoe: refactor the filter to clean up the code The SPOE code is now pretty big and it was the good time to clean it up. It is not perfect, some parts remains a bit ugly. But it is far better now.	2017-03-09 15:32:55 +01:00
Christopher Faulet	42bfa46234	MINOR: spoe: Remove SPOE details from the appctx structure Now, as for peers, we use an opaque pointer to store information related to the SPOE filter in appctx structure. These information are now stored in a dedicated structure (spoe_appctx) and allocated, using a pool, when the applet is created. This removes the dependency between applets and the SPOE filter and avoids to eventually inflate the appctx structure.	2017-03-09 15:32:55 +01:00
Christopher Faulet	a1cda02995	MAJOR: spoe: Add support of pipelined and asynchronous exchanges with agents Now, HAProxy and agents can announce the support for "pipelining" and/or "async" capabilities during the HELLO handshake. For now, HAProxy always announces the support of both. In addition, in its HELLO frames. HAproxy adds the "engine-id" key. It is a uniq string that identify a SPOE engine. The "pipelining" capability is the ability for a peer to decouple NOTIFY and ACK frames. This is a symmectical capability. To be used, it must be supported by HAproxy and agents. Unlike HTTP pipelining, the ACK frames can be send in any order, but always on the same TCP connection used for the corresponding NOTIFY frame. The "async" capability is similar to the pipelining, but here any TCP connection established between HAProxy and the agent can be used to send ACK frames. if an agent accepts connections from multiple HAProxy, it can use the "engine-id" value to group TCP connections.	2017-03-09 15:32:55 +01:00
Emmanuel Hocdet	f6b37c67be	BUG/MEDIUM: ssl: in bind line, ssl-options after 'crt' are ignored. Bug introduced with "removes SSL_CTX_set_ssl_version call and cleanup CTX creation": ssl_sock_new_ctx is called before all the bind line is parsed. The fix consists of separating the use of default_ctx as the initialization context of the SSL connection via bind_conf->initial_ctx. Initial_ctx contains all the necessary parameters before performing the selection of the CTX: default_ctx is processed as others ctx without unnecessary parameters.	2017-03-07 10:42:43 +01:00
Emmanuel Hocdet	4608ed9511	MEDIUM: ssl: remove ssl-options from crt-list ssl-options are link to the initial negotiation environnement worn by default_ctx. Remove it from crt-list to avoid any confusion.	2017-03-07 10:33:16 +01:00
Emmanuel Hocdet	0594211987	MEDIUM: boringssl: support native multi-cert selection without bundling This patch used boringssl's callback to analyse CLientHello before any handshake to extract key signature capabilities. Certificat with better signature (ECDSA before RSA) is choosed transparenty, if client can support it. RSA and ECDSA certificates can be declare in a row (without order). This makes it possible to set different ssl and filter parameter with crt-list.	2017-03-02 18:31:05 +01:00
Willy Tarreau	b686afd568	MINOR: chunks: implement a simple dynamic allocator for trash buffers The trash buffers are becoming increasingly complex to deal with due to the code's modularity allowing some functions to be chained and causing the same chunk buffers to be used multiple times along the chain, possibly corrupting each other. In fact the trash were designed from scratch for explicitly not surviving a function call but string manipulation makes this impossible most of the time while not fullfilling the need for reliable temporary chunks. Here we introduce the ability to allocate a temporary trash chunk which is reserved, so that it will not conflict with the trash chunks other functions use, and will even support reentrant calls (eg: build_logline). For this, we create a new pool which is exactly the size of a usual chunk buffer plus the size of the chunk struct so that these chunks when allocated are exactly the same size as the ones returned by get_trash_buffer(). These chunks may fail so the caller must check them, and the caller is also responsible for freeing them. The code focuses on minimal changes and ease of reliable backporting because it will be needed in stable versions in order to support next patch.	2017-02-08 11:16:29 +01:00
Baptiste Assmann	5cd1b9222e	MINOR: dns: give ability to dns_init_resolvers() to close a socket when requested The function dns_init_resolvers() is used to initialize socket used to send DNS queries. This patch gives the function the ability to close a socket before re-opening it. [wt: this needs to be backported to 1.7 for next fix]	2017-02-03 07:21:32 +01:00
Willy Tarreau	c0752565fe	MINOR: server: extend the flags to 32 bits Right now not only we're limited to 8 bits, but it's mentionned nowhere and the limit was already reached. In addition, pp_opts (proxy protocol options) were set to 32 bits while only 3 are needed. So let's swap these two and group them together to avoid leaving two holes in the structure, saving 64 bits on 64-bit machines.	2017-01-25 18:49:27 +01:00
Willy Tarreau	e3e326d9f0	BUILD: ssl: kill a build warning introduced by BoringSSL compatibility A recent patch to support BoringSSL caused this warning to appear on OpenSSL 1.1.0 : src/ssl_sock.c:3062:4: warning: statement with no effect [-Wunused-value] It's caused by SSL_CTX_set_ecdh_auto() which is now only a macro testing that the last argument is zero, and the result is not used here. Let's just kill it for both versions. Tested with 0.9.8, 1.0.0, 1.0.1, 1.0.2, 1.1.0. This fix may be backported to 1.7 if the boringssl fix is as well.	2017-01-19 17:56:20 +01:00
Willy Tarreau	77d88da7e1	BUILD: ssl: eliminate warning with OpenSSL 1.1.0 regarding RAND_pseudo_bytes() This function was deprecated in 1.1.0 causing this warning : src/ssl_sock.c:551:3: warning: 'RAND_pseudo_bytes' is deprecated (declared at /opt/openssl-1.1.0/include/openssl/rand.h:47) [-Wdeprecated-declarations] The man suggests to use RAND_bytes() instead. While the return codes differ, it turns out that the function was already misused and was relying on RAND_bytes() return code instead. The patch was tested on 0.9.8, 1.0.0, 1.0.1, 1.0.2 and 1.1.0. This fix must be backported to 1.7 and the return code check should be backported to earlier versions if relevant.	2017-01-19 17:28:08 +01:00
Willy Tarreau	a4fb8ed1f2	BUILD: ssl: silence a warning reported for ERR_remove_state() In 1.0.0, this function was replaced with ERR_remove_thread_state(). As of openssl 1.1.0, both are now deprecated and do nothing at all. Thus we simply make this call do nothing in 1.1.0 to silence the warning. The change was tested with 0.9.8, 1.0.0, 1.0.1, 1.0.2 and 1.1.0. This kills the following warning on 1.1.0 : src/ssl_sock.c:7266:9: warning: 'ERR_remove_state' is deprecated (declared at /dev/shm/openssl-1.1.0b/include/openssl/err.h:247) [-Wdeprecated-declarations] This fix should be backported to 1.7.	2017-01-19 17:28:08 +01:00
Willy Tarreau	2b3205b5fa	BUILD: ssl: fix build on OpenSSL 1.0.0 After the code was ported to support 1.1.0, this one broke on 1.0.0 : src/shctx.c:406: undefined reference to `SSL_SESSION_set1_id_context' The function was indeed introduced only in 1.0.1. The build was validated with 0.9.8, 1.0.0, 1.0.1, 1.0.2 and 1.1.0. This fix must be backported to 1.7.	2017-01-19 17:27:57 +01:00
Emmanuel Hocdet	fdec7897fd	BUILD: ssl: fix to build (again) with boringssl Limitations: . disable force-ssl/tls (need more work) should be set earlier with SSL_CTX_new (SSL_CTX_set_ssl_version is removed) . disable generate-certificates (need more work) introduce SSL_NO_GENERATE_CERTIFICATES to disable generate-certificates. Cleanup some #ifdef and type related to boringssl env.	2017-01-16 12:40:35 +01:00
Emmanuel Hocdet	e7f2b7301c	MINOR: ssl: add curve suite for ECDHE negotiation Add 'curves' parameter on 'bind' and for 'crt-list' to set curve suite. (ex: curves X25519:P-256)	2017-01-13 11:41:01 +01:00
Emmanuel Hocdet	98263291cc	MAJOR: ssl: bind configuration per certificat crt-list is extend to support ssl configuration. You can now have such line in crt-list <file>: mycert.pem [npn h2,http/1.1] Support include "npn", "alpn", "verify", "ca_file", "crl_file", "ecdhe", "ciphers" configuration and ssl options. "crt-base" is also supported to fetch certificates.	2017-01-13 11:40:34 +01:00
Emeric Brun	3f78357066	OPTIM/MINOR: config: Optimize fullconn automatic computation loading configuration The previous version used an O(number of proxies)^2 algo to get the sum of the number of maxconns of frontends which reference a backend at least once. This new version adds the frontend's maxconn number to the backend's struct proxy member 'tot_fe_maxconn' when the backend name is resolved for switching rules or default_backend statment. At the end, the final backend's fullconn is computed looping only one time for all on proxies O(n). The load of a configuration using a large amount of backends (10 thousands) without configured fullconn was reduced from several minutes to few seconds.	2017-01-12 17:36:09 +01:00
Willy Tarreau	48ef4c95b6	MINOR: tools: make str2sa_range() return the port in a separate argument This will be needed so that we're don't have to extract it from the returned address where it will not always be anymore (eg: for unresolved servers).	2017-01-06 19:29:34 +01:00
Willy Tarreau	04276f3d6e	MEDIUM: server: split the address and the port into two different fields Keeping the address and the port in the same field causes a lot of problems, specifically on the DNS part where we're forced to cheat on the family to be able to keep the port. This causes some issues such as some families not being resolvable anymore. This patch first moves the service port to a new field "svc_port" so that the port field is never used anymore in the "addr" field (struct sockaddr_storage). All call places were adapted (there aren't that many).	2017-01-06 19:29:33 +01:00
Emeric Brun	4f60301235	MINOR: connection: add sample fetch "fc_rcvd_proxy" fc_rcvd_proxy : boolean Returns true if the client initiated the connection with a PROXY protocol header. A flag is added on the struct connection if a PROXY header is successfully parsed.	2017-01-06 11:59:17 +01:00
Robin H. Johnson	52f5db2a44	MINOR: http: custom status reason. The older 'rsprep' directive allows modification of the status reason. Extend 'http-response set-status' to take an optional string of the new status reason. http-response set-status 418 reason "I'm a coffeepot" Matching updates in Lua code: - AppletHTTP.set_status - HTTP.res_set_status Signed-off-by: Robin H. Johnson <robbat2@gentoo.org>	2017-01-06 11:57:44 +01:00
Willy Tarreau	0ebb511b3e	MINOR: tools: add a generic hexdump function for debugging debug_hexdump() prints to the requested output stream (typically stdout or stderr) an hex dump of the blob passed in argument. This is useful to help debug binary protocols.	2017-01-05 20:12:20 +01:00
Willy Tarreau	10e61cbf41	BUG/MINOR: http: report real parser state in error captures Error captures almost always report a state 26 (MSG_ERROR) making it very hard to know what the parser was expecting. The reason is that we have to switch to MSG_ERROR to trigger the dump, and then during the dump we capture the current state which is already MSG_ERROR. With this change we now copy the current state into an err_state field that will be reported as the faulty state. This patch looks a bit large because the parser doesn't update the current state until it runs out of data so the current state is never known when jumping to ther error label! Thus the code had to be updated to take copies of the current state before switching to MSG_ERROR based on the switch/case values. As a bonus, it now shows the current state in human-readable form and not only in numeric form ; in the past it was not an issue since it was always 26 (MSG_ERROR). At least now we can get exploitable invalid request/response reports : [05/Jan/2017:19:28:57.095] frontend f (#2): invalid request backend <NONE> (#-1), server <NONE> (#-1), event #1 src 127.0.0.1:39894, session #4, session flags 0x00000080 HTTP msg state MSG_RQURI(4), msg flags 0x00000000, tx flags 0x00000000 HTTP chunk len 0 bytes, HTTP body len 0 bytes buffer flags 0x00908002, out 0 bytes, total 20 bytes pending 20 bytes, wrapping at 16384, error at position 5: 00000 GET /\e HTTP/1.0\r\n 00017 \r\n 00019 \n [05/Jan/2017:19:28:33.827] backend b (#3): invalid response frontend f (#2), server s1 (#1), event #0 src 127.0.0.1:39718, session #0, session flags 0x000004ce HTTP msg state MSG_HDR_NAME(17), msg flags 0x00000000, tx flags 0x08300000 HTTP chunk len 0 bytes, HTTP body len 0 bytes buffer flags 0x80008002, out 0 bytes, total 59 bytes pending 59 bytes, wrapping at 16384, error at position 31: 00000 HTTP/1.1 200 OK\r\n 00017 Content-length : 10\r\n 00038 \r\n 00040 0a\r\n 00044 0123456789\r\n 00056 0\r\n This should be backported to 1.7 and 1.6 at least to help with bug reports.	2017-01-05 19:48:50 +01:00
Christopher Faulet	0184ea71a6	BUG/MAJOR: channel: Fix the definition order of channel analyzers It is important to defined analyzers (AN_REQ_* and AN_RES_) in the same order they are evaluated in process_stream. This order is really important because during analyzers evaluation, we run them in the order of the lower bit to the higher one. This way, when an analyzer adds/removes another one during its evaluation, we know if it is located before or after it. So, when it adds an analyzer which is located before it, we can switch to it immediately, even if it has already been called once but removed since. With the time, and introduction of new analyzers, this order was broken up. the main problems come from the filter analyzers. We used values not related with their evaluation order. Furthermore, we used same values for request and response analyzers. So, to fix the bug, filter analyzers have been splitted in 2 distinct lists to have different analyzers for the request channel than those for the response channel. And of course, we have moved them to the right place. Some other analyzers have been reordered to respect the evaluation order: AN_REQ_HTTP_TARPIT has been moved just before AN_REQ_SRV_RULES * AN_REQ_PRST_RDP_COOKIE has been moved just before AN_REQ_STICKING_RULES * AN_RES_STORE_RULES has been moved just after AN_RES_WAIT_HTTP Note today we have 29 analyzers, all stored into a 32 bits bitfield. So we can still add 4 more analyzers before having a problem. A good way to fend off the problem for a while could be to have a different bitfield for request and response analyzers. [wt: all of this must be backported to 1.7, and part of it must be backported to 1.6 and 1.5]	2017-01-05 17:58:22 +01:00
David Carlier	f2592b29f1	MEDIUM: regex: pcre2 support this adds a support of the newest pcre2 library, more secure than its older sibling in a cost of a more complex API. It works pretty similarly to pcre's part to keep the overall change smooth, except : - we define the string class supported at compile time. - after matching the ovec data is properly sized, althought we do not take advantage of it here. - the lack of jit support is treated less 'dramatically' as pcre2_jit_compile in this case is 'no-op'.	2016-12-28 12:51:51 +01:00
Willy Tarreau	ef934603c0	CLEANUP: ssl: move most ssl-specific global settings to ssl_sock.c Historically a lot of SSL global settings were stored into the global struct, but we've reached a point where there are 3 ifdefs in it just for this, and others in haproxy.c to initialize it. This patch moves all the private fields to a new struct "global_ssl" stored in ssl_sock.c. This includes : char crt_base; char ca_base; char listen_default_ciphers; char connect_default_ciphers; int listen_default_ssloptions; int connect_default_ssloptions; int tune.sslprivatecache; /* Force to use a private session cache even if nbproc > 1 / unsigned int tune.ssllifetime; / SSL session lifetime in seconds / unsigned int tune.ssl_max_record; / SSL max record size / unsigned int tune.ssl_default_dh_param; / SSL maximum DH parameter size / int tune.ssl_ctx_cache; / max number of entries in the ssl_ctx cache. */ The "tune" part was removed (useless here) and the occasional "ssl" prefixes were removed as well. Thus for example instead of global.tune.ssl_default_dh_param we now have : global_ssl.default_dh_param A few initializers were present in the constructor, they could be brought back to the structure declaration. A few other entries had to stay in global for now. They concern memory calculationn (used in haproxy.c) and stats (used in stats.c). The code is already much cleaner now, especially for global.h and haproxy.c which become readable.	2016-12-22 23:26:38 +01:00
Willy Tarreau	d1c5750370	CLEANUP: ssl: move tlskeys_finalize_config() to a post_check callback tlskeys_finalize_config() was the only reason for haproxy.c to still require ifdef and includes for ssl_sock. This one fits perfectly well in the late initializers so it was changed to be registered with hap_register_post_check().	2016-12-22 23:26:38 +01:00
Willy Tarreau	d84dab733a	MINOR: connection: add new prepare_srv()/destroy_srv() entries to xprt_ops These one will be used by the SSL layer to prepare and destroy a server-side SSL context.	2016-12-22 23:26:38 +01:00
Willy Tarreau	d9f5cca3d5	CLEANUP: connection: unexport raw_sock and ssl_sock This way we're sure not to reuse them by accident.	2016-12-22 23:26:38 +01:00
Willy Tarreau	13e1410f8a	MINOR: connection: add a minimal transport layer registration system There are still a lot of #ifdef USE_OPENSSL in the code (still 43 occurences) because we never know if we can directly access ssl_sock or not. This patch attacks the problem differently by providing a way for transport layers to register themselves and for users to retrieve the pointer. Unregistered transport layers will point to NULL so it will be easy to check if SSL is registered or not. The mechanism is very inexpensive as it relies on a two-entries array of pointers, so the performance will not be affected.	2016-12-22 23:26:38 +01:00
Willy Tarreau	141ad85d10	MINOR: server: move the use_ssl field out of the ifdef USE_OPENSSL Having it in the ifdef complicates certain operations which require additional ifdefs just to access a member which could remain zero in non-ssl cases. Let's move it out, it will not even increase the struct size on 64-bit machines due to alignment.	2016-12-22 23:26:38 +01:00
Willy Tarreau	fa983d3caa	MINOR: connection: add a new destroy_bind_conf() entry to xprt_ops This one will be set by the transport layers which want to destroy a bind_conf. It will typically be used by SSL to release certificates, CAs and so on.	2016-12-22 23:26:38 +01:00
Willy Tarreau	55d3791b46	MEDIUM: ssl_sock: implement ssl_sock_prepare_bind_conf() Instead of hard-coding all SSL preparation in cfgparse.c, we now register this new function as the transport layer's prepare_bind_conf() and call it only when definied. This removes some non-obvious SSL-specific code from cfgparse.c as well as a #ifdef.	2016-12-22 23:26:38 +01:00
Willy Tarreau	5aacf78a34	MINOR: connection: add a new prepare_bind_conf() entry to xprt_ops This one will be set by the transport layers which want to initialize a bind_conf. It will typically be used by SSL to load certificates, CAs and so on.	2016-12-22 23:26:38 +01:00
Willy Tarreau	0320934f7e	MEDIUM: ssl: remote the proxy argument from most functions Most of the SSL functions used to have a proxy argument which was mostly used to be able to emit clean errors using Alert(). First, many of them were converted to memprintf() and don't require this pointer anymore. Second, the rare which still need it also have either a bind_conf argument or a server argument, both of which carry a pointer to the relevant proxy. So let's now get rid of it, it needlessly complicates the API and certain functions already have many arguments.	2016-12-22 23:26:38 +01:00
Willy Tarreau	c95bad5013	MEDIUM: move listener->frontend to bind_conf->frontend Historically, all listeners have a pointer to the frontend. But since the introduction of SSL, we now have an intermediary layer called bind_conf corresponding to a "bind" line. It makes no sense to have the frontend on each listener given that it's the same for all listeners belonging to a same bind_conf. Also certain parts like SSL can only operate on bind_conf and need the frontend. This patch fixes this by moving the frontend pointer from the listener to the bind_conf. The extra indirection is quite cheap given and the places were this is used are very scarce.	2016-12-22 23:26:38 +01:00
Willy Tarreau	71a8c7c49e	MINOR: listener: move the transport layer pointer to the bind_conf A mistake was made when the socket layer was cut into proto and transport, the transport was attached to the listener while all listeners in a single "bind" line always have exactly the same transport. It doesn't seem obvious but this is the reason why there are so many #ifdefs USE_OPENSSL in cfgparse : a lot of operations have to be open-coded because cfgparse only manipulates bind_conf and we don't have the information of the transport layer here. Very little code makes use of the transport layer, mainly session setup and log. These places can afford an extra pointer indirection (the listener points to the bind_conf). This change is thus very small, it saves a little bit of memory (8B per listener) and makes the code more flexible.	2016-12-22 23:26:37 +01:00
Willy Tarreau	368780334c	MEDIUM: compression: move the zlib-specific stuff from global.h to compression.c This finishes to clean up the zlib-specific parts. It also unbreaks recent commit `b97c6fb` ("CLEANUP: compression: use the build options list to report the algos") which broke USE_ZLIB due to MAXWBITS not being defined anymore in haproxy.c.	2016-12-22 20:00:46 +01:00
Willy Tarreau	ece9b07c71	MINOR: cfgparse: add two new functions to check arguments count We already had alertif_too_many_args{,_idx}(), but these ones are specifically designed for use in cfgparse. Outside of it we're trying to avoid calling Alert() all the time so we need an equivalent using a pointer to an error message. These new functions called too_many_args{,_idx)() do exactly this. They don't take the file name nor the line number which they have no use for but instead they take an optional pointer to an error message and the pointer to the error code is optional as well. With (NULL, NULL) they'll simply check the validity and return a verdict. They are quite convenient for use in isolated keyword parsers. These two new functions as well as the previous ones have all been exported.	2016-12-21 23:39:26 +01:00
Willy Tarreau	bee9dde31f	CLEANUP: da: move global settings out of the global section We replaced global.deviceatlas with global_deviceatlas since there's no need to store all this into the global section. This removes the last #ifdefs, and now the code is 100% self-contained in da.c. The file da.h was now removed because it was only used to load dac.h, which is more easily loaded directly from da.c. It provides another good example of how to integrate code in the future without touching the core parts.	2016-12-21 21:30:54 +01:00
Willy Tarreau	b7a671477f	CLEANUP: 51d: move global settings out of the global section We replaced global._51degrees with global_51degrees since there's no need to store all this into the global section. This removes the last #ifdefs, and now the code is 100% self-contained in 51d.c. The file 51d.h was now removed because it was only used to load 51Degrees.h, which is more easily loaded from 51d.c. It provides a good example of how to integrate code in the future without touching the core parts.	2016-12-21 21:30:54 +01:00
Willy Tarreau	350c1c6886	CLEANUP: wurfl: move global settings out of the global section We replaced global.wurfl with global_wurfl since there's no need to store all this into the global section. This removes the last #ifdefs, and now the code is 100% self-contained in wurfl.c. It provides a good example of how to integrate code in the future without touching the core parts.	2016-12-21 21:30:54 +01:00
Willy Tarreau	b149eedd5a	CLEANUP: da: register the deinitialization function deinit_deviceatlas() is not called anymore from haproxy.c, removing 2 still includes other parts of the Deviceatlas library so it was not touched.	2016-12-21 21:30:54 +01:00
Willy Tarreau	7ac4c20509	CLEANUP: 51d: register the deinitialization function deinit_51degrees() is not called anymore from haproxy.c, removing 2 #ifdefs and one include. The function was made static. The include file still includes 51Degrees.h which is needed by global.h and 51d.c so it was not touched beyond this last function removal.	2016-12-21 21:30:54 +01:00
Willy Tarreau	800f93f375	CLEANUP: wurfl: register the deinit function via the dedicated list By registering the deinit function we avoid another #ifdef in haproxy.c. The ha_wurfl_deinit() function has been made static and unexported. Now proto/wurfl.h is totally empty, the code being self-contained in wurfl.c, so the useless .h has been removed.	2016-12-21 21:30:54 +01:00
Willy Tarreau	05554e6bf1	MINOR: haproxy: add a registration for post-deinit functions The 3 device detection engines stop at the same place in deinit() with the usual #ifdefs. Similar to the other functions we can have some late deinitialization functions. These functions do not return anything however so we have to use a different type.	2016-12-21 21:30:54 +01:00
Willy Tarreau	876054df96	CLEANUP: da: make use of the late init registration code Instead of having a #ifdef in the main init code we now use the registered init functions. Doing so also enables error checking as errors were previously reported as alerts but ignored. Also they were incorrect as the 'status' variable was hidden by a second one and was always reporting DA_SYS (which is apparently an error) in every case including the case where no file was loaded. The init_deviceatlas() function was unexported since it's not used outside of this place anymore.	2016-12-21 21:30:54 +01:00
Willy Tarreau	9f3f2549fb	CLEANUP: 51d: make use of the late init registration This removes some #ifdefs from the main haproxy code path. Function init_51degrees() now returns ERR_* instead of exit(1) on error, and this function was made static and is not exported anymore.	2016-12-21 21:30:54 +01:00
Willy Tarreau	dc2ed47163	CLEANUP: wurfl: make use of the late init registration This removes some #ifdefs from the main haproxy code path and enables error checking. The current code only makes use of warnings even for some errors that look serious. While this choice is questionnable, it has been kept as-is, and only the return codes were adapted to ERR_WARN to at least report that some warnings were emitted. ha_wurfl_init() was unexported as it's not needed anymore.	2016-12-21 21:30:54 +01:00
Willy Tarreau	64bca599d9	CLEANUP: filters: use the function registration to initialize all proxies Function flt_init() was called in the main init code path, now we move it to the list of initializers and we can unexport flt_init().	2016-12-21 21:30:54 +01:00
Willy Tarreau	865c5148e6	CLEANUP: checks: make use of the post-init registration to start checks Instead of calling the checks directly from the init code, we now register the start_checks() function to be run at this point. This also allows to unexport the check init function and to remove one include from haproxy.c.	2016-12-21 21:30:54 +01:00
Willy Tarreau	e694573fa0	MINOR: haproxy: add a registration for post-check functions There's a significant amount of late initialization calls which are performed after the point where we exit in check mode. These calls are used to allocate resource and perform certain slow operations. Let's have a way to register some functions which need to be called there instead of having this multitude of #ifdef in the init path.	2016-12-21 21:30:54 +01:00
Willy Tarreau	cdb737e5a2	MINOR: haproxy: add a registration for build options Many extensions now report some build options to ease debugging, but this is now being done at the expense of code maintainability. Let's provide a registration function to do this so that we can start to remove most of the #ifdefs from haproxy.c (18 currently just for a single function).	2016-12-21 21:30:54 +01:00
Thierry FOURNIER	2c8b54e7be	MEDIUM: lua: remove Lua struct from session, and allocate it with memory pools This patch use memory pools for allocating the Lua struct. This save 128B of memory in the session if the Lua is unused.	2016-12-21 15:24:56 +01:00
Willy Tarreau	f5f26e824a	MINOR: appctx/cli: remove the "tlskeys" entry from the appctx union This one now migrates to the general purpose cli.p0 for the ref pointer, cli.i0 for the dump_all flag and cli.i1 for the dump_keys_index. A few comments were added. The applet.h file doesn't depend on openssl anymore. It's worth noting that the previous dependency was accidental and only used to work because all files including this one used to have openssl included prior to loading this file.	2016-12-16 19:40:14 +01:00
Willy Tarreau	3c92f2aca4	MINOR: appctx/cli: remove the "server_state" entry from the appctx union This one now migrates to the general purpose cli.p0 for the proxy pointer, cli.p1 for the server pointer, and cli.i0 for the proxy's instance if only one has to be dumped.	2016-12-16 19:40:14 +01:00
Willy Tarreau	777b560d04	MINOR: appctx/cli: remove the "dns" entry from the appctx union This one now migrates to the general purpose cli.p0.	2016-12-16 19:40:14 +01:00
Willy Tarreau	608ea5921a	MINOR: appctx/cli: remove the "be" entry from the appctx union This one now migrates to the general purpose cli.p0. The parsing function was removed since it was only used to set the pointer to NULL.	2016-12-16 19:40:14 +01:00
Willy Tarreau	f6710f8811	MINOR: appctx/cli: remove the env entry from the appctx union This one now migrates to the general purpose cli.p0.	2016-12-16 19:40:14 +01:00
Willy Tarreau	3af9d832e8	MINOR: appctx/cli: remove the cli_socket entry from the appctx union This one now migrates to the general purpose cli.p0.	2016-12-16 19:40:13 +01:00
Willy Tarreau	a2d5872297	MINOR: cli: add two general purpose pointers and integers in the CLI struct Most of the keywords don't need to have their own entry in the appctx union, they just need to reuse some generic pointers like we've been used to do in the appctx with st{0,1,2}. This patch adds p0, p1, i0, i1 and initializes them to zero before calling the parser. This way some of the simplest existing keywords will be able to disappear from the union. It's worth noting that this is an extension to what was initially attempted via the "private" member that I removed a few patches ago by not understanding how it was supposed to be used. Here the fact that we share the same union will force us to be stricter: the code either uses the general purpose variables or it uses its own fields but not both.	2016-12-16 19:40:13 +01:00
Willy Tarreau	ebec3519b0	CLEANUP: applet: group all CLI contexts together The appctx storage became a real mess along the years. It now contains mostly CLI-specific parts that share the same storage as the "cli" part which in fact only contains the fields needed to pass an error message to the caller, and it also has room a few other regular applets which may become more and more common. This first patch moves the parts around in the union so that all standard applet parts are grouped together and the CLI-specific ones are grouped together. It also adds a few comments to indicate what certain parts are used for since it's sometimes a bit confusing.	2016-12-16 19:40:13 +01:00
Thierry FOURNIER	ebed6e908a	MEDIUM: lua: use memory pool for hlua struct in applets The struct hlua size is 128 bytes. The size is the biggest of all the elements of the union embedded in the appctx struct. With HTTP2, it is possible that this appctx struct will be use many times for each connection, so the 128 bytes are a little bit heavy for the global memory consomation. This patch replace the embbeded hlua struct by a pointer and an associated memory pool. Now, the memory for lua is allocated only if it is required. [wt: the appctx is now down to 160 bytes]	2016-12-16 16:31:45 +01:00
Thierry FOURNIER	18d0990a5d	CLEANUP: lua: rename one of the lua appctx union It is named hlua, which does not represent the usage of this variable. this patch renames this one to "hlua_cosocket".	2016-12-16 12:59:00 +01:00
William Lallemand	eceddf7225	MEDIUM: cli: 'show cli sockets' list the CLI sockets 'show cli sockets' from the CLI socket displays the list of CLI sockets available, with their level and process number.	2016-12-15 23:00:51 +01:00
Willy Tarreau	90f19e3c68	CLEANUP: applet: remove the now unused appctx->private field Nobody uses it anymore. This shrinks the appctx size by 8 bytes.	2016-12-14 16:48:16 +01:00
Willy Tarreau	a24bc78ad4	CLEANUP: applet/table: add an "action" entry in ->table context Just like previous patch, this was the only other user of the "private" field of the applet. It used to store a copy of the keyword's action. Let's just put it into ->table->action and use it from there. It also slightly simplifies the code by removing a few pointer to integer casts.	2016-12-14 16:48:16 +01:00
Willy Tarreau	8ae4f7533d	CLEANUP: applet/lua: create a dedicated ->fcn entry in hlua_cli context We have very few users of the appctx's private field which was introduced prior to the split of the CLI. Unfortunately it was not removed after the end. This commit simply introduces hlua_cli->fcn which is the pointer to the Lua function that the Lua code used to store in this private pointer.	2016-12-14 16:48:16 +01:00
Thierry FOURNIER	11cfb3daec	BUG/MEDIUM: lua: In some case, the return of sample-fetches is ignored (2) This problem is already detected here: `8dc7316a6f` Another case raises. Now HAProxy sends a final message (typically with "http-request deny"). Once the the message is sent, the response channel flags are not modified. HAProxy executes a Lua sample-fecthes for building logs, and the result is ignored because the response flag remains set to the value HTTP_MSG_RPBEFORE. So the Lua function hlua_check_proto() want to guarantee the valid state of the buffer and ask for aborting the request. The function check_proto() is not the good way to ensure request consistency. The real question is not "Are the message valid ?", but "Are the validity of message unchanged ?" This patch memorize the parser state before entering int the Lua code, and perform a check when it go out of the Lua code. If the parser state change for down, the request is aborted because the HTTP message is degraded. This patch should be backported in version 1.6 and 1.7	2016-12-14 12:52:47 +01:00
Luca Pizzamiglio	578b169dcb	BUILD/MEDIUM: Fixing the build using LibreSSL Fixing the build using LibreSSL as OpenSSL implementation. Currently, LibreSSL 2.4.4 provides the same API of OpenSSL 1.0.1x, but it redefine the OpenSSL version number as 2.0.x, breaking all checks with OpenSSL 1.1.x. The patch solves the issue checking the definition of the symbol LIBRESSL_VERSION_NUMBER when Openssl 1.1.x features are requested.	2016-12-12 22:57:04 +01:00
Christopher Faulet	a73e59b690	BUG/MAJOR: Fix how the list of entities waiting for a buffer is handled When an entity tries to get a buffer, if it cannot be allocted, for example because the number of buffers which may be allocated per process is limited, this entity is added in a list (called <buffer_wq>) and wait for an available buffer. Historically, the <buffer_wq> list was logically attached to streams because it were the only entities likely to be added in it. Now, applets can also be waiting for a free buffer. And with filters, we could imagine to have more other entities waiting for a buffer. So it make sense to have a generic list. Anyway, with the current design there is a bug. When an applet failed to get a buffer, it will wait. But we add the stream attached to the applet in <buffer_wq>, instead of the applet itself. So when a buffer is available, we wake up the stream and not the waiting applet. So, it is possible to have waiting applets and never awakened. So, now, <buffer_wq> is independant from streams. And we really add the waiting entity in <buffer_wq>. To be generic, the entity is responsible to define the callback used to awaken it. In addition, applets will still request an input buffer when they become active. But they will not be sleeped anymore if no buffer are available. So this is the responsibility to the applet I/O handler to check if this buffer is allocated or not. This way, an applet can decide if this buffer is required or not and can do additional processing if not. [wt: backport to 1.7 and 1.6]	2016-12-12 19:11:04 +01:00
Christopher Faulet	9d810cae11	BUG/MEDIUM: stream: Save unprocessed events for a stream A stream can be awakened for different reasons. During its processing, it can be early stopped if no buffer is available. In this situation, the reason why the stream was awakened is lost, because we rely on the task state, which is reset after each processing loop. In many cases, that's not a big deal. But it can be useful to accumulate the task states if the stream processing is interrupted, especially if some filters need to be called. To be clearer, here is an simple example: 1) A stream is awakened with the reason TASK_WOKEN_MSG. 2) Because no buffer is available, the processing is interrupted, the stream is back to sleep. And the task state is reset. 3) Some buffers become available, so the stream is awakened with the reason TASK_WOKEN_RES. At this step, the previous reason (TASK_WOKEN_MSG) is lost. Now, the task states are saved for a stream and reset only when the stream processing is not interrupted. The correspoing bitfield represents the pending events for a stream. And we use this one instead of the task state during the stream processing. Note that TASK_WOKEN_TIMER and TASK_WOKEN_RES are always removed because these events are always handled during the stream processing. [wt: backport to 1.7 and 1.6]	2016-12-12 19:10:58 +01:00
Christopher Faulet	34c5cc98da	MINOR: task: Rename run_queue and run_queue_cur counters <run_queue> is used to track the number of task in the run queue and <run_queue_cur> is a copy used for the reporting purpose. These counters has been renamed, respectively, <tasks_run_queue> and <tasks_run_queue_cur>. So the naming is consistent between tasks and applets. [wt: needed for next fixes, backport to 1.7 and 1.6]	2016-12-12 19:10:54 +01:00
Christopher Faulet	1cbe74cd83	MINOR: applet: Count number of (active) applets As for tasks, 2 counters has been added to track : * the total number of applets : nb_applets * the number of active applets : applets_active_queue [wt: needed for next fixes, to backport to 1.7 and 1.6]	2016-12-12 19:10:46 +01:00
Thierry FOURNIER / OZON.IO	4b123bebe4	MINOR: lua: Allow argument for actions (http\|tcp)-(request\|response) action cannot take arguments from the configuration file. Arguments are useful for executing the action with a special context. This patch adds the possibility of passing arguments to an action. It runs exactly like sample fetches and other Lua wrappers. Note that this patch implements a 'TODO'.	2016-12-12 14:34:56 +01:00
Willy Tarreau	796c5b7997	OPTIM: stream-int: don't disable polling anymore on DONT_READ Commit `5fddab0` ("OPTIM: stream_interface: disable reading when CF_READ_DONTWAIT is set") improved the connection layer's efficiency back in 1.5-dev13 by avoiding successive read attempts on an active FD. But by disabling this on a polled FD, it causes an unpleasant side effect which is that the FD that was subscribed to polling is suddenly stopped and may need to be re-enabled once the kernel starts to slow down on data eviction (eg: saturated server at the other end, bursty traffic caused by too large maxpollevents). This behaviour is observable with persistent connections when there is a large enough connection count so that there's no data in the early connection and polling is required, because there are then up to 4 epoll_ctl() calls per request. It's important that the server is slower than haproxy to cause some delays when reading response. The current connection layer as designed in 1.6 with the FD cache doesn't require this trick anymore, though it still benefits from it when it saves an FD from being uselessly polled. But compared to the increased cost of enabling and disabling poll all the time, it's still better to disable it. In some cases it's possible to observe a performance increase as high as 30% by avoiding this epoll_ctl() dance. In the end we only want to disable it when the FD is speculatively read and not when it's polled. For this we introduce a new function __conn_data_done_recv() which is used to indicate that we're done with recv() and not interested in new attempts. If/when we later support event-triggered epoll, this function will have to change a bit to do the same even in the polled case. A quick test with keep-alive requests run on a dual-core / dual- thread Atom shows a significant improvement : single process, 0 bytes : before: Requests per second: 12243.20 [#/sec] (mean) after: Requests per second: 13354.54 [#/sec] (mean) single process, 4k : before: Requests per second: 9639.81 [#/sec] (mean) after: Requests per second: 10991.89 [#/sec] (mean) dual process, 0 bytes (unstable) : before: Requests per second: 16900-19800 ~ 17600 [#/sec] (mean) after: Requests per second: 18600-21400 ~ 20500 [#/sec] (mean)	2016-12-05 13:49:57 +01:00
Christopher Faulet	4aad833924	BUG/MINOR: filters: Protect args in macros HAS_DATA_FILTERS and IS_DATA_FILTER [wt: backport needed in 1.7]	2016-11-29 17:03:04 +01:00
Willy Tarreau	ddc3e9e55d	BUG/MINOR: stats: make field_str() return an empty string on NULL It already returns an empty string when the field is empty, but as a preventive measure we should do the same when the string itself is a NULL. While it is not supposed to happen, it will make the code more resistant against failed allocations and unexpected results. This fix should be backported to 1.7.	2016-11-26 15:58:37 +01:00
Willy Tarreau	7d56221d57	REORG: stkctr: move all the stick counters processing to stick-tables.c Historically we used to have the stick counters processing put into session.c which became stream.c. But a big part of it is now in stick-table.c (eg: converters) but despite this we still have all the sample fetch functions in stream.c These parts do not depend on the stream anymore, so let's move the remaining chunks to stick-table.c and have cleaner files. What remains in stream.c is everything needed to attach/detach trackers to the stream and to update the counters while the stream is being processed.	2016-11-25 16:10:05 +01:00
Willy Tarreau	397131093f	REORG: tcp-rules: move tcp rules processing to their own file There's no more reason to keep tcp rules processing inside proto_tcp.c given that there is nothing in common there except these 3 letters : tcp. The tcp rules are in fact connection, session and content processing rules. Let's move them to "tcp-rules" and let them live their life there.	2016-11-25 15:57:38 +01:00
Willy Tarreau	ae9bea0591	CLEANUP: counters: move from 3 types to 2 types We used to have 3 types of counters with a huge overlap : - listener counters : stats collected for each bind line - proxy counters : union of the frontend and backend counters - server counters : stats collected per server It happens that quite a good part was common between listeners and proxies due to the frontend counters being updated at the two locations, and that similarly the server and proxy counters were overlapping and being updated together. This patch cleans this up to propose only two types of counters : - fe_counters: used by frontends and listeners, related to incoming connections activity - be_counters: used by backends and servers, related to outgoing connections activity This allowed to remove some non-sensical counters from both parts. For frontends, the following entries were removed : cum_lbconn, last_sess, nbpend_max, failed_conns, failed_resp, retries, redispatches, q_time, c_time, d_time, t_time For backends, this ones was removed : intercepted_req. While doing this it was discovered that we used to incorrectly report intercepted_req for backends in the HTML stats, which was always zero since it's never updated. Also it revealed a few inconsistencies (which were not fixed as they are harmless). For example, backends count connections (cum_conn) instead of sessions while servers count sessions and not connections. Over the long term, some extra cleanups may be performed by having some counters update functions touching both the server and backend at the same time, as well as both the frontend and listener, to ensure that all sides have all their stats properly filled. The stats dump will also be able to factor the dump functions by counter types.	2016-11-25 15:03:12 +01:00
Willy Tarreau	3758581e19	BUG/MINOR: freq-ctr: make swrate_add() support larger values Reinhard Vicinus reported that the reported average response times cannot be larger than 16s due to the double multiply being performed by swrate_add() which causes an overflow very quickly. Indeed, with N=512, the highest average value is 16448. One solution proposed by Reinhard is to turn to long long, but this involves 64x64 multiplies and 64->32 divides, which are extremely expensive on 32-bit platforms. There is in fact another way to avoid the overflow without using larger integers, it consists in avoiding the multiply using the fact that x*(n-1)/N = x-(x/N). Now it becomes possible to store average values as large as 8.4 millions, which is around 2h18mn. Interestingly, this improvement also makes the code cheaper to execute both on 32 and on 64 bit platforms : Before : 00000000 <swrate_add>: 0: 8b 54 24 04 mov 0x4(%esp),%edx 4: 8b 0a mov (%edx),%ecx 6: 89 c8 mov %ecx,%eax 8: c1 e0 09 shl $0x9,%eax b: 29 c8 sub %ecx,%eax d: 8b 4c 24 0c mov 0xc(%esp),%ecx 11: c1 e8 09 shr $0x9,%eax 14: 01 c8 add %ecx,%eax 16: 89 02 mov %eax,(%edx) After : 00000020 <swrate_add>: 20: 8b 4c 24 04 mov 0x4(%esp),%ecx 24: 8b 44 24 0c mov 0xc(%esp),%eax 28: 8b 11 mov (%ecx),%edx 2a: 01 d0 add %edx,%eax 2c: 81 c2 ff 01 00 00 add $0x1ff,%edx 32: c1 ea 09 shr $0x9,%edx 35: 29 d0 sub %edx,%eax 37: 89 01 mov %eax,(%ecx) This fix may be backported to 1.6.	2016-11-25 11:55:10 +01:00
Willy Tarreau	35069f84af	MINOR: cli: make "show errors" capable of dumping only request or response When dealing with many proxies, it's hard to spot response errors because all internet-facing frontends constantly receive attacks. This patch now makes it possible to demand that only request or response errors are dumped by appending "request" or "reponse" to the show errors command.	2016-11-25 09:16:37 +01:00
Thierry FOURNIER / OZON.IO	8a4e4420fb	MEDIUM: log-format: Use standard HAProxy log system to report errors The function log format emit its own error message using Alert(). This patch replaces this behavior and uses the standard HAProxy error system (with memprintf). The benefits are: - cleaning the log system - the logformat can ignore the caller (actually the caller must set a flag designing the caller function). - Make the usage of the logformat function easy for future components.	2016-11-25 07:32:58 +01:00
Thierry FOURNIER / OZON.IO	4ed1c9585d	MINOR: http/conf: store the use_backend configuration file and line for logs The error log of the directive use_backend doesn't provide the file and line containing the declaration. This patch stores theses informations.	2016-11-25 07:15:09 +01:00
Thierry FOURNIER / OZON.IO	7f3aa8b62f	MINOR: lua: add utility function for check boolean argument Strangely, the Lua API doesn't provides a function like luaL_checkboolean(). This little function add this one.	2016-11-24 21:35:10 +01:00
Willy Tarreau	80ebacfc82	BUILD: ssl: make the SSL layer build again with openssl 0.9.8 Commit `1866d6d` ("MEDIUM: ssl: Add support for OpenSSL 1.1.0") introduced support for openssl 1.1.0 and temporarily broke 0.9.8. In the end the port was not very hard given that the only cause of build failures were functions supposedly absent from 0.9.8 that in fact did exist. Thus, adding a new #if to move these functions for versions older than 0.9.8 was enough to fix the trouble. It received very light testing, basically only an SSL bridge decrypting and re-encrypting traffic, and checking that everything looks right. That said, the functions specific to 0.9.8 here compared to 1.0.x are only SSL_SESSION_set1_id_context(), EVP_PKEY_base_id(), and X509_PUBKEY_get0_param().	2016-11-24 20:18:21 +01:00
Thierry FOURNIER / OZON.IO	a2c38d7904	MEDIUM: log-format: strict parsing and enable fail Until now, the function parse_logformat_string() never fails. It send warnings when it parses bad format, and returns expression in best effort. This patch replaces warnings by alert and returns a fail code. Maybe the warning mode is designed for a compatibility with old configuration versions. If it is the case, now this compatibility is broken. [wt: no, the reason is that an alert must cause a startup failure, but this will be OK with next patch]	2016-11-24 18:54:26 +01:00
Thierry FOURNIER / OZON.IO	6fe0e1b977	CLEANUP: log-format: remove unused arguments The log-format function parse_logformat_string() takes file and line for building parsing logs. These two parameters are embedded in the struct proxy curproxy, which is the current parsing context. This patch removes these two unused arguments.	2016-11-24 18:54:26 +01:00
Thierry FOURNIER / OZON.IO	eb51e16306	CLEANUP/MINOR: log-format: unexport functions parse_logformat_var_args() and parse_logformat_var() Remove export of the fucntion parse_logformat_var_args() and parse_logformat_var(). These functions are a part of the logformat parser, and this export is useless.	2016-11-24 18:54:25 +01:00
Willy Tarreau	df4399fcb6	BUILD: server: remove a build warning introduced by latest series We get this when Lua is disabled, just a missing include. In file included from src/queue.c:18:0: include/proto/server.h:51:39: warning: 'struct appctx' declared inside parameter list [enabled by default]	2016-11-24 17:32:01 +01:00
Willy Tarreau	3b6e547be8	CLEANUP: cli: rename STAT_CLI_* to CLI_ST_* These are in CLI states, not stats states anymore. STAT_CLI_O_CUSTOM was more appropriately renamed CLI_ST_CALLBACK.	2016-11-24 16:59:28 +01:00
Willy Tarreau	0a73929dc8	REORG: cli: make "show env" also use the generic keyword registration This way we don't have any more state specific to a given yieldable command. The other commands should be easier to move as they only involve a parser.	2016-11-24 16:59:28 +01:00
Willy Tarreau	12207b360a	REORG: cli: move "show errors" out of cli.c It really belongs to proto_http.c since it's a dump for HTTP request and response errors. Note that it's possible that some parts do not need to be exported anymore since it really is the only place where errors are manipulated.	2016-11-24 16:59:28 +01:00
Willy Tarreau	f13ebdf286	REORG: cli: move table dump/clear/set to stick_table.c The table dump code was a horrible mess, with common parts interleaved all the way to deal with the various actions (set/clear/show). A few error messages were still incorrect, as the "set" operation did not update them so they would still report "unknown action" (now fixed). The action was now passed as a private argument to the CLI keyword which itself is copied into the appctx private field. It's just an int cast to a pointer. Some minor issues were noticed while doing this, for example when dumping an entry by key, if the key doesn't exist, nothing is printed, not even the table's header. It's unclear whether this was intentional but it doesn't really match what is done for data-based dumps. It was left unchanged for now so that a later fix can be backported if needed. Enum entries STAT_CLI_O_TAB, STAT_CLI_O_CLR and STAT_CLI_O_SET were removed.	2016-11-24 16:59:28 +01:00
Willy Tarreau	97c2ae13bc	REORG: cli: move dump_text(), dump_text_line(), and dump_binary() to standard.c These are general purpose functions, move them away.	2016-11-24 16:59:27 +01:00
Willy Tarreau	0baac8cf1f	REORG: cli: move "show info" to stats.c Move the "show info" command to stats.c using the CLI keyword API to register it on the CLI. The stats_dump_info_to_buffer() function is now static again. Note, we don't need proto_ssl anymore in cli.c.	2016-11-24 16:59:27 +01:00
Willy Tarreau	2b812e29f6	REORG: cli: move "show stat" to stats.c Move the "show stat" command to stats.c using the CLI keyword API to register it on the CLI. The stats_dump_stat_to_buffer() function is now static again.	2016-11-24 16:59:27 +01:00
William Lallemand	933efcd01a	REORG: cli: move 'show backend' to proxy.c Move 'show backend' CLI functions to proxy.c and use the cli keyword API to register it on the CLI.	2016-11-24 16:59:27 +01:00
William Lallemand	4c5b4d531c	REORG: cli: move 'show sess' to stream.c Move 'show sess' CLI functions to stream.c and use the cli keyword API to register it on the CLI. [wt: the choice of stream vs session makes sense because since 1.6 these really are streams that we're dumping and not sessions anymore]	2016-11-24 16:59:27 +01:00
William Lallemand	a6c5f3372d	REORG: cli: move 'show servers' to proxy.c Move 'show servers' CLI functions to proxy.c and use the cli keyword API to register it on the CLI.	2016-11-24 16:59:27 +01:00
William Lallemand	e7ed8855de	REORG: cli: move 'show pools' to memory.c Move 'show pools' CLI functions to memory.c and use the cli keyword API to register it on the CLI.	2016-11-24 16:59:27 +01:00
Willy Tarreau	960f2cb056	MINOR: proxy: create new function cli_find_frontend() to find a frontend Several CLI commands require a frontend, so let's have a function to look this one up and prepare the appropriate error message and the appctx's state in case of failure.	2016-11-24 16:59:27 +01:00
Willy Tarreau	21b069dca8	MINOR: server: create new function cli_find_server() to find a server Several CLI commands require a server, so let's have a function to look this one up and prepare the appropriate error message and the appctx's state in case of failure.	2016-11-24 16:59:27 +01:00
Willy Tarreau	de57a578ba	MINOR: cli: create new function cli_has_level() to validate permissions This function is used to check that the CLI features the appropriate level of permissions or to prepare the adequate error message.	2016-11-24 16:59:27 +01:00
William Lallemand	69e9644e35	REORG: cli: move show stat resolvers to dns.c Move dns CLI functions to dns.c and use the cli keyword API to register actions on the CLI.	2016-11-24 16:59:27 +01:00
William Lallemand	ad8be61c7e	REORG: cli: move map and acl code to map.c Move map and acl CLI functions to map.c and use the cli keyword API to register actions on the CLI. Then remove the now unused individual "add" and "del" keywords.	2016-11-24 16:59:27 +01:00
William Lallemand	32af203b75	REORG: cli: move ssl CLI functions to ssl_sock.c Move ssl CLI functions to ssl_sock.c and use the cli keyword API to register ssl actions on the CLI.	2016-11-24 16:59:27 +01:00
William Lallemand	9ed6203aef	REORG: cli: split dumpstats.h in stats.h and cli.h proto/dumpstats.h has been split in 4 files: * proto/cli.h contains protypes for the CLI * proto/stats.h contains prototypes for the stats * types/cli.h contains definition for the CLI * types/stats.h contains definition for the stats	2016-11-24 16:59:27 +01:00
William Lallemand	74c24fb071	REORG: cli: split dumpstats.c in src/cli.c and src/stats.c dumpstats.c was containing either the stats code and the CLI code. The cli code has been moved to cli.c and the stats code to stats.c	2016-11-24 16:59:27 +01:00
Willy Tarreau	8e0bb0ae16	MINOR: connection: add names for transport and data layers This makes debugging easier and avoids having to put ugly checks against certain well-known internal struct pointers.	2016-11-24 16:58:12 +01:00
Willy Tarreau	2dc770c641	MINOR: connection: add a few functions to report the data and xprt layers' names These functions will be needed by "show sess" on the CLI, let's make them globally available. It's important to note that due to the fact that we still do not set the data and transport layers' names in the structures, we still have to rely on some exports just to match the pointers. This is ugly but is preferable to adding many includes since the short-term goal is to get rid of these tests by having proper names in place.	2016-11-24 16:49:40 +01:00
David Carlier	327298c215	BUILD: fix build on Solaris 10/11 uint16_t instead of u_int16_t None ISO fields of struct tm are not present, but by zeroyfing it, on GNU and BSD systems tm_gmtoff field will be set. [wt: moved the memset into each of the date functions]	2016-11-22 12:04:19 +01:00
Christopher Faulet	a00d817aba	MINOR: filters: Add check_timeouts callback to handle timers expiration on streams A filter can now be notified when a stream is woken up because of an expired timer. The documentation and the TRACE filter have been updated.	2016-11-21 15:29:58 +01:00
Willy Tarreau	733b1327a6	DEBUG: connection: mark the closed FDs with a value that is easier to detect Setting an FD to -1 when closed isn't the most easily noticeable thing to do when we're chasing accidental reuse of a stale file descriptor. Instead set it to that large a negative value that it will overflow the fdtab and provide an analysable core at the moment the issue happens. Care was taken to ensure it doesn't overflow nor change sign on 32-bit machines when multiplied by fdtab, and that it also remains negative for the various checks that exist. The value equals 0xFDDEADFD which happens to be easily spotted in a debugger.	2016-11-18 15:00:42 +01:00
Willy Tarreau	350135cf49	BUG/MEDIUM: connection: check the control layer before stopping polling The bug described in commit `568743a` ("BUG/MEDIUM: stream-int: completely detach connection on connect error") was not a stream-interface layer bug but a connection layer bug. There was exactly one place in the code where we could change a file descriptor's status without first checking whether it is valid or not, it was in conn_stop_polling(). This one is called when the polling status is changed after an update, and calls fd_stop_both even if we had already closed the file descriptor : 1479388298.484240 ->->->->-> conn_fd_handler > conn_cond_update_polling 1479388298.484240 ->->->->->-> conn_cond_update_polling > conn_stop_polling 1479388298.484241 ->->->->->->-> conn_stop_polling > conn_ctrl_ready 1479388298.484241 conn_stop_polling < conn_ctrl_ready 1479388298.484241 ->->->->->->-> conn_stop_polling > fd_stop_both 1479388298.484242 ->->->->->->->-> fd_stop_both > fd_update_cache 1479388298.484242 ->->->->->->->->-> fd_update_cache > fd_release_cache_entry 1479388298.484242 fd_update_cache < fd_release_cache_entry 1479388298.484243 fd_stop_both < fd_update_cache 1479388298.484243 conn_stop_polling < fd_stop_both 1479388298.484243 conn_cond_update_polling < conn_stop_polling 1479388298.484243 conn_fd_handler < conn_cond_update_polling The problem with the previous fix above is that it break the http_proxy mode and possibly even some Lua parts and peers to a certain extent ; all outgoing connections where the target address is initially copied into the outgoing connection which experience a retry would use a random outgoing address after the retry because closing and detaching the connection causes the target address to be lost. This was attempted to be addressed by commit `0857d7a` ("BUG/MAJOR: stream: properly mark the server address as unset on connect retry") but it used to only solve the most visible effect and not the root cause. Prior to this fix, it was possible to cause this config to keep CLOSE_WAIT for as long as it takes to expire a client or server timeout (note the missing client timeout) : listen test mode http bind :8002 server s1 127.0.0.1:8001 $ tcploop 8001 L0 W N20 A R P100 S:"HTTP/1.1 200 OK\r\nContent-length: 0\r\n\r\n" & $ tcploop 8002 N200 C T W S:"GET / HTTP/1.0\r\n\r\n" O P10000 K With this patch, these CLOSE_WAIT properly vanish when both processes leave. This commit reverts the two fixes above and replaces them with the proper fix in connection.h. It must be backported to 1.6 and 1.5. Thanks to Robson Roberto Souza Peixoto for providing very detailed traces showing some obvious inconsistencies leading to finding this bug.	2016-11-18 14:48:52 +01:00
Thierry FOURNIER / OZON.IO	a44fdd95f9	MEDIUM: lua: Add cli handler for Lua Now, HAProxy allows to register some keys in the "cli". This patch allows to handle these keys with Lua code.	2016-11-18 14:32:03 +01:00
Thierry FOURNIER / OZON.IO	6a22dcbe27	MINOR: cli: add private pointer and release function This pointer will be used for storing private context. With this, the same executed function can handle more than one keyword. This will be very useful for creation Lua cli bindings. The release function is called when the command is terminated (give back the hand to the prompt) or when the session is broken (timeout or client closed).	2016-11-18 14:32:03 +01:00
Thierry FOURNIER / OZON.IO	62fec75183	MINOR: lua: add ip addresses and network manipulation function Add two functions core.parse_addr() and core.match_addr() where are used for matching networks.	2016-11-12 10:42:30 +01:00
Willy Tarreau	f07741d0d5	BUILD: http: include types/sample.h in proto_http.h Commit `d7c9196` ("MAJOR: filters: Add filters support") removed sample.h from proto_http.h, but it has become necessary as of commit `fd7edd3` ("MINOR: Move http method enum from proto_http to sample") in order to have HTTP_METH_*. Due to this, the "debug/flags" utility doesn't build anymore.	2016-11-11 07:56:48 +01:00
Christopher Faulet	ba7bc164f7	MINOR: spoe/checks: Add support for SPOP health checks A new "option spop-check" statement has been added to enable server health checks based on SPOP HELLO handshake. SPOP is the protocol used by SPOE filters to talk to servers.	2016-11-09 22:57:02 +01:00
Christopher Faulet	f7e4e7e096	MAJOR: spoe: Add an experimental Stream Processing Offload Engine SPOE makes possible the communication with external components to retrieve some info using an in-house binary protocol, the Stream Processing Offload Protocol (SPOP). In the long term, its aim is to allow any kind of offloading on the streams. This first version, besides being experimental, won't do lot of things. The most important today is to validate the protocol design and lay the foundations of what will, one day, be a full offload engine for the stream processing. So, for now, the SPOE can offload the stream processing before "tcp-request content", "tcp-response content", "http-request" and "http-response" rules. And it only supports variables creation/suppression. But, in spite of these limited features, we can easily imagine to implement a SSO solution, an ip reputation service or an ip geolocation service. Internally, the SPOE is implemented as a filter. So, to use it, you must use following line in a proxy proxy section: frontend my-front ... filter spoe [engine <name>] config <file> ... It uses its own configuration file to keep the HAProxy configuration clean. It is also a easy way to disable it by commenting out the filter line. See "doc/SPOE.txt" for all details about the SPOE configuration.	2016-11-09 22:57:01 +01:00
Christopher Faulet	85d79c94a9	MINOR: vars: Add 'unset-var' action/converter It does the opposite of 'set-var' action/converter. It is really useful for per-process variables. But, it can be used for any scope. The lua function 'unset_var' has also been added.	2016-11-09 22:57:01 +01:00
Christopher Faulet	ff2613ed7a	MEDIUM: vars: Add a per-process scope for variables Now it is possible to use variables attached to a process. The scope name is 'proc'. These variables are released only when HAProxy is stopped. 'tune.vars.proc-max-size' directive has been added to confiure the maximum amount of memory used by "proc" variables. And because memory accounting is hierachical for variables, memory for "proc" vars includes memory for "sess" vars.	2016-11-09 22:57:00 +01:00
Christopher Faulet	09c9df286b	MINOR: vars: Add vars_set_by_name_ifexist function This function, unsurprisingly, sets a variable value only if it already exists. In other words, this function will succeed only if the variable was found somewhere in the configuration during HAProxy startup. It will be used by SPOE filter. So an agent will be able to set a value only for existing variables. This prevents an agent to create a very large number of unused variables to flood HAProxy and exhaust the memory reserved to variables..	2016-11-09 22:57:00 +01:00
Christopher Faulet	476e5d0e03	REORG: sample: move code to release a sample expression in sample.c This code has been moved from haproxy.c to sample.c and the function release_sample_expr can now be called from anywhere to release a sample expression. This function will be used by the stream processing offload engine (SPOE).	2016-11-09 22:57:00 +01:00
Christopher Faulet	79bdef3cad	MINOR: cfgparse: Parse scope lines and save the last one parsed A scope is a section name between square bracket, alone on its line, ie: [scope-name] ... The spaces at the beginning and at the end of the line are skipped. Comments at the end of the line are also skipped. When a scope is parsed, its name is saved in the global variable cfg_scope. Initially, cfg_scope is NULL and it remains NULL until a valid scope line is parsed. This feature remains unused in the HAProxy configuration file and undocumented. However, it will be used during SPOE configuration parsing.	2016-11-09 22:56:59 +01:00
Christopher Faulet	7110b40d06	MINOR: cfgparse: Add functions to backup and restore registered sections This feature will be used by the stream processing offload engine (SPOE) to parse dedicated configuration files without mixing HAProxy sections with SPOE sections. So, here we can back up all sections known by HAProxy, unregister all of them and add new ones, dedicted to the SPOE. Once the SPOE configuration file parsed, we can roll back all changes by restoring HAProxy sections.	2016-11-09 22:56:59 +01:00
Christopher Faulet	31ed32dce4	MEDIUM: filters: Add attch/detach and stream_set_backend callbacks New callbacks have been added to handle creation and destruction of filter instances: * 'attach' callback is called after a filter instance creation, when it is attached to a stream. This happens when the stream is started for filters defined on the stream's frontend and when the backend is set for filters declared on the stream's backend. It is possible to ignore the filter, if needed, by returning 0. This could be useful to have conditional filtering. * 'detach' callback is called when a filter instance is detached from a stream, before its destruction. This happens when the stream is stopped for filters defined on the stream's frontend and when the analyze ends for filters defined on the stream's backend. In addition, the callback 'stream_set_backend' has been added to know when a backend is set for a stream. It is only called when the frontend and the backend are not the same. And it is called for all filters attached to a stream (frontend and backend). Finally, the TRACE filter has been updated.	2016-11-09 22:50:54 +01:00
Christopher Faulet	898566e7e6	CLEANUP: remove last references to 'ruleset' section	2016-11-09 22:50:54 +01:00
Willy Tarreau	3eed10e54b	MINOR: init: add -dr to ignore server address resolution failures It is very common when validating a configuration out of production not to have access to the same resolvers and to fail on server address resolution, making it difficult to test a configuration. This option simply appends the "none" method to the list of address resolution methods for all servers, ensuring that even if the libc fails to resolve an address, the startup sequence is not interrupted.	2016-11-09 15:33:52 +01:00
Willy Tarreau	4310d36a7e	MINOR: server: add support for explicit numeric address in init-addr This will allow a server to automatically fall back to an explicit numeric IP address when all other methods fail. The address is simply specified in the address list.	2016-11-09 15:30:47 +01:00
Willy Tarreau	25e515235a	MEDIUM: server: make use of init-addr It is now supported. If not set, we default to the legacy methods list which is "last,libc".	2016-11-09 15:30:47 +01:00
Baptiste Assmann	25938278b7	MEDIUM: server: add a new init-addr server line setting This new setting supports a comma-delimited list of methods used to resolve the server's FQDN to an IP address. Currently supported methods are "libc" (use the regular libc's resolver) and "last" (use the last known valid address found in the state file). The list is implemented in a 32-bit integer, because each init-addr method only requires 3 bits. The last one must always be SRV_IADDR_END (0), allowing to store up to 10 methods in a single 32 bit integer. Note: the doc is provided at the end of this series.	2016-11-09 15:30:47 +01:00
Baptiste Assmann	987e16d6f4	MINOR: dns: implement extra 'hold' timers. This adds new "hold" timers : nx, refused, timeout, other. This timers will be used to tell HAProxy to keep an erroneous response as valid for the corresponding period. For now they're only configured, not enforced.	2016-11-09 15:30:47 +01:00
Willy Tarreau	8b42848a44	MINOR: server: make srv_set_admin_state() capable of telling why this happens It will be important to help debugging some DNS resolution issues to know why a server was marked down, so let's make the function support a 3rd argument with an indication of the reason. Passing NULL will keep the message as-is.	2016-11-09 15:30:47 +01:00
Baptiste Assmann	89aa7f30da	MINOR: new srv_admin flag: SRV_ADMF_RMAINT This flag has to be set when an IP address resolution fails (either using libc at start up or using HAProxy's runtime resolver). This will automatically trigger the administrative status "MAINT", through the global mask SRV_ADMF_MAINT.	2016-11-09 15:23:16 +01:00
Baptiste Assmann	83cbaa531f	MAJOR: server: postpone address resolution Server addresses are not resolved anymore upon the first pass so that we don't fail if an address cannot be resolved by the libc. Instead they are processed all at once after the configuration is fully loaded, by the new function srv_init_addr(). This function only acts on the server's address if this address uses an FQDN, which appears in server->hostname. For now the function does two things, to followup with HAProxy's historical default behavior: 1. apply server IP address found in server-state file if runtime DNS resolution is enabled for this server 2. use the DNS resolver provided by the libc If none of the 2 options above can find an IP address, then an error is returned. All of this will be needed to support the new server parameter "init-addr". For now, the biggest user-visible change is that all server resolution errors are dumped at once instead of causing a startup failure one by one.	2016-11-09 14:24:20 +01:00
Dirkjan Bussink	1866d6d8f1	MEDIUM: ssl: Add support for OpenSSL 1.1.0 In the last release a lot of the structures have become opaque for an end user. This means the code using these needs to be changed to use the proper functions to interact with these structures instead of trying to manipulate them directly. This does not fix any deprecations yet that are part of 1.1.0, it only ensures that it can be compiled against that version and is still compatible with older ones. [wt: openssl-0.9.8 doesn't build with it, there are conflicts on certain function prototypes which we declare as inline here and which are defined differently there. But openssl-0.9.8 is not supported anymore so probably it's OK to go without it for now and we'll see later if some users still need it. Emeric has reviewed this change and didn't spot anything obvious which requires special care. Let's try it for real now]	2016-11-08 20:54:41 +01:00
Willy Tarreau	e5d3169e1c	CLEANUP: wurfl: reduce exposure in the rest of the code The only reason wurfl/wurfl.h was needed outside of wurfl.c was to expose wurfl_handle which is a pointer to a structure, referenced by global.h. By just storing a void* there instead, we can confine all wurfl code to wurfl.c, which is really nice.	2016-11-08 18:47:25 +01:00
scientiamobile	d0027ed5b1	MEDIUM: wurfl: add Scientiamobile WURFL device detection module WURFL is a high-performance and low-memory footprint mobile device detection software component that can quickly and accurately detect over 500 capabilities of visiting devices. It can differentiate between portable mobile devices, desktop devices, SmartTVs and any other types of devices on which a web browser can be installed. In order to add WURFL device detection support, you would need to download Scientiamobile InFuze C API and install it on your system. Refer to www.scientiamobile.com to obtain a valid InFuze license. Any useful information on how to configure HAProxy working with WURFL may be found in: doc/WURFL-device-detection.txt doc/configuration.txt examples/wurfl-example.cfg Please find more information about WURFL device detection API detection at https://docs.scientiamobile.com/documentation/infuze/infuze-c-api-user-guide	2016-11-08 14:21:43 +01:00
Willy Tarreau	757478e900	BUG/MEDIUM: servers: properly propagate the maintenance states during startup Right now there is an issue with the way the maintenance flags are propagated upon startup. They are not propagate, just copied from the tracked server. This implies that depending on the server's order, some tracking servers may not be marked down. For example this configuration does not work as expected : server s1 1.1.1.1:8000 track s2 server s2 1.1.1.1:8000 track s3 server s3 1.1.1.1:8000 track s4 server s4 wtap:8000 check inter 1s disabled It results in s1/s2 being up, and s3/s4 being down, while all of them should be down. The only clean way to process this is to run through all "root" servers (those not tracking any other server), and to propagate their state down to all their trackers. This is the same algorithm used to propagate the state changes. It has to be done both to compute the IDRAIN flag and the IMAINT flag. However, doing so requires that tracking servers are not marked as inherited maintenance anymore while parsing the configuration (and given that it is wrong, better drop it). This fix also addresses another side effect of the bug above which is that the IDRAIN/IMAINT flags are stored in the state files, and if restored while the tracked server doesn't have the equivalent flag, the servers may end up in a situation where it's impossible to remove these flags. For example in the configuration above, after removing "disabled" on server s4, the other servers would have remained down, and not anymore with this fix. Similarly, the combination of IMAINT or IDRAIN with their respective forced modes was not accepted on reload, which is wrong as well. This bug has been present at least since 1.5, maybe even 1.4 (it came with tracking support). The fix needs to be backported there, though the srv-state parts are irrelevant. This commit relies on previous patch to silence warnings on startup.	2016-11-07 14:31:52 +01:00
Willy Tarreau	2235b261b6	OPTIM: http: move all http character classs tables into a single one We used to have 7 different character classes, each was 256 bytes long, resulting in almost 2kB being used in the L1 cache. It's as cheap to test a bit than to check the byte is not null, so let's store a 7-bit composite value and check for the respective bits there instead. The executable is now 4 kB smaller and the performance on small objects increased by about 1% to 222k requests/second with a config involving 4 http-request rules including 1 header lookup, one header replacement, and 2 variable assignments.	2016-11-05 15:58:08 +01:00
Willy Tarreau	9df94c2b25	MINOR: peers: remove the pointer to the stream There's no reason to use the stream anymore, only the appctx should be used by a peer. This was a leftover from the migration to appctx and it caused some confusion, so let's totally drop it now. Note that half of the patch are just comment updates.	2016-10-31 20:07:01 +01:00
Andrew Rodland	13d5ebb913	MINOR: server: compute a "cumulative weight" to allow chash balancing to hit its target For active servers, this is the sum of the eweights of all active servers before this one in the backend, and [srv->cumulative_weight .. srv_cumulative_weight + srv_eweight) is a space occupied by this server in the range [0 .. lbprm.tot_wact), and likewise for backup servers with tot_wbck. This allows choosing a server or a range of servers proportional to their weight, by simple integer comparison. Signed-off-by: Andrew Rodland <andrewr@vimeo.com>	2016-10-25 20:21:32 +02:00
Andrew Rodland	b1f48e3161	MINOR: backend: add hash-balance-factor option for hash-type consistent 0 will mean no balancing occurs; otherwise it represents the ratio between the highest-loaded server and the average load, times 100 (i.e. a value of 150 means a 1.5x ratio), assuming equal weights. Signed-off-by: Andrew Rodland <andrewr@vimeo.com>	2016-10-25 20:21:32 +02:00
Andrew Rodland	e168feb4a8	MINOR: proxy: add 'served' field to proxy, equal to total of all servers' This will allow lb_chash to determine the total active sessions for a proxy without any computation. Signed-off-by: Andrew Rodland <andrewr@vimeo.com>	2016-10-25 20:21:32 +02:00
Willy Tarreau	a5bc36b31c	MINOR: stats: emit dses This is the number of denied sessions, blocked by "tcp-request session reject".	2016-10-21 18:19:48 +02:00
Willy Tarreau	620408f406	MEDIUM: tcp: add registration and processing of TCP L5 rules This commit introduces "tcp-request session" rules. These are very much like "tcp-request connection" rules except that they're processed after the handshake, so it is possible to consider SSL information and addresses rewritten by the proxy protocol header in actions. This is particularly useful to track proxied sources as this was not possible before, given that tcp-request content rules are processed after each HTTP request. Similarly it is possible to assign the proxied source address or the client's cert to a variable.	2016-10-21 18:19:24 +02:00
Willy Tarreau	27df66e8d9	MINOR: counters: add new fields for denied_sess This will be used with denied sessions. This counter is not exposed yet.	2016-10-21 18:19:24 +02:00
Willy Tarreau	7d9736fb5d	CLEANUP: tcp rules: mention everywhere that tcp-conn rules are L4 This is in order to make integration of tcp-request-session cleaner : - tcp_exec_req_rules() was renamed tcp_exec_l4_rules() - LI_O_TCP_RULES was renamed LI_O_TCP_L4_RULES (LI_O_*'s horrible indent was also fixed and a provision was left for L5 rules).	2016-10-21 18:19:24 +02:00
Willy Tarreau	8a90b8ea19	MINOR: stats: output dcon These are denied conns. Strangely this wasn't emitted while it used to be available for a while. It corresponds to the number of connections blocked by "tcp-request connection reject".	2016-10-21 18:17:56 +02:00
William Lallemand	1e08cd819a	MEDIUM: cli: register CLI keywords with cli_register_kw() To register a new cli keyword, you need to declare a cli_kw_list structure in your source file: static struct cli_kw_list cli_kws = {{ },{ { { "test", "list", NULL }, "test list : do some tests on the cli", test_parsing, NULL }, { { NULL }, NULL, NULL, NULL, NULL } }}; And then register it: cli_register_kw(&cli_kws); The first field is an array of 5 elements, where you declare the keywords combination which will match, it must be ended by a NULL element. The second field is used as a usage message, it will appear in the help of the cli, you can set it to NULL if you don't want to show it, it's a good idea if you want to overwrite some existing keywords. The two last fields are callbacks. The first one is used at parsing time, you can use it to parse the arguments of your keywords and print small messages. The function must return 1 in case of a failure, otherwise 0: #include <proto/dumpstats.h> static int test_parsing(char *args, struct appctx appctx) { struct chunk out; if (!args[2]) { appctx->ctx.cli.msg = "Error: the 3rd argument is mandatory !"; appctx->st0 = STAT_CLI_PRINT; return 1; } chunk_reset(&trash); chunk_printf(&trash, "arg[3]: %s\n", args[2]); chunk_init(&out, NULL, 0); chunk_dup(&out, &trash); appctx->ctx.cli.err = out.str; appctx->st0 = STAT_CLI_PRINT_FREE; / print and free in the default cli_io_handler */ return 0; } The last field is the IO handler callback, it can be set to NULL if you want to use the default cli_io_handler() otherwise you can write your own. You can use the private pointer in the appctx if you need to store a context or some data. stats_dump_sess_to_buffer() is a good example of IO handler, IO handlers often use the appctx->st2 variable for the state machine. The handler must return 0 in case it have to be recall later otherwise 1.	2016-10-19 19:03:40 +02:00
Fr�d�ric L�caille	523cc9e858	MEDIUM: peers: Fix a peer stick-tables synchronization issue. During the stick-table teaching process which occurs at reloading/restart time, expiration dates of stick-tables entries were not synchronized between peers. This patch adds two new stick-table messages to provide such a synchronization feature. As these new messages are not supported by older haproxy peers protocol versions, this patch increments peers protol version, from 2.0 to 2.1, to help in detecting/supporting such older peers protocol implementations so that new versions might still be able to transparently communicate with a newer one. [wt: technically speaking it would be nice to have this backported into 1.6 as some people who reload often are affected by this design limitation, but it's not a totally transparent change that may make certain users feel reluctant to upgrade older versions. Let's let it cook in 1.7 first and decide later]	2016-10-17 19:44:35 +02:00
Lukas Tribus	a0bcbdcb04	MEDIUM: make SO_REUSEPORT configurable With Linux officially introducing SO_REUSEPORT support in 3.9 and its mainstream adoption we have seen more people running into strange SO_REUSEPORT related issues (a process management issue turning into hard to diagnose problems because the kernel load-balances between the new and an obsolete haproxy instance). Also some people simply want the guarantee that the bind fails when the old process is still bound. This change makes SO_REUSEPORT configurable, introducing the command line argument "-dR" and the noreuseport configuration directive. A backport to 1.6 should be considered.	2016-09-13 07:56:03 +02:00
Lukas Tribus	dcbc5c5ecf	MINOR: show Built with PCRE version Inspired by PCRE's pcre_version.c and improved with Willy's suggestions. Reusable parts have been added to include/common/standard.h.	2016-09-13 07:55:51 +02:00
Baptiste Assmann	83b0a17678	MINOR: dns: comments in types/dns.h about structures endianness To avoid issues when porting code to some architecture, we need to know the endianess the structures are currently used. This patch simply had a short notice before those structures to report endianess and ease contributor's job.	2016-09-12 20:02:22 +02:00
Baptiste Assmann	c1ce5f358e	MEDIUM: dns: new DNS response parser New DNS response parser function which turn the DNS response from a network buffer into a DNS structure, much easier for later analysis by upper layer. Memory is pre-allocated at start-up in a chunk dedicated to DNS response store. New error code to report a wrong number of queries in a DNS response.	2016-09-12 19:54:23 +02:00
Baptiste Assmann	5748f73ff8	MINOR: dns: new DNS structures to store received packets struct dns_query_item: describes a DNS query record struct dns_answer_item: describes a DNS answer record struct dns_response_packet: describes a DNS response packet	2016-09-12 19:54:11 +02:00
Baptiste Assmann	ed97c95331	MINOR: dns: new macro to compute DNS header size macro to compute in a simple way the size of the dns_header structure. Make the code more readable were used.	2016-09-12 19:53:59 +02:00
Baptiste Assmann	4ec076f8ed	MINOR: dns: new MAX values DNS_MIN_RECORD_SIZE: minimal size of a DNS record DNS_MAX_QUERY_RECORDS: maximum number of query records we allow. For now, we send one DNS query per request. DNS_MAX_ANSWER_RECORDS: maximum number of records we may found in a response WIP dns: new MAX values	2016-09-12 19:53:41 +02:00
Baptiste Assmann	d20bbaf093	MINOR: dns: wrong DNS_MAX_UDP_MESSAGE value Current implementation of HAProxy's DNS resolution expect only 512 bytes of data in the response. Update DNS_MAX_UDP_MESSAGE to match this. Backport: can be backported to 1.6	2016-09-12 19:53:25 +02:00
Baptiste Assmann	7819c125c2	MINOR: chunk: new strncat function Purpose of this function is to append data to the end of a chunk when we know only the pointer to the beginning of the string and the string length.	2016-09-12 19:51:59 +02:00
Baptiste Assmann	d458adcc52	MINOR: new update_server_addr_port() function to change both server's ADDR and service PORT This function can replace update_server_addr() where the need to change the server's port as well as the IP address is required. It performs some validation before performing each type of change.	2016-09-11 08:13:11 +02:00
Baptiste Assmann	6b453f166f	MINOR: server: introduction of 3 new server flags Introduction of 3 new server flags to remember if some parameters were set during configuration parsing. * SRV_F_CHECKADDR: this server has a check addr configured * SRV_F_CHECKPORT: this server has a check port configured * SRV_F_AGENTADDR: this server has a agent addr configured	2016-09-11 08:12:42 +02:00
Baptiste Assmann	95db2bcfee	MAJOR: check: find out which port to use for health check at run time HAProxy used to deduce port used for health checks when parsing configuration at startup time. Because of this way of working, it makes it complicated to change the port at run time. The current patch changes this behavior and makes HAProxy to choose the port used for health checking when preparing the check task itself. A new type of error is introduced and reported when no port can be found. There won't be any impact on performance, since the process to find out the port value is made of a few 'if' statements. This patch also introduces a new check state CHK_ST_PORT_MISS: this flag is used to report an error in the case when HAProxy needs to establish a TCP connection to a server, to perform a health check but no TCP ports can be found for it. And last, it also introduces a new stream termination condition: SF_ERR_CHK_PORT. Purpose of this flag is to report an error in the event when HAProxy has to run a health check but no port can be found to perform it.	2016-09-11 08:12:13 +02:00
ben51degrees	1f077ebff2	BUILD/MAJOR:updated 51d Trie implementation to incorperate latest update to 51Degrees.c Trie now uses a dataset structure just like Pattern, so this has been defined in includes/types/global.h for both Pattern and Trie where it was just Pattern. In src/51d.c all functions used by the Trie implementation which need a dataset as an argument now use the global dataset. The fiftyoneDegreesDestroy method has now been replaced with fiftyoneDegreesDataSetFree which is common to Pattern and Trie. In addition, two extra dataset init status' have been added to the switch statement in init_51degrees.	2016-08-24 20:29:31 +02:00
Willy Tarreau	17494bc695	CLEANUP: logs: remove unused log format field definitions A few log format fields were declared but never used, so let's drop them, the whole list is confusing enough already : LOG_FMT_VARIABLE, LOG_FMT_T, LOG_FMT_CONN, LOG_FMT_QUEUES.	2016-08-23 15:25:28 +02:00
Thierry FOURNIER / OZON.IO	4cac359a39	MEDIUM: log: Decompose %Tq in %Th %Ti %TR Tq is the time between the instant the connection is accepted and a complete valid request is received. This time includes the handshake (SSL / Proxy-Protocol), the idle when the browser does preconnect and the request reception. This patch decomposes %Tq in 3 measurements names %Th, %Ti, and %TR which returns respectively the handshake time, the idle time and the duration of valid request reception. It also adds %Ta which reports the request's active time, which is the total time without %Th nor %Ti. It replaces %Tt as the total time, reporting accurate measurements for HTTP persistent connections. %Th is avalaible for TCP and HTTP sessions, %Ti, %TR and %Ta are only avalaible for HTTP connections. In addition to this, we have new timestamps %tr, %trg and %trl, which log the date of start of receipt of the request, respectively in the default format, in GMT time and in local time (by analogy with %t, %T and %Tl). All of them are obviously only available for HTTP. These values are more relevant as they more accurately represent the request date without being skewed by a browser's preconnect nor a keep-alive idle time. The HTTP log format and the CLF log format have been modified to use %tr, %TR, and %Ta respectively instead of %t, %Tq and %Tt. This way the default log formats now produce the expected output for users who don't want to manually fiddle with the log-format directive. Example with the following log-format : log-format "%ci:%cp [%tr] %ft %b/%s h=%Th/i=%Ti/R=%TR/w=%Tw/c=%Tc/r=%Tr/a=%Ta/t=%Tt %ST %B %CC %CS %tsc %ac/%fc/%bc/%sc/%rc %sq/%bq %hr %hs %{+Q}r" The request was sent by hand using "openssl s_client -connect" : Aug 23 14:43:20 haproxy[25446]: 127.0.0.1:45636 [23/Aug/2016:14:43:20.221] test~ test/test h=6/i=2375/R=261/w=0/c=1/r=0/a=262/t=2643 200 145 - - ---- 1/1/0/0/0 0/0 "GET / HTTP/1.1" => 6 ms of SSL handshake, 2375 waiting before sending the first char (in fact the time to type the first line), 261 ms before the end of the request, no time spent in queue, 1 ms spend connecting to the server, immediate response, total active time for this request = 262ms. Total time from accept to close : 2643 ms. The timing now decomposes like this : first request 2nd request \|<-------------------------------->\|<-------------- ... t tr t tr ... ---\|----\|----\|----\|----\|----\|----\|----\|----\|-- : Th Ti TR Tw Tc Tr Td : Ti ... :<---- Tq ---->: : :<-------------- Tt -------------->: :<--------- Ta --------->:	2016-08-23 15:18:08 +02:00
Baptiste Assmann	08396c87d0	MINOR: standard.c: ipcpy() function to copy an IP address from a struct sockaddr_storage into an other one The function ipcpy() simply duplicates the IP address found in one struct sockaddr_storage into an other struct sockaddr_storage. It also update the family on the destination structure. Memory of destination structure must be allocated and cleared by the caller.	2016-08-14 12:16:43 +02:00
Baptiste Assmann	08b24cfdb2	MINOR: standard.c: ipcmp() function to compare 2 IP addresses stored in 2 struct sockaddr_storage new ipcmp() function to compare 2 IP addresses stored in struct sockaddr_storage. Returns 0 if both addresses doesn't match and 1 if they do.	2016-08-14 12:16:27 +02:00
Willy Tarreau	9d87ca0685	BUILD: tcp: define SOL_TCP when only IPPROTO_TCP exists FreeBSD prefers to use IPPROTO_TCP over SOL_TCP, just like it does with their *_IP counterparts. It's worth noting that there are a few inconsistencies between SOL_TCP and IPPROTO_TCP in the code, eg on TCP_QUICKACK. The two values are the same but it's worth applying what implementations recommend. No backport is needed, this was uncovered by the recent tcp_info stuff.	2016-08-10 21:11:38 +02:00
Willy Tarreau	d2629f293e	BUILD: connection: fix build breakage on openbsd due to missing in_systm.h Recent commit `93b227d` ("MINOR: listener: add the "accept-netscaler-cip" option to the "bind" keyword") introduced an include of netinet/ip.h which requires in_systm.h on OpenBSD. No backport is needed.	2016-08-10 19:32:33 +02:00
Willy Tarreau	16e015635c	MINOR: tcp: add dst_is_local and src_is_local It is sometimes needed in application server environments to easily tell if a source is local to the machine or a remote one, without necessarily knowing all the local addresses (dhcp, vrrp, etc). Similarly in transparent proxy configurations it is sometimes desired to tell the difference between local and remote destination addresses. This patch adds two new sample fetch functions for this : dst_is_local : boolean Returns true if the destination address of the incoming connection is local to the system, or false if the address doesn't exist on the system, meaning that it was intercepted in transparent mode. It can be useful to apply certain rules by default to forwarded traffic and other rules to the traffic targetting the real address of the machine. For example the stats page could be delivered only on this address, or SSH access could be locally redirected. Please note that the check involves a few system calls, so it's better to do it only once per connection. src_is_local : boolean Returns true if the source address of the incoming connection is local to the system, or false if the address doesn't exist on the system, meaning that it comes from a remote machine. Note that UNIX addresses are considered local. It can be useful to apply certain access restrictions based on where the client comes from (eg: require auth or https for remote machines). Please note that the check involves a few system calls, so it's better to do it only once per connection.	2016-08-09 16:50:08 +02:00
Willy Tarreau	77128f585c	MINOR: sample: provide smp_is_rw() and smp_make_rw() At some places, smp_dup() is inappropriately called to ensure a modification is possible while in fact we only need to ensure the sample may be modified in place. Let's provide smp_is_rw() to check for this capability and smp_make_rw() to perform the smp_dup() when it is not the case. Note that smp_is_rw() will also try to add the trailing zero on strings when needed if possible, to avoid a useless duplication.	2016-08-09 14:30:57 +02:00
Willy Tarreau	2c594794dd	MINOR: sample: implement smp_is_safe() and smp_make_safe() These functions ensure that the designated sample is "safe for use", which means that its size is known, its length is correct regarding its size, and that strings are properly zero-terminated. smp_is_safe() only checks (and optionally sets the trailing zero when needed and possible). smp_make_safe() will call smp_dup() after smp_is_safe() fails.	2016-08-09 14:03:36 +02:00
Willy Tarreau	ad63582eb9	BUG/MEDIUM: samples: make smp_dup() always duplicate the sample Vedran Furac reported a strange problem where the "base" sample fetch would not always work for tracking purposes. In fact, it happens that commit `bc8c404` ("MAJOR: stick-tables: use sample types in place of dedicated types") merged in 1.6 exposed a fundamental bug related to the way samples use chunks as strings. The problem is that chunks convey a base pointer, a length and an optional size, which may be zero when unknown or when the chunk is allocated from a read-only location. The sole purpose of this size is to know whether or not the chunk may be appended new data. This size cause some semantics issue in the sample, which has its own SMP_F_CONST flag to indicate read-only contents. The problem was emphasized by the commit above because it made use of new calls to smp_dup() to convert a sample to a table key. And since smp_dup() would only check the SMP_F_CONST flag, it would happily return read-write samples indicating size=0. So some tests were added upon smp_dup() return to ensure that the actual length is smaller than size, but this in fact made things even worse. For example, the "sni" server directive does some bad stuff on many occasions because it limits len to size-1 and effectively sets it to -1 and writes the zero byte before the beginning of the string! It is therefore obvious that smp_dup() needs to be modified to take this nature of the chunks into account. It's not enough but is needed. The core of the problem comes from the fact that smp_dup() is called for 5 distinct needs which are not always fulfilled : 1) duplicate a sample to keep a copy of it during some operations 2) ensure that the sample is rewritable for a converter like upper() 3) ensure that the sample is terminated with a \0 4) set a correct size on the sample 5) grow the sample in case it was extracted from a partial chunk Case 1 is not used for now, so we can ignore it. Case 2 indicates the wish to modify the sample, so its R/O status must be removed if any, but there's no implied requirement that the chunk becomes larger. Case 3 is used when the sample has to be made compatible with libc's str* functions. There's no need to make it R/W nor to duplicate it if it is already correct. Case 4 can happen when the sample's size is required (eg: before performing some changes that must fit in the buffer). Case 5 is more or less similar but will happen when the sample by be grown but we want to ensure we're not bound by the current small size. So the proposal is to have different functions for various operations. One will ensure a sample is safe for use with str* functions. Another one will ensure it may be rewritten in place. And smp_dup() will have to perform an inconditional duplication to guarantee at least #5 above, and implicitly all other ones. This patch only modifies smp_dup() to make the duplication inconditional. It is enough to fix both the "base" sample fetch and the "sni" server directive, and all use cases in general though not always optimally. More patches will follow to address them more optimally and even better than the current situation (eg: avoid a dup just to add a \0 when possible). The bug comes from an ambiguous design, so its roots are old. 1.6 is affected and a backport is needed. In 1.5, the function already existed but was only used by two converters modifying the data in place, so the bug has no effect there.	2016-08-09 14:03:23 +02:00
Dragan Dosen	1a5d06032b	MINOR: standard: add function "escape_string" Similar to "escape_chunk", this function tries to prefix all characters tagged in the <map> with the <escape> character. The specified <string> contains the input to be escaped.	2016-07-26 15:25:32 +02:00
Ruoshan Huang	e4edc6b628	MEDIUM: http: implement http-response track-sc* directive This enables tracking of sticky counters from current response. The only difference from "http-request track-sc" is the <key> sample expression can only make use of samples in response (eg. res.*, status etc.) and samples below Layer 6.	2016-07-26 14:31:14 +02:00
Thierry FOURNIER	9bd52d478b	BUG/MEDIUM: lua: the function txn_done() from action wrapper can crash If an action wrapper stops the processing of the transaction with a txn_done() function, the return code of the action is "continue". So the continue can implies the processing of other like adding headers. However, the HTTP content is flushed and a segfault occurs. This patchs add a flag indicating that the Lua code want to stop the processing, ths flags is forwarded to the haproxy core, and other actions are ignored. Must be backported in 1.6	2016-07-14 16:14:32 +02:00
Thierry FOURNIER	ab00df6cf6	BUG/MEDIUM: lua: the function txn_done() from sample fetches can crash The function txn_done() ends a transaction. It does not make sense to call this function from a lua sample-fetch wrapper, because the role of a sample-fetch is not to terminate a transaction. This patch modify the role of the fucntion txn_done() if it is called from a sample-fetch wrapper, now it just ends the execution of the Lua code like the done() function. Must be backported in 1.6	2016-07-14 16:14:24 +02:00
Nenad Merdanovic	8ab79420ba	BUG/MINOR: Fix endiness issue in DNS header creation code Alexander Lebedev reported that the response bit is set on SPARC when DNS queries are sent. This has been tracked to the endianess issue, so this patch makes the code portable. Signed-off-by: Nenad Merdanovic <nmerdan@anine.io>	2016-07-13 14:47:58 +02:00
Willy Tarreau	eec1d3869d	BUG/MEDIUM: dns: fix alignment issues in the DNS response parser Alexander Lebedev reported that the DNS parser crashes in 1.6 with a bus error on Sparc when it receives a response. This is obviously caused by some alignment issues. The issue can also be reproduced on ARMv5 when setting /proc/cpu/alignment to 4 (which helps debugging). Two places cause this crash in turn, the first one is when the IP address from the packet is compared to the current one, and the second place is when the address is assigned because an unaligned address is passed to update_server_addr(). This patch modifies these places to properly use memcpy() and memcmp() to manipulate the unaligned data. Nenad Merdanovic found another set of places specific to 1.7 in functions in_net_ipv4() and in_net_ipv6(), which are used to compare networks. 1.6 has the functions but does not use them. There we perform a temporary copy to a local variable to fix the problem. The type of the function's argument is wrong since it's not necessarily aligned, so we change it for a const void * instead. This fix must be backported to 1.6. Note that in 1.6 the code is slightly different, there's no rec[] array, the pointer is used directly from the buffer.	2016-07-13 12:13:24 +02:00
David Carlier	3015a2eebd	CLEANUP: connection: using internal struct to hold source and dest port. Originally, tcphdr's source and dest from Linux were used to get the source and port which led to a build issue on BSD oses. To avoid side problems related to network then we just use an internal struct as we need only those two fields.	2016-07-05 14:43:05 +02:00
Hubert Verstraete	2eae3a0497	MINOR: new function my_realloc2 = realloc + free upon failure When realloc fails to allocate memory, the original pointer is not freed. Sometime people override the original pointer with the pointer returned by realloc which is NULL in case of failure. This results in a memory leak because the memory pointed by the original pointer cannot be freed.	2016-06-29 10:45:15 +02:00
Bertrand Jacquin	9075968356	MINOR: tcp: add "tcp-request connection expect-netscaler-cip layer4" This configures the client-facing connection to receive a NetScaler Client IP insertion protocol header before any byte is read from the socket. This is equivalent to having the "accept-netscaler-cip" keyword on the "bind" line, except that using the TCP rule allows the PROXY protocol to be accepted only for certain IP address ranges using an ACL. This is convenient when multiple layers of load balancers are passed through by traffic coming from public hosts.	2016-06-20 23:02:47 +02:00
Bertrand Jacquin	93b227db95	MINOR: listener: add the "accept-netscaler-cip" option to the "bind" keyword When NetScaler application switch is used as L3+ switch, informations regarding the original IP and TCP headers are lost as a new TCP connection is created between the NetScaler and the backend server. NetScaler provides a feature to insert in the TCP data the original data that can then be consumed by the backend server. Specifications and documentations from NetScaler: https://support.citrix.com/article/CTX205670 https://www.citrix.com/blogs/2016/04/25/how-to-enable-client-ip-in-tcpip-option-of-netscaler/ When CIP is enabled on the NetScaler, then a TCP packet is inserted just after the TCP handshake. This is composed as: - CIP magic number : 4 bytes Both sender and receiver have to agree on a magic number so that they both handle the incoming data as a NetScaler Client IP insertion packet. - Header length : 4 bytes Defines the length on the remaining data. - IP header : >= 20 bytes if IPv4, 40 bytes if IPv6 Contains the header of the last IP packet sent by the client during TCP handshake. - TCP header : >= 20 bytes Contains the header of the last TCP packet sent by the client during TCP handshake.	2016-06-20 23:02:47 +02:00
Emmanuel Hocdet	5e0e6e409b	MINOR: ssl: crt-list parsing factor LINESIZE and MAX_LINE_ARGS are too low for parsing crt-list.	2016-06-20 17:29:56 +02:00
William Lallemand	72a8a18e89	MEDIUM: dumpstats: make stats_tlskeys_list() yield-aware during tls-keys dump The previous dump algorithm was not trying to yield when the buffer is full, it's not a problem with the TLS_TICKETS_NO which is 3 by default but it can become one if the buffer size is lowered and if the TLS_TICKETS_NO is increased. The index of the latest ticket dumped is now stored to ensure we can resume the dump after a yield.	2016-06-14 19:42:08 +02:00
William Lallemand	cf9e788790	BUG/MEDIUM: dumpstats: undefined behavior in stats_tlskeys_list() The function stats_tlskeys_list() can meet an undefined behavior when called with appctx->st2 == STAT_ST_LIST, indeed the ref pointer is used uninitialized. However this function was using NULL in appctx->ctx.tlskeys.ref as a flag to dump every tickets from every references. A real flag appctx->ctx.tlskeys.dump_all is now used for this behavior. This patch delete the 'ref' variable and use appctx->ctx.tlskeys.ref directly.	2016-06-14 19:41:58 +02:00
Dragan Dosen	e984a0e4fb	MINOR: stream: export the function 'smp_create_src_stkctr' Could be useful outside of this file.	2016-06-13 21:21:51 +02:00
William Lallemand	2e785f23cb	MEDIUM: tcp: add 'set-src' to 'tcp-request connection' The 'set-src' action was not available for tcp actions The action code has been converted into a function in proto_tcp.c to be used for both 'http-request' and 'tcp-request connection' actions. Both http and tcp keywords are registered in proto_tcp.c	2016-06-01 11:44:11 +02:00
Willy Tarreau	5f6e9054b9	BUILD: fix build on Solaris 11 htonll()/ntohll() already exist on Solaris 11 with a different declaration, causing a build error as reported by Jonathan Fisher. They used to exist on OSX with a #define which allowed us to detect them. It was a bad idea to give these functions a name subject to conflicts like this. Simply rename them my_htonll()/my_ntohll() to definitely get rid of the conflict. This patch must be backported to 1.6.	2016-05-26 07:15:57 +02:00
Lukas Tribus	f2ebcb47cb	BUG/MEDIUM: dns: unbreak DNS resolver after header fix DNS requests (using the internal resolver) are corrupted since commit `e2f8497716` ("BUG/MINOR: dns: fix DNS header definition"). Fix it by defining the struct in network byte order, while complying with RFC 2535, section 6.1. First reported by Eduard Vopicka on discourse. This must be backported to 1.6 (1.6.5 is affected).	2016-05-25 22:39:37 +02:00
Willy Tarreau	58727ec088	BUG/MAJOR: http: fix breakage of "reqdeny" causing random crashes Commit `108b1dd` ("MEDIUM: http: configurable http result codes for http-request deny") introduced in 1.6-dev2 was incomplete. It introduced a new field "rule_deny_status" into struct http_txn, which is filled only by actions "http-request deny" and "http-request tarpit". It's then used in the deny code path to emit the proper error message, but is used uninitialized when the deny comes from a "reqdeny" rule, causing random behaviours ranging from returning a 200, an empty response, or crashing the process. Often upon startup only 200 was returned but after the fields are used the crash happens. This can be sped up using -dM. There's no need at all for storing this status in the http_txn struct anyway since it's used immediately after being set. Let's store it in a temporary variable instead which is passed as an argument to function http_req_get_intercept_rule(). As an extra benefit, removing it from struct http_txn reduced the size of this struct by 8 bytes. This fix must be backported to 1.6 where the bug was detected. Special thanks to Falco Schmutz for his detailed report including an exploitable core and a reproducer.	2016-05-25 16:23:59 +02:00
Vincent Bernat	6e61589573	BUG/MAJOR: fix listening IP address storage for frontends When compiled with GCC 6, the IP address specified for a frontend was ignored and HAProxy was listening on all addresses instead. This is caused by an incomplete copy of a "struct sockaddr_storage". With the GNU Libc, "struct sockaddr_storage" is defined as this: struct sockaddr_storage { sa_family_t ss_family; unsigned long int __ss_align; char __ss_padding[(128 - (2 * sizeof (unsigned long int)))]; }; Doing an aggregate copy (ss1 = ss2) is different than using memcpy(): only members of the aggregate have to be copied. Notably, padding can be or not be copied. In GCC 6, some optimizations use this fact and if a "struct sockaddr_storage" contains a "struct sockaddr_in", the port and the address are part of the padding (between sa_family and __ss_align) and can be not copied over. Therefore, we replace any aggregate copy by a memcpy(). There is another place using the same pattern. We also fix a function receiving a "struct sockaddr_storage" by copy instead of by reference. Since it only needs a read-only copy, the function is converted to request a reference.	2016-05-19 10:43:24 +02:00
Christopher Faulet	3a394fa7cd	MEDIUM: filters: Add pre and post analyzer callbacks 'channel_analyze' callback has been removed. Now, there are 2 callbacks to surround calls to analyzers: * channel_pre_analyze: Called BEFORE all filterable analyzers. it can be called many times for the same analyzer, once at each loop until the analyzer finishes its processing. This callback is resumable, it returns a negative value if an error occurs, 0 if it needs to wait, any other value otherwise. * channel_post_analyze: Called AFTER all filterable analyzers. Here, AFTER means when an analyzer finishes its processing. This callback is NOT resumable, it returns a negative value if an error occurs, any other value otherwise. Pre and post analyzer callbacks are not automatically called. 'pre_analyzers' and 'post_analyzers' bit fields in the filter structure must be set to the right value using AN_* flags (see include/types/channel.h). The flag AN_RES_ALL has been added (AN_REQ_ALL already exists) to ease the life of filter developers. AN_REQ_ALL and AN_RES_ALL include all filterable analyzers.	2016-05-18 15:11:54 +02:00
Christopher Faulet	a9215b7206	MINOR: filters: Simplify calls to analyzers using 2 new macros Now, to call an analyzer in 'process_stream' function, we should use FLT_ANALAYZE or ANALYZE macros, depending if this is a filterable analyzer or not.	2016-05-18 15:11:54 +02:00
Christopher Faulet	1339d744d5	MEDIUM: filters: Move HTTP headers filtering in its own callback Instead of calling 'channel_analyze' callback with the flag AN_FLT_HTTP_HDRS, now we use the new callback 'http_headers'. This change is done because 'channel_analyze' callback will be removed in a next commit.	2016-05-18 15:11:54 +02:00
Willy Tarreau	27b639d37f	MINOR: log: add the %Td log-format specifier As suggested by Pavlos, it's too bad that we didn't have a %Td log format tag given that there are a few mentions of Td corresponding to the data transmission time already in the doc, so this is now done. Just like the other specifiers, we report -1 if the connection failed before reaching the data transmission state.	2016-05-17 18:04:30 +02:00
Maxime de Roucy	dc88785f9c	MINOR: add list_append_word function int list_append_word(struct list li, const char str, char **err) Append a copy of string <str> (inside a wordlist) at the end of the list <li>. The caller is responsible for freeing the <err> and <str> copy memory area using free(). On failure : return 0 and <err> filled with an error message.	2016-05-14 00:00:54 +02:00
Vincent Bernat	e2f8497716	BUG/MINOR: dns: fix DNS header definition Conforming to RFC 2535, section 6.1. This is not an important bug as those fields don't seem to be set to something else than 0 and to be checked on answers.	2016-05-09 11:01:08 +02:00
Cyril Bont�	6ca9e01ab2	BUG/MEDIUM: stats: show backend may show an empty or incomplete result This is the same issue as "show servers state", where the result is incorrect it the data can't fit in one buffer. The similar fix is applied, to restart the data processing where it stopped as buffers are sent to the client. This fix should be backported to haproxy 1.6	2016-05-06 12:28:43 +02:00
Cyril Bont�	76a99784f4	BUG/MEDIUM: stats: show servers state may show an empty or incomplete result It was reported that the unix socket command "show servers state" returned an empty response while "show servers state <backend>" worked. In fact, both cases can reproduce the issue. It happens when the response can't fit in one buffer. The fix consists in processing the response in several steps, as it is done in some others commands, by restarting where it was stopped after the buffer is sent to the client. This fix should be backported to haproxy 1.6	2016-05-06 12:28:43 +02:00
Willy Tarreau	8bf242b764	BUG/MEDIUM: channel: fix inconsistent handling of 4GB-1 transfers In 1.4-dev3, commit `31971e5` ("[MEDIUM] add support for infinite forwarding") made it possible to configure the lower layer to forward data indefinitely by setting the forward size to CHN_INFINITE_FORWARD (4GB-1). By then larger chunk sizes were not supported so there was no confusion in the usage of the function. Since 1.5 we support 64-bit content-lengths and chunk sizes and the function has grown to support 64-bit arguments, though it still limits a single pass to 32-bit quantities (what fit in the channel's to_forward field). The issue now becomes that a 4GB-1 content-length can be confused with infinite forwarding (in fact it's 4GB-1+what was already in the buffer). It causes a visible effect when transferring this exact size because the transfer rate is lower than with other sizes due in part to the disabling of the Nagle algorithm on the sendto() call. In theory with keep-alive it should prevent a second request from being processed after such a transfer, but since the analysers are still present, the forwarding analyser properly counts down the remaining size to transfer and ultimately the transaction gets correctly reset so there is no visible effect. Since the root cause of the issue is an API problem (lack of distinction between a real valid length and a magic value), this patch modifies the API to have a new dedicated function called channel_forward_forever() to program a permanent forwarding. The existing function __channel_forward() was modified to properly take care of the requested sizes and ensure it 1) never overflows and 2) never reaches CHN_INFINITE_FORWARD by accident. It is worth noting that the function used to have a bug causing a 2GB forward to be scheduled if it was called with less data than what is present in buf->i. Fortunately this bug couldn't be triggered with existing code. This fix should be backported to 1.6 and 1.5. While it also theorically affects 1.4, it's better not to backport it there, as the risk of breaking large object transfers due to significant API differences is high, compared to the fact that the largest supported objects (4GB-1) are just slower to transfer.	2016-05-04 15:26:37 +02:00
Willy Tarreau	ef907fee12	BUG/MAJOR: channel: fix miscalculation of available buffer space (4th try) Unfortunately, commit `169c470` ("BUG/MEDIUM: channel: fix miscalculation of available buffer space (3rd try)") was still not enough to completely address the issue. It fell into an integer comparison trap. Contrary to expectations, chn->to_forward may also have the sign bit set when forwarding regular data having a large content-length, resulting in an incomplete check of the result and of the reserve because the with to_forward very large, to_forward+o could become very small and also the reserve could become positive again and make channel_recv_limit() return a negative value. One way to reproduce this situation is to transfer a large file (> 2GB) with http-keep-alive or http-server-close, without splicing, and ensure that the server uses content-length instead of chunks. The transfer should stall very early after the first buffer has been transferred to the client. This fix now properly checks 1) for an overflow caused by summing o and to_forward, and 2) for o+to_forward being smaller or larger than maxrw before performing the subtract, so that all sensitive operations are properly performed on 33-bit arithmetics. The code was subjected again to a series of tests using inject+httpterm scanning a wide range of object sizes (+10MB after each new request) : $ printf "new page 1\nget 127.0.0.1:8002 / s=%%s0m\n" \| \ inject64 -o 1 -u 1 -f /dev/stdin With previous fix, the transfer would suddenly stop when reaching 2GB : hits ^hits hits/s ^h/s bytes kB/s last errs tout htime sdht ptime 203 1 2 1 216816173354 2710202 3144892 0 0 685.0 0.0 685.0 205 2 2 2 219257283186 2706880 2441109 0 0 679.5 6.5 679.5 205 0 2 0 219257283186 2673836 0 0 0 0.0 0.0 0.0 205 0 2 0 219257283186 2641622 0 0 0 0.0 0.0 0.0 205 0 2 0 219257283186 2610174 0 0 0 0.0 0.0 0.0 Now it's fine even past 4 GB. Many thanks to Vedran Furac for reporting this issue early with a common access pattern helping to troubleshoot this. This fix must be backported to 1.6 and 1.5 where the commit above was already backported.	2016-05-03 17:58:03 +02:00
Willy Tarreau	55e58f2334	MINOR: channel: add new function channel_congested() This function returns non-zero if the channel is congested with data in transit waiting for leaving, indicating to the caller that it should wait for the reserve to be released before starting to process new data in case it needs the ability to append data. This is meant to be used while waiting for a clean response buffer before processing a request.	2016-05-02 16:39:22 +02:00
Thierry Fournier	3610c39c8c	MINOR: filters: add opaque data Add opaque data between the filter keyword registrering and the parsing function. This opaque data allow to use the same parser with differents registered keywords. The opaque data is used for giving data which mainly makes difference between the two keywords. It will be used with Lua keywords registering.	2016-04-27 10:48:15 +02:00
Nenad Merdanovic	174dd37d88	MINOR: Add ability for agent-check to set server maxconn This is very useful in complex architecture systems where HAproxy is balancing DB connections for example. We want to keep the maxconn high in order to avoid issues with queueing on the LB level when there is slowness on another part of the system. Example is a case of an architecture where each thread opens multiple DB connections, which if get stuck in queue cause a snowball effect (old connections aren't closed, new ones cannot be established). These connections are mostly idle and the DB server has no problem handling thousands of them. Allowing us to dynamically set maxconn depending on the backend usage (LA, CPU, memory, etc.) enables us to have high maxconn for situations like above, but lowering it in case there are real issues where the backend servers become overloaded (cache issues, DB gets hit hard).	2016-04-25 17:23:50 +02:00
Willy Tarreau	169c47028a	BUG/MEDIUM: channel: fix miscalculation of available buffer space (3rd try) Latest fix `8a32106` ("BUG/MEDIUM: channel: fix miscalculation of available buffer space (2nd try)") did happen to fix some observable issues but not all of them in fact, some corner cases still remained and at least one user reported a busy loop that appeared possible, though not easily reproducible under experimental conditions. The remaining issue is that we still consider min(i, to_fwd) as the number of bytes in transit, but in fact <i> is not relevant here. Indeed, what matters is that we can read everything we want at once provided that at the end, <i> cannot be larger than <size-maxrw> (if it was not already). This is visible in two cases : - let's have i=o=max/2 and to_fwd=0. Then i+o >= max indicates that the buffer is already full, while it is not since once <o> is forwarded, some space remains. - when to_fwd is much larger than i, it's obvious that we can fill the buffer. The only relevant part in fact is o + to_fwd. to_fwd will ensure that at least this many bytes will be moved from <i> to <o> hence will leave the buffer, whatever the number of rounds it takes. Interestingly, the fix applied here ensures that channel_recv_max() will now equal (size - maxrw - i + to_fwd), which is indeed what remains available below maxrw after to_fwd bytes are forwarded from i to o and leave the buffer. Additionally, the latest fix made it possible to meet an integer overflow that was not caught by the range test when forwarding in TCP or tunnel mode due to to_forward being added to an existing value, causing the buffer size to be limited when it should not have been, resulting in 2 to 3 recv() calls when a single one was enough. The first one was limited to the unreserved buffer size, the second one to the size of the reserve minus 1, and the last one to the last byte. Eg with a 2kB buffer : recvfrom(22, "HTTP/1.1 200\r\nConnection: close\r"..., 1024, 0, NULL, NULL) = 1024 recvfrom(22, "23456789.123456789.123456789.123"..., 1023, 0, NULL, NULL) = 1023 recvfrom(22, "5", 1, 0, NULL, NULL) = 1 This bug is still present in 1.6 and 1.5 so the fix should be backported there.	2016-04-21 18:06:08 +02:00
Willy Tarreau	93dc478a04	BUG/MEDIUM: channel: incorrect polling condition may delay event delivery The condition to poll for receive as implemented in channel_may_recv() is still incorrect. If buf->o is null and buf->i is slightly larger than chn->to_forward and at least as large as buf->size - maxrewrite, then reading will be disabled. It may slightly delay some data delivery by having first to forward pending bytes, but may also cause some random issues with analysers that wait for some data before starting to forward what they correctly parsed. For instance, a body analyser may be prevented from seeing the data that only fits in the reserve. This bug may also prevent an applet's chk_rcv() function from being called when part of a buffer is released. It is possible (though not verified) that this participated to some peers frozen session issues some people have been facing. This fix should be backported to 1.6 and 1.5 to ensure better coherency with channel_recv_limit().	2016-04-21 17:03:46 +02:00
Willy Tarreau	4b46a3e8cc	BUG/MEDIUM: channel: don't allow to overwrite the reserve until connected Commit `9c06ee4` ("BUG/MEDIUM: channel: don't schedule data in transit for leaving until connected") took care of an issue involving POST in conjunction with http-send-name-header, where we absolutely never want to touch the reserve until we're sure not to touch the buffer contents anymore, which is indicated by the output stream-interface being connected. But channel_may_recv() was not equipped with such a test, so in some situations it might decide that it is possible to poll for reads, and later channel_recv_limit() will decide it's not possible to read, causing a loop. So we must add a similar test there. Since the fix above was backported to 1.6 and 1.5, this fix must as well.	2016-04-21 15:31:22 +02:00
Christopher Faulet	b3f4e14932	MINOR: filters: Print the list of existing filters during HA startup This is done in verbose/debug mode and when build options are reported.	2016-04-21 06:58:08 +02:00
Willy Tarreau	7a798e5d6b	CLEANUP: fix inconsistency between fd->iocb, proto->accept and accept() There's quite some inconsistency in the internal API. listener_accept() which is the main accept() function returns void but is declared as int in the include file. It's assigned to proto->accept() for all stream protocols where an int is expected but the result is never checked (nor is it documented by the way). This proto->accept() is in turn assigned to fd->iocb() which is supposed to return an int composed of FD_WAIT_* flags, but which is never checked either. So let's fix all this mess : - nobody checks accept()'s return - nobody checks iocb()'s return - nobody sets a return value => let's mark all these functions void and keep the current ones intact. Additionally we now include listener.h from listener.c to ensure we won't silently hide this incoherency in the future. Note that this patch could/should be backported to 1.6 and even 1.5 to simplify debugging sessions.	2016-04-14 11:18:22 +02:00
Willy Tarreau	8a32106fff	BUG/MEDIUM: channel: fix miscalculation of available buffer space (2nd try) Commit `999f643` ("BUG/MEDIUM: channel: fix miscalculation of available buffer space.") introduced a bug which made output data to be ignored when computing the remaining room in a buffer. The problem is that channel_may_recv() properly considers them and may declare that the FD may be polled for read events, but once the even strikes, channel_recv_limit() called before recv() says the opposite. In 1.6 and later this case is automatically caught by polling loop detection at the connection level and is harmless. But the backport in 1.5 ends up with a busy polling loop as soon as it becomes possible to have a buffer with this conflict. In order to reproduce it, it is necessary to have less than [maxrewrite] bytes available in a buffer, no forwarding enabled (end of transfer) and [buf->o >= maxrewrite - free space]. Since this heavily depends on socket buffers, it will randomly strike users. On 1.5 with 8kB buffers it was possible to reproduce it with httpterm using the following command line : $ (printf "GET /?s=675000 HTTP/1.0\r\n\r\n"; sleep 60) \| \ nc6 --rcvbuf-size 1 --send-only 127.0.0.1 8002 This bug is only medium in 1.6 and later but is major in the 1.5 backport, so it must be backported there. Thanks to Nenad Merdanovic and Janusz Dziemidowicz for reporting this issue with enough elements to help understand it.	2016-04-11 17:13:35 +02:00
Willy Tarreau	f3764b7993	MEDIUM: proxy: use dynamic allocation for error dumps There are two issues with error captures. The first one is that the capture size is still hard-coded to BUFSIZE regardless of any possible tune.bufsize setting and of the fact that frontends only capture request errors and that backends only capture response errors. The second is that captures are allocated in both directions for all proxies, which start to count a lot in configs using thousands of proxies. This patch changes this so that error captures are allocated only when needed, and of the proper size. It also refrains from dumping a buffer that was not allocated, which still allows to emit all relevant info such as flags and HTTP states. This way it is possible to save up to 32 kB of RAM per proxy in the default configuration.	2016-03-31 13:49:23 +02:00
Thierry Fournier	ff480424ab	MINOR: lua: add class listener This class provides the access to the listener struct, it allows some manipulations and retrieve informations.	2016-03-30 18:43:47 +02:00
Thierry Fournier	f2fdc9dc39	MINOR: lua: add class server This class provides the access to the server struct, it allows some manipulations and retrieve informations.	2016-03-30 18:43:47 +02:00
Thierry Fournier	f61aa6356e	MINOR: lua: add class proxy This class provides the access to the proxy struct, it allows some manipulations and retrieve informations.	2016-03-30 18:43:42 +02:00
Thierry Fournier	d0a56c2953	MINOR: dumpstats: split stats_dump_be_stats() in two parts This patch splits the function stats_dump_be_stats() in two parts. The part is called stats_fill_be_stats(), and just fill the stats buffer. This split allows the usage of preformated stats in other parts of HAProxy like the Lua.	2016-03-30 17:26:19 +02:00
Thierry Fournier	61fe6c0adb	MINOR: dumpstats: split stats_dump_sv_stats() in two parts This patch splits the function stats_dump_sv_stats() in two parts. The extracted part is called stats_fill_sv_stats(), and just fill the stats buffer. This split allows the usage of preformated stats in other parts of HAProxy like the Lua.	2016-03-30 17:26:09 +02:00
Thierry Fournier	c4456856b0	MINOR: dumpstats: split stats_dump_li_stats() in two parts This patch splits the function stats_dump_li_stats() in two parts. The extracted part is called stats_fill_li_stats(), and just fill the stats buffer. This split allows the usage of preformated stats in other parts of HAProxy like the Lua.	2016-03-30 17:26:02 +02:00
Thierry Fournier	23d2d64185	MINOR: dumpstats: split stats_dump_fe_stats() in two parts This patch splits the function stats_dump_fe_stats() in two parts. The extracted part is called stats_fill_fe_stats(), and just fill the stats buffer. This split allows the usage of preformated stats in other parts of HAProxy like the Lua.	2016-03-30 17:21:59 +02:00
Thierry Fournier	cb2c767681	MINOR: dumpstats: split stats_dump_info_to_buffer() in two parts This patch splits the function stats_dump_info_to_buffer() in two parts. The extracted part is called stats_fill_info(), and just fill the stats buffer. This split allows the usage of preformated stats in other parts of HAProxy like the Lua.	2016-03-30 17:21:37 +02:00
Thierry Fournier	31e64ca301	MINOR: dumpstats: extract stats fields enum and names These field names can be used outside of the dumpstats file. This will be useful for exporting stats in Lua.	2016-03-30 17:21:09 +02:00
Thierry Fournier	3d4a675f24	MINOR: lua: post initialization This patch adds a Lua post initialisation wrapper. It already exists for pure Lua function, now it executes also C. It is useful for doing things when the configuration is ready to use. For example we can can browse and register all the proxies.	2016-03-30 15:44:58 +02:00
Thierry Fournier	45e78d7aa9	MINOR: lua: refactor the Lua object registration All the HAProxy Lua object are declared with the same pattern: - Add the function __tosting which dumps the object name - Register the name in the Lua REGISTRY - Register the reference ID These action are refactored in on function. This remove some lines of code.	2016-03-30 15:43:52 +02:00
Thierry Fournier	ddd8988fe5	MINOR: lua: move class registration facilities The functions - hlua_class_const_int() - hlua_class_const_str() - hlua_class_function() are use for common class registration actions. The function 'hlua_dump_object()' is generic dump name function. These functions can be used by all the HAProxy objects, so I move it into the safe functions file.	2016-03-30 15:42:20 +02:00
Thierry Fournier	9ba1d024db	CLEANUP: map: it seems that the map were planed to be chained It seems that the map were planed to be chained, but obviously it is not the case. This patch remove the struct which should be used gor the chain	2016-03-30 15:41:15 +02:00
David Carlier	8ab1043c6b	CLEANUP: chunk: adding NULL check to chunk_dup allocation. Avoiding harmful memcpy call if the allocation failed. Resetting the size which avoids further harmful freeing invalid pointer. Closer to the comment behavior description.	2016-03-24 10:18:44 +01:00
Benoit GARNIER	e2e5bde3f2	BUG/MINOR: log: Don't use strftime() which can clobber timezone if chrooted The strftime() function can call tzset() internally on some platforms. When haproxy is chrooted, the /etc/localtime file is not found, and some implementations will clobber the content of the current timezone. The GMT offset is computed by diffing the times returned by gmtime_r() and localtime_r(). These variants are guaranteed to not call tzset() and were already used in haproxy while chrooted, so they should be safe. This patch must be backported to 1.6 and 1.5.	2016-03-17 05:30:03 +01:00
David Carlier	15073a3393	MINOR: sample: Moves ARGS underlying type from 32 to 64 bits. ARG# macros allow to create a list up to 7 in theory but 5 in practice. The change to a guaranteed 64 bits type increase to up to 12.	2016-03-15 22:11:52 +01:00
Benoit GARNIER	b413c2a759	BUG/MINOR: log: GMT offset not updated when entering/leaving DST GMT offset used in local time formats was computed at startup, but was not updated when DST status changed while running. For example these two RFC5424 syslog traces where emitted 5 seconds apart, just before and after DST changed: <14>1 2016-03-27T01:59:58+01:00 bunch-VirtualBox haproxy 2098 - - Connect ... <14>1 2016-03-27T03:00:03+01:00 bunch-VirtualBox haproxy 2098 - - Connect ... It looked like they were emitted more than 1 hour apart, unlike with the fix: <14>1 2016-03-27T01:59:58+01:00 bunch-VirtualBox haproxy 3381 - - Connect ... <14>1 2016-03-27T03:00:03+02:00 bunch-VirtualBox haproxy 3381 - - Connect ... This patch should be backported to 1.6 and partially to 1.5 (no fix needed in log.c).	2016-03-13 23:48:05 +01:00
Willy Tarreau	cb80912001	MEDIUM: stats: support "show info typed" on the CLI This emits the field positions, names and types. It is more convenient than the default output for a parser that doesn't know all the fields. It simply relies on stats_emit_typed_data_field() and stats_emit_field_tags() added by previous patch for the output. A new stats format flag was added, STAT_FMT_TYPED, which is set when the "typed" keyword is specified on the CLI.	2016-03-11 17:08:06 +01:00
Willy Tarreau	b47785f862	MINOR: stats: add functions to emit typed fields into a chunk New function stats_emit_typed_data_field() does exactly like stats_emit_raw_data_field() except that it also prints the data type after a colon. This will be used to print using the typed format. And function stats_emit_field_tags() appends a 3-letter code describing the origin, nature, and scope, followed by an optional delimiter. This will be particularly convenient to dump typed data.	2016-03-11 17:08:05 +01:00
Willy Tarreau	508a63fb96	MINOR: stats: add ST_SHOWADMIN to pass the admin info in the regular flags It's easier to have a new flag in <flags> to indicate whether or not we want to display the admin column in HTML dumps. We already have similar flags to show the version or the legends.	2016-03-11 17:08:05 +01:00
Willy Tarreau	8e62c05af2	MINOR: stats: create fields types suitable for all CSV output data We're preparing for various data types for each stats field as they appear in the CSV output. For now we only cover the regular types handled by printf, so we have 32 and 64 bit ints and counters, strings, and of course "empty" to indicate that there's nothing in the field and which guarantees that any accessed entry will return 0. More types will surely come later so that some fields are properly represented. For example, we could see limits where only the value 0 doesn't show up, or human time, etc.	2016-03-11 17:08:04 +01:00
Willy Tarreau	6204cd9f27	BUG/MAJOR: vars: always retrieve the stream and session from the sample This is the continuation of previous patch called "BUG/MAJOR: samples: check smp->strm before using it". It happens that variables may have a session-wide scope, and that their session is retrieved by dereferencing the stream. But nothing prevents them from being used from a streamless context such as tcp-request connection, thus crashing the process. Example : tcp-request connection accept if { src,set-var(sess.foo) -m found } In order to fix this, we have to always ensure that variable manipulation only happens via the sample, which contains the correct owner and context, and that we never use one from a different source. This results in quite a large change since a lot of functions are inderctly involved in the call chain, but the change is easy to follow. This fix must be backported to 1.6, and requires the last two patches.	2016-03-10 17:28:04 +01:00
Willy Tarreau	1777ea63e0	MINOR: sample: add a new helper to initialize the owner of a sample Since commit `6879ad3` ("MEDIUM: sample: fill the struct sample with the session, proxy and stream pointers") merged in 1.6-dev2, the sample contains the pointer to the stream and sample fetch functions as well as converters use it heavily. This requires from a lot of call places to initialize 4 fields, and it was even forgotten at a few places. This patch provides a convenient helper to initialize all these fields at once, making it easy to prepare a new sample from a previous one for example. A few call places were cleaned up to make use of it. It will be needed by further fixes. At one place in the Lua code, it was moved earlier because we used to call sample casts with a non completely initialized sample, which is not clean eventhough at the moment there are no consequences.	2016-03-10 16:42:58 +01:00
Willy Tarreau	be508f1580	BUG/MAJOR: samples: check smp->strm before using it Since commit `6879ad3` ("MEDIUM: sample: fill the struct sample with the session, proxy and stream pointers") merged in 1.6-dev2, the sample contains the pointer to the stream and sample fetch functions as well as converters use it heavily. The problem is that earlier commit `87b0966` ("REORG/MAJOR: session: rename the "session" entity to "stream"") had split the session and stream resulting in the possibility for smp->strm to be NULL before the stream was initialized. This is what happens in tcp-request connection rulesets, as discovered by Baptiste. The sample fetch functions must now check that smp->strm is valid before using it. An alternative could consist in using a dummy stream with nothing in it to avoid some checks but it would only result in deferring them to the next step anyway, and making it harder to detect that a stream is valid or the dummy one. There is still an issue with variables which requires a complete independant fix. They use strm->sess to find the session with strm possibly NULL and passed as an argument. All call places indirectly use smp->strm to build strm. So the problem is there but the API needs to be changed to remove this duplicate argument that makes it much harder to know what pointer to use. This fix must be backported to 1.6, as well as the next one fixing variables.	2016-03-10 16:42:58 +01:00
Willy Tarreau	320ec2a745	BUG/MEDIUM: chunks: always reject negative-length chunks The recent addition of "show env" on the CLI has revealed an interesting design bug. Chunks are supposed to support a negative length to indicate that they carry no data. chunk_printf() sets this size to -1 if the string is too large for the buffer. At a few places in the http engine we may end up with trash.len = -1. But bi_putchk(), chunk_appendf() and a few other chunks consumers don't consider this case as possible and will use such a chunk, possibly restoring an invalid string or trying to copy -1 bytes. This fix takes care of clarifying the situation in a backportable way where such sizes are used, so that a negative length indicating an error remains present until the chunk is reinitialized or overwritten. But a cleaner design adjustment needs to be done so that there's a clear contract on how to use these chunks. At first glance it doesn't seem that useful to support negative sizes, so probably this is what should change. This fix must be backported to 1.6 and 1.5.	2016-02-25 16:24:14 +01:00
Thierry Fournier	09a9178311	MINOR: server: generalize the "updater" source the function server_parse_addr_change_request() contain an hardcoded updater source "stats command". this function can be called from other sources than the "stats command", so this patch make this argument generic.	2016-02-24 23:37:39 +01:00
Thierry Fournier	d35b7a6d93	CLEANUP: server: add "const" to some message strings "updater" is used in "read only" mode, so I add a const qualifier to the variable declaration.	2016-02-24 23:37:39 +01:00
Thierry Fournier	9f72555b65	BUG/MINOR: server: some prototypes are renamed The commit 87b096 renames the functions srv_shutdown_backup_sessions() and srv_shutdown_sessions() to srv_shutdown_backup_streams() and srv_shutdown_streams(). The header file <proto/servers.h> does not repport these changes. This bug should be repported in the 1.6 branch, even if it is useless because new dev are frozen.	2016-02-23 22:42:47 +01:00
Pieter Baauw	235fcfcf14	MINOR: mailers: make it possible to configure the connection timeout This patch introduces a configurable connection timeout for mailers with a new "timeout mail <time>" directive. Acked-by: Simon Horman <horms@verge.net.au>	2016-02-20 15:33:06 +01:00
Thierry Fournier	ac88cfe452	MEDIUM: dns: add a "resolve-net" option which allow to prefer an ip in a network This options prioritize th choice of an ip address matching a network. This is useful with clouds to prefer a local ip. In some cases, a cloud high avalailibility service can be announced with many ip addresses on many differents datacenters. The latency between datacenter is not negligible, so this patch permitsto prefers a local datacenter. If none address matchs the configured network, another address is selected.	2016-02-19 14:37:49 +01:00
Thierry Fournier	ada348459f	MEDIUM: dns: extract options DNS selection preferences are actually declared inline in the struct server. There are copied from the server struct to the dns_resolution struct for each resolution. Next patchs adds new preferences options, and it is not a good way to copy all the configuration information before each dns resolution. This patch extract the configuration preference from the struct server and declares a new dedicated struct. Only a pointer to this new striuict will be copied before each dns resolution.	2016-02-19 14:37:46 +01:00
Thierry Fournier	70473a5f8c	MINOR: common: mask conversion Add function which converts network mask from bit length form to struct in*_addr form.	2016-02-19 14:37:41 +01:00
Thierry Fournier	49d4842e98	BUG/MAJOR: lua: segfault using Concat object Concat object is based on "luaL_Buffer". The luaL_Buffer documentation says: During its normal operation, a string buffer uses a variable number of stack slots. So, while using a buffer, you cannot assume that you know where the top of the stack is. You can use the stack between successive calls to buffer operations as long as that use is balanced; that is, when you call a buffer operation, the stack is at the same level it was immediately after the previous buffer operation. (The only exception to this rule is luaL_addvalue.) After calling luaL_pushresult the stack is back to its level when the buffer was initialized, plus the final string on its top. So, the stack cannot be manipulated between the first call at the function "luaL_buffinit()" and the last call to the function "luaL_pushresult()" because we cannot known the stack status. In other way, the memory used by these functions seems to be collected by GC, so if the GC is triggered during the usage of the Concat object, it can be used some released memory. This patch rewrite the Concat class without the "luaL_Buffer" system. It uses "userdata()" forr the memory allocation of the buffer strings.	2016-02-19 13:24:09 +01:00
Pieter Baauw	46af170e41	MINOR: mailers: increase default timeout to 10 seconds This allows the tcp connection to send multiple SYN packets, so 1 lost packet does not cause the mail to be lost. It changes the socket timeout from 2 to 10 seconds, this allows for 3 syn packets to be send and waiting a little for their reply. This patch should be backported to 1.6. Acked-by: Simon Horman <horms@verge.net.au>	2016-02-17 10:19:08 +01:00
Willy Tarreau	ae79572f89	MINOR: cli: add a new "show env" command Using environment variables in configuration files can make troubleshooting complicated because there's no easy way to verify that the variables are correct. This patch introduces a new "show env" command which displays the whole environment on the CLI, one variable per line. The socket must at least have level operator to display the environment.	2016-02-16 11:43:03 +01:00
Dragan Dosen	835b9212f6	MEDIUM: log: add a new log format flag "E" The +E mode escapes characters '"', '\' and ']' with '\' as prefix. It mostly makes sense to use it in the RFC5424 structured-data log formats. Example: log-format-sd %{+Q,+E}o\ [exampleSDID@1234\ header=%[capture.req.hdr(0)]]	2016-02-12 13:36:47 +01:00
Dragan Dosen	0edd10925d	MINOR: standard: add function "escape_chunk" This function tries to prefix all characters tagged in the <map> with the <escape> character. The specified <chunk> contains the input to be escaped.	2016-02-12 13:36:47 +01:00
Thierry Fournier	9e7e3ea991	MINOR: lua: move common function This patch moves the function hlua_checkudata which check that an object contains the expected class_reference as metatable. This function is commonly used by all the lua functions. The function hlua_metatype is also moved.	2016-02-12 11:08:53 +01:00
Thierry Fournier	9312794ed7	MINOR: standard: add RFC HTTP date parser This parser takes a string containing an HTTP date. It returns a broken-down time struct. We must considers considers this time as GMT. Maybe later the timezone will be taken in account.	2016-02-12 11:08:53 +01:00
Thierry Fournier	fb0b5467ca	MINOR: lua: file dedicated to unsafe functions When Lua executes functions from its API, these can throws an error. These function must be executed in a special environment which catch these error, otherwise a critical error (like segfault) can raise. This patch add a c file called "hlua_fcn.c" which collect all the Lua/c function needing safe environment for its execution.	2016-02-12 11:08:53 +01:00
Thierry Fournier	8feaa661b6	MINOR: map: Add regex matching replacement This patch declares a new map which provides a string based on a string with back references replaced by the content matched by the regex.	2016-02-10 23:38:34 +01:00
Christopher Faulet	443ea1a242	MINOR: filters: Extract proxy stuff from the struct filter Now, filter's configuration (.id, .conf and .ops fields) is stored in the structure 'flt_conf'. So proxies own a flt_conf list instead of a filter list. When a filter is attached to a stream, it gets a pointer on its configuration. This avoids mixing the filter's context (owns by a stream) and its configuration (owns by a proxy). It also saves 2 pointers per filter instance.	2016-02-09 14:53:15 +01:00
Christopher Faulet	113f7decfc	MINOR: filters/http: Slightly update the parsing of chunks Now, http_parse_chunk_size and http_skip_chunk_crlf return the number of bytes parsed on success. http_skip_chunk_crlf does not use msg->sol anymore. On the other hand, http_forward_trailers is unchanged. It returns >0 if the end of trailers is reached and 0 if not. In all cases (except if an error is encountered), msg->sol contains the length of the last parsed part of the trailer headers. Internal doc and comments about msg->sol has been updated accordingly.	2016-02-09 14:53:15 +01:00
Christopher Faulet	3e7bc67722	MINOR: filters: Remove unused or useless stuff and do small optimizations	2016-02-09 14:53:15 +01:00
Christopher Faulet	da02e17d42	MAJOR: filters: Require explicit registration to filter HTTP body and TCP data Before, functions to filter HTTP body (and TCP data) were called from the moment at least one filter was attached to the stream. If no filter is interested by these data, this uselessly slows data parsing. A good example is the HTTP compression filter. Depending of request and response headers, the response compression can be enabled or not. So it could be really nice to call it only when enabled. So, now, to filter HTTP/TCP data, a filter must use the function register_data_filter. For TCP streams, this function can be called only once. But for HTTP streams, when needed, it must be called for each HTTP request or HTTP response. Only registered filters will be called during data parsing. At any time, a filter can be unregistered by calling the function unregister_data_filter.	2016-02-09 14:53:15 +01:00
Christopher Faulet	fcf035cb5a	MINOR: filters: Add stream_filters structure to hide filters info From the stream point of view, this new structure is opaque. it hides filters implementation details. So, impact for future optimizations will be reduced (well, we hope so...). Some small improvements has been made in filters.c to avoid useless checks.	2016-02-09 14:53:15 +01:00
Christopher Faulet	309c6418b0	MEDIUM: filters: Replace filter_http_headers callback by an analyzer This new analyzer will be called for each HTTP request/response, before the parsing of the body. It is identified by AN_FLT_HTTP_HDRS. Special care was taken about the following condition : * the frontend is a TCP proxy * filters are defined in the frontend section * the selected backend is a HTTP proxy So, this patch explicitly add AN_FLT_HTTP_HDRS analyzer on the request and the response channels when the backend is a HTTP proxy and when there are filters attatched on the stream. This patch simplifies http_request_forward_body and http_response_forward_body functions.	2016-02-09 14:53:15 +01:00
Christopher Faulet	2fb2880caf	MEDIUM: filters: remove http_start_chunk, http_last_chunk and http_chunk_end For Chunked HTTP request/response, the body filtering can be really expensive. In the worse case (many chunks of 1 bytes), the filters overhead is of 3 calls per chunk. If http_data callback is useful, others are just informative. So these callbacks has been removed. Of course, existing filters (trace and compression) has beeen updated accordingly. For the HTTP compression filter, the update is quite huge. Its implementation is closer to the old one.	2016-02-09 14:53:15 +01:00
Christopher Faulet	3e34429515	MEDIUM: filters: Use macros to call filters callbacks to speed-up processing When no filter is attached to the stream, the CPU footprint due to the calls to filters_* functions is huge, especially for chunk-encoded messages. Using macros to check if we have some filters or not is a great improvement. Furthermore, instead of checking the filter list emptiness, we introduce a flag to know if filters are attached or not to a stream.	2016-02-09 14:53:15 +01:00
Christopher Faulet	92d3638d2d	MAJOR: filters/http: Rewrite the HTTP compression as a filter HTTP compression has been rewritten to use the filter API. This is more a PoC than other thing for now. It allocates memory to work. So, if only for that, it should be rewritten. In the mean time, the implementation has been refactored to allow its use with other filters. However, there are limitations that should be respected: - No filter placed after the compression one is allowed to change input data (in 'http_data' callback). - No filter placed before the compression one is allowed to change forwarded data (in 'http_forward_data' callback). For now, these limitations are informal, so you should be careful when you use several filters. About the configuration, 'compression' keywords are still supported and must be used to configure the HTTP compression behavior. In absence of a 'filter' line for the compression filter, it is added in the filter chain when the first compression' line is parsed. This is an easy way to do when you do not use other filters. But another filter exists, an error is reported so that the user must explicitly declare the filter. For example: listen tst ... compression algo gzip compression offload ... filter flt_1 filter compression filter flt_2 ...	2016-02-09 14:53:15 +01:00
Christopher Faulet	3d97c90974	REORG: filters: Prepare creation of the HTTP compression filter HTTP compression will be moved in a true filter. To prepare the ground, some functions have been moved in a dedicated file. Idea is to keep everything about compression algos in compression.c and everything related to the filtering in flt_http_comp.c. For now, a header has been added to help during the transition. It will be removed later. Unused empty ACL keyword list was removed. The "compression" keyword parser was moved from cfgparse.c to flt_http_comp.c.	2016-02-09 14:53:15 +01:00
Christopher Faulet	d7c9196ae5	MAJOR: filters: Add filters support This patch adds the support of filters in HAProxy. The main idea is to have a way to "easely" extend HAProxy by adding some "modules", called filters, that will be able to change HAProxy behavior in a programmatic way. To do so, many entry points has been added in code to let filters to hook up to different steps of the processing. A filter must define a flt_ops sutrctures (see include/types/filters.h for details). This structure contains all available callbacks that a filter can define: struct flt_ops { /* * Callbacks to manage the filter lifecycle / int (init) (struct proxy p); void (deinit)(struct proxy p); int (check) (struct proxy p); / * Stream callbacks / void (stream_start) (struct stream s); void (stream_accept) (struct stream s); void (session_establish)(struct stream s); void (stream_stop) (struct stream s); / * HTTP callbacks / int (http_start) (struct stream s, struct http_msg msg); int (http_start_body) (struct stream s, struct http_msg msg); int (http_start_chunk) (struct stream s, struct http_msg msg); int (http_data) (struct stream s, struct http_msg msg); int (http_last_chunk) (struct stream s, struct http_msg msg); int (http_end_chunk) (struct stream s, struct http_msg msg); int (http_chunk_trailers)(struct stream s, struct http_msg msg); int (http_end_body) (struct stream s, struct http_msg msg); void (http_end) (struct stream s, struct http_msg msg); void (http_reset) (struct stream s, struct http_msg msg); int (http_pre_process) (struct stream s, struct http_msg msg); int (http_post_process) (struct stream s, struct http_msg msg); void (http_reply) (struct stream s, short status, const struct chunk msg); }; To declare and use a filter, in the configuration, the "filter" keyword must be used in a listener/frontend section: frontend test ... filter <FILTER-NAME> [OPTIONS...] The filter referenced by the <FILTER-NAME> must declare a configuration parser on its own name to fill flt_ops and filter_conf field in the proxy's structure. An exemple will be provided later to make it perfectly clear. For now, filters cannot be used in backend section. But this is only a matter of time. Documentation will also be added later. This is the first commit of a long list about filters. It is possible to have several filters on the same listener/frontend. These filters are stored in an array of at most MAX_FILTERS elements (define in include/types/filters.h). Again, this will be replaced later by a list of filters. The filter API has been highly refactored. Main changes are: * Now, HA supports an infinite number of filters per proxy. To do so, filters are stored in list. * Because filters are stored in list, filters state has been moved from the channel structure to the filter structure. This is cleaner because there is no more info about filters in channel structure. * It is possible to defined filters on backends only. For such filters, stream_start/stream_stop callbacks are not called. Of course, it is possible to mix frontend and backend filters. * Now, TCP streams are also filtered. All callbacks without the 'http_' prefix are called for all kind of streams. In addition, 2 new callbacks were added to filter data exchanged through a TCP stream: - tcp_data: it is called when new data are available or when old unprocessed data are still waiting. - tcp_forward_data: it is called when some data can be consumed. * New callbacks attached to channel were added: - channel_start_analyze: it is called when a filter is ready to process data exchanged through a channel. 2 new analyzers (a frontend and a backend) are attached to channels to call this callback. For a frontend filter, it is called before any other analyzer. For a backend filter, it is called when a backend is attached to a stream. So some processing cannot be filtered in that case. - channel_analyze: it is called before each analyzer attached to a channel, expects analyzers responsible for data sending. - channel_end_analyze: it is called when all other analyzers have finished their processing. A new analyzers is attached to channels to call this callback. For a TCP stream, this is always the last one called. For a HTTP one, the callback is called when a request/response ends, so it is called one time for each request/response. * 'session_established' callback has been removed. Everything that is done in this callback can be handled by 'channel_start_analyze' on the response channel. * 'http_pre_process' and 'http_post_process' callbacks have been replaced by 'channel_analyze'. * 'http_start' callback has been replaced by 'http_headers'. This new one is called just before headers sending and parsing of the body. * 'http_end' callback has been replaced by 'channel_end_analyze'. * It is possible to set a forwarder for TCP channels. It was already possible to do it for HTTP ones. * Forwarders can partially consumed forwardable data. For this reason a new HTTP message state was added before HTTP_MSG_DONE : HTTP_MSG_ENDING. Now all filters can define corresponding callbacks (http_forward_data and tcp_forward_data). Each filter owns 2 offsets relative to buf->p, next and forward, to track, respectively, input data already parsed but not forwarded yet by the filter and parsed data considered as forwarded by the filter. A any time, we have the warranty that a filter cannot parse or forward more input than previous ones. And, of course, it cannot forward more input than it has parsed. 2 macros has been added to retrieve these offets: FLT_NXT and FLT_FWD. In addition, 2 functions has been added to change the 'next size' and the 'forward size' of a filter. When a filter parses input data, it can alter these data, so the size of these data can vary. This action has an effet on all previous filters that must be handled. To do so, the function 'filter_change_next_size' must be called, passing the size variation. In the same spirit, if a filter alter forwarded data, it must call the function 'filter_change_forward_size'. 'filter_change_next_size' can be called in 'http_data' and 'tcp_data' callbacks and only these ones. And 'filter_change_forward_size' can be called in 'http_forward_data' and 'tcp_forward_data' callbacks and only these ones. The data changes are the filter responsability, but with some limitation. It must not change already parsed/forwarded data or data that previous filters have not parsed/forwarded yet. Because filters can be used on backends, when we the backend is set for a stream, we add filters defined for this backend in the filter list of the stream. But we must only do that when the backend and the frontend of the stream are not the same. Else same filters are added a second time leading to undefined behavior. The HTTP compression code had to be moved. So it simplifies http_response_forward_body function. To do so, the way the data are forwarded has changed. Now, a filter (and only one) can forward data. In a commit to come, this limitation will be removed to let all filters take part to data forwarding. There are 2 new functions that filters should use to deal with this feature: * flt_set_http_data_forwarder: This function sets the filter (using its id) that will forward data for the specified HTTP message. It is possible if it was not already set by another filter _AND_ if no data was yet forwarded (msg->msg_state <= HTTP_MSG_BODY). It returns -1 if an error occurs. * flt_http_data_forwarder: This function returns the filter id that will forward data for the specified HTTP message. If there is no forwarder set, it returns -1. When an HTTP data forwarder is set for the response, the HTTP compression is disabled. Of course, this is not definitive.	2016-02-09 14:53:15 +01:00
Christopher Faulet	635c0adec2	BUG/MINOR: ssl: Be sure to use unique serial for regenerated certificates The serial number for a generated certificate was computed using the requested servername, without any variable/random part. It is not a problem from the moment it is not regenerated. But if the cache is disabled or when the certificate is evicted from the cache, we may need to regenerate it. It is important to not reuse the same serial number for the new certificate. Else clients (especially browsers) trigger a warning because 2 certificates issued by the same CA have the same serial number. So now, the serial is a static variable initialized with now_ms (internal date in milliseconds) and incremented at each new certificate generation. (Ref MPS-2031)	2016-02-09 09:04:53 +01:00
Christopher Faulet	c34d19fc3c	BUG: stream_interface: Reuse connection even if the output channel is empty in function 'si_connect', an existing connection is reused (and considered as established) only when there are some pending data in the output channel. This can be problem when filters are used, because a filter can choose to not forward data immediatly. So when we try to initiate a connection to a server, the output channel can be empty. In this situation, if the connection already exists, it is not considered as established and nothing happens. If the stream interface is in the state SI_ST_ASS, this leads to an infinite loop in process_stream because it remains in this state. This patch fixes this problem. Now, in 'si_connect', we always reuse an existing connection, whether or not there are pending data in the output channel.	2016-02-03 14:22:55 +01:00
Willy Tarreau	581bf81d34	MEDIUM: pools: add a new flag to avoid rounding pool size up Usually it's desirable to merge similarly sized pools, which is the reason why their size is rounded up to the next multiple of 16. But for the buffers this is problematic because we add the size of struct buffer to the user-requested size, and the rounding results in 8 extra bytes that are usable in the end. So the user gets more bytes than asked for, and in case of SSL it results in short writes for the extra bytes that are sent above multiples of 16 kB. So we add a new flag MEM_F_EXACT to request that the size is not rounded up when creating the entry. Thus it doesn't disable merging.	2016-01-25 02:31:18 +01:00
Willy Tarreau	999f643ed2	BUG/MEDIUM: channel: fix miscalculation of available buffer space. The function channel_recv_limit() relies on channel_reserved() which itself relies on channel_in_transit(). Individually they're OK but combined they're doing the wrong thing. The problem is that we refrain from filling buffers while to_forward is even much larger than the buffer because of a semantic issue along the call chain. This is particularly visible when offloading SSL on moderately large files (1 MB), though it is also visible on clear text. Twice the number of recv() calls are made compared to what is needed, and the typical performance drops by 15-20% in SSL in 1.6 and later, and no directly measurable drop in 1.5 except when using strace. There's no need for all these intermediate functions, so let's get rid of them and reimplement channel_recv_limit() from scratch in a safer way. This fix needs to be backported to 1.6 and 1.5 (at least). Note that in 1.5 the function is called buffer_recv_limit() and it may differ a bit.	2016-01-25 02:31:18 +01:00
Thiago Farina	b1af23ebea	MINOR: fix the return type for dns_response_get_query_id() function This function should return a 16-bit type as that is the type for dns header id. Also because it is doing an uint16 unpack big-endian operation. Backport: can be backported to 1.6 Signed-off-by: Thiago Farina <tfarina@chromium.org> Signed-off-by: Baptiste Assmann <bedis9@gmail.com>	2016-01-20 23:51:24 +01:00
Baptiste Assmann	22c4ed6937	MINOR: lru: new function to delete <nb> least recently used keys Introduction of a new function in the LRU cache source file. Purpose of this function is to be used to delete a number of entries in the cache. 'number' is defined by the caller and the key removed are taken at the tail of the tree	2016-01-11 07:31:35 +01:00
Willy Tarreau	898529b4a8	MEDIUM: tools: add csv_enc_append() to preserve the original chunk We have csv_enc() but there's no way to append some CSV-encoded data to an existing chunk, so here we modify the existing function for this and create an inlined version of csv_enc() which first resets the output chunk. It will be handy to append data to an existing chunk without having to use an extra temporary chunk, or to encode multiple strings into a single chunk with chunk_newstr(). The patch is quite small, in fact most changes are typo fixes in the comments.	2016-01-06 20:58:55 +01:00
Willy Tarreau	70af633ebe	MINOR: chunk: make chunk_initstr() take a const string chunk_initstr() prepares a read-only chunk from a string of fixed length. Thus it must be prepared to accept a read-only string on the input, otherwise the caller has to force-cast some const char* and that's not a good idea.	2016-01-06 20:58:55 +01:00
Willy Tarreau	601360b41d	MINOR: chunks: add chunk_strcat() and chunk_newstr() These two new functions will make it easier to manipulate small strings from within functions, because at many places, multiple short strings are needed which do not deserve a malloc() nor a free(), and alloca() is often discouraged. Since we already have trash chunks, it's convenient to be able to allocate substrings from a chunk and use them later since our functions already perform all the length checks. chunk_newstr() adds a trailing zero at the end of a chunk and returns the pointer to the next character, which can be used as an independant string. chunk_strcat() does what it says.	2016-01-06 13:53:37 +01:00
Willy Tarreau	0b6044fa24	MINOR: chunks: ensure that chunk_strcpy() adds a trailing zero Since thus function bears the name of a well-known string function, it must at least promise compatible semantics. Here it means always adding the trailing zero so that anyone willing to use chunk->str as a regular string can do it. Of course the zero is not counted in the chunk's length.	2016-01-06 13:53:37 +01:00
Willy Tarreau	f9476a5a30	BUG/MINOR: chunk: make chunk_dup() always check and set dst->size chunk_dup() was affected by two bugs at once related to dst->size : - first, it didn't check dst->size to know if it could free(dst->str), so using it on a statically allocated chunk would cause a free(constant) and crash the process ; - second, it didn't properly set dst->size, possibly causing smaller strings not to be properly reported in a chunk that was previously used for something else. Fortunately, neither of these situations ever happened since the function is rarely used. In the process of doing this, we even allocate one more byte for a trailing zero if the input chunk was not full, so that the copied string can safely be reused by standard string functions. The bug was introduced in 1.3.4 nine years ago with this commit : `0f77253` ("[MINOR] store HTTP error messages into a chunk array") It's better to backport this fix in case a future fix relies on it.	2016-01-04 20:47:27 +01:00
Christopher Faulet	a94e5a548c	MINOR: filters/http: Use a wrapper function instead of stream_int_retnclose The function http_reply_and_close has been added in proto_http.c to wrap calls to stream_int_retnclose. This functions will be modified when the filters will be added.	2015-12-28 16:49:36 +01:00
Thierry FOURNIER	ec9a58c709	BUILD/MINOR: regex: missing header When HAProxy is compiled with pcre, strlen() is used, but <string.h> is not included. This patch must be backported in 1.6	2015-12-22 13:36:01 +01:00
Thierry FOURNIER	ca98866bcf	BUG/MEDIUM: lua: Lua applets must not fetch samples using http_txn If a sample fetch needing http_txn is called from an HTTP Lua applet, the result will be invalid and may even cause a crash because some HTTP data can be forwarded and the HTTP txn is no longer valid. Here the solution is to ensure that a fetch called from Lua never needs http_txn. This is done thanks to a new flag HLUA_F_MAY_USE_HTTP which indicates whether or not it is safe to call a fetch which needs HTTP. This fix needs to be backported to 1.6.	2015-12-20 23:13:00 +01:00
Thierry FOURNIER	7fa0549a2b	REORG/MINOR: lua: convert boolean "int" to bitfield This patch converts a boolean "int" to a bitfiled. The main reason is to save space in the struct if another flag may will be require. Note that this patch is required for next fix and will need to be backported to 1.6.	2015-12-20 23:13:00 +01:00
Willy Tarreau	7006045e48	BUG/MEDIUM: config: properly adjust maxconn with nbproc when memmax is forced When memmax is forced using "-m", the per-process memory limit is enforced using setrlimit(), but this value is not used to compute the automatic maxconn limit. In addition, the per-process memory limit didn't consider the fact that the shared SSL cache only needs to be accounted once. The doc was also fixed to clearly state that "-m" is global and not per process. It makes sense because people who use -m want to protect the system's resources regardless of whatever appears in the configuration.	2015-12-14 13:03:09 +01:00
Willy Tarreau	9579d12f2e	BUILD/MINOR: http: proto_http.h needs sample.h Since commit `fd7edd3` ("MINOR: Move http method enum from proto_http to sample") proto_http.h needs to include sample.h. This can be backported to 1.6 though it doesn't affect existing code.	2015-11-26 10:24:48 +01:00
Thierry FOURNIER	1db96672c4	BUILD: freebsd: double declaration On freebsd, the macro LIST_PREV already exists in the header file <sys/queue.h>, and this makes a build error. This patch removes the macros before declaring it. This ensure that the error doesn't occurs.	2015-11-06 01:15:02 +01:00
Baptiste Assmann	e9544935e8	BUG/MINOR: http rule: http capture 'id' rule points to a non existing id It is possible to create a http capture rule which points to a capture slot id which does not exist. Current patch prevent this when parsing configuration and prevent running configuration which contains such rules. This configuration is now invalid: frontend f bind :8080 http-request capture req.hdr(User-Agent) id 0 default_backend b this one as well: frontend f bind :8080 declare capture request len 32 # implicit id is 0 here http-request capture req.hdr(User-Agent) id 1 default_backend b It applies of course to both http-request and http-response rules.	2015-11-04 08:47:55 +01:00
James Brown	55f9ff11b5	MINOR: check: add agent-send server parameter Causes HAProxy to emit a static string to the agent on every check, so that you can independently control multiple services running behind a single agent port.	2015-11-04 07:26:51 +01:00
Thierry FOURNIER	c4eebc8157	BUG/MEDIUM: lua: sample fetches based on response doesn't work The direction (request or response) is not propagated in the sample fecthes called throught Lua. This patch adds the direction status in some structs (hlua_txn and hlua_smp) to make sure that the sample fetches will be called with all the information. The converters can not access to a TXN object, so there are not impacted the direction. However, the samples used as input of the Lua converter wrapper are initiliazed with the direction. Thereby, the struct smp stay consistent. [wt: needs to be backported to 1.6]	2015-11-03 10:50:14 +01:00
Willy Tarreau	58102cf30b	MEDIUM: memory: add accounting for failed allocations We now keep a per-pool counter of failed memory allocations and we report that, as well as the amount of memory allocated and used on the CLI.	2015-10-28 16:24:21 +01:00
Willy Tarreau	de30a684ca	DEBUG/MEDIUM: memory: add optional control pool memory operations When DEBUG_MEMORY_POOLS is used, we now use the link pointer at the end of the pool to store a pointer to the pool, and to control it during pool_free2() in order to serve four purposes : - at any instant we can know what pool an object was allocated from when examining memory, hence how we should possibly decode it ; - it serves to detect double free when they happen, as the pointer cannot be valid after the element is linked into the pool ; - it serves to detect if an element is released in the wrong pool ; - it serves as a canary, to detect if some buffers experienced an overflow before being release. All these elements will definitely help better troubleshoot strange situations, or at least confirm that certain conditions did not happen.	2015-10-28 15:28:05 +01:00
Willy Tarreau	ac421118db	DEBUG/MEDIUM: memory: optionally protect free data in pools When debugging a core file, it's sometimes convenient to be able to visit the released entries in the pools (typically last released session). Unfortunately the first bytes of these entries are destroyed by the link elements of the pool. And of course, most structures have their most accessed elements at the beginning of the structure (typically flags). Let's add a build-time option DEBUG_MEMORY_POOLS which allocates an extra pointer in each pool to put the link at the end of each pool item instead of the beginning.	2015-10-28 15:27:59 +01:00
Willy Tarreau	a84dcb8440	DEBUG/MINOR: memory: add a build option to disable memory pools sharing Sometimes analysing a core file isn't easy due to shared memory pools. Let's add a build option to disable this. It's not enabled by default, it could be backported to older versions.	2015-10-28 15:27:55 +01:00
Andrew Hayworth	e6a4a329b8	MEDIUM: dns: Don't use the ANY query type Basically, it's ill-defined and shouldn't really be used going forward. We can't guarantee that resolvers will do the 'legwork' for us and actually resolve CNAMES when we request the ANY query-type. Case in point (obfuscated, clearly): PRODUCTION! ahayworth@secret-hostname.com:~$ dig @10.11.12.53 ANY api.somestartup.io ; <<>> DiG 9.8.4-rpz2+rl005.12-P1 <<>> @10.11.12.53 ANY api.somestartup.io ; (1 server found) ;; global options: +cmd ;; Got answer: ;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 62454 ;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 4, ADDITIONAL: 0 ;; QUESTION SECTION: ;api.somestartup.io. IN ANY ;; ANSWER SECTION: api.somestartup.io. 20 IN CNAME api-somestartup-production.ap-southeast-2.elb.amazonaws.com. ;; AUTHORITY SECTION: somestartup.io. 166687 IN NS ns-1254.awsdns-28.org. somestartup.io. 166687 IN NS ns-1884.awsdns-43.co.uk. somestartup.io. 166687 IN NS ns-440.awsdns-55.com. somestartup.io. 166687 IN NS ns-577.awsdns-08.net. ;; Query time: 1 msec ;; SERVER: 10.11.12.53#53(10.11.12.53) ;; WHEN: Mon Oct 19 22:02:29 2015 ;; MSG SIZE rcvd: 242 HAProxy can't handle that response correctly. Rather than try to build in support for resolving CNAMEs presented without an A record in an answer section (which may be a valid improvement further on), this change just skips ANY record types altogether. A and AAAA are much more well-defined and predictable. Notably, this commit preserves the implicit "Prefer IPV6 behavior." Furthermore, ANY query type by default is a bad idea: (from Robin on HAProxy's ML): Using ANY queries for this kind of stuff is considered by most people to be a bad practice since besides all the things you named it can lead to incomplete responses. Basically a resolver is allowed to just return whatever it has in cache when it receives an ANY query instead of actually doing an ANY query at the authoritative nameserver. Thus if it only received queries for an A record before you do an ANY query you will not get an AAAA record even if it is actually available since the resolver doesn't have it in its cache. Even worse if before it only got MX queries, you won't get either A or AAAA	2015-10-20 22:31:01 +02:00
Willy Tarreau	a5c51ac6a6	BUILD: properly report when USE_ZLIB and USE_SLZ are used together Use #error here otherwise the errors are hard to spot for the casual user.	2015-10-13 16:47:16 +02:00
Willy Tarreau	163d4620c6	MEDIUM: server: implement TCP_USER_TIMEOUT on the server This is equivalent to commit `2af207a` ("MEDIUM: tcp: implement tcp-ut bind option to set TCP_USER_TIMEOUT") except that this time it works on the server side. The purpose is to detect dead server connections even when checks are rare, disabled, or after a soft reload (since checks are disabled there as well), and to ensure client connections will get killed faster.	2015-10-13 16:18:27 +02:00
Neale Ferguson	5e98e3e998	BUILD: enable build on Linux/s390x I would like to contribute the following fix to enable the Linux s390x platform. The fix was built against today's git master. I've attached the patch for review. Depending on your buildbot/jenkins/? requirements I can set up a virtual machine for automated building/testing of the package in this environment.	2015-10-12 20:58:51 +02:00
Christopher Faulet	7969a33a01	MINOR: ssl: Add support for EC for the CA used to sign generated certificates This is done by adding EVP_PKEY_EC type in supported types for the CA private key when we get the message digest used to sign a generated X509 certificate. So now, we support DSA, RSA and EC private keys. And to be sure, when the type of the private key is not directly supported, we get its default message digest using the function 'EVP_PKEY_get_default_digest_nid'. We also use the key of the default certificate instead of generated it. So we are sure to use the same key type instead of always using a RSA key.	2015-10-09 12:13:12 +02:00
Christopher Faulet	77fe80c0b4	MINOR: ssl: Release Servers SSL context when HAProxy is shut down [wt: could be backported to 1.5 as well]	2015-10-09 10:33:00 +02:00
Christopher Faulet	d2cab92e75	BUG/MINOR: ssl: fix management of the cache where forged certificates are stored First, the LRU cache must be initialized after the configuration parsing to correctly set its size. Next, the function 'ssl_sock_set_generated_cert' returns -1 when an error occurs (0 if success). In that case, the caller is responsible to free the memory allocated for the certificate. Finally, when a SSL certificate is generated by HAProxy but cannot be inserted in the cache, it must be freed when the SSL connection is closed. This happens when 'tune.ssl.ssl-ctx-cache-size' is set to 0.	2015-10-09 10:20:53 +02:00
Joseph Lynch	ffaf30b689	BUILD: Fix the build on OSX (htonll/ntohll) htonll and ntohll were defined in `5b4dd683cb` but on osx they are already defined in sys/_endian.h. So, we check if they are defined before declaring them. [wt: no backport needed]	2015-10-09 10:11:59 +02:00
Willy Tarreau	067ac9f4b6	MINOR: debug: enable memory poisonning to use byte 0 When debugging an issue, sometimes it can be useful to be able to use byte 0 to poison memory areas, resulting in the same effect as a calloc(). This patch changes the default mem_poison_byte to -1 to disable it so that all positive values are usable.	2015-10-08 14:12:13 +02:00
Thierry FOURNIER	ab95e656ea	MINOR: http/tcp: fill the avalaible actions This patch adds a function that generates the list of avalaible actions for the error message.	2015-10-02 22:56:11 +02:00
Dragan Dosen	43885c728e	BUG/MEDIUM: logs: segfault writing to log from Lua Michael Ezzell reported a bug causing haproxy to segfault during startup when trying to send syslog message from Lua. The function __send_log() can be called with *p that is NULL and/or when the configuration is not fully parsed, as is the case with Lua. This patch fixes this problem by using individual vectors instead of the pre-generated strings log_htp and log_htp_rfc5424. Also, this patch fixes a problem causing haproxy to write the wrong pid in the logs -- the log_htp(_rfc5424) strings were generated at the haproxy start, but "pid" value would be changed after haproxy is started in daemon/systemd mode.	2015-10-02 00:57:45 +02:00
Thierry FOURNIER	10770faf8e	MEDIUM: lua: change the timeout execution Now, the Lua timeout is relative to the effective run time. When the Lua is waiting for I/O, this time is not took in lua run time account.	2015-09-29 19:13:49 +02:00
Willy Tarreau	ae459f3b9f	BUILD: tcp: use IPPROTO_IP when SOL_IP is not available Dmitry Sivachenko reported a build failure on FreeBSD due to SOL_IP not being defined. IPPROTO_IP must be used there instead.	2015-09-29 18:19:32 +02:00
Dragan Dosen	5b78d9b437	MEDIUM: logs: pass the trailing "\n" as an iovec This patch passes the trailing "\n" as an iovec in the function __send_log(), so that we don't need to modify the original log message.	2015-09-28 18:31:09 +02:00
Dragan Dosen	c8cfa7b4f3	MEDIUM: logs: have global.log_send_hostname not contain the trailing space This patch unifies global.log_send_hostname addition in the log header processing.	2015-09-28 18:27:45 +02:00
David Carlier	5801a8247a	MINOR: global: Few new struct fields for da module The name and length of the client cookie, useful for extracting cookie value's function and a simple bitfield one to define if set or not.	2015-09-28 14:01:27 +02:00
David Carlier	4686f792b4	MINOR: proto_http: Externalisation of previously internal functions Needs to expose the HTTP headers 'iterator' and the client's cookie value extraction functions.	2015-09-28 14:01:27 +02:00
Dragan Dosen	0b85ecee53	MEDIUM: logs: add a new RFC5424 log-format for the structured-data This patch adds a new RFC5424-specific log-format for the structured-data that is automatically send by __send_log() when the sender is in RFC5424 mode. A new statement "log-format-sd" should be used in order to set log-format for the structured-data part in RFC5424 formatted syslog messages. Example: log-format-sd [exampleSDID@1234\ bytes=\"%B\"\ status=\"%ST\"]	2015-09-28 14:01:27 +02:00
Dragan Dosen	1322d09a6f	MEDIUM: logs: add support for RFC5424 header format per logger The function __send_log() iterates over senders and passes the header as the first vector to sendmsg(), thus it can send a logger-specific header in each message. A new logger arguments "format rfc5424" should be used in order to enable RFC5424 header format. For example: log 10.2.3.4:1234 len 2048 format rfc5424 local2 info	2015-09-28 14:01:27 +02:00
Dragan Dosen	68d2e3a742	MEDIUM: logs: remove the hostname, tag and pid part from the logheader At the moment we have to call snprintf() for every log line just to rebuild a constant. Thanks to sendmsg(), we send the message in 3 parts: time-based header, proxy-specific hostname+log-tag+pid, session-specific message.	2015-09-28 14:01:27 +02:00
Dragan Dosen	59cee973cd	MEDIUM: log: use a separate buffer for the header and for the message Make sendmsg() use two vectors, one for the message header that is updated by update_log_hdr() and one for the message buffer.	2015-09-28 14:01:27 +02:00
Dragan Dosen	609ac2ab6c	MEDIUM: log: replace sendto() with sendmsg() in __send_log() This patch replaces sendto() with sendmsg() in __send_log() and makes use of an iovec to send the log message.	2015-09-28 14:01:27 +02:00
David Carlier	60deeba090	MINOR: chunk: New function free_trash_buffers() This new function is meant to be called in the general deinit phase, to free those two internal chunks.	2015-09-28 14:00:00 +02:00
David Carlier	845efb53c7	MINOR: cfgparse: New function cfg_unregister_sections() A new function introduced meant to be called during general deinit phase. During the configuration parsing, the section entries are all allocated. This new function free them.	2015-09-28 14:00:00 +02:00
Willy Tarreau	270978492c	MEDIUM: config: set tune.maxrewrite to 1024 by default The tune.maxrewrite parameter used to be pre-initialized to half of the buffer size since the very early days when buffers were very small. It has grown to absurdly large values over the years to reach 8kB for a 16kB buffer. This prevents large requests from being accepted, which is the opposite of the initial goal. Many users fix it to 1024 which is already quite large for header addition. So let's change the default setting policy : - pre-initialize it to 1024 - let the user tweak it - in any case, limit it to tune.bufsize / 2 This results in 15kB usable to buffer HTTP messages instead of 8kB, and doesn't affect existing configurations which already force it.	2015-09-28 13:59:41 +02:00
Thierry FOURNIER	a30b5dbf85	MINOR: lua: add AppletHTTP class and service This class is used by Lua code for running as an applet called in HTTP mode It defines also the associated lua service	2015-09-28 01:03:48 +02:00
Thierry FOURNIER	f0a64b676f	MINOR: lua: add AppletTCP class and service This class is used by Lua code for running as an applet called in TCP mode. It defines also the lua service.	2015-09-28 01:03:48 +02:00
Thierry FOURNIER	5a363e71b2	MINOR: stream/applet: add use-service action This new target can be called from the frontend or the backend. It is evaluated just before the backend choice and just before the server choice. So, the input stream or HTTP request can be forwarded to a server or to an internal service.	2015-09-28 01:03:48 +02:00
Thierry FOURNIER	1245a8318a	MINOR: applet: add an execution timeout This patch only declares the execution timeout variable. The applet must be respect or ignore it. For example an applet who doing some network accesses and doesn't control its execution yime should use this timeout.	2015-09-28 01:03:48 +02:00
Thierry FOURNIER	c069cfdfea	MINOR: applet: add init function this patch adds init function for the applet. the HAProxy standard applet (peers, stats, lua) have no standard way for the applet initialisation. This new function pointer have for goal to try to standardize the applet initialization.	2015-09-28 01:03:48 +02:00
Willy Tarreau	acc980036f	MEDIUM: action: add a new flag ACT_FLAG_FIRST This flag is used by custom actions to know that they're called for the first time. The only case where it's not set is when they're resuming from a yield. It will be needed to let them know when they have to allocate some resources.	2015-09-27 23:34:39 +02:00
Thierry FOURNIER	7c39ab4ac2	OPTIM/MEDIUM: lua: executes the garbage collector only when using cosocket The garbage collector is a little bit heavy to run, and it was added only for cosockets. This patch prevent useless executions when no cosockets are used.	2015-09-27 22:56:40 +02:00
Thierry FOURNIER	7fe3be7281	MINOR: standard: avoid DNS resolution from the function str2sa_range() This patch blocks the DNS resolution in the function str2sa_range(), this is useful if the function is used during the HAProxy runtime.	2015-09-27 15:04:32 +02:00
Willy Tarreau	c1b10d38d7	MEDIUM: actions: add new flag ACT_FLAG_FINAL to notify about last call This new flag indicates to a custom action that it must not yield because it will not be called anymore. This addresses an issue introduced by commit `bc4c1ac` ("MEDIUM: http/tcp: permit to resume http and tcp custom actions"), which made it possible to yield even after the last call and causes Lua actions not to be stopped when the session closes. Note that the Lua issue is not fixed yet at this point. Also only TCP rules were handled, for now HTTP rules continue to let the action yield since we don't know whether or not it is a final call.	2015-09-27 11:04:06 +02:00
Willy Tarreau	658b85b68d	MEDIUM: actions: pass a new "flags" argument to custom actions Since commit `bc4c1ac` ("MEDIUM: http/tcp: permit to resume http and tcp custom actions"), some actions may yield and be called back when new information are available. Unfortunately some of them may continue to yield because they simply don't know that it's the last call from the rule set. For this reason we'll need to pass a flag to the custom action to pass such information and possibly other at the same time.	2015-09-27 11:04:06 +02:00
Willy Tarreau	1895428ef4	DEBUG: add p_malloc() to return a poisonned memory area This one is useful to detect improperly initialized memory areas when some suspicious malloc() are involved in random behaviours.	2015-09-26 01:28:43 +02:00
Thierry FOURNIER	fd50f0bcc8	MINOR: http: split initialization The goal is to export the http txn initialisation functions for using it in the Lua code.	2015-09-25 23:39:48 +02:00
Thierry FOURNIER	8d16de0ad0	MINOR: http: export function http_msg_analyzer() Exports the function http_msg_analyzer().	2015-09-25 23:39:43 +02:00
Thierry FOURNIER	127169e7f5	MINOR: http: export the get_reason() function This patch exports the get_reason() function.	2015-09-25 23:39:36 +02:00
Thierry FOURNIER	3c3317849f	MINOR: http: export http_get_path() function This patch simply exports the http_get_path() function from the proto_http.c file.	2015-09-25 23:39:27 +02:00
Thierry FOURNIER	0efc94cbb4	CLEANUP: lua: align defines Align the defined values	2015-09-25 23:28:02 +02:00
Thierry FOURNIER	27929fbfd7	MINOR: channel: rename function chn_sess to chn_strm The name of the function chn_sess is no longer appropriate. This patch renames it to chn_strm.	2015-09-25 23:27:33 +02:00
Willy Tarreau	958f0742a2	BUG/MEDIUM: stream-int: avoid double-call to applet->release While the SI_ST_DIS state is set after doing the close on a connection, it was set before calling release on an applet. Applets have no internal flags contrary to connections, so they have no way to detect they were already released. Because of this it happened that applets were closed twice, once via si_applet_release() and once via si_release_endpoint() at the end of a transaction. The CLI applet could perform a double free in this case, though the situation to cause it is quite hard because it requires that the applet is stuck on output in states that produce very few data. In order to solve this, we now assign the SI_ST_DIS state after calling ->release, and we refrain from doing so if the state is already assigned. This makes applets work much more like connections and definitely avoids this double release. In the future it might be worth making applets have their own flags like connections to carry their own state regardless of the stream interface's state, especially when dealing with connection reuse. No backport is needed since this issue was caused by the rearchitecture in 1.6.	2015-09-25 21:16:03 +02:00
Willy Tarreau	aa977ba205	MINOR: stream-int: rename si_applet_done() to si_applet_wake_cb() This function is a callback made only for calls from the applet handler. Rename it to remove confusion. It's currently called from the Lua code but that's not correct, we should call the notify and update functions instead otherwise it will not enable the applet again.	2015-09-25 21:16:02 +02:00
Willy Tarreau	335520305c	MEDIUM: stream-int: completely remove stream_int_update_embedded() This one is not needed anymore as what it used to do is either completely covered by the new stream_int_notify() function, or undesired and inherited from the past as a side effect of introducing the connections. This update is theorically never called since it's assigned only when nothing is connected to the stream interface. However a test has been added to si_update() to stay safe if some foreign code decides to call si_update() in unsafe situations.	2015-09-25 21:16:02 +02:00
Willy Tarreau	615f28bec1	MINOR: stream-int: implement the stream_int_notify() function stream_int_notify() was taken from the common part between si_conn_wake_cb() and si_applet_done(). It is designed to report activity to a stream from outside its handler. It'll generally be used by lower layers to report I/O completion but may also be used by remote streams if the buffer processing is shared.	2015-09-25 21:16:02 +02:00
Willy Tarreau	cbc32601a6	MINOR: stream-int: export stream_int_update_* Not only these functions were not static, but we'll also want to export them.	2015-09-25 18:07:16 +02:00
Willy Tarreau	5d5b2fecac	MEDIUM: stream-int: call stream_int_update() from si_update() Now the call to stream_int_update() is moved to si_update(), which is exclusively called from the stream, so that the socket layer may be updated without updating the stream layer. This will later permit to call it individually from other places (other tasks or applets for example).	2015-09-25 18:07:16 +02:00
Willy Tarreau	25f1310f33	MINOR: stream-int: implement a new stream_int_update() function This function is designed to be called from within the stream handler to update the channels' expiration timers and the stream interface's flags based on the channels' flags. It needs to be called only once after the channels' flags have settled down, and before they are cleared, though it doesn't harm to call it as often as desired (it just slightly hurts performance). It must not be called from outside of the stream handler, as what it does will be used to compute the stream task's expiration. The code was taken directly from stream_int_update_applet() and stream_int_update_conn() which had exactly the same one except for applet-specific or connection-specific status update.	2015-09-25 18:07:16 +02:00
Willy Tarreau	64bca9d36a	MINOR: applet: rename applet_runq to applet_active_queue This is not a real run queue and we're facing ugly bugs because if this : if a an applet removes another applet from the queue, typically the next one after itself, the list iterator loops forever because the list's backup pointer is not valid anymore. Before creating a run queue, let's rename this list.	2015-09-25 18:02:44 +02:00
Thierry FOURNIER	85c6c97830	MINOR: action: add reference to the original keywork matched for the called parser. This is usefull because the keyword can contains some condifiguration data set while the keyword registration.	2015-09-23 21:44:23 +02:00
Thierry FOURNIER	7ea160cb5f	MINOR: action: add private configuration This private configuration pointer is used for storing some configuration data associated the keyword, So many keywords can use the same parse function, and this one can use a discriminator.	2015-09-23 21:44:23 +02:00
Willy Tarreau	0fd99094c3	CLEANUP: stream-int: remove obsolete function si_applet_call() This one is not used anymore and is bogus due to the way applets now work. Remove it instead of fixing it before someone finds it fun to use it.	2015-09-23 13:42:08 +02:00
Willy Tarreau	f7ead61388	BUG/MINOR: args: add name for ARGT_VAR Commit `4834bc7` ("MEDIUM: vars: adds support of variables") introduced ARGT_VAR but forgot to put it in the names array. No backport needed.	2015-09-21 20:57:12 +02:00
Willy Tarreau	a68f7629dd	BUG/MEDIUM: stick-tables: fix double-decrement of tracked entries Mailing list participant "mlist" reported negative conn_cur values in stick tables as the result of "tcp-request connection track-sc". The reason is that after the stick entry it copied from the session to the stream, both the session and the stream grab a reference to the entry and when the stream ends, it decrements one reference and one connection, then the same is done for the session. In fact this problem was already encountered slightly differently in the past and addressed by Thierry using the patch below as it was believed by then to be only a refcount issue since it was the observable symptom : `827752e` "BUG/MEDIUM: stick-tables: refcount error after copying SC..." In reality the problem is that the stream must touch neither the refcount nor the connection count for entries it inherits from the session. While we have no way to tell whether a track entry was inherited from the session (since they're simply memcpy'd), it is possible to prevent the stream from touching an entry that already exists in the session because that's a guarantee that it was inherited from it. Note that it may be a temporary fix. Maybe in the future when a session gives birth to multiple streams we'll face a situation where a session may be updated to add more tracked entries after instanciating some streams. The correct long-term fix is to mark some tracked entries as shared or private (or RO/RW). That will allow the session to track more entries even after the same trackers are being used by early streams. No backport is needed, this is only caused by the session/stream split in 1.6.	2015-09-21 17:48:24 +02:00
James Rosewell	10790964b0	MINOR: global: Added new fields for 51Degrees device detection Added support for version 3.2 of 51Degrees C library. Added fields to store HTTP header names important to device detection other than User-Agent. Included a pool of worksets for use with Pattern device detection.	2015-09-21 12:10:41 +02:00
James Rosewell	91a41cb32d	MINOR: http: made CHECK_HTTP_MESSAGE_FIRST accessible to other functions Added the definition of CHECK_HTTP_MESSAGE_FIRST and the declaration of smp_prefetch_http to the header. Changed smp_prefetch_http implementation to remove the static qualifier.	2015-09-21 12:05:26 +02:00
Baptiste Assmann	e11cfcd2c9	MINOR: config: new backend directives: load-server-state-from-file and server-state-file-name This directive gives HAProxy the ability to use the either the global server-state-file directive or a local one using server-state-file-name to load server states. The state can be saved right before the reload by the init script, using the "show servers state" command on the stats socket redirecting output into a file.	2015-09-19 17:05:28 +02:00
Baptiste Assmann	e0882263e0	MINOR: config: new global section directive: server-state-file This new global section directive is used to store the path to the file where HAProxy will be able to retrieve server states across reloads. The file pointed by this path is used to store a file which can contains state of all servers from all backends.	2015-09-19 17:05:27 +02:00
Baptiste Assmann	6bc89366bb	MINOR: config: new global directive server-state-base This new global directive can be used to provide a base directory where all the server state files could be loaded. If a server state file name starts with a slash '/', then this directive must not be applied.	2015-09-19 17:05:26 +02:00
Baptiste Assmann	2828946cb5	MINOR: cli: new stats socket command: show servers state new command 'show servers state' which dumps all variable parameters of a server during an HAProxy process life. Purpose is to dump current server state at current run time in order to read them right after the reload. The format of the output is versionned and we support version 1 for now.	2015-09-19 16:52:46 +02:00
Baptiste Assmann	41472f4d98	MINOR: server: Macro definition for server-state Introduces a few new macros used by server state save and application accros reloads: - currently used state server file format version - currently used state server file header fields - MIN and MAX value for version number - maximum number of fields that could be found in a server-state file - an arbitrary state-file max line length	2015-09-19 11:34:41 +02:00
Baptiste Assmann	54a4730c65	BUG/MAJOR: can't enable a server through the stat socket When a server is disabled in the configuration using the "disabled" keyword, a single flag is positionned: SRV_ADMF_CMAINT (use to be SRV_ADMF_FMAINT).. That said, when providing the first version of this code, we also changed the SRV_ADMF_MAINT mask to match any of the possible MAINT cases: SRV_ADMF_FMAINT, SRV_ADMF_IMAINT, SRV_ADMF_CMAINT Since SRV_ADMF_CMAINT is never (and is not supposed to be) altered at run time, once a server has this flag set up, it can never ever be enabled again using the stats socket. In order to fix this, we should: - consider SRV_ADMF_CMAINT as a simple flag to report the state in the old configuration file (will be used after a reload to deduce the state of the server in a new running process) - enabling both SRV_ADMF_CMAINT and SRV_ADMF_FMAINT when the keyword "disabled" is in use in the configuration - update the mask SRV_ADMF_MAINT as it was before, to only match SRV_ADMF_FMAINT and SRV_ADMF_IMAINT. The following patch perform the changes above. It allows fixing the regression without breaking the way the up coming feature (seamless server state accross reloads) is going to work. Note: this is 1.6-only, no backport needed.	2015-09-18 12:38:23 +02:00
Thierry FOURNIER	007ff4ce0f	MINOR: build: missing header Regex header file is missing in types/action.h Repported by Conrad Hoffmann I cannot build the current dev's master HEAD (`ec3c37d`) because of this error: > In file included from include/proto/proto_http.h:26:0, > from src/stick_table.c:26: > include/types/action.h:102:20: error: field ‘re’ has incomplete type > struct my_regex re; /* used by replace-header and replace-value / > ^ > Makefile:771: recipe for target 'src/stick_table.o' failed > make: ** [src/stick_table.o] Error 1 The struct act_rule defined in action.h includes a full struct my_regex without #include-ing regex.h. Both gcc 5.2.0 and clang 3.6.2 do not allow this.	2015-09-11 17:34:56 +02:00
Thierry FOURNIER	ec3c37db3b	CLEANUP: actions: missplaced includes The include of action.h are misplaced.	2015-09-10 21:17:04 +02:00
Baptiste Assmann	0453a1dd45	MINOR: dns: new flag to report that no IP can be found in a DNS response packet Some DNS response may be valid from a protocol point of view but may not contain any IP addresses. This patch gives a new flag to the function dns_get_ip_from_response to report such case. It's up to the upper layer to decide what to do with this information.	2015-09-10 15:42:55 +02:00
Baptiste Assmann	96972bcd36	MINOR: dns: no expected DNS record type found Some DNS responses may be valid from a protocol point of view, but may not contain any information considered as interested by the requester.. Purpose of the flag DNS_RESP_NO_EXPECTED_RECORD introduced by this patch is to allow reporting such situation. When this happens, a new DNS query is sent with a new query type. For now, the function only expect A and AAAA query types which is enough to cover current cases. In a next future, it will be up to the caller to tell the function which query types are expected.	2015-09-10 15:41:53 +02:00
Willy Tarreau	72b8c1f0aa	MEDIUM: tools: make str2sa_range() optionally return the FQDN The function does a bunch of things among which resolving environment variables, skipping address family specifiers and trimming port ranges. It is the only one which sees the complete host name before trying to resolve it. The DNS resolving code needs to know the original hostname, so we modify this function to optionally provide it to the caller. Note that the function itself doesn't know if the host part was a host or an address, but str2ip() knows that and can be asked not to try to resolve. So we first try to parse the address without resolving and try again with resolving enabled. This way we know if the address is explicit or needs some kind of resolution.	2015-09-08 15:50:19 +02:00
Baptiste Assmann	0df5d9669a	MINOR: dns: New DNS response analysis code: DNS_RESP_TRUNCATED This patch introduces a new internal response state about the analysis of a DNS response received by a server. It is dedicated to report to above layer that the response is 'truncated'.	2015-09-08 14:58:07 +02:00
Baptiste Assmann	6cdea9359b	MINOR: dns: dns_nameserver structure update: new counter for truncated response This patch updates the dns_nameserver structure to integrate a counter dedicated to 'truncated' response sent by servers. Such response are important to track, since HAProxy is supposed to replay its request.	2015-09-08 14:57:28 +02:00
Baptiste Assmann	042d0a1f8c	MINOR: dns: new bitmasks to use against DNS flags Current DNS client code implementation doesn't take care of response flags setup by the server. This patch introduces a couple of bitmasks one can use to retrieve the truncated flag and the reply code available in the 2-bytes flag field.	2015-09-08 12:14:03 +02:00
Baptiste Assmann	189363e35a	MINOR: dns: dns_resolution structure update: time_t to unsigned int 3 variables of the dns_resolution structure are set to 'time_t' type. Since they are all set by 'now_ms' and used as 'ticks' in HAProxy's internal, it is safer to set them to the same type than now_ms: 'unsigned int'.	2015-09-08 10:50:17 +02:00
Willy Tarreau	b7ce424be2	BUG/MINOR: http: remove stupid HTTP_METH_NONE entry When converting the "method" fetch to a string, we used to get an empty string if the first character was not an upper case. This was caused by the lookup function which returns HTTP_METH_NONE when a lookup is not possible, and this method being mapped to an empty string in the array. This is a totally stupid mechanism, there's no reason for having the result depend on the first char. In fact the message parser already checks that the syntax matches an HTTP token so we can only land there with a valid token, hence only HTTP_METH_OTHER should be returned. This fix should be backported to all actively supported branches.	2015-09-03 17:15:21 +02:00
Thierry FOURNIER	42148735bc	MEDIUM: actions: remove ACTION_STOP Before this patch, two type of custom actions exists: ACT_ACTION_CONT and ACT_ACTION_STOP. ACT_ACTION_CONT is a non terminal action and ACT_ACTION_STOP is a terminal action. Note that ACT_ACTION_STOP is not used in HAProxy. This patch remove this behavior. Only type type of custom action exists, and it is called ACT_CUSTOM. Now, the custion action can return a code indicating the required behavior. ACT_RET_CONT wants that HAProxy continue the current rule list evaluation, and ACT_RET_STOP wants that HAPRoxy stops the the current rule list evaluation.	2015-09-02 18:36:38 +02:00
Thierry FOURNIER	35d70efc33	MINOR: http: Action for manipulating the returned status code. This patch is inspired by Bowen Ni's proposal and it is based on his first implementation: With Lua integration in HAProxy 1.6, one can change the request method, path, uri, header, response header etc except response line. I'd like to contribute the following methods to allow modification of the response line. [...] There are two new keywords in 'http-response' that allows you to rewrite them in the native HAProxy config. There are also two new APIs in Lua that allows you to do the same rewriting in your Lua script. Example: Use it in HAProxy config: http-response set-code 404 Or use it in Lua script: txn.http:res_set_reason("Redirect") I dont take the full patch because the manipulation of the "reason" is useless. standard reason are associated with each returned code, and unknown code can take generic reason. So, this patch can set the status code, and the reason is automatically adapted.	2015-08-27 14:29:44 +02:00
Thierry FOURNIER	0a99b89531	MINOR: lua: add core.done() function This function immediately give back the control to HAProxy core.	2015-08-27 11:27:29 +02:00
Willy Tarreau	29fbe51490	MAJOR: tproxy: remove support for cttproxy This was the first transparent proxy technology supported by haproxy circa 2005 but it was obsoleted in 2007 by Tproxy 4.0 which removed a lot of the earlier versions' shortcomings and was finally merged into the kernel. Since nobody has been using cttproxy for many years now and nobody has even just tried to compile the files, it's time to remove it. The doc was updated as well.	2015-08-20 19:35:14 +02:00
Thierry FOURNIER	e0627bdae0	MINOR: stick-tables: Add GPC0 actions This patch adds access to GPC0 through http and tcp actions	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	236657b5e0	MINOR: stick-tables: Add GPT0 access This patch adds acces to GPT0. The access can be done with http and tcp actions, and through a converter.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	3cf1111eb3	MEDIUM: stick-tables: Add GPT0 in the stick tables This patch adds GPT0 (General Purpose Tag 0). It is useful for storing a simple state.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	7e25df3f17	MAJOR: stick-tables: remove key storage from the key struct Now, the key struct only points to the storage provided by the sample as input.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	5d24ebc3d7	MEDIUM: stick-tables: use the sample type names This patch removes the special stick tables types names and use the standard sample type names. This avoid the maintainance of two types and remove the switch/case for matching a sample type for each stick table type.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	bc8c404449	MAJOR: stick-tables: use sample types in place of dedicated types This patch is the first step for sample integration. Actually the stick tables uses her own data type, and some converters must be called to convert sample type to stick-tables types. This patch removes the stick-table types and replace it by the sample types. This prevent: - Maintenance of two types of converters - reduce the code using the samples converters	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	afa80496db	MEDIUM: actions: Normalize the return code of the configuration parsers This patch normalize the return code of the configuration parsers. Before these changes, the tcp action parser returned -1 if fail and 0 for the succes. The http action returned 0 if fail and 1 if succes. The normalisation does: - ACT_RET_PRS_OK for succes - ACT_RET_PRS_ERR for failure	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	322a124867	MINOR: actions: mutualise the action keyword lookup Each (http\|tcp)-(request\|response) action use the same method for looking up the action keyword during the cofiguration parsing. This patch mutualize the code.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	c2bb050f7f	MINOR: proto_tcp: proto_tcp.h is now useles After removing the keyword register from types/proto_tcp.h, the header file remains empty. This patch remove it	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	36481b8667	MEDIUM: actions: Merge (http\|tcp)-(request\|reponse) keywords structs This patch merges the conguration keyword struct. Each declared configuration keyword struct are similar with the others. This patch simplify the code.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	24ff6c6fce	MEDIUM: actions: Add standard return code for the action API Action function can return 3 status: - error if the action encounter fatal error (like out of memory) - yield if the action must terminate his work later - continue in other cases	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	0ea5c7fafa	MINOR: actions: change actions names For performances considerations, some actions are not processed by remote function. They are directly processed by the function. Some of these actions does the same things but for different processing part (request / response). This patch give the same name for the same actions, and change the normalization of the other actions names. This patch is ONLY a rename, it doesn't modify the code.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	91f6ba0f2c	MINOR: actions: Declare all the embedded actions in the same header file This patch group the action name in one file. Some action are called many times and need an action embedded in the action caller. The main goal is to have only one header file grouping all definitions.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	22e49011b1	MINOR: actions: remove the mark indicating the last entry in enum This mark permit to detect if the action tag is over the allowed range. - Normally, this case doesn't appear - If it appears, it is processed by ded fault case of the switch	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	5563e4b469	MINOR: actions: add "from" information This struct member is used to specify who is the rule caller. It permits to use one function for differents callers.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	d0d65aeab6	MEDIUM: capture: Move the capture configuration storage in the union This patch moves the capture configuration struct (capture_prm) in the main "arg" union. This reduce the size of the struct.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	5ec63e008d	MEDIUM: track-sc: Move the track-sc configuration storage in the union This patch moves the track-sc configuration struct (track_ctr_prm) in the main "arg" union. This reduce the size od the struct.	2015-08-20 17:13:47 +02:00
Thierry FOURNIER	e209797ef0	MINOR: proto_http: replace generic opaque types by real used types in "http_capture" by id This patch removes the generic opaque type for storing the configuration of the action "http_capture" by id.	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	32b15003fe	MINOR: proto_http: replace generic opaque types by real used types in "http_capture" This patch removes the generic opaque type for storing the configuration of the action "http_capture"".	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	8855a92d8c	MINOR: proto_http: replace generic opaque types by real used types for the actions on thr request line This patch removes the generic opaque type for storing the configuration of the action "set-method", "set-path", "set-query" and "set-uri".	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	a002dc9df8	MINOR: proto_http: use an "expr" type in place of generic opaque type. This patch removes the generic opaque type for storing the configuration of the acion "set-src" (HTTP_REQ_ACT_SET_SRC), and use the dedicated type "struct expr"	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	f8c1dcea01	MINOR: vars: use the vars types as argument in place of opaque type The (http\|tcp)-(request\|response) action rules use common opaque type. For the HAProxy embbedded feature, types are know, it better to add this types in the action union and use it.	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	231ef1d99c	MINOR: lua: use the hlua_rule type in place of opaque type The (http\|tcp)-(request\|response) action rules use common opaque type. For the HAProxy embbedded feature, types are know, it better to add this types in the action union and use it.	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	7677f400f5	MINOR: actions: Remove the data opaque pointer This patch removes the "data" opaque pointer and replace it by the generic opaque pointer "p[0]".	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	a28a9429b2	MEDIUM: actions: Merge (http\|tcp)-(request\|reponse) action structs This patch is the first of a serie which merge all the action structs. The function "tcp-request content", "tcp-response-content", "http-request" and "http-response" have the same values and the same process for some defined actions, but the struct and the prototype of the declared function are different. This patch try to unify all of these entries.	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	7d4335c935	MEDIUM: pattern/map: Maps can returns various types A map can store and return various types as output. The only one example is the IPv4 and IPv6 types. The previous patch remove the type from the sample storage struct and use the conoverter output type, expecting that all entries of the map have the same type. This will be wrong when the maps will support both IPv4 and IPv6 as output.	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	136f9d34a9	MINOR: samples: rename union from "data" to "u" The union name "data" is a little bit heavy while we read the source code because we can read "data.data.sint". The rename from "data" to "u" makes the read easiest like "data.u.sint".	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	654398a937	MINOR: samples: extract the anonymous union and create the union sample_value This extract is not really required, but it maybe will be usefull later. A comming soonpatch about simplification of stick table values will use this union	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	8c542cac07	MEDIUM: samples: Use the "struct sample_data" in the "struct sample" This patch remove the struct information stored both in the struct sample_data and in the striuct sample. Now, only thestruct sample_data contains data, and the struct sample use the struct sample_data for storing his own data.	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	503bb09873	MINOR: samples: rename some struct member from "smp" to "data" This members contains data and not sample.	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	12ba0c29d6	MINOR: samples: rename a struct from sample_storage to sample_data This a first step of sample reorganization.	2015-08-20 17:13:46 +02:00
Thierry FOURNIER	a123ad886a	MINOR: sample/proto_tcp: export "smp_fetch_src" This patch exports the sample fetch "smp_fetch_src()".	2015-08-11 14:14:11 +02:00
Thierry FOURNIER	fd7edd3968	MINOR: Move http method enum from proto_http to sample This is useful to prevent cross includes. The header file sample.h needs to include proto_http, stick_tables.h will need to include sample.h and proto_http includes stick_tables.h. I choose to move the known http method define because this enum is mainly used in sample.h. This enum is used for the sample type method.	2015-08-11 14:14:10 +02:00
Thierry FOURNIER	422a3af4ce	MINOR: proto_tcp: add session in the action prototype Some actions require the "struct session" while the "struct stream" is not avalaible. This patch adds a pointer to the session.	2015-08-11 14:08:29 +02:00
Thierry FOURNIER	a6b6343cff	CLEANUP: http/tcp actions: remove the scope member The scope member is not used. This patch removes this entry.	2015-08-11 13:44:53 +02:00
Willy Tarreau	de39c9b10f	CLEANUP: appsession: remove the last include files These ones were include/common/appsession.h and include/common/sessionhash.h.	2015-08-10 19:42:30 +02:00
Willy Tarreau	3580b03fc0	CLEANUP: proxy: remove last references to appsession The code entirely builds without using these elements, let's get rid of them now.	2015-08-10 19:42:30 +02:00
Willy Tarreau	53a09d520e	MAJOR: http: remove references to appsession appsessions started to be deprecated with the introduction of stick tables, and the latter are much more powerful and flexible, and in addition they are replicated between nodes and maintained across reloads. Let's now remove appsession completely.	2015-08-10 19:16:18 +02:00
Andrew Hayworth	e63ac871f8	MINOR: log: Add log-format variable %HQ, to log HTTP query strings Since sample fetches are not always available in the response phase, this patch implements %HQ such that: GET /foo?bar=baz HTTP/1.0 ...would be logged as: ?bar=baz	2015-08-09 10:16:49 +02:00
Baptiste Assmann	9f5ada32e4	MINOR: server: add new SRV_ADMF_CMAINT flag The purpose of SRV_ADMF_CMAINT flag is to keep in mind the server was forced to maintenance status because of the configuration file.	2015-08-08 18:18:17 +02:00
Willy Tarreau	7017cb040c	MINOR: server: add a list of safe, already reused idle connections These ones are considered safe as they have already been reused. They will be useful in "aggressive" and "always" http-reuse modes in order to place the first request of a connection with the least risk.	2015-08-06 16:29:01 +02:00
Willy Tarreau	d8fecee291	MINOR: config: add new setting "http-reuse" For now it only supports "never", meaning that we never want to reuse a shared connection, and "always", meaning that we can use any connection that was not marked private. When "never" is set, this also implies that no idle connection may become a shared one.	2015-08-06 11:14:20 +02:00
Willy Tarreau	387ebf84dd	MINOR: connection: add a new flag CO_FL_PRIVATE This flag is set on an outgoing connection when this connection gets some properties that must not be shared with other connections, such as dynamic transparent source binding, SNI or a proxy protocol header, or an authentication challenge from the server. This will be needed later to implement connection reuse.	2015-08-06 11:14:17 +02:00
Willy Tarreau	173a1c6b43	MINOR: server: add a list of already used idle connections There's a difference with the other idle conns in that these new ones have already been used and may be reused by other streams.	2015-08-06 11:13:47 +02:00
Willy Tarreau	4320eaac62	MINOR: stream-int: make si_idle_conn() only accept valid connections This function is now dedicated to idle connections only, which means that it must not be used without any endpoint nor anything not a connection. The connection remains attached to the stream interface.	2015-08-06 11:11:10 +02:00
Willy Tarreau	323a2d925c	MEDIUM: stream-int: queue idle connections at the server Now we get a per-server list of all idle connections. That way we'll be able to reclaim them upon shortage later.	2015-08-06 11:06:25 +02:00
Willy Tarreau	d75d40e9a8	MINOR: connection: add a new list member in the connection struct This list member will be used to attach a connection to a list of idle, reusable or queued connections. It's unused for now. Given that it's not expected to be used more than a few times per session, the member was put after the target, in the area starting at the second cache line of the structure.	2015-08-06 11:00:34 +02:00
Willy Tarreau	600802aef0	MINOR: server: add a list of private idle connections For now it's not populated but we have the list entry. It will carry all idle connections that sessions don't want to share. They may be used later to reclaim connections upon socket shortage for example.	2015-08-06 10:59:08 +02:00
Willy Tarreau	41b09f4bbc	MINOR: stream-int: add new function si_detach_endpoint() This function only detaches the endpoint from the stream-int and optionally returns the original pointer. This will be needed to steal idle connections from other connections.	2015-08-05 21:54:25 +02:00
Willy Tarreau	973a54235f	MEDIUM: stream-int: simplify si_alloc_conn() Since we now always call this function with the reuse parameter cleared, let's simplify the function's logic as it cannot return the existing connection anymore. The savings on this inline function are appreciable (240 bytes) : $ size haproxy.old haproxy.new text data bss dec hex filename 1020383 40816 36928 1098127 10c18f haproxy.old 1020143 40816 36928 1097887 10c09f haproxy.new	2015-08-05 21:51:09 +02:00
Willy Tarreau	2100b49122	CLEANUP/MINOR: dns: dns_str_to_dn_label() only needs a const char The string is an input, let's constify it.	2015-07-22 16:42:43 +02:00
Thierry FOURNIER	fd77e05f65	MINOR: vars: returns variable content This patch copy the content of a variable in a sample. The function returns 0 if the variable is not found.	2015-07-22 00:48:24 +02:00
Thierry FOURNIER	bf65cd4d77	MAJOR: arg: converts uint and sint in sint This patch removes the 32 bits unsigned integer and the 32 bit signed integer. It replaces these types by a unique type 64 bit signed.	2015-07-22 00:48:23 +02:00
Thierry FOURNIER	07ee64ef4d	MAJOR: sample: converts uint and sint in 64 bits signed integer This patch removes the 32 bits unsigned integer and the 32 bit signed integer. It replaces these types by a unique type 64 bit signed. This makes easy the usage of integer and clarify signed and unsigned use. With the previous version, signed and unsigned are used ones in place of others, and sometimes the converter loose the sign. For example, divisions are processed with "unsigned", if one entry is negative, the result is wrong. Note that the integer pattern matching and dotted version pattern matching are already working with signed 64 bits integer values. There is one user-visible change : the "uint()" and "sint()" sample fetch functions which used to return a constant integer have been replaced with a new more natural, unified "int()" function. These functions were only introduced in the latest 1.6-dev2 so there's no impact on regular deployments.	2015-07-22 00:48:23 +02:00
Willy Tarreau	5b4dd683cb	MINOR: standard: provide htonll() and ntohll() These are the 64-bit equivalent of htonl() and ntohl(). They're a bit tricky in order to avoid expensive operations. The principle consists in letting the compiler detect we're playing with a union and simplify most or all operations. The asm-optimized htonl() version involving bswap (x86) / rev (arm) / other is a single operation on little endian, or a NOP on big-endian. In both cases, this lets the compiler "see" that we're rebuilding a 64-bit word from two 32-bit quantities that fit into a 32-bit register. In big endian, the whole code is optimized out. In little endian, with a decent compiler, a few bswap and 2 shifts are left, which is the minimum acceptable.	2015-07-21 23:50:06 +02:00
Thierry FOURNIER	763a5d85f7	MINOR: standard: add 64 bits conversion functions This patch adds 3 functions for 64 bit integer conversion. * lltoa_r : converts signed 64 bit integer to string * read_uint64 : converts from string to signed 64 bits integer with capping * read_int64 : converts from string to unsigned 64 bits integer with capping	2015-07-21 23:27:10 +02:00
Baptiste Assmann	19a106d24a	MINOR: server: server_find functions: id, name, best_match This patch introduces three new functions which can be used to find a server in a farm using different server information: - server unique id (srv->puid) - server name - find best match using either name or unique id When performing best matching, the following applies: - use the server name first (if provided) - use the server id if provided in any case, the function can update the caller about mismatches encountered.	2015-07-21 23:24:16 +02:00
Baptiste Assmann	7cc419ae1d	MINOR: server: new server flag: SRV_F_FORCED_ID This flag aims at reporting whether the server unique id (srv->puid) has been forced by the administrator in HAProxy's configuration. If not set, it means HAProxy has generated automatically the server's unique id.	2015-07-21 23:24:16 +02:00
Baptiste Assmann	8a027ccb38	MINOR: proxy: bit field for proxy_find_best_match diff status function proxy_find_best_match can update the caller by updating an int provided in argument. For now, proxy_find_best_match hardcode bit values 0x01, 0x02 and 0x04, which is not understandable when reading a code exploiting them. This patch defines 3 macros with a more explicit wording, so further reading of a code exploiting the magic bit values will be understandable more easily.	2015-07-21 23:24:16 +02:00
Willy Tarreau	d7c7475ad6	MINOR: stream-int: only use si_release_endpoint() to release a connection Change si_alloc_conn() to call si_release_endpoint() instead of open-coding the connection releasing code when reuse is disabled. This fuses the code with the one already dealing with applets, makes it shorter and helps centralizing the connection freeing logic at a single place.	2015-07-19 19:14:10 +02:00
Willy Tarreau	7a08d3b2d7	CLEANUP: stream-int: remove stream_int_unregister_handler() and si_detach() The former was not used anymore and the latter was only used by the former. They were only aliases to other existing functions anyway.	2015-07-19 18:48:20 +02:00
Thierry FOURNIER	25f4e3ef33	CLEANUP: vars: remove unused struct The commit "MEDIUM: vars: move the session variables to the session, not the stream" (ebcd4844e82a4198ea5d98fe491a46267da1d1ec") moves the variables from the stream to the session. It forgot to remove the stream definition of the "vars_sess".	2015-07-10 16:30:08 +02:00
Willy Tarreau	732eac41f4	MEDIUM: ssl: add sni support on the server lines The new "sni" server directive takes a sample fetch expression and uses its return value as a hostname sent as the TLS SNI extension. A typical use case consists in forwarding the front connection's SNI value to the server in a bridged HTTPS forwarder : sni ssl_fc_sni	2015-07-10 11:43:15 +02:00
Willy Tarreau	630764188b	MINOR: ssl: provide ia function to set the SNI extension on a connection ssl_sock_set_servername() is used to set the SNI hostname on an outgoing connection. This function comes from code originally provided by Christopher Faulet of Qualys.	2015-07-10 11:37:29 +02:00
Willy Tarreau	28d976d5ee	MINOR: args: add new context for servers We'll have to support fetch expressions and args on server lines for "usesrc", "usedst", "sni", etc...	2015-07-09 11:39:33 +02:00
Willy Tarreau	646b864fe5	CLEANUP: ssl: make ssl_sock_generated_cert_serial() take a const It doesn't modify the input.	2015-07-07 18:09:15 +02:00
Adis Nezirovic	2fbcafc9ce	MEDIUM: http: Add new 'set-src' option to http-request This option enables overriding source IP address in a HTTP request. It is useful when we want to set custom source IP (e.g. front proxy rewrites address, but provides the correct one in headers) or we wan't to mask source IP address for privacy or compliance. It acts on any expression which produces correct IP address.	2015-07-06 16:17:28 +02:00
Adis Nezirovic	79beb248b9	CLEANUP: sample: generalize sample_fetch_string() as sample_fetch_as_type() This modification makes possible to use sample_fetch_string() in more places, where we might need to fetch sample values which are not plain strings. This way we don't need to fetch string, and convert it into another type afterwards. When using aliased types, the caller should explicitly check which exact type was returned (e.g. SMP_T_IPV4 or SMP_T_IPV6 for SMP_T_ADDR). All usages of sample_fetch_string() are converted to use new function.	2015-07-06 16:17:25 +02:00
David Carlier	e6c3941668	BUILD/MINOR: tools: rename popcount to my_popcountl This is in order to avoid conflicting with NetBSD popcount* functions since 6.x release, the final l to mentions the argument is a long like NetBSD does. This patch could be backported to 1.5 to fix the build issue there as well.	2015-07-02 11:32:25 +02:00
Dragan Dosen	105c8e6368	MEDIUM: 51d: add LRU-based cache on User-Agent string detection This cache is used by 51d converter. The input User-Agent string, the converter args and a random seed are used as a hashing key. The cached entries contains a pointer to the resulting string for specific User-Agent string detection. The cache size can be tuned using 51degrees-cache-size parameter.	2015-06-30 10:43:03 +02:00
Dragan Dosen	93b38d9191	MEDIUM: 51Degrees code refactoring and cleanup Moved 51Degrees code from src/haproxy.c, src/sample.c and src/cfgparse.c into a separate files src/51d.c and include/import/51d.h. Added two new functions init_51degrees() and deinit_51degrees(), updated Makefile and other code reorganizations related to 51Degrees.	2015-06-30 10:43:03 +02:00
Willy Tarreau	ebcd4844e8	MEDIUM: vars: move the session variables to the session, not the stream It's important that the session-wide variables are in the session and not in the stream.	2015-06-19 11:59:02 +02:00
Willy Tarreau	ce7b4def74	BUILD/MINOR: lua: ensure that hlua_ctx_destroy is properly defined When Lua is disabled, the alternate functions must have the same prototype as the original ones, otherwise we get such warnings : src/stream.c:278:27: warning: too many arguments in call to 'hlua_ctx_destroy' hlua_ctx_destroy(&s->hlua); ~~~~~~~~~~~~~~~~ ^ No backport is needed.	2015-06-17 20:18:54 +02:00
Emeric Brun	aaf5860fd6	MINOR: peers: avoid re-scheduling of pending stick-table's updates still not pushed.	2015-06-16 16:11:12 +02:00
Thierry FOURNIER	c365d99a07	MINOR: vars: adds get and set functions This patch adds two functions used for variable acces using the variable full name. If the variable doesn't exists in the variable pool name, it is created.	2015-06-13 23:01:37 +02:00
Thierry FOURNIER	4834bc773c	MEDIUM: vars: adds support of variables This patch adds support of variables during the processing of each stream. The variables scope can be set as 'session', 'transaction', 'request' or 'response'. The variable type is the type returned by the assignment expression. The type can change while the processing. The allocated memory can be controlled for each scope and each request, and for the global process.	2015-06-13 23:01:37 +02:00
Thierry FOURNIER	a9ff994461	MINOR: build: fix build dependency fix include dependency. The header file sample.h don't need to known the content of the struct arg, so I remove the include, and replace it by a simple pointer declaration. This prevent an include dependecy issue with the next patch.	2015-06-13 23:01:37 +02:00
Thierry FOURNIER	0e11863a6f	MINOR: tcp/http/conf: extends the keyword registration options This patch permits to register a new keyword with the keyword "tcp-request content" 'tcp-request connection", tcp-response content", http-request" and "http-response" which is identified only by matching the start of the keyword. for example, we register the keyword "set-var" with the option "match_pfx" and the configuration keyword "set-var(var_name)" matchs this entry.	2015-06-13 23:01:37 +02:00
Thierry FOURNIER	fbdb77582d	MINOR: tcp: increase the opaque data array This patch increase the opaque data array for the tcp_rules. It is used by the "store" action (next commited) which deal with variables.	2015-06-13 23:01:37 +02:00
Thierry FOURNIER	9c627e84b2	MEDIUM: sample: Add type any This type is used to accept any type of sample as input, and prevent any automatic "cast". It runs like the type "ADDR" which accept the type "IPV4" and "IPV6".	2015-06-13 22:59:14 +02:00
Thierry FOURNIER	1480bd8dd2	MINOR: standard: add function that converts signed int to a string This function is the same as "ultoa_r", but it takes a signed value as input.	2015-06-13 22:59:14 +02:00
Baptiste Assmann	3863f97349	MINOR: dns: add DNS statistics add a new command on the stats socket to print a DNS resolvers section (including per server) statistics: "show stats resolvers <id>"	2015-06-13 22:07:35 +02:00
Baptiste Assmann	a68ca96375	MAJOR: server: add DNS-based server name resolution Relies on the DNS protocol freshly implemented in HAProxy. It performs a server IP addr resolution based on a server hostname.	2015-06-13 22:07:35 +02:00
Baptiste Assmann	325137d603	MEDIUM: dns: implement a DNS resolver Implementation of a DNS client in HAProxy to perform name resolution to IP addresses. It relies on the freshly created UDP client to perform the DNS resolution. For now, all UDP socket calls are performed in the DNS layer, but this might change later when the protocols are extended to be more suited to datagram mode. A new section called 'resolvers' is introduced thanks to this patch. It is used to describe DNS servers IP address and also many parameters.	2015-06-13 22:07:35 +02:00
Baptiste Assmann	5d4e4f7a57	MEDIUM: protocol: add minimalist UDP protocol client Basic introduction of a UDP layer in HAProxy. It can be used as a client only and manages UDP exchanges with servers. It can't be used to load-balance UDP protocols, but only used by internal features such as DNS resolution.	2015-06-13 22:07:35 +02:00
Baptiste Assmann	3d8f831f13	MEDIUM: server: change server ip address from stats socket New command available on the stats socket to change a server addr using the command "set server <backend>/<server> addr <ip4\|ip6>"	2015-06-13 22:07:35 +02:00
Baptiste Assmann	14e4014a48	MEDIUM: server: add support for changing a server's address Ability to change a server IP address during HAProxy run time. For now this is provided via function update_server_addr() which currently is not called. A log is emitted on each change. For now we do it inconditionally, but later we'll want to do it only on certain circumstances, which explains why the logging block is enclosed in if(1).	2015-06-13 22:07:35 +02:00
Joris Dedieu	9dd44ba5d6	BUG/MEDIUM: compat: fix segfault on FreeBSD Since commit `65d805fd` witch removes standard.h from compat.h some values were not properly set on FreeBSD. This caused a segfault at startup when smp_resolve_args is called. As FreeBSD have IP_BINDANY, CONFIG_HAP_TRANSPARENT is define. This cause struct conn_src to be extended with some fields. The size of this structure was incorrect. Including netinet/in.h fix this issue. While diving in code preprocessing, I found that limits.h was require to properly set MAX_HOSTNAME_LEN, ULONG_MAX, USHRT_MAX and others system limits on FreeBSD.	2015-06-13 08:25:36 +02:00
Christopher Faulet	3054880e09	MINOR: ssl: Export functions to manipulate generated certificates Following functions are now available in the SSL public API: * ssl_sock_create_cert * ssl_sock_get_generated_cert * ssl_sock_set_generated_cert * ssl_sock_generated_cert_serial These functions could be used to create a certificate by hand, set it in the cache used to store generated certificates and retrieve it. Here is an example (pseudo code): X509 cacert = ...; EVP_PKEY capkey = ...; char servername = ...; unsigned int serial; serial = ssl_sock_generated_cert_serial(servername, strlen(servername)); if (!ssl_sock_get_generated_cert(serial, cacert)) { SSL_CTX ctx = ssl_sock_create_cert(servername, serial, cacert, capkey); ssl_sock_set_generated_cert(ctx, serial, cacert); }	2015-06-12 18:06:59 +02:00
Christopher Faulet	31af49d62b	MEDIUM: ssl: Add options to forge SSL certificates With this patch, it is possible to configure HAProxy to forge the SSL certificate sent to a client using the SNI servername. We do it in the SNI callback. To enable this feature, you must pass following BIND options: * ca-sign-file <FILE> : This is the PEM file containing the CA certitifacte and the CA private key to create and sign server's certificates. * (optionally) ca-sign-pass <PASS>: This is the CA private key passphrase, if any. * generate-certificates: Enable the dynamic generation of certificates for a listener. Because generating certificates is expensive, there is a LRU cache to store them. Its size can be customized by setting the global parameter 'tune.ssl.ssl-ctx-cache-size'.	2015-06-12 18:06:59 +02:00
Christopher Faulet	92939d20fa	MINOR: lru: Add lru64_lookup function It lookup a key in a LRU cache for use with specified domain and revision. It differs from lru64_get as it does not create missing keys. The function returns NULL if an error or a cache miss occurs.	2015-06-12 18:06:59 +02:00
Christopher Faulet	f90ac55d9e	MINOR: lru: Add the possibility to free data when an item is removed Now, When a item is committed in an LRU tree, you can define a function to free data owned by this item. This function will be called when the item is removed from the LRU tree or when the tree is destroyed..	2015-06-12 18:06:59 +02:00
Thomas Holmes	4d441a759c	MEDIUM: sample: add trie support to 51Degrees Trie or pattern algorithm is used depending on what 51Degrees source files are provided to MAKE.	2015-06-02 19:30:53 +02:00
Thomas Holmes	eef3f9fbd3	MINOR: global: add several 51Degrees members to global These are : fiftyoneDegreesDataSet _51d_data_set char *_51d_data_file_path char _51d_property_seperator struct list _51d_property_names	2015-06-02 14:00:22 +02:00
Willy Tarreau	a397e76ac7	CLEANUP: adjust the envelope name in da.h to reflect the file name Since the file is import/da.h, the define should be _IMPORT_DA_H, nor MOBI_DA_DAC_HAPROXY_H.	2015-06-02 13:42:12 +02:00
Willy Tarreau	0d74f77ee7	CLEANUP: da: register the config keywords in da.c No need to export the registration to haproxy.c, it can be done locally.	2015-06-02 13:42:12 +02:00
Willy Tarreau	f63386ad27	CLEANUP: da: move the converter registration to da.c There's no reason to put it into sample.c, it's better to register it locally in da.c, it removes a number of ifdefs and exports.	2015-06-02 13:42:12 +02:00
Willy Tarreau	ba3dd2b7e0	CLEANUP: global: remove one ifdef USE_DEVICEATLAS The include file already has the ifdef, let's remove it from the global file.	2015-06-02 13:42:12 +02:00
David Carlier	88d13cbf4d	MEDIUM: global: add the DeviceAtlas required elements to struct global This diff is the raw C struct definition of all DeviceAtlas module data needed added to the main global struct haproxy configuration. The three first members are needed for both init and deinit phases as some dynamic memory allocations are done. The useragentid serves to hold during the whole lifecycle of the module the User-Agent HTTP Header identifier from the DeviceAtlas data during the init process.	2015-06-02 13:24:49 +02:00
David Carlier	8167f30661	MEDIUM: config: add DeviceAtlas global keywords This diff is for the DeviceAtlas convertor. This patch adds the following converters : deviceatlas-json-file deviceatlas-log-level deviceatlas-property-separator First, the configuration keywords handling (only the log level configuration part does not end the haproxy process if it is wrongly set, it fallbacks to the default level). Furthermore, init, deinit phases and the API lookup phase, the da_haproxy function which is fed by the input provided and set all necessary properties chosen via the configuration to the output, separated by the separator.	2015-06-02 13:24:44 +02:00
Remi Gacogne	47783ef05b	MEDIUM: ssl: add the possibility to use a global DH parameters file This patch adds the ssl-dh-param-file global setting. It sets the default DH parameters that will be used during the SSL/TLS handshake when ephemeral Diffie-Hellman (DHE) key exchange is used, for all "bind" lines which do not explicitely define theirs.	2015-05-31 22:02:00 +02:00
Thierry FOURNIER	561a0f989d	MINOR: tcp: add custom actions that can continue tcp-(request\|response) processing Actually, the tcp-request and tcp-response custom ation are always final actions. This patch create a new type of action that can permit to continue the evaluation of tcp-request and tcp-response processing.	2015-05-29 17:49:48 +02:00
Emeric Brun	b3971ab062	MAJOR: peers: peers protocol version 2.0 This patch does'nt add any new feature: the functional behavior is the same than version 1.0. Technical differences: In this version all updates on different stick tables are multiplexed on the same tcp session. There is only one established tcp session per peer whereas in first version there was one established tcp session per peer and per stick table. Messages format was reviewed to be more evolutive and to support further types of data exchange such as SSL sessions or other sticktable's data types (currently only the sticktable's server id is supported).	2015-05-29 15:50:33 +02:00
Thierry FOURNIER	ddea626de4	MINOR: common: escape CSV strings This function checks a string for using it in a CSV output format. If the string contains one of the following four char <">, <,>, CR or LF, the string is encapsulated between <"> and the <"> are escaped by a <""> sequence. The rounding by <"> is optionnal. It can be canceled, forced or the function choose automatically the right way.	2015-05-28 17:47:19 +02:00
Willy Tarreau	51d861a44f	MEDIUM: http: implement http-response redirect rules Sometimes it's problematic not to have "http-response redirect" rules, for example to perform a browser-based redirect based on certain server conditions (eg: match of a header). This patch adds "http-response redirect location <fmt>" which gives enough flexibility for most imaginable operations. The connection to the server is closed when this is performed so that we don't risk to forward any pending data from the server. Any pending response data are trimmed so that we don't risk to forward anything pending to the client. It's harmless to also do that for requests so we don't need to consider the direction.	2015-05-28 17:45:43 +02:00
Willy Tarreau	be4653b6d4	MINOR: http: prepare support for parsing redirect actions on responses In order to support http-response redirect, the parsing needs to be adapted a little bit to only support the "location" type, and to adjust the log-format parser so that it knows the direction of the sample fetch calls.	2015-05-28 17:43:11 +02:00
Thierry FOURNIER	cdb67cad01	MINOR: http: add array of generic pointers in http_res_rules This patch adds an array of generic pointers in the http_res_rules struct. It is used later by the "response capture" functions.	2015-05-28 13:50:29 +02:00
Willy Tarreau	eb3e34854d	MEDIUM: proxy: add a new proxy_find_best_match() function This function tries to spot a proxy by its name, ID and type, and in case some elements don't match, it tries to determine which ones could be ignored and reports which ones were ignored so that the caller can decide whether or not it wants to pick this proxy. This will be used for maintaining the status across reloads where the config might have changed a bit.	2015-05-27 16:51:28 +02:00
Willy Tarreau	0d1fdf7df5	MINOR: proxy: add a flag to memorize that the proxy's ID was forced This will be used to know if proxy's ID should be considered when names mismatch upon check status reload.	2015-05-27 16:51:28 +02:00
Willy Tarreau	3c56a7d94f	MINOR: proxy: add a new function proxy_find_by_id() It does the same as the other one except that it only focuses on the numeric ID and the capabilities. It's used by proxy_find_by_name() for numeric names.	2015-05-27 16:49:44 +02:00
Willy Tarreau	98d0485a90	MAJOR: config: remove the deprecated reqsetbe / reqisetbe actions These ones were already obsoleted in 1.4, marked for removal in 1.5, and not documented anymore. They used to emit warnings, and do still require quite some code to stay in place. Let's remove them now.	2015-05-26 12:18:29 +02:00
Willy Tarreau	e2dc1fa8ca	MEDIUM: stick-table: remove the now duplicate find_stktable() function Since proxy_tbl_by_name() already does the same job, let's not keep duplicate functions and use this one only.	2015-05-26 12:08:07 +02:00
Willy Tarreau	63d38fda4a	CLEANUP: proxy: remove now unused function findproxy_mode() It's not used anymore.	2015-05-26 12:05:53 +02:00
Willy Tarreau	9e0bb1013e	CLEANUP: proxy: make the proxy lookup functions more user-friendly First, findproxy() was renamed proxy_find_by_name() so that its explicit that a name is required for the lookup. Second, we give this function the ability to search for tables if needed. Third we now provide inline wrappers to pass the appropriate PR_CAP_* flags and to explicitly look up a frontend, backend or table.	2015-05-26 11:24:42 +02:00
Joseph Lynch	726ab7145c	MEDIUM: backend: Allow redispatch on retry intervals For backend load balancing it sometimes makes sense to redispatch rather than retrying against the same server. For example, when machines or routers fail you may not want to waste time retrying against a dead server and would instead prefer to immediately redispatch against other servers. This patch allows backend sections to specify that they want to redispatch on a particular interval. If the interval N is positive the redispatch occurs on every Nth retry, and if the interval N is negative then the redispatch occurs on the Nth retry prior to the last retry (-1 is the default and maintains backwards compatibility). In low latency environments tuning this setting can save a few hundred milliseconds when backends fail.	2015-05-22 07:07:40 +02:00
Nenad Merdanovic	200b0facde	MEDIUM: Add support for updating TLS ticket keys via socket Until now, HAproxy needed to be restarted to change the TLS ticket keys. With this patch, the TLS keys can be updated on a per-file basis using the admin socket. Two new socket commands have been introduced: "show tls-keys" and "set ssl tls-keys". Signed-off-by: Nenad Merdanovic <nmerdan@anine.io>	2015-05-16 11:28:04 +02:00
Nenad Merdanovic	146defaff4	MINOR: Add TLS ticket keys reference and use it in the listener struct Within the listener struct we need to use a reference to the TLS ticket keys which binds the actual keys with the filename. This will make it possible to update the keys through the socket Signed-off-by: Nenad Merdanovic <nmerdan@anine.io>	2015-05-16 11:28:04 +02:00
Baptiste Assmann	22b09d2393	MINOR: include comment in tcpcheck error log tcpcheck error messages include the step id where the error occurs. In some cases, this is not enough. Now, HAProxy also use the comment field of the latest tcpcheck rule which has been run. This commit allows HAProxy to parse a new directive in the tcpcheck ruleset: 'comment'. It is used to setup comments on the current tcpcheck rules.	2015-05-12 11:04:39 +02:00
Baptiste Assmann	aa12b47b06	MINOR: tcpcheck_rule structure update A new field is added into the tcpcheck_rule structure. This field will host a string used as a comment to describe the rule. Then this comment can be used in logs to report a more user friendly message on the step which failed during the tcpcheck ruleset.	2015-05-12 10:44:49 +02:00
Thierry FOURNIER	0786d05a04	MEDIUM: sample: change the prototype of sample-fetches functions This patch removes the "opt" entry from the prototype of the sample-fetches fucntions. This permits to remove some weight in the prototype call.	2015-05-11 20:03:08 +02:00
Thierry FOURNIER	1d33b882d2	MINOR: sample: fill the struct sample with the options. Options are relative to the sample. Each sample fetched is associated with fetch options or fetch flags. This patch adds the 'opt' vaue in the sample struct. This permits to reduce the sample-fetch function prototype. In other way, the converters will have more detail about the origin of the sample.	2015-05-11 20:02:11 +02:00
Thierry FOURNIER	0a9a2b8cec	MEDIUM: sample change the prototype of sample-fetches and converters functions This patch removes the structs "session", "stream" and "proxy" from the sample-fetches and converters function prototypes. This permits to remove some weight in the prototype call.	2015-05-11 20:01:42 +02:00
Thierry FOURNIER	6879ad31a5	MEDIUM: sample: fill the struct sample with the session, proxy and stream pointers Some sample analyzer (sample-fetch or converters) needs to known the proxy, session and stream attached to the sampel. The sample-fetches and the converters function pointers cannot be called without these 3 pointers filled. This patch permits to reduce the sample-fetch and the converters called prototypes, and provides a new mean to add information for this type of functions.	2015-05-11 20:00:03 +02:00
Willy Tarreau	9fbe18e174	MEDIUM: http: add a new option http-buffer-request It is sometimes desirable to wait for the body of an HTTP request before taking a decision. This is what is being done by "balance url_param" for example. The first use case is to buffer requests from slow clients before connecting to the server. Another use case consists in taking the routing decision based on the request body's contents. This option placed in a frontend or backend forces the HTTP processing to wait until either the whole body is received, or the request buffer is full, or the first chunk is complete in case of chunked encoding. It can have undesired side effects with some applications abusing HTTP by expecting unbufferred transmissions between the frontend and the backend, so this should definitely not be used by default. Note that it would not work for the response because we don't reset the message state before starting to forward. For the response we need to 1) reset the message state to MSG_100_SENT or BODY , and 2) to reset body_len in case of chunked encoding to avoid counting it twice.	2015-05-02 00:10:44 +02:00
Willy Tarreau	748179eb5a	MEDIUM: stream: move HTTP request body analyser before process_common Since 1.5, the request body analyser has become independant from any other element and does not even disturb the message forwarder anymore. And since it's disabled by default, we can place it before most analysers so that it's can preempt any other one if an intermediary one enables it.	2015-05-02 00:10:44 +02:00
Willy Tarreau	30fe818979	DOC: fix the comments about the meaning of msg->sol in HTTP It has a meaning while parsing a body when using chunked encoding. This must be backported to 1.5 since it caused a bug there as well.	2015-05-01 23:24:31 +02:00
Willy Tarreau	aa729784e1	MINOR: peers: store the pointer to the signal handler We'll need it to unregister stopped peers sections.	2015-05-01 20:16:31 +02:00
Willy Tarreau	0f228a037a	MEDIUM: http: add option-ignore-probes to get rid of the floods of 408 Recently some browsers started to implement a "pre-connect" feature consisting in speculatively connecting to some recently visited web sites just in case the user would like to visit them. This results in many connections being established to web sites, which end up in 408 Request Timeout if the timeout strikes first, or 400 Bad Request when the browser decides to close them first. These ones pollute the log and feed the error counters. There was already "option dontlognull" but it's insufficient in this case. Instead, this option does the following things : - prevent any 400/408 message from being sent to the client if nothing was received over a connection before it was closed ; - prevent any log from being emitted in this situation ; - prevent any error counter from being incremented That way the empty connection is silently ignored. Note that it is better not to use this unless it is clear that it is needed, because it will hide real problems. The most common reason for not receiving a request and seeing a 408 is due to an MTU inconsistency between the client and an intermediary element such as a VPN, which blocks too large packets. These issues are generally seen with POST requests as well as GET with large cookies. The logs are often the only way to detect them. This patch should be backported to 1.5 since it avoids false alerts and makes it easier to monitor haproxy's status.	2015-05-01 15:39:23 +02:00
Willy Tarreau	f3045d2a06	MAJOR: pattern: add LRU-based cache on pattern matching The principle of this cache is to have a global cache for all pattern matching operations which rely on lists (reg, sub, dir, dom, ...). The input data, the expression and a random seed are used as a hashing key. The cached entries contains a pointer to the expression and a revision number for that expression so that we don't accidently used obsolete data after a pattern update or a very unlikely hash collision. Regarding the risk of collisions, 10k entries at 10k req/s mean 1% risk of a collision after 60 years, that's already much less than the memory's reliability in most machines and more durable than most admin's life expectancy. A collision will result in a valid result to be returned for a different entry from the same list. If this is not acceptable, the cache can be disabled using tune.pattern.cache-size. A test on a file containing 10k small regex showed that the regex matching was limited to 6k/s instead of 70k with regular strings. When enabling the LRU cache, the performance was back to 70k/s.	2015-04-29 19:15:24 +02:00
Willy Tarreau	72f073b6c7	MEDIUM: pattern: add a revision to all pattern expressions This will be used to detect any change on the pattern list between two operations, ultimately making it possible to implement a cache which immediately invalidates obsolete keys after an update. The revision is simply taken from the timestamp counter to ensure that even upon a pointer reuse we cannot accidently come back to the same (expr,revision) tuple.	2015-04-29 19:15:24 +02:00
Willy Tarreau	b5684e0081	IMPORT: hash: import xxhash-r39 The xxhash library provides a very fast and excellent hash algorithm suitable for many purposes. It excels at hashing large blocks but is also extremely fast on small ones. It's distributed under a 2-clause BSD license (GPL-compatible) so it can be included here. Updates are distributed here : https://github.com/Cyan4973/xxHash	2015-04-29 19:15:21 +02:00
Willy Tarreau	69c696c138	IMPORT: lru: import simple ebtree-based LRU functions This will be usable to implement some maps/acl caches for heavy datasets loaded from files (mostly regex-based but in general anything that cannot be indexed in a tree).	2015-04-29 19:14:43 +02:00
Willy Tarreau	e6e49cfa93	MINOR: tools: provide an rdtsc() function for time comparisons This one returns a timestamp, either the one from the CPU or from gettimeofday() in 64-bit format. The purpose is to be able to compare timestamps on various entities to make it easier to detect updates. It can also be used for benchmarking in certain situations during development.	2015-04-29 19:14:03 +02:00
Andrew Hayworth	0ebc55f6b4	MEDIUM: logs: Add HTTP request-line log format directives This commit adds 4 new log format variables that parse the HTTP Request-Line for more specific logging than "%r" provides. For example, we can parse the following HTTP Request-Line with these new variables: "GET /foo?bar=baz HTTP/1.1" - %HM: HTTP Method ("GET") - %HV: HTTP Version ("HTTP/1.1") - %HU: HTTP Request-URI ("/foo?bar=baz") - %HP: HTTP Request-URI without query string ("/foo")	2015-04-28 21:03:05 +02:00
Willy Tarreau	e5843b383d	BUG/MEDIUM: peers: recent applet changes broke peers updates scheduling Since appctx are scheduled out of streams, it's pointless to wake up the task managing the stream to push updates, they won't be seen. In fact unit tests work because silent sessions are restarted after 5s of idle and the exchange is correctly scheduled during startup! So we need to notify the appctx instead. For this we add a pointer to the appctx in the peer session. No backport is needed of course.	2015-04-27 18:42:17 +02:00
Willy Tarreau	eb406dc73c	MINOR: stream-int: add two flags to indicate an applet's wishes regarding I/O Currently we have a problem. There are some cases where a sleeping applet is not woken up (eg: show sess during an injection). The reason is that the applet is marked WAIT_DATA and is not woken up when WAIT_ROOM leaves, because we wait for both flags to be cleared in order to call it. And if we wait for either flag, then we have the opposite situation, which is that we're not waiting for room in the output buffer so we're spinning calling the applet to do nothing. What is missing is an indication of what the applet needs. Since it only manipulates the WAIT_ROOM/WAIT_DATA which are overwritten later, that cannot work. In the case of connections, the problem doesn't happen because the connection maintains these extra states. Ideally we'd need to have similar states for each appctx and to store those information there. But it would be overcomplicated given that an applet doesn't exist alone without a stream-int, so we can safely put these information into the stream int and make the code simpler. With this patch we introduce two new flags in the stream interface : - SI_FL_WANT_PUT : the applet wants to put something into the buffer - SI_FL_WANT_GET : the applet wants to get something from the buffer We also have the new functions si_applet_{stop\|want\|cant}_{get\|put} to make the code look similar to the connection code. For now these flags are not used yet.	2015-04-23 17:56:17 +02:00
Willy Tarreau	e5f8649102	MEDIUM: stream-int: add a new function si_applet_done() This is the equivalent of si_conn_wake() but for applets. It will be called after changes to the stream interface are brought by the applet I/O handler. Ultimately it will release buffers and may be even wake the stream's task up if some important changes are detected. It would be nice to be able to merge it with the connection's wake function since it mostly manipulates the stream interface, but there are minor differences (such as how to enable/disable polling on a fd vs applet) and some specificities to applets (eg: don't wake the applet up until the output is empty) which would require abstract functions which would slow down everything.	2015-04-23 17:56:16 +02:00
Willy Tarreau	3c595ac3ad	MEDIUM: applet: implement a run queue for active appctx The new function is called for each round of polling in order to call any active appctx. For now we pick the stream interface from the appctx's owner. At the moment there's no appctx queued yet, but we have everything needed to queue them and remove them.	2015-04-23 17:56:16 +02:00
Willy Tarreau	81f38d6f57	MEDIUM: applet: add basic support for an applet run queue This will be needed so that we can schedule applets out of the streams. For now nothing calls the queue yet.	2015-04-23 17:56:16 +02:00
Willy Tarreau	d45b9f8991	REORG: stream-int: create si_applet_ops dedicated to applets These functions are dedicated to applets so that we don't use the default ones anymore in this case.	2015-04-23 17:56:16 +02:00
Willy Tarreau	3057645b37	CLEANUP: applet: rename struct si_applet to applet Since this one does not depend on stream_interface anymore, remove the "si_" prefix.	2015-04-23 17:56:16 +02:00
Willy Tarreau	8a8d83b85c	REORG: applet: move the applet definitions out of stream_interface We're tidying the definitions so that appctx lives on its own. A new set of applet.h files has been added for this purpose.	2015-04-23 17:56:16 +02:00
Willy Tarreau	00a37f0029	MEDIUM: applet: make the applet not depend on a stream interface anymore Now that applet's functions only take an appctx in argument, not a stream interface. This slightly simplifies the code and will be needed to take the appctx out of the stream interface.	2015-04-23 17:56:16 +02:00
Willy Tarreau	19c8161b3d	MINOR: applet: add a new "owner" pointer in the appctx This pointer indicates what stream-interface the appctx belongs to, just like we have for the connections.	2015-04-23 17:56:16 +02:00

... 48 49 50 51 52 ...

6744 Commits