haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-13 18:46:57 +02:00

Author	SHA1	Message	Date
Amaury Denoyelle	46f041d7f8	MEDIUM: fcgi: remove conn from session on detach FCGI mux is marked with HOL blocking. On safe reuse mode, the connection using it are placed on the sessions instead of the available lists to avoid sharing it with several clients. On detach, if they are no more streams, remove the connection from the session before adding it to the idle list. If there is still used streams, do not add it to available list as it should be already on the session list.	2020-10-15 15:19:34 +02:00
Amaury Denoyelle	6b8daef56b	MEDIUM: h2: remove conn from session on detach H2 mux is marked with HOL blocking. On safe reuse mode, the connection using it are placed on the sessions instead of the available lists to avoid sharing it with several clients. On detach, if they are no more streams, remove the connection from the session before adding it to the idle list. If there is still used streams, do not add it to available list as it should be already on the session list.	2020-10-15 15:19:34 +02:00
Amaury Denoyelle	0d21deaded	MEDIUM: backend: add reused conn to sess if mux marked as HOL blocking If a connection is using a mux protocol subject to HOL blocking, add it to the session instead of the available list to avoid sharing it with other clients on connection reuse.	2020-10-15 15:19:34 +02:00
Amaury Denoyelle	00464ab8f4	MEDIUM: backend: add new conn to session if mux marked as HOL blocking When allocating a new session on connect_server, if the mux protocol is marked as subject of HOL blocking, add it into session instead of available list to avoid sharing it with other clients.	2020-10-15 15:19:34 +02:00
Amaury Denoyelle	3d3c0918dc	MINOR: mux/connection: add a new mux flag for HOL risk This flag is used to indicate if the mux protocol is subject to head-of-line blocking problem.	2020-10-15 15:19:34 +02:00
Amaury Denoyelle	9c13b62b47	BUG/MEDIUM: connection: fix srv idle count on conn takeover On server connection migration from one thread to another, the wrong idle thread-specific counter is decremented. This bug was introduced since commit `3d52f0f1f8` due to the factorization with srv_use_idle_conn. However, this statement is only executed from conn_backend_get. Extract the decrement from srv_use_idle_conn in conn_backend_get and use the correct thread-specific counter. Rename the function to srv_use_conn to better reflect its purpose as it is also used with a newly initialized connection not in the idle list. As a side change, the connection insertion to available list has also been extracted to conn_backend_get. This will be useful to be able to specify an alternative list for protocol subject to HOL risk that should not be shared between several clients. This bug is only present in this release and thus do not need a backport.	2020-10-15 15:19:34 +02:00
Amaury Denoyelle	5f1ded5629	BUG/MINOR: connection: fix loop iter on connection takeover The loop always missed one iteration due to the incrementation done on the for check. Move the incrementation on the loop last statement to fix this behaviour. This bug has a very limited impact, not at all visible to the user, but could be backported to 2.2.	2020-10-15 15:19:25 +02:00
Willy Tarreau	1a3770cbc7	BUG/MEDIUM: deinit: check fdtab before fdtab[fd].owner When running a pure config check (haproxy -c) we go through the deinit phase without having allocated fdtab, so we can't blindly dereference it. The issue was added by recent commit `ae7bc4a23` ("MEDIUM: deinit: close all receivers/listeners before scanning proxies"), no backport is needed.	2020-10-14 12:13:51 +02:00
Willy Tarreau	2f6f362756	CLEANUP: protocol: intitialize all of the sockaddr when disconnecting In issue #894, Coverity suspects uninitialized values for a socket's address whose family is AF_UNSPEC but it doesn't know that the address is not used in this case. It's not on a critical path and working around it is trivial, let's fully declare the address. We're doing it for both TCP and UDP, because the same principle appears at two places.	2020-10-14 10:54:15 +02:00
Willy Tarreau	258b351704	BUG/MINOR: listener: detect and handle shared sockets stopped in other processes It may happen that during a temporary listener pause resulting from a SIGTTOU, one process gets one of its sockets disabled by another process and will not be able to recover from this situation by itself. For the protocols supporting this (TCPv4 and TCPv6 at the moment) this situation is detectable, so when this happens, let's put the listener into the PAUSED state so that it remains consistent with the real socket state. One nice effect is that just sending the SIGTTIN signal to the process is enough to recover the socket in this case. There is no need to backport this, this behavior has been there forever and the fix requires to reimplement the getsockopt() call there.	2020-10-13 18:15:33 +02:00
Willy Tarreau	85d2ba6b78	CLEANUP: unix: make use of sock_accept_conn() where relevant This allows to get rid of one getsockopt(SO_ACCEPTCONN) in the binding code.	2020-10-13 18:15:33 +02:00
Willy Tarreau	3e12de2cc6	CLEANUP: tcp: make use of sock_accept_conn() where relevant This allows to get rid of two getsockopt(SO_ACCEPTCONN).	2020-10-13 18:15:33 +02:00
Willy Tarreau	cc8b653483	MINOR: sockpair: implement the .rx_listening function For socket pairs we don't rely on a real listening socket but we need to have a properly connected UNIX stream socket. This is what the new sockpair_accept_conn() tries to report. Some corner cases like half shutdown will still not be detected but that should be sufficient for most cases we really care about.	2020-10-13 18:15:33 +02:00
Willy Tarreau	29185140db	MINOR: protocol: make proto_tcp & proto_uxst report listening sockets Now we introdce a new .rx_listening() function to report if a receiver is actually a listening socket. The reason for this is to help detect shared sockets that might have been broken by sibling processes.	2020-10-13 18:15:33 +02:00
Willy Tarreau	5ced3e8879	MINOR: sock: add sock_accept_conn() to test a listening socket At several places we need to check if a socket is still valid and still willing to accept connections. Instead of open-coding this, each time, let's add a new function for this.	2020-10-13 18:15:33 +02:00
Willy Tarreau	8b6fc3d10e	MINOR: proto-tcp: make use of connect(AF_UNSPEC) for the pause Currently the suspend/resume mechanism for listeners only works on Linux and we resort to a number of tricks involving shutdown+listen+shutdown to try to detect failures on other operating systems that do not support it. But on Linux connect(AF_UNSPEC) also works pretty well and is much cleaner. It still doesn't work on other operating systems but the error is easier to detect and appears safer. So let's switch to this.	2020-10-13 18:15:33 +02:00
Willy Tarreau	7c9f756dcc	MINOR: fd: report an error message when failing initial allocations When starting with a huge maxconn (say 1 billion), the only error seen is "No polling mechanism available". This doesn't help at all to resolve the problem. Let's add specific alerts for the failed mallocs. Now we can get this instead: [ALERT] 286/154439 (23408) : Not enough memory to allocate 2000000033 entries for fdtab! This may be backported as far as 2.0 as it helps debugging bad configurations.	2020-10-13 18:15:33 +02:00
Willy Tarreau	b1e600c9c5	BUG/MINOR: mux-h2: do not stop outgoing connections on stopping There are reports of a few "SC" in logs during reloads when H2 is used on the backend side. Christopher analysed this as being caused by the proxy disabled test in h2_process(). As the comment says, this was done for frontends only, and must absolutely not send a GOAWAY to the backend, as all it will result in is to make newly queued streams fail. The fix consists in simply testing the connection side before deciding to send the GOAWAY. This may be backported as far as 2.0, though for whatever reason it seems to manifest itself only since 2.2 (probably due to changes in the outgoing connection setup sequence).	2020-10-13 18:15:33 +02:00
Willy Tarreau	2bd0f8147b	BUG/MINOR: init: only keep rlim_fd_cur if max is unlimited On some operating systems, RLIM_INFINITY is set to -1 so that when the hard limit on the number of FDs is set to unlimited, taking the MAX of both values keeps rlim_fd_cur and everything works. But on other systems this values is defined as the highest positive integer. This is what was observed on a 32-bit AIX 5.1. The effect is that maxsock becomes 2^31-1 and that fdtab allocation fails. Note that a simple workaround consists in manually setting maxconn in the global section. Let's ignore unlimited as soon as we retrieve rlim_fd_max so that all systems behave consistently. This may be backported as far as 2.0, though it doesn't seem like it has annoyed anyone.	2020-10-13 15:36:08 +02:00
Fr�d�ric L�caille	3fc0fe05fd	MINOR: peers: heartbeat, collisions and handshake information for "show peers" command. This patch adds "coll" new counter and the heartbeat timer values to "show peers" command. It also adds the elapsed time since the last handshake to new "last_hdshk" new peer dump field.	2020-10-09 20:59:58 +02:00
Willy Tarreau	0a002df2c2	BUG/MINOR: proxy: respect the proper format string in sig_pause/sig_listen When factoring out the pause/resume error messages in commit `775e00158` ("MAJOR: signals: use protocol_pause_all() and protocol_resume_all()") I forgot that ha_warning() and send_log() take a format string and not just a const string. No backport is needed, this is 2.3-dev.	2020-10-09 19:26:27 +02:00
Willy Tarreau	ccf429960b	MEDIUM: config: remove the deprecated and dangerous global "debug" directive This one was scheduled for removal in 2.3 since 2.2-dev3 by commit `1b85785bc` ("MINOR: config: mark global.debug as deprecated"). Let's remove it now. It remains totally possible to use -d on the command line though.	2020-10-09 19:18:45 +02:00
Willy Tarreau	ab0a5192a8	MEDIUM: config: mark "grace" as deprecated This was introduced 15 years ago or so to delay the stopping of some services so that a monitoring device could detect its port being down before services were stopped. Since then, clean reloads were implemented and this doesn't cope well with reload at all, preventing the new process from seamlessly binding, and forcing processes to coexist with half-baked configurations. Now it has become a real problem because there's a significant code portion in the proxies that is solely dedicated to this obsolete feature, and dealing with its special cases eases the introduction of bugs in other places so it's about time that it goes. We could tentatively schedule its removal for 2.4 with a hard deadline for 2.5 in any case.	2020-10-09 19:07:01 +02:00
Willy Tarreau	e03204c8e1	MEDIUM: listeners: implement protocol level ->suspend/resume() calls Now we have ->suspend() and ->resume() for listeners at the protocol level. This means that it now becomes possible for a protocol to redefine its own way to suspend and resume. The default functions are provided for TCP, UDP and unix, and they are pass-through to the receiver equivalent as it used to be till now. Nothing was defined for sockpair since it does not need to suspend/resume during reloads, hence it will succeed.	2020-10-09 18:44:37 +02:00
Willy Tarreau	7b2febde1d	MINOR: listeners: split do_unbind_listener() in two The inner part now goes into the protocol and is used to decide how to unbind a given protocol's listener. The existing code which is able to also unbind the receiver was provided as a default function that we currently use everywhere. Some complex listeners like QUIC will use this to decide how to unbind without impacting existing connections, possibly by setting up other incoming paths for the traffic.	2020-10-09 18:44:37 +02:00
Willy Tarreau	f58b8db47b	MEDIUM: receivers: add an rx_unbind() method in the protocols This is used as a generic way to unbind a receiver at the end of do_unbind_listener(). This allows to considerably simplify that function since we can now let the protocol perform the cleanup. The generic code was moved to sock.c, along with the conditional rx_disable() call. Now the code also supports that the ->disable() function of the protocol which acts on the listener performs the close itself and adjusts the RX_F_BUOND flag accordingly.	2020-10-09 18:44:36 +02:00
Willy Tarreau	18c20d28d7	MINOR: listeners: move the LI_O_MWORKER flag to the receiver This listener flag indicates whether the receiver part of the listener is specific to the master or to the workers. In practice it's only used by the master's CLI right now. It's used to know whether or not the FD must be closed before forking the workers. For this reason it's way more of a receiver's property than a listener's property, so let's move it there under the name RX_F_MWORKER. The rest of the code remains unchanged.	2020-10-09 18:43:05 +02:00
Willy Tarreau	75c98d166e	CLEANUP: listeners: remove the do_close argument to unbind_listener() And also remove it from its callers. This subtle distinction was added as sort of a hack for the seamless reload feature but is not needed anymore since the do_close turned unused since commit previous commit ("MEDIUM: listener: let do_unbind_listener() decide whether to close or not"). This also removes the unbind_listener_no_close() function.	2020-10-09 18:41:56 +02:00
Willy Tarreau	374e9af358	MEDIUM: listener: let do_unbind_listener() decide whether to close or not The listener contains all the information needed to decide to close on unbind or not. The rule is the following (when we're not stopping): - worker process unbinding from a worker's FD with socket transfer enabled => keep - master process unbinding from a master's inherited FD => keep - master process unbinding from a master's FD => close - master process unbinding from a worker's FD => close - worker process unbinding from a master's FD => close - worker process unbinding from a worker's FD => close Let's translate that into the function and stop using the do_close argument that is a bit obscure for callers. It was not yet removed to ease code testing.	2020-10-09 18:41:48 +02:00
Willy Tarreau	87acd4e848	BROKEN/MEDIUM: listeners: rework the unbind logic to make it idempotent BROKEN: the failure rate on reg-tests/seamless-reload/abns_socket.vtc has significantly increased for no obvious reason. It fails 99% of the time vs 10% before. do_unbind_listener() is not logical and is not even idempotent. It must not touch the fd if already -1, which also means not touch the receiver. In addition, when performing a partial stop on a socket (not closing), we know the socket remains in the listening state yet it's marked as LI_ASSIGNED, which is confusing as it doesn't translate its real state. With this change, we make sure that FDs marked for close end up in ASSIGNED state and that those which are really bound and on which a listen() was made (i.e. not pause) remain in LISTEN state. This is what is closest to reality. Ideally this function should become a default proto->unbind() one but it may still keep a bit too much state logic to become generalized to other protocols (e.g. QUIC).	2020-10-09 18:29:04 +02:00
Willy Tarreau	d6afb53bdc	MEDIUM: listeners: always close master vs worker listeners Right now in enable_listener(), we used to start all enabled listeners then kill from the workers those that were for the master. But this is incomplete. We must also close from the master the listeners that are solely for workers, and do it before we even start them. Otherwise we end up with a master responding to the worker CLI connections if the listener remains in listen mode to translate the socket's real state. It doesn't seem like it could have caused bugs in the past because we used to aggressively mark disabled listeners as LI_ASSIGNED despite the fact that they were still bound and listening. If this patch were ever seen as a candidate solution for any obscure bug, be careful in that it subtly relies on the fact that fd_delete() doesn't close inherited FDs anymore, otherwise that could break the master's ability to pass inherited FDs on reloads.	2020-10-09 18:29:04 +02:00
Willy Tarreau	95a3460739	MINOR: listener: add a few BUG_ON() statements to detect inconsistencies We must not have an fd==-1 when switching to certain states. This will later disappear but for now it helps detecting inconsistencies.	2020-10-09 18:29:04 +02:00
Willy Tarreau	e122dc5316	MEDIUM: udp: implement udp_suspend() and udp_resume() In Linux kernel's net/ipv4/udp.c there's a udp_disconnect() function which is called when connecting to AF_UNSPEC, and which unhashes a "connection". This property, which is also documented in connect(2) both in Linux and Open Group's man pages for datagrams, is interesting because it allows to reverse a connect() which is in fact a filter on the source. As such we can suspend a receiver by making it connect to itself, which will cause it not to receive any traffic anymore, letting a new one receive it all, then resume it by breaking this connection. This was tested to work well on Linux, other operating systems should also be tested. Before this, sending a SIGTTOU to a process having a UDP syslog forwarder would cause this error: [WARNING] 280/194249 (3268) : Paused frontend GLOBAL. [WARNING] 280/194249 (3268) : Some proxies refused to pause, performing soft stop now. [WARNING] 280/194249 (3268) : Proxy GLOBAL stopped (cumulated conns: FE: 0, BE: 0). [WARNING] 280/194249 (3268) : Proxy sylog-loadb stopped (cumulated conns: FE: 0, BE: 0). With this change, it now proceeds just like with TCP listeners: [WARNING] 280/195503 (3885) : Paused frontend GLOBAL. [WARNING] 280/195503 (3885) : Paused frontend sylog-loadb. And SIGTTIN also works: [WARNING] 280/195507 (3885) : Resumed frontend GLOBAL. [WARNING] 280/195507 (3885) : Resumed frontend sylog-loadb. On Linux this also works with TCP listeners (which can then be resumed using listen()) and established TCP sockets (which we currently kill using setsockopt(so_linger)), both not being portable on other OSes. UNIX sockets and ABNS sockets do not support it however (connect always fails). This needs to be further explored to see if other OSes might benefit from this to perform portable and reliable resets particularly on the backend side.	2020-10-09 18:29:04 +02:00
Willy Tarreau	626f3a7beb	MEDIUM: proxy: make soft_stop() stop most listeners using protocol_stop_now() One difficulty in soft-stopping is to make sure not to forget unlisted listeners. By first doing a pass using protocol_stop_now() we catch the vast majority of them. The few remaining ones are the ones belonging to a proxy having a grace period. For these ones, the proxy will arm its stop_time timer and emit a log message. Since neither UDP listeners nor peers use the grace period, we can already get rid of the special cases there since we know they will have been stopped by the protocols.	2020-10-09 18:29:04 +02:00
Willy Tarreau	02e8557e88	MINOR: protocol: add protocol_stop_now() to instant-stop listeners This will instantly stop all listeners except those which belong to a proxy configured with a grace time. This means that UDP listeners, and peers will also be stopped when called this way.	2020-10-09 18:29:04 +02:00
Willy Tarreau	acde152175	MEDIUM: proxy: centralize proxy status update and reporting There are multiple ways a proxy may switch to the disabled state, but now it's essentially once it loses its last listener. Instead of keeping duplicate code around and reporting the state change before actually seeing it, we now report it at the moment it's performed (from the last listener leaving) which allows to remove the message from all other places.	2020-10-09 18:29:04 +02:00
Willy Tarreau	a389c9e1e3	MEDIUM: proxy: add mode PR_MODE_PEERS to flag peers frontends For now we cannot easily distinguish a peers frontend from another one, which will be problematic to avoid reporting them when stopping their listeners. Let's add PR_MODE_PEERS for this. It's not supposed to cause any issue since all non-HTTP proxies are handled similarly now.	2020-10-09 18:28:21 +02:00
Willy Tarreau	322b9b94e9	MEDIUM: proxy: make stop_proxy() now use stop_listener() The function will stop the listeners using this method, which in turn will ping back once it finishes disabling the proxy.	2020-10-09 18:28:18 +02:00
Willy Tarreau	caa7df1296	MINOR: listeners: add a new stop_listener() function This function will be used to definitely stop a listener (e.g. during a soft_stop). This is actually tricky because it may be called for a proxy or for a protocol, both of which require locks and already hold some. The function takes booleans indicating which ones are already held, hoping this will be enough. It's not well defined wether proto->disable() and proto->rx_disable() are supposed to be called with any lock held, and they are used from do_unbind_listener() with all these locks. Some back annotations ought to be added on this point. The proxy's listeners count is updated, and the proxy is marked as disabled and woken up after the last one is gone. Note that a listener in listen state is already not attached anymore since it was disabled.	2020-10-09 18:27:48 +02:00
Willy Tarreau	455585e3cd	MINOR: listeners: count unstoppable jobs on creation, not deletion We have to count unstoppable jobs which correspond to worker sockpairs, in order to know when to count. However the way it's currently done is quite awkward because these are counted when stopping making the stop mechanism non-idempotent. This is definitely something we want to fix before stopping by protocol or our listeners count will quickly go wrong. Now they are counted when the listeners are created.	2020-10-09 18:25:14 +02:00
Willy Tarreau	b4c083f5bf	MINOR: listeners: split delete_listener() in two versions We'll need an already locked variant of this function so let's make __delete_listener() which will be called with the protocol lock held and the listener's lock held.	2020-10-09 11:27:30 +02:00
Willy Tarreau	4b51f42899	MEDIUM: listeners: now use the listener's ->enable/disable At each place we used to manipulate the FDs directly we can now call the listener protocol's enable/disable/rx_enable/rx_disable depending on whether the state changes on the listener or the receiver. One exception currently remains in listener_accept() which is a bit special and which should be split into 2 or 3 parts in the various protocol layers. The test of fd_updt in do_unbind_listener() that was added by commit `a51885621` ("BUG/MEDIUM: listeners: Don't call fd_stop_recv() if fd_updt is NULL.") could finally be removed since that part is correctly handled in the low-level disable() function. One disable() was added in resume_listener() before switching to LI_FULL because rx_resume() enables polling on the FD for the receiver while we want to disable it if the listener is full. There are different ways to clean this up in the future. One of them could be to consider that TCP receivers only act at the listener level. But in fact it does not translate reality. The reality is that only the receiver is paused and that the listener's state ought not be affected here. Ultimately the resume_listener() function should be split so that the part controlled by the protocols only acts on the receiver, and that the receiver itself notifies the upper listener about the change so that the listener protocol may decide to disable or enable polling. Conversely the listener should automatically update its receiver when they share the same state. Since there is no harm proceeding like this, let's keep this for now.	2020-10-09 11:27:30 +02:00
Willy Tarreau	5ddf1ce9c4	MINOR: protocol: add a new pair of enable/disable methods for listeners These methods will be used to enable/disable accepting new connections so that listeners do not play with FD directly anymore. Since all the currently supported protocols work on socket for now, these are identical to the rx_enable/rx_disable functions. However they were not defined in sock.c since it's likely that some will quickly start to differ. At the moment they're not used. We have to take care of fd_updt before calling fd_{want,stop}_recv() because it's allocated fairly late in the boot process and some such functions may be called very early (e.g. to stop a disabled frontend's listeners).	2020-10-09 11:27:30 +02:00
Willy Tarreau	686fa3db50	MINOR: protocol: add a new pair of rx_enable/rx_disable methods These methods will be used to enable/disable rx at the receiver level so that callers don't play with FDs directly anymore. All our protocols use the generic ones from sock.c at the moment. For now they're not used.	2020-10-09 11:27:30 +02:00
Willy Tarreau	e70c7977f2	MINOR: sock: provide a set of generic enable/disable functions These will be used on receivers, to enable or disable receiving on a listener, which most of the time just consists in enabling/disabling the file descriptor. We have to take care of the existence of fd_updt to know if we may or not call fd_{want,stop}_recv() since it's not permitted in very early boot.	2020-10-09 11:27:30 +02:00
Willy Tarreau	010fe151ce	MINOR: listener: use the protocol's ->rx_resume() method when available Instead of calling listen() for IPPROTO_TCP in resume_listener(), let's call the protocol's ->rx_resume() method when defined, which does the same. This removes another hard-dependency on the fd and underlying protocol from the generic functions.	2020-10-09 11:27:30 +02:00
Willy Tarreau	58e6b71bb0	MINOR: protocol: implement an ->rx_resume() method This one undoes ->rx_suspend(), it tries to restore an operational socket. It was only implemented for TCP since it's the only one we support right now.	2020-10-09 11:27:30 +02:00
Willy Tarreau	cb66ea60cf	MINOR: protocol: replace ->pause(listener) with ->rx_suspend(receiver) The ->pause method is inappropriate since it doesn't exactly "pause" a listener but rather temporarily disables it so that it's not visible at all to let another process take its place. The term "suspend" is more suitable, since the "pause" is actually what we'll need to apply to the FULL and LIMITED states which really need to make a pause in the accept process. And it goes well with the use of the "resume" function that will also need to be made per-protocol. Let's rename the function and make it act on the receiver since it's already what it essentially does, hence the prefix "_rx" to make it more explicit. The protocol struct was a bit reordered because it was becoming a real mess between the parts related to the listeners and those for the receivers.	2020-10-09 11:27:30 +02:00
Willy Tarreau	d7f331c8b8	MINOR: protocol: rename the ->listeners field to ->receivers Since the listeners were split into receiver+listener, this field ought to have been renamed because it's confusing. It really links receivers and not listeners, as most of the time it's used via rx.proto_list! The nb_listeners field was updated accordingly.	2020-10-09 11:27:30 +02:00
Willy Tarreau	dae0692717	CLEANUP: listeners: remove the now unused enable_all_listeners() It's not used anymore since previous commit. The good thing is that no more listener function now directly acts on a protocol.	2020-10-09 11:27:30 +02:00
Willy Tarreau	078e1c7102	CLEANUP: protocol: remove the ->enable_all method It's not used anymore, now the listeners are enabled from protocol_enable_all().	2020-10-09 11:27:30 +02:00
Willy Tarreau	5b95ae6b32	MINOR: protocol: directly call enable_listener() from protocol_enable_all() protocol_enable_all() calls proto->enable_all() for all protocols, which is always equal to enable_all_listeners() which in turn simply is a generic loop calling enable_listener() always returning ERR_NONE. Let's clean this madness by first calling enable_listener() directly from protocol_enable_all().	2020-10-09 11:27:30 +02:00
Willy Tarreau	7834a3f70f	MINOR: listeners: export enable_listener() we'll soon call it from outside.	2020-10-09 11:27:30 +02:00
Willy Tarreau	d008009958	CLEANUP: listeners: remove unused disable_listener and disable_all_listeners These ones have never been called, they were referenced by the protocol's disable_all for some protocols but there are no traces of their use, so in addition to not being sure the code works, it has never been tested. Let's remove a bit of complexity starting from there.	2020-10-09 11:27:30 +02:00
Willy Tarreau	fb4ead8e8a	CLEANUP: protocol: remove the ->disable_all method This one has never been used, is only referenced by proto_uxst and proto_sockpair, and it's not even certain it works at all. Let's get rid of it.	2020-10-09 11:27:30 +02:00
Willy Tarreau	e53608b2cd	MINOR: listeners: move fd_stop_recv() to the receiver's socket code fd_stop_recv() has nothing to do in the generic listener code, it's per protocol as some don't need it. For instance with abns@ it could even lead to fd_stop_recv(-1). And later with QUIC we don't want to touch the fd at all! It used to be that since commit `f2cb169487` delegating fd manipulation to their respective threads it wasn't possible to call it down there but it's not the case anymore, so let's perform the action in the protocol-specific code.	2020-10-09 11:27:30 +02:00
Willy Tarreau	fb76bd5ca6	BUG/MEDIUM: listeners: correctly report pause() errors By using the same "ret" variable in the "if" block to test the return value of pause(), the second one shadows the first one and when forcing the result to zero in case of an error, it doesn't do anything. The problem is that some listeners used to fail to pause in multi-process mode and this was not reported, but their failure was automatically resolved by the last process to pause. By properly checking for errors we might now possibly report a race once in a while so we may have to roll this back later if some users meet it. The test on ==0 is wrong too since technically speaking a total stop validates the need for a pause, but stops the listener so it's just the resume that won't work anymore. We could switch to stopped but it's an involuntary switch and the user will not know. Better then mark it as paused and let the resume continue to fail so that only the resume will eventually report an error (e.g. abns@). This must not be backported as there is a risk of side effect by fixing this bug, given that it hides other bugs itself.	2020-10-09 11:27:30 +02:00
Willy Tarreau	91c614dd0e	MEDIUM: proto_tcp: make the pause() more robust in multi-process In multi-process, the TCP pause is very brittle and we never noticed it because the error was lost in the upper layers. The problem is that shutdown() may fail if another process already did it, and will cause a process to fail to pause. What we do here in case of error is that we double-check the socket's state to verify if it's still accepting connections, and if not, we can conclude that another process already did the job in parallel. The difficulty here is that we're trying to eliminate false positives where some OSes will silently report a success on shutdown() while they don't shut the socket down, hence this dance of shutw/listen/shutr that only keeps the compatible ones. Probably that a new approach relying on connect(AF_UNSPEC) would provide better results.	2020-10-09 11:27:30 +02:00
Willy Tarreau	1accacbcc3	CLEANUP: proxy: remove the now unused pause_proxies() and resume_proxies() They're not used anymore, delete them before someone thinks about using them again!	2020-10-09 11:27:30 +02:00
Willy Tarreau	775e00158a	MAJOR: signals: use protocol_pause_all() and protocol_resume_all() When temporarily pausing the listeners with SIG_TTOU, we now pause all listeners via the protocols instead of the proxies. This has the benefits that listeners are paused regardless of whether or not they belong to a visible proxy. And for resuming via SIG_TTIN we do the same, which allows to report binding conflicts and address them, since the operation can be repeated on a per-listener basis instead of a per-proxy basis. While in appearance all cases were properly handled, it's impossible to completely rule out the possibility that something broken used to work by luck due to the scan ordering which is naturally different, hence the major tag.	2020-10-09 11:27:30 +02:00
Willy Tarreau	09819d1118	MINOR: protocol: introduce protocol_{pause,resume}_all() These two functions are used to pause and resume all listeners of all protocols. They use the standard listener functions for this so they're supposed to handle the situation gracefully regardless of the upper proxies' states, and they will report completion on proxies once the switch is performed. It might be nice to define a particular "failed" state for listeners that cannot resume and to count them on proxies in order to mention that they're definitely stuck. On the other hand, the current situation is retryable which is quite appreciable as well.	2020-10-09 11:27:30 +02:00
Willy Tarreau	58651b42fc	MEDIUM: listener/proxy: make the listeners notify about proxy pause/resume Till now, we used to call pause_proxy()/resume_proxy() to enable/disable processing on a proxy, which is used during soft reloads. But since we want to drive this process from the listeners themselves, we have to instead proceed the other way around so that when we enable/disable a listener, it checks if it changed anything for the proxy and notifies about updates at this level. The detection is made using li_ready=0 for pause(), and li_paused=0 for resume(). Note that we must not include any test for li_bound because this state is seen by processes which share the listener with another one and which must not act on it since the other process will do it. As such the socket behind the FD will automatically be paused and resume without its local state changing, but this is the limit of a multi-process system with shared listeners.	2020-10-09 11:27:30 +02:00
Willy Tarreau	5d7f9ce831	MINOR: listeners: check the current listener earlier state in resume_listener() It's quite confusing to have the test on LI_READY very low in the function as it should be made much earlier. Just like with previous commit, let's do it when entering. The additional states, however (limited, full) continue to go through the whole function.	2020-10-09 11:27:30 +02:00
Willy Tarreau	9b3a932777	MINOR: listeners: check the current listener state in pause_listener() It's better not to try to perform pause() actions on wrong states, so let's check this and make sure that all callers are now safe. This means that we must not try to pause a listener which is already paused (e.g. it could possibly fail if the pause operation isn't idempotent at the socket level), nor should we try it on earlier states.	2020-10-09 11:27:30 +02:00
Willy Tarreau	337c835d16	MEDIUM: proxy: merge zombify_proxy() with stop_proxy() The two functions don't need to be distinguished anymore since they have all the necessary info to act as needed on their listeners. Let's just pass via stop_proxy() and make it check for each listener which one to close or not.	2020-10-09 11:27:30 +02:00
Willy Tarreau	43ba3cf2b5	MEDIUM: proxy: remove start_proxies() Its sole remaining purpose was to display "proxy foo started", which has little benefit and pollutes output for those with plenty of proxies. Let's remove it now. The VTCs were updated to reflect this, because many of them had explicit counts of dropped lines to match this message. This is tagged as MEDIUM because some users may be surprized by the loss of this quite old message.	2020-10-09 11:27:30 +02:00
Willy Tarreau	c3914d4fff	MEDIUM: proxy: replace proxy->state with proxy->disabled The remaining proxy states were only used to distinguish an enabled proxy from a disabled one. Due to the initialization order, both PR_STNEW and PR_STREADY were equivalent after startup, and they would only differ from PR_STSTOPPED when the proxy is disabled or shutdown (which is effectively another way to disable it). Now we just have a "disabled" field which allows to distinguish them. It's becoming obvious that start_proxies() is only used to print a greeting message now, that we'd rather get rid of. Probably that zombify_proxy() and stop_proxy() should be merged once their differences move to the right place.	2020-10-09 11:27:30 +02:00
Willy Tarreau	1ad64acf6c	CLEANUP: peers: don't use the PR_ST* states to mark enabled/disabled The enabled/disabled config options were stored into a "state" field that is an integer but contained only PR_STNEW or PR_STSTOPPED, which is a bit confusing, and causes a dependency with proxies. This was renamed to "disabled" and is used as a boolean. The field was also moved to the end of the struct to stop creating a hole and fill another one.	2020-10-09 11:27:30 +02:00
Willy Tarreau	b50bf046e8	MINOR: startup: don't rely on PR_STNEW to check for listeners Instead of looking at listeners in proxies in PR_STNEW state, we'd rather check for listeners in those not in PR_STSTOPPED as it's only this state which indicates the proxy was disabled. And let's check the listeners count instead of testing the list's head.	2020-10-09 11:27:30 +02:00
Willy Tarreau	f18d968830	MEDIUM: proxy: remove state PR_STPAUSED This state was used to mention that a proxy was in PAUSED state, as opposed to the READY state. This was causing some trouble because if a listener failed to resume (e.g. because its port was temporarily in use during the resume), it was not possible to retry the operation later. Now by checking the number of READY or PAUSED listeners instead, we can accurately know if something went bad and try to fix it again later. The case of the temporary port conflict during resume now works well: $ socat readline /tmp/sock1 prompt > disable frontend testme3 > disable frontend testme3 All sockets are already disabled. > enable frontend testme3 Failed to resume frontend, check logs for precise cause (port conflict?). > enable frontend testme3 > enable frontend testme3 All sockets are already enabled.	2020-10-09 11:27:30 +02:00
Willy Tarreau	a17c91b37f	MEDIUM: proxy: remove the PR_STERROR state This state is only set when a pause() fails but isn't even set when a resume() fails. And we cannot recover from this state. Instead, let's just count remaining ready listeners to decide to emit an error or not. It's more accurate and will better support new attempts if needed.	2020-10-09 11:27:30 +02:00
Willy Tarreau	6b3bf733dd	MEDIUM: proxy: remove the unused PR_STFULL state Since v1.4 or so, it's almost not possible anymore to set this state. The only exception is by using the CLI to change a frontend's maxconn setting below its current usage. This case makes no sense, and for other cases it doesn't make sense either because "full" is a vague concept when only certain listeners are full and not all. Let's just remove this unused state and make it clear that it's not reported. The "ready" or "open" states will continue to be reported without being misleading as they will be opposed to "stop".	2020-10-09 11:27:30 +02:00
Willy Tarreau	efc0eec4c1	MINOR: proxy: maintain per-state counters of listeners The proxy state tries to be synthetic but that doesn't work well with many listeners, especially for transition phases or after a failed pause/resume. In order to address this, we'll instead rely on counters of listeners in a given state for the 3 major states (ready, paused, listen) and a total counter. We'll now be able to determine a proxy's state by comparing these counters only.	2020-10-09 11:27:30 +02:00
Willy Tarreau	a37b244509	MINOR: listeners: introduce listener_set_state() This function is used as a wrapper to set a listener's state everywhere. We'll use it later to maintain some counters in a consistent state when switching state so it's capital that all state changes go through it. No functional change was made beyond calling the wrapper.	2020-10-09 11:27:30 +02:00
Willy Tarreau	bec7ab0ad9	CLEANUP: proxy: remove the first_to_listen hack in zombify_proxy() This thing was needed for an optimization used in soft_stop() which doesn't exist anymore, so let's remove it as it's cryptic and hinders the listeners cleanup.	2020-10-09 11:27:29 +02:00
Willy Tarreau	987dbf5bab	MINOR: listeners: do not uselessly try to close zombie listeners in soft_stop() The loop doesn't match anymore since the non-started listeners are in LI_INIT and even if it had ever worked the benefit of closing zombies at this point looks void at best.	2020-10-09 11:27:29 +02:00
Willy Tarreau	c6dac6c7f5	MEDIUM: listeners: remove the now unused ZOMBIE state The zombie state is not used anymore by the listeners, because in the last two cases where it was tested it couldn't match as it was covered by the test on the process mask. Instead now the FD is either in the LISTEN state or the INIT state. This also avoids forcing the listener to be single-dimensional because actually belonging to another process isn't totally exclusive with the other states, which explains some of the difficulties requiring to check the proc_mask and the fd sometimes. So let's get rid of it now not to be tempted to reuse it. The doc on the listeners state was updated.	2020-10-09 11:27:29 +02:00
Willy Tarreau	ae7bc4a237	MEDIUM: deinit: close all receivers/listeners before scanning proxies Because of the zombie state, proxies have a skewed vision of the state of listeners, which explains why there are hacks switching the state from ZOMBIE to INIT in the proxy cleaning loop. This is particularly complicated and not needed, as all the information is now available in the protocol list and the fdtab. What we do here instead is to first close all active listeners or receivers by protocol and clean their protocol parts. Then we scan the fdtab to get rid of remaining ones that were necessarily in INIT state after a previous invocation of delete_listener(). From this point, we know the listeners are cleaned, the can safely be freed by scanning the proxies.	2020-10-09 11:27:29 +02:00
Willy Tarreau	b6607bfaf0	MEDIUM: listeners: make unbind_listener() converge if needed The ZOMBIE state on listener is a real mess. Listeners passing through this state have lost their consistency with the proxy AND with the fdtab. Plus this state is not used for all foreign listeners, only for those belonging to a proxy that entirely runs on another process, otherwise it stays in INIT state, which makes the usefulness extremely questionable. But the real issue is that it's impossible to untangle the receivers from the proxy state as long as we have this because of deinit()... So what we do here is to start by making unbind_listener() support being called more than once. This will permit to call it again to really close the FD and finish the operations if it's called with an FD that's in a fake state (such as INIT but with a valid fd).	2020-10-09 11:27:29 +02:00
Willy Tarreau	02b092f006	MEDIUM: init: stop disabled proxies after initializing fdtab During the startup process we don't have any fdtab nor fd_updt for quite a long time, and as such some operations on the listeners are not permitted, such as fd_want_/fd_stop_ or fd_delete(). The latter is of particular concern because it's used when stopping a disabled frontend, and it's performed very early during check_config_validity() while there is no fdtab yet. The trick till now relies on the listener's state which is a bit brittle. There is absolutely no valid reason for stopping a proxy's listeners this early, we can postpone it after init_pollers() which will at least have allocated fdtab.	2020-10-09 11:27:29 +02:00
Willy Tarreau	cb89e32f31	MEDIUM: listeners: don't bounce listeners management between queues During 2.1 development, commit `f2cb16948` ("BUG/MAJOR: listener: fix thread safety in resume_listener()") was introduced to bounce the enabling/disabling of a listener's FD to one of its threads because the remains of fd_update_cache() were fundamentally incompatible with the need to call fd_want_recv() or fd_stop_recv() for another thread. However since then we've totally dropped such code and it's totally safe to use these functions on an FD that is solely used by another thread (this is even used by the FD migration code). The only remaining limitation concerning the wake up delay was addressed by previous commit "MEDIUM: fd: always wake up one thread when enabling a foreing FD". The current situation forces the FD management to remain in the pause_listener() and resume_listener() functions just so that it can bounce between threads, without having the ability to delegate it to the suitable protocol layer. So let's first remove this now unneeded workaround.	2020-10-09 11:27:29 +02:00
Willy Tarreau	f015887444	MEDIUM: fd: always wake up one thread when enabling a foreing FD Since 2.2 it's safe to enable/disable another thread's FD but the fd_wake calls will not immediately be considered because nothing wakes the other threads up. This will have an impact on listeners when deciding to resume them after they were paused, so at minima we want to wake up one of their threads, just like the scheduler does on task_kill(). This is what this patch does.	2020-10-09 11:27:29 +02:00
Christopher Faulet	b8d148a93f	BUG/MINOR: http-htx: Expect no body for 204/304 internal HTTP responses 204 and 304 HTTP responses must no contain message body. These status codes are correctly handled when the responses are received from a server. But there is no specific processing for internal HTTP reponses (errorfile and http replies). Now, when errorfiles or an http replies are parsed during the configuration parsing, an error is triggered if a 204/304 message contains a body. An extra check is also performed to ensure the body length matches the announce content-length. This patch should fix the issue #891. It must be backported as far as 2.0. For 2.1 and 2.0, only the http_str_to_htx() function must be fixed. http_parse_http_reply() function does not exist.	2020-10-09 10:02:09 +02:00
Christopher Faulet	5563392554	BUG/MINOR: http: Fix content-length of the default 500 error 96 bytes is announce in the C-L header for a message of body of 97 bytes. This bug was introduced by the patch `46a030cdd` ("CLEANUP: assorted typo fixes in the code and comments"). This patch must be backported in all versions where the patch above is (the 2.2 for now).	2020-10-09 10:02:09 +02:00
Christopher Faulet	aade4edc1a	BUG/MEDIUM: mux-h2: Don't handle pending read0 too early on streams This patch is similar to the previous one on the fcgi. Same is true for the H2. But the bug is far harder to trigger because of the protocol cinematic. But it may explain strange aborts in some edge cases. A read0 received on the connection must not be handled too early by H2 streams. If the demux buffer is not empty, the pending read0 must not be considered. The H2 streams must not be passed in half-closed remote state in h2s_wake_one_stream() and the CS_FL_EOS flag must not be set on the associated conn-stream in h2_rcv_buf(). To sum up, it means, if there are still data pending in the demux buffer, no abort must be reported to the streams. To fix the issue, a dedicated function has been added, responsible for detecting pending read0 for a H2 connection. A read0 is reported only if the demux buffer is empty. This function is used instead of conn_xprt_read0_pending() at some places. Note that the HREM stream state should not be used to report aborts. It is performed on h2s_wake_one_stream() function and it is a legacy of the very first versions of the mux-h2. This patch should be backported as far as 2.0. In the 1.8, the code is too different to apply it like that. But it is probably useless because the mux-h2 can only be installed on the client side.	2020-10-09 10:02:09 +02:00
Christopher Faulet	6670e3e2bf	BUG/MEDIUM: mux-fcgi: Don't handle pending read0 too early on streams A read0 received on the connection must not be handled too early by FCGI streams. If the demux buffer is not empty, the pending read0 must not be considered. The FCGI streams must not be passed in half-closed remote state in fcgi_strm_wake_one_stream() and the CS_FL_EOS flag must not be set on the associated conn-stream in fcgi_rcv_buf(). To sum up, it means, if there are still data pending in the demux buffer, no abort must be reported to the streams. To fix the issue, a dedicated function has been added, responsible for detecting pending read0 for a FCGI connection. A read0 is reported only if the demux buffer is empty. This function is used instead of conn_xprt_read0_pending() at some places. This patch should fix the issue #886. It must be backported as far as 2.1.	2020-10-09 10:02:00 +02:00
Emeric Brun	b0c331f71f	BUG/MINOR: proxy/log: frontend/backend and log forward names must differ This patch disallow to use same name for a log forward section and a frontend/backend section.	2020-10-08 08:53:26 +02:00
Emeric Brun	cbb7bf7dd1	MEDIUM: log: syslog TCP support on log forward section. This patch re-introduce the "bind" statement on log forward sections to handle syslog TCP listeners as defined in rfc-6587. As complement it introduce "maxconn", "backlog" and "timeout client" statements to parameter those listeners.	2020-10-07 17:17:27 +02:00
Emeric Brun	6d75616951	MINOR: channel: new getword and getchar functions on channel. This patch adds two new functions to get a char or a word from a channel.	2020-10-07 17:17:27 +02:00
Emeric Brun	2897644ae5	MINOR: stats: inc req counter on listeners. This patch enables count of requests for listeners if listener's counters are enabled.	2020-10-07 17:17:27 +02:00
Emeric Brun	c47ba59d1e	BUG/MEDIUM: log: old processes with log foward section don't die on soft stop. Old processes didn't die if a log foward section is declared and a soft stop is requested. This patch fix this issue and should be backpored in banches including the log forward feature.	2020-10-07 17:17:27 +02:00
Emeric Brun	a39ecbdac1	BUG/MINOR: proxy: inc req counter on new syslog messages. Increase req counter instead of conn counter on new syslog messages. This should be backported on branches including the syslog forward feature.	2020-10-07 17:17:27 +02:00
Christopher Faulet	9589aa0fe5	CLEANUP: sock-unix: Remove an unreachable goto clause Coverity reported dead code in sock_unix_bind_receiver() function. A goto clause is unreachable because of the preceeding if/else block. This patch should fix the issue #865. No backport needed.	2020-10-07 14:37:03 +02:00
Christopher Faulet	7b06d3adaa	MINOR: mux-h1: Don't wakeup the H1C when output buffer become available There is no reason to wake up the H1 connection when a new output buffer is retrieved after an allocation failure because only the H1 stream will fill it.	2020-10-07 14:07:29 +02:00
Christopher Faulet	e9da975aab	BUG/MINOR: mux-h1: Always set the session on frontend h1 stream The session is always defined for a frontend connection. When a new client connection is established, the session is set for the first H1 stream. But on keep-alived connections, it is not set for the followings H1 streams while it is possible. This patch is tagged as a bug because it fixes an inconsistency in the H1 streams creation. But it does not fixed a known bug. This patch must be backported as far as 2.0.	2020-10-07 14:07:29 +02:00
Christopher Faulet	69f2cb8df3	BUG/MINOR: mux-h1: Be sure to only set CO_RFL_READ_ONCE for the first read The condition to set CO_RFL_READ_ONCE flag is not really accurate. We must check the request state on frontend connection only and, in the opposite, the response state on backend connection only. Only the parsed side must be considered, not the opposite one. This patch must be backported to 2.2.	2020-10-07 14:07:29 +02:00
Christopher Faulet	58feb49ed2	CLEANUP: ssl: Release cached SSL sessions on deinit On deinit, when the server SSL ctx is released, we must take care to release the cached SSL sessions stored in the array <ssl_ctx.reused_sess>. There are global.nbthread entries in this array, each one may have a pointer on a cached session. This patch should fix the issue #802. No backport needed.	2020-10-07 14:07:29 +02:00
Tim Duesterhus	d7c6e6a71d	CLEANUP: cache: Fix leak of cconf->c.name during config check During the config check, the post parsing is not performed. Thus, cache filters are not fully initialized and their cache name are never released. To be able to release them, a flag is now set when a cache filter is fully initialized. On deinit, if the flag is not set, it means the cache name must be freed. The patch should fix #849. No backport needed. [Cf: Tim is the patch author, but I added the commit message]	2020-10-07 14:07:29 +02:00
Christopher Faulet	a10000305f	BUG/MINOR: proto_tcp: Report warning messages when listeners are bound When a TCP listener is bound, in the tcp_bind_listener() function, a warning message may be reported and should be displayed on verbose mode. But the warning message is actually lost if the socket is successfully bound because we don't fill the <errmsg> variable in this case. This patch should fix the issue #863. No backport is needed.	2020-10-07 14:07:16 +02:00
Fr�d�ric L�caille	e7e2b21d27	BUG/MINOR: peers: Inconsistency when dumping peer status codes. A peer connection status must be considered as valid only if there is an applet which has been instantiated for the connection to the peer. So, ->statuscode should be considered as the last known peer connection status from the last connection to this peer if any. To reflect this, "statuscode" field of peer dump is renamed to "last_statuscode". This patch also add "active"/"inactive" field after the peer location type ("remote" or "local") if an applet has been instantiated for this peer connection or not. Thank you to Emeric for having noticed this issue. Must be backported in >=1.9 version.	2020-10-07 07:27:01 +02:00
Amaury Denoyelle	27373f7f75	MINOR: stats: remove for loop declaration Remove variable declaration inside a for-loop. This was introduced by my patches serie of the implementation of dynamic stats. This is not supported by older gcc, notably on the freebsd environment of the ci.	2020-10-05 17:55:40 +02:00
Amaury Denoyelle	fbd0bc98fe	MINOR: dns/stats: integrate dns counters in stats Use the new stats module API to integrate the dns counters in the standard stats. This is done in order to avoid code duplication, keep the code related to cli out of dns and use the full possibility of the stats function, allowing to print dns stats in csv or json format.	2020-10-05 12:02:14 +02:00
Amaury Denoyelle	0b70a8a314	MINOR: stats: add config "stats show modules" By default, hide the extra statistics on the html page. Define a new flag STAT_SHMODULES which is activated if the config "stats show modules" is set.	2020-10-05 12:02:14 +02:00
Amaury Denoyelle	e3f576c29e	MINOR: stats: display extra proxy stats on the html page Integrate the additional proxy stats on the html stats page. For each module, a new column is displayed with the individual stats available as a tooltip.	2020-10-05 12:02:14 +02:00
Amaury Denoyelle	d3700a7fda	MINOR: stats: support clear counters for dynamic stats Add a boolean 'clearable' on stats module structure. If set, it forces all the counters to be reset on 'clear counters' cli command. If not, the counters are reset only when 'clear counters all' is used.	2020-10-05 12:02:14 +02:00
Amaury Denoyelle	ee63d4bd67	MEDIUM: stats: integrate static proxies stats in new stats This is executed on startup with the registered statistics module. The existing statistics have been merged in a list containing all statistics for each domain. This is useful to print all available statistics in a generic way. Allocate extra counters for all proxies/servers/listeners instances. These counters are allocated with the counters from the stats modules registered on startup.	2020-10-05 12:02:14 +02:00
Amaury Denoyelle	58d395e0d6	MEDIUM: stats: define an API to register stat modules A stat module can be registered to quickly add new statistics on haproxy. It must be attached to one of the available stats domain. The register must be done using INITCALL on STG_REGISTER. The stat module has a name which should be unique for each new module in a domain. It also contains a statistics list with their name/desc and a pointer to a function used to fill the stats from the module counters. The module also provides the initial counters values used on automatically allocated counters. The offset for these counters are stored in the module structure.	2020-10-05 12:02:14 +02:00
Amaury Denoyelle	50660a894d	MEDIUM: stats: add delimiter for static proxy stats on csv Use the character '-' to mark the end of static statistics on proxy domain. After this marker, the order of the fields is not guaranteed and should be parsed with care.	2020-10-05 12:02:14 +02:00
Amaury Denoyelle	72b16e5173	MINOR: stats: define additional flag px cap on domain This flag can be used to determine on what type of proxy object the statistics should be relevant. It will be useful when adding dynamic statistics. Currently, this flag is not used.	2020-10-05 12:02:14 +02:00
Amaury Denoyelle	072f97eddf	MINOR: stats: define the concept of domain for statistics The domain option will be used to have statistics attached to other objects than proxies/listeners/servers. At the moment, only the PROXY domain is available. Add an argument 'domain' on the 'show stats' cli command to specify the domain. Only 'domain proxy' is available now. If not specified, proxy will be considered the default domain. For HTML output, only proxy statistics will be displayed.	2020-10-05 12:02:14 +02:00
Christopher Faulet	f98d821b94	MINOR: hlua: Display debug messages on stderr only in debug mode Debug Messages emitted in lua using core.Debug() or core.log() are now only displayed on stderr if HAProxy is started in debug mode (-d parameter on the command line). There is no change for other message levels. This patch should fix the issue #879. It may be backported to all stable versions.	2020-10-05 11:11:36 +02:00
Amaury Denoyelle	98b81cb393	REORG: stats: extract proxies dump loop in a function Create a dedicated function to loop on proxies and dump them. This will be clearer when other object will be dump as well. This patch is needed to extend stat support to components other than proxies objects.	2020-10-05 10:54:35 +02:00
Amaury Denoyelle	f34017bb74	REORG: stats: extract proxy json dump Create a dedicated function to dump a proxy as a json content. This patch will be needed when other types of objects will be available for json dump. This patch is needed to extend stat support to components other than proxies objects.	2020-10-05 10:53:50 +02:00
Amaury Denoyelle	da5b6d1cd9	MINOR: stats: hide px/sv/li fields in applet struct Use an opaque pointer to store proxy instance. Regroup server/listener as a single opaque pointer. This has the benefit to render the structure more evolutive to support statistics on other types of objects in the future. This patch is needed to extend stat support for components other than proxies objects. The prometheus module has been adapted for these changes.	2020-10-05 10:48:58 +02:00
Amaury Denoyelle	97323c9ed4	MINOR: stats: add stats size as a parameter for csv/json dump Render the stats size parametric in csv/json dump functions. This is needed for the future patch which provides dynamic stats. For now the static value ST_F_TOTAL_FIELDS is provided. Remove unused parameter px on stats_dump_one_line. This patch is needed to extend stat support to components other than proxies objects.	2020-10-05 09:06:10 +02:00
Amaury Denoyelle	3ca927e68f	REORG: stats: export some functions Un-mark stats_dump_one_line and stats_putchk as static and export them in the header file. These functions will be reusable by other components to print their statistics. This patch is needed to extend stat support to components other than proxies objects.	2020-10-05 09:06:10 +02:00
Amaury Denoyelle	a53ce4cc01	BUG/MINOR: stats: fix validity of the json schema The json schema seems to be invalid when checking using the validator from https://www.jsonschemavalidator.net/. Correct it using the following specification : http://json-schema.org/draft/2019-09/json-schema-validation.html#rfc.section.9.1 The impact of the bug it not well known as I am not sure of how useful the json schema is for users. It is probably not used at all or else this bug would have been reported. This should be backported up to 1.8.	2020-10-05 09:06:06 +02:00
William Lallemand	51f784bcf9	CLEANUP: ssl: "bundle" is not an OpenSSL wording There is a confusion between the HAProxy bundle and OpenSSL. OpenSSL does not have "bundles" but multiple certificates in the same store. Fix a commentary in the crt-list code.	2020-10-02 18:11:47 +02:00
Christopher Faulet	f7177271f3	BUG/MINOR: tcpcheck: Set socks4 and send-proxy flags before the connect call Since the health-check refactoring in the 2.2, the checks through a socks4 proxy are broken. To fix this bug, CO_FL_SOCKS4 flag must be set on the connection before calling the connect() callback function because this flags is checked to use the right destination address. The same is done for the CO_FL_SEND_PROXY flag for a consistency purpose. A reg-test has been added to test the "check-via-socks4" directive. This patch must be backported to 2.2.	2020-10-02 17:14:34 +02:00
Christopher Faulet	2079a4ad36	MEDIUM: tcp-rules: Warn if a track-sc* content rule doesn't depend on content The warning is only emitted for HTTP frontend. Idea is to encourage the usage of "tcp-request session" rules to track counters that does not depend on the request content. The documentation has been updated accordingly. The warning is important because since the multiplexers were added in the processing chain, the HTTP parsing is performed at a lower level. Thus parsing errors are detected in the multiplexers, before the stream creation. In HTTP/2, the error is reported by the multiplexer itself and the stream is never created. This difference has a certain number of consequences, one of which is that HTTP request counting in stick tables only works for valid H2 request, and HTTP error tracking in stick tables never considers invalid H2 requests but only invalid H1 ones. And the aim is to do the same with the mux-h1. This change will not be done for the 2.3, but the 2.4. At the end, H1 and H2 parsing errors will be caught by the multiplexers, at the session level. Thus, tracking counters at the content level should be reserved for rules using a key based on the request content or those using ACLs based on the request content. To be clear, a warning will be emitted for the following rules : tcp-request content track-sc0 src tcp-request content track-sc0 src if ! { src 10.0.0.0/24 } tcp-request content track-sc0 src if { ssl_fc } But not for the following ones : tcp-request content track-sc0 req.hdr(host) tcp-request content track-sc0 src if { req.hdr(host) -m found }	2020-10-02 15:50:26 +02:00
Eric Salama	7cea6065ac	BUG/MINOR: Fix several leaks of 'log_tag' in init(). We use chunk_initstr() to store the program name as the default log-tag. If we use the log-tag directive in the config file, this chunk will be destroyed and replaced. chunk_initstr() sets the chunk size to 0 so we will free the chunk itself, but not its content. This happens for a global section and also for a proxy. We fix this by using chunk_initlen() instead of chunk_initstr(). We also check that the memory allocation was successfull, otherwise we quit. This fixes github issue #850. It can be backported as far as 1.9, with minor adjustments to includes.	2020-10-02 15:50:26 +02:00
William Dauchy	1d0206e71f	MINOR: ssl: remove uneeded check in crtlist_parse_file this condition is never true as we either break or goto error, so those two lines could be removed in the current state of the code. this is fixing github issue #862 Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-10-02 15:43:01 +02:00
Tim Duesterhus	b9f6accc9e	MINOR: ssl: Add error if a crt-list might be truncated Similar to warning during the parsing of the regular configuration file that was added in `2fd5bdb439` this patch adds a warning to the parsing of a crt-list if the file does not end in a newline (and thus might have been truncated). The logic essentially just was copied over. It might be good to refactor this in the future, allowing easy re-use within all line-based config parsers. see https://github.com/haproxy/haproxy/issues/860#issuecomment-693422936 see `0354b658f0` This should be backported as a warning to 2.2.	2020-10-02 12:29:03 +02:00
Tim Duesterhus	6d07fae3c0	CLEANUP: ssl: Use structured format for error line report during crt-list parsing This reuses the known `parsing [%s:%d]:` from regular config file error reporting.	2020-10-02 12:29:03 +02:00
Willy Tarreau	fe2cc41151	BUILD: tools: fix minor build issue on isspace() Previous commit `fa41cb679` ("MINOR: tools: support for word expansion of environment in parse_line") introduced two new isspace() on a char and broke the build on systems using an array disguised in a macro instead of a function (like cygwin). Just use the usual cast.	2020-10-01 18:05:48 +02:00
Amaury Denoyelle	fa41cb6792	MINOR: tools: support for word expansion of environment in parse_line Allow the syntax "${...[*]}" to expand an environment variable containing several values separated by spaces as individual arguments. A new flag PARSE_OPT_WORD_EXPAND has been added to toggle this feature on parse_line invocation. In case of an invalid syntax, a new error PARSE_ERR_WRONG_EXPAND will be triggered. This feature has been asked on the github issue #165.	2020-10-01 17:24:14 +02:00
Willy Tarreau	82cd5c13a5	OPTIM: backend: skip LB when we know the backend is full For some algos (roundrobin, static-rr, leastconn, first) we know that if there is any request queued in the backend, it's because a previous attempt failed at finding a suitable server after trying all of them. This alone is sufficient to decide that the next request will skip the LB algo and directly reach the backend's queue. Doing this alone avoids an O(N) lookup when load-balancing on a saturated farm of N servers, which starts to be very expensive for hundreds of servers, especially under the lbprm lock. This change alone has increased the request rate from 110k to 148k RPS for 200 saturated servers on 8 threads, and fwlc_reposition_srv() doesn't show up anymore in perf top. See github issue #880 for more context. It could have been the same for random, except that random is performed using a consistent hash and it only considers a small set of servers (2 by default), so it may result in queueing at the backend despite having some free slots on unknown servers. It's no big deal though since random() only performs two attempts by default. For hashing algorithms this is pointless since we don't queue at the backend, except when there's no hash key found, which is the least of our concerns here.	2020-09-29 17:18:37 +02:00
Willy Tarreau	b88ae18021	OPTIM: backend/random: never queue on the server, always on the backend If random() returns a server whose maxconn is reached or the queue is used, instead of adding the request to the server's queue, better add it to the backend queue so that it can be served by any server (hence the fastest one).	2020-09-29 17:18:11 +02:00
William Lallemand	20b0fed28c	BUG/MINOR: ssl/crt-list: exit on warning out of crtlist_parse_line() We should not exits on error out of the crtlist_parse_line() function. The cfgerr error must be checked with the ERR_CODE mask. Must be backported in 2.2.	2020-09-28 15:48:54 +02:00
Miroslav Zagorac	a6aca669b5	BUILD: trace: include tools.h If the TRACE option is used when compiling the haproxy source, the following error occurs on debian 9.13: src/calltrace.o: In function `make_line': .../src/calltrace.c:204: undefined reference to `rdtsc' src/calltrace.o: In function `calltrace': .../src/calltrace.c:277: undefined reference to `rdtsc' collect2: error: ld returned 1 exit status Makefile:866: recipe for target 'haproxy' failed	2020-09-25 17:54:48 +02:00
Willy Tarreau	82cd028d71	BUG/MINOR: listeners: properly close listener FDs The code dealing with zombie proxies in soft_stop() is bogus, it uses close() instead of fd_delete(), leaving a live entry in the fdtab with a dangling pointer to a free memory location. The FD might be reassigned for an outgoing connection for the time it takes the proxy to completely stop, or could be dumped on the CLI's "show fd" command. In addition, the listener's FD was not even reset, leaving doubts about whether or not it will happen again in deinit(). And in deinit(), the loop in charge of closing zombie FDs is particularly unsafe because it closes the fd then calls unbind_listener() then delete_listener() hoping none of them will touch it again. Since it requires some mental efforts to figure what's done there, let's correctly reset the fd here as well and close it using fd_delete() to eliminate any remaining doubts. It's uncertain whether this should be backported. Zombie proxies are rare and the situations capable of triggering such issues are not trivial to setup. However it's easy to imagine how things could go wrong if backported too far. Better wait for any matching report if at all (this code has been there since 1.8 without anobody noticing).	2020-09-25 13:46:47 +02:00
Willy Tarreau	02e1975c29	BUG/MEDIUM: listeners: do not pause foreign listeners There's a nasty case with listeners that belong to foreign processes. If a proxy is defined this way: global nbproc 2 frontend f bind :1111 process 1 bind :2222 process 2 and if stats expose-fd listeners is set, the listeners' FDs will not be closed on the processes that don't use them. At this point it's not a big deal, except that they're shared between processes and that a "disable frontend f" issued on one process will pause all of them and cause the other process to see accept() fail, turning its own listener to state LI_LIMITED to try to leave it some time to recover. But it will never recover, even after an enable. The root cause of the issue is that the ZOMBIE state doesn't cover this situation since it's only for a proxy being entirely bound to a process. What we do here to address this is that we refrain from pausing a file descriptor that belongs to a foreign process in pause_listener(). This definitely solves the problem. A similar test is present in resume_listener() and is the reason why the FD doesn't recover upon the "enable" action by the way. This ought to be backported to 1.8 where seamless reload was integrated. The config above should be sufficient to validate that the fix works; after a pair of "disable/enable frontend" no process will handle the traffic to one of the ports anymore.	2020-09-25 13:46:47 +02:00
Willy Tarreau	57a374131c	MINOR: backend: add a new "path-only" option to "balance uri" Since we've fixed the way URIs are handled in 2.1, some users have started to experience inconsistencies in "balance uri" between requests received over H1 and the same ones received over H2. This is caused by the fact that H1 rarely uses absolute URIs while H2 always uses them. Similar issues were reported already around replace-uri etc, leading to "pathq" recently being introduced, so this isn't new. Here what this patch does is add a new option to "balance uri" to indicate that the hashing should only start at the path and not cover the authority. This makes H1 relative URIs and H2 absolute URI hashes equally again. Some extra options could be added to normalize URIs by always hashing the authority (or host) in front of them, which would make sure that both absolute and relative requests provide the same hash. This is left for later if needed.	2020-09-23 08:56:29 +02:00
Willy Tarreau	3d1119d225	MINOR: backend: make the "whole" option of balance uri take only one bit We'll want to add other boolean options on "balance uri", so let's make some room aside "whole" and make it take only one bit and not one int.	2020-09-23 08:05:47 +02:00
Amaury Denoyelle	36b536652f	BUG/MINOR: config: Fix memory leak on config parse listen This memory leak happens if there is two or more defaults section. When the default proxy is reinitialized, the structure member containing the config filename must be freed. Fix github issue #851. Should be backported as far as 1.6.	2020-09-18 16:17:09 +02:00
Eric Salama	1aab911017	BUG/MINOR: Fix memory leaks cfg_parse_peers When memory allocation fails in cfg_parse_peers or when an error occurs while parsing a stick-table, the temporary table and its id must be freed. This fixes github issue #854. It should be backported as far as 2.0.	2020-09-18 12:06:08 +02:00
Christopher Faulet	d2414a23c4	BUG/MINOR: http-fetch: Don't set the sample type during the htx prefetch A subtle bug was introduced by the commit `a6d9879e6` ("BUG/MEDIUM: htx: smp_prefetch_htx() must always validate the direction"), for the "method" sample fetch only. The sample data type and the method id are always overwritten because smp_prefetch_htx() function is called later in the sample fetch evaluation. The bug is in the smp_prefetch_htx() function but it is only visible for the "method" sample fetch, for an unknown method. In fact, when smp_prefetch_htx() is called, the sample object is altered. The data type is set to SMP_T_BOOL and, on success, the data value is set to 1. Thus, if the caller has already set some infos into the sample object, they may be lost. AFAIK, there is no reason to do so. It is inherited from the legacy HTTP code and I honestely don't known why it was done this way. So, instead of fixing the "method" sample fetch to set useful info after the call to smp_prefetch_htx() function, I prefer to not alter the sample object in smp_prefetch_htx(). This patch must be backported as far as 2.0. On the 2.0, only the HTX part must be fixed.	2020-09-18 11:06:24 +02:00
Willy Tarreau	bba7a4dafd	BUG/MINOR: h2/trace: do not display "stream error" after a frame ACK When sending a frame ACK, the parser state is not equal to H2_CS_FRAME_H and we used to report it as an error, which is not true. In fact we should only indicate when we skip remaining data. This may be backported as far as 2.1.	2020-09-18 07:41:28 +02:00
Willy Tarreau	8520d87198	MINOR: h2/trace: also display the remaining frame length in traces It's often missing when debugging, even though it's often zero for control frames or after data are consumed.	2020-09-18 07:39:29 +02:00
Willy Tarreau	f2cda10b1d	BUILD: sock_inet: include errno.h I was careful to have it for sock_unix.c but missed it for sock_inet which broke with commit `36722d227` ("MINOR: sock_inet: report the errno string in binding errors") depending on the build options. No backport is needed.	2020-09-17 14:02:01 +02:00
Willy Tarreau	3cd58bf805	MINOR: sock_unix: report the errno string in binding errors Just like with previous patch, let's report UNIX socket binding errors in plain text. we can now see for example: [ALERT] 260/083531 (13365) : Starting frontend f: cannot switch final and temporary UNIX sockets (Operation not permitted) [/tmp/root.sock] [ALERT] 260/083640 (13375) : Starting frontend f: cannot change UNIX socket ownership (Operation not permitted) [/tmp/root.sock]	2020-09-17 08:35:38 +02:00
Willy Tarreau	36722d2274	MINOR: sock_inet: report the errno string in binding errors With the socket binding code cleanup it becomes easy to add more info to error messages. One missing thing used to be the error string, which is now added after the generic one, for example: [ALERT] 260/082852 (12974) : Starting frontend f: cannot bind socket (Permission denied) [0.0.0.0:4] [ALERT] 260/083053 (13292) : Starting frontend f: cannot bind socket (Address already in use) [0.0.0.0:4444] [ALERT] 260/083104 (13298) : Starting frontend f: cannot bind socket (Cannot assign requested address) [1.1.1.1:4444]	2020-09-17 08:32:17 +02:00
Willy Tarreau	eb8cfe6723	BUILD: sock_unix: add missing errno.h It builds fine when openssl is enabled, but fails otherwise. No backport is needed.	2020-09-16 22:15:40 +02:00
Willy Tarreau	af9609b4d1	MINOR: tools: drop listener detection hack from str2sa_range() We used to resort to a trick to detect whether the caller was a listener or an outgoing socket in order never to present an AF_CUST_UDP* socket to a log server nor a nameserver. This is no longer necessary, the socket type alone will be enough.	2020-09-16 22:08:08 +02:00
Willy Tarreau	2b5e0d8b6a	MEDIUM: proto_udp: replace last AF_CUST_UDP* with AF_INET* We don't need to cheat with the sock_domain anymore, we now always have the SOCK_DGRAM sock_type as a complementary selector. This patch restores the sock_domain to AF_INET* in the udp* protocols and removes all traces of the now unused AF_CUST_*.	2020-09-16 22:08:08 +02:00
Willy Tarreau	b2ffc99bbd	MEDIUM: tools: make str2sa_range() use protocol_lookup() By doing so we can remove the hard-coded mapping from AF_INET to AF_CUST_UDP but we still need to keep the test on the listeners as long as these dummy families remain present in the code.	2020-09-16 22:08:08 +02:00
Willy Tarreau	910c64da96	MEDIUM: protocol: store the socket and control type in the protocol array The protocol array used to be only indexed by socket family, which is very problematic with UDP (requiring an extra family) and with the forthcoming QUIC (also requiring an extra family), especially since that binds them to certain families, prevents them from supporting dgram UNIX sockets etc. In order to address this, we now start to register the protocols with more info, namely the socket type and the control type (either stream or dgram). This is sufficient for the protocols we have to deal with, but could also be extended further if multiple protocol variants were needed. But as is, it still fits nicely in an array, which is convenient for lookups that are instant.	2020-09-16 22:08:08 +02:00
Willy Tarreau	a54553f74f	MINOR: protocol: add the control layer type in the protocol struct This one will be needed to more accurately select a protocol. It may differ from the socket type for QUIC, which uses dgram at the socket layer and provides stream at the control layer. The upper level requests a control layer only so we need this field.	2020-09-16 22:08:08 +02:00
Willy Tarreau	65ec4e3ff7	MEDIUM: tools: make str2sa_range() check that the protocol has ->connect() Most callers of str2sa_range() need the protocol only to check that it provides a ->connect() method. It used to be used to verify that it's a stream protocol, but it might be a bit early to get rid of it. Let's keep the test for now but move it to str2sa_range() when the new flag PA_O_CONNECT is present. This way almost all call places could be cleaned from this. There's a strange test in the server address parsing code that rechecks the family from the socket which seems to be a duplicate of the previously removed tests. It will have to be rechecked.	2020-09-16 22:08:08 +02:00
Willy Tarreau	5fc9328aa2	MINOR: tools: make str2sa_range() directly return the protocol We'll need this so that it can return pointers to stacked protocol in the future (for QUIC). In addition this removes a lot of tests for protocol validity in the callers. Some of them were checked further apart, or after a call to str2listener() and they were simplified as well. There's still a trick, we can fail to return a protocol in case the caller accepts an fqdn for use later. This is what servers do and in this case it is valid to return no protocol. A typical example is: server foo localhost:1111	2020-09-16 22:08:08 +02:00
Willy Tarreau	9b3178df23	MINOR: listener: pass the chosen protocol to create_listeners() The function will need to use more than just a family, let's pass it the selected protocol. The caller will then be able to do all the fancy stuff required to pick the best protocol.	2020-09-16 22:08:08 +02:00
Willy Tarreau	5e1779abbf	MEDIUM: config: make str2listener() not accept datagram sockets anymore str2listener() was temporarily hacked to support datagram sockets for the log-forward listeners. This has has an undesirable side effect that "bind udp@1.2.3.4:5555" was silently accepted as TCP for a bind line. We don't need this hack anymore since the only user (log-forward) now relies on str2receiver(). Now such an address will properly be rejected.	2020-09-16 22:08:08 +02:00
Willy Tarreau	26ff5dabc0	MINOR: log-forward: use str2receiver() to parse the dgram-bind address Thanks to this we don't need to specify "udp@" as it's implicitly a datagram type listener that is expected, so any AF_INET/AF_INET4 address will work.	2020-09-16 22:08:08 +02:00
Willy Tarreau	aa333123f2	MINOR: cfgparse: add str2receiver() to parse dgram receivers This is at least temporary, as the migration at once is way too difficuly. For now it still creates listeners but only allows DGRAM sockets. This aims at easing the split between listeners and receivers.	2020-09-16 22:08:08 +02:00
Willy Tarreau	62a976cd44	MINOR: tools: remove the central test for "udp" in str2sa_range() Now we only rely on dgram type associated with AF_INET/AF_INET6 to infer UDP4/UDP6. We still keep the hint based on PA_O_SOCKET_FD to detect that the caller is a listener though. It's still far from optimal but UDP remains rooted into the protocols and needs to be taken out first.	2020-09-16 22:08:08 +02:00
Willy Tarreau	3baec249b1	MEDIUM: tools: make str2sa_range() only report AF_CUST_UDP on listeners For now only listeners can make use of AF_CUST_UDP and it requires hacks in the DNS and logsrv code to remap it to AF_INET. Make str2sa_range() smarter by detecting that it's called for a listener and only set these protocol families for listeners. This way we can get rid of the hacks.	2020-09-16 22:08:08 +02:00
Willy Tarreau	e835bd8f91	MINOR: tools: start to distinguish stream and dgram in str2sa_range() The parser now supports a socket type for the control layer and a possible other one for the transport layer. Usually they are the same except for protocols like QUIC which will provide a stream transport layer based on a datagram control layer. The default types are preset based on the caller's expectations, and may be refined using "stream+" and "dgram+" prefixes. For now they were not added to the docuemntation because other changes will probably happen around UDP as well. It is conceivable that "tcpv4@" or "udpv6@" will appear later as aliases for "stream+ipv4" or "dgram+ipv6".	2020-09-16 22:08:08 +02:00
Willy Tarreau	a215be282d	MEDIUM: tools: make str2sa_range() check for the sockpair's FD usability Just like for inherited sockets, we want to make sure that FDs that are mentioned in "sockpair@" are actually usable. Right now this test is performed by the callers, but not everywhere. Typically, the following config will fail if fd #5 is not bound: frontend bind sockpair@5 But this one will pass if fd #6 is not bound: backend server s1 sockpair@6 Now both will return an error in such a case: - 'bind' : cannot use file descriptor '5' : Bad file descriptor. - 'server s1' : cannot use file descriptor '6' : Bad file descriptor. As such the test in str2listener() is not needed anymore (and it was wrong by the way, as it used to test for the socket by overwriting the local address with a new address that's made of the FD encoded on 16 bits and happens to still be at the same place, but that strictly depends on whatever the kernel wants to put there).	2020-09-16 22:08:08 +02:00
Willy Tarreau	804f11fdf8	MINOR: config: do not test an inherited socket again Since previous patch we know that a successfully bound fd@XXX socket is returned as its own protocol family from str2sa_range() and not as AF_CUST_EXISTING_FD anymore o we don't need to check for that case in str2listener().	2020-09-16 22:08:08 +02:00
Willy Tarreau	6edc722093	MEDIUM: tools: make str2sa_range() resolve pre-bound listeners When str2sa_range() is invoked for a bind or log line, and it gets a file descriptor number, it will immediately resolve the socket's address (when it's a socket) so that the address family, address and port are correctly set. This will later allow to resolve some transport protocols that are attached to existing FDs. For raw FDs (e.g. logs) and for socket pairs, the FD number is still returned in the address, because we need the underlying address management to complete the bind/listen/connect/whatever needed. One immediate benefit is that passing a bad FD will now result in one of these errors: 'bind' : cannot use file descriptor '3' : Socket operation on non-socket. 'bind' : socket on file descriptor '3' is of the wrong type. Note that as of now, we never return a listening socket with a family of AF_CUST_EXISTING_FD. The only case where this family is seen is for a raw FD (e.g. logs).	2020-09-16 22:08:08 +02:00
Willy Tarreau	895992619d	MINOR: log: detect LOG_TARGET_FD from the fd and not from the syntax Now that we have the FD value reported we don't need to cheat and detect "fd@" in the address, we can safely rely on the FD value.	2020-09-16 22:08:08 +02:00
Willy Tarreau	a93e5c7fae	MINOR: tools: make str2sa_range() optionally return the fd If a file descriptor was passed, we can optionally return it. This will be useful for listening sockets which are both a pre-bound FD and a ready socket.	2020-09-16 22:08:08 +02:00
Willy Tarreau	909c23b086	MINOR: listener: remove the inherited arg to create_listener() This argument can now safely be determined from fd != -1, let's just drop it.	2020-09-16 22:08:08 +02:00
Willy Tarreau	328199348b	MINOR: tools: add several PA_O_* flags in str2sa_range() callers These flags indicate whether the call is made to fill a bind or a server line, or even just send/recv calls (like logs or dns). Some special cases are made for outgoing FDs (e.g. pipes for logs) or socket FDs (e.g external listeners), and there's a distinction between stream or dgram usage that's expected to significantly help str2sa_range() proceed appropriately with the input information. For now they are not used yet.	2020-09-16 22:08:08 +02:00
Willy Tarreau	8b0fa8f0ab	MEDIUM: config: remove all checks for missing/invalid ports/ranges Now that str2sa_range() checks for appropriate port specification, we don't need to implement adhoc test cases in every call place, if the result is valid, the conditions are met otherwise the error message is appropriately filled.	2020-09-16 22:08:08 +02:00
Willy Tarreau	7f96a8474c	MEDIUM: tools: make str2sa_range() validate callers' port specifications Now str2sa_range() will enforce the caller's port specification passed using the PA_O_PORT_* flags, and will return an error on failure. For optional ports, values 0-65535 will be enforced. For mandatory ports, values 1-65535 are enforced. In case of ranges, it is also verified that the upper bound is not lower than the lower bound, as this used to result in empty listeners. I couldn't find an easy way to test this using VTC since the purpose is to trigger parse errors, so instead a test file is provided as tests/ports.cfg with comments about what errors are expected for each line.	2020-09-16 22:08:08 +02:00
Willy Tarreau	809587635e	MINOR: tools: add several PA_O_PORT_* flags in str2sa_range() callers These flags indicate what is expected regarding port specifications. Some callers accept none, some need fixed ports, some have it mandatory, some support ranges, and some take an offset. Each possibilty is reflected by an option. For now they are not exploited, but the goal is to instrument str2sa_range() to properly parse that.	2020-09-16 22:08:07 +02:00
Willy Tarreau	cd3a5591f6	MINOR: tools: make str2sa_range() take more options than just resolve We currently have an argument to require that the address is resolved but we'll soon add more, so let's turn it into a bit field. The old "resolve" boolean is now PA_O_RESOLVE.	2020-09-16 22:08:07 +02:00
Willy Tarreau	5a7beed67b	CLEANUP: tools: make str2sa_range() less awful for fd@ and sockpair@ The code is built to match prefixes at one place and to parse the address as a second step, except for fd@ and sockpair@ where the test first passes via AF_UNSPEC that is changed again. This is ugly and confusing, so let's proceed like for the other ones.	2020-09-16 22:08:07 +02:00
Willy Tarreau	a5b325f92c	MINOR: protocol: add a real family for existing FDs At some places (log fd@XXX, bind fd@XXX) we support using an explicit file descriptor number, that is placed into the sockaddr for later use. The problem is that till now it was done with an AF_UNSPEC family, which is also used for other situations like missing info or rings (for logs). Let's create an "official" family AF_CUST_EXISTING_FD for this case so that we are certain the FD can be found in the address when it is set.	2020-09-16 22:08:07 +02:00
Willy Tarreau	1e984b73f0	CLEANUP: protocol: remove family-specific fields from struct protocol This removes the following fields from struct protocol that are now retrieved from the protocol family instead: .sock_family, .sock_addrlen, .l3_addrlen, .addrcmp, .bind, .get_src, .get_dst. This also removes the UDP-specific udp{,6}_get_{src,dst}() functions which were referenced but not used yet. Their goal was only to remap the original AF_INET* addresses to AF_CUST_UDP*. Note that .sock_domain is still there as it's used as a selector for the protocol struct to be used.	2020-09-16 22:08:07 +02:00
Willy Tarreau	f1f660978c	MINOR: protocol: retrieve the family-specific fields from the family We now take care of retrieving sock_family, l3_addrlen, bind(), addrcmp(), get_src() and get_dst() from the protocol family and not just the protocol itself. There are very few places, this was only seldom used. Interestingly in sock_inet.c used to rely on ->sock_family instead of ->sock_domain, and sock_unix.c used to hard-code PF_UNIX instead of using ->sock_domain. Also it appears obvious we have something wrong it the protocol selection algorithm because sock_domain is the one set to the custom protocols while it ought to be sock_family instead, which would avoid having to hard-code some conversions for UDP namely.	2020-09-16 22:08:07 +02:00
Willy Tarreau	b0254cb361	MINOR: protocol: add a new proto_fam structure for protocol families We need to specially handle protocol families which regroup common functions used for a given address family. These functions include bind(), addrcmp(), get_src() and get_dst() for now. Some fields are also added about the address family, socket domain (protocol family passed to the socket() syscall), and address length. These protocol families are referenced from the protocols but not yet used.	2020-09-16 22:08:07 +02:00
Willy Tarreau	ad33acf838	MEDIUM: protocol: do not call proto->bind() anymore from bind_listener() All protocol's listeners now only take care of themselves and not of the receiver anymore since that's already being done in proto_bind_all(). Now it finally becomes obvious that UDP doesn't need a listener, as the only thing it does is to set the listener's state to LI_LISTEN!	2020-09-16 22:08:07 +02:00
Willy Tarreau	fc974887ce	MEDIUM: protocol: explicitly start the receiver before the listener Now protocol_bind_all() starts the receivers before their respective listeners so that ultimately we won't need the listeners for non- connected protocols. We still have to resort to an ugly trick to set the I/O handler in case of syslog over UDP because for now it's still not set in the receiver, so we hard-code it.	2020-09-16 22:08:07 +02:00
Willy Tarreau	9eda7a6d62	MEDIUM: proto_sockpair: make use of sockpair_bind_receiver() Now we rely on the address family's receiver instead of binding everything ourselves.	2020-09-16 22:08:07 +02:00
Willy Tarreau	62292b28a3	MEDIUM: sockpair: implement sockpair_bind_receiver() Note that for now we don't have a sockpair.c file to host that unusual family, so the new function was placed directly into proto_sockpair.c. It's no big deal given that this family is currently not shared with multiple protocols. The function does almost nothing but setting up the receiver. This is normal as the socket the FDs are passed onto are supposed to have been already created somewhere else, and the only usable identifier for such a socket pair is the receiving FD itself. The function was assigned to sockpair's ->bind() and is not used yet.	2020-09-16 22:08:07 +02:00
Willy Tarreau	cd5e5eaf50	MEDIUM: uxst: make use of sock_unix_bind_receiver() This removes all the AF_UNIX-specific code from uxst_bind_listener() and now simply relies on sock_unix_bind_listener() to do the same job. As mentionned in previous commit, the only difference is that now an unlikely failure on listen() will not result in a roll back of the temporary socket names since they will have been renamed during the bind() operation (as expected). But such failures do not correspond to any normal case and mostly denote operating system issues so there's no functionality loss here.	2020-09-16 22:08:07 +02:00
Willy Tarreau	1e0a860099	MEDIUM: sock_unix: implement sock_unix_bind_receiver() This function performs all the bind-related stuff for UNIX sockets that was previously done in uxst_bind_listener(). There is a very tiny difference however, which is that previously, in the unlikely event where listen() would fail, it was still possible to roll back the binding and rename the backup to the original socket. Now we have to rename it before calling returning, hence it will be done before calling listen(). However, this doesn't cover any particular use case since listen() has no reason to fail there (and the rollback is not done for inherited sockets), that was just done that way as a generic error processing path. The code is not used yet and is referenced in the uxst proto's ->bind().	2020-09-16 22:08:07 +02:00
Willy Tarreau	2f7687d0e8	MEDIUM: udp: make use of sock_inet_bind_receiver() This removes all the AF_INET-specific code from udp_bind_listener() and now simply relies on sock_inet_bind_listener() to do the same job. The function is now basically just a wrapper around sock_inet_bind_receiver().	2020-09-16 22:08:07 +02:00
Willy Tarreau	af9a7f5bb0	MEDIUM: tcp: make use of sock_inet_bind_receiver() This removes all the AF_INET-specific code from tcp_bind_listener() and now simply relies on sock_inet_bind_listener() to do the same job. The function was now roughly cut in half and its error path significantly simplified.	2020-09-16 22:08:07 +02:00
Willy Tarreau	d69ce1ffbc	MEDIUM: sock_inet: implement sock_inet_bind_receiver() This function collects all the receiver-specific code from both tcp_bind_listener() and udp_bind_listener() in order to provide a more generic AF_INET/AF_INET6 socket binding function. For now the API is not very elegant because some info are still missing from the receiver while there's no ideal place to fill them except when calling ->listen() at the protocol level. It looks like some polishing code is needed in check_config_validity() or somewhere around this in order to finalize the receivers' setup. The main issue is that listeners and receivers are created before bind_conf options are parsed and that there's no finishing step to resolve some of them. The function currently sets up a receiver and subscribes it to the poller. In an ideal world we wouldn't subscribe it but let the caller do it after having finished to configure the L4 stuff. The problem is that the caller would then need to perform an fd_insert() call and to possibly set the exported flag on the FD while it's not its job. Maybe an improvement could be to have a separate sock_start_receiver() call in sock.c. For now the function is not used but it will soon be. It's already referenced as tcp and udp's ->bind().	2020-09-16 22:08:07 +02:00
Willy Tarreau	b3580b19c8	MINOR: protocol: rename the ->bind field to ->listen The function currently is doing both the bind() and the listen(), so let's call it ->listen so that the bind() operation can move to another place.	2020-09-16 22:08:07 +02:00
Willy Tarreau	c049c0d5ad	MINOR: sock: make sock_find_compatible_fd() only take a receiver We don't need to have a listener anymore to find an fd, a receiver with its settings properly set is enough now.	2020-09-16 22:08:07 +02:00
Willy Tarreau	3fd3bdc836	MINOR: receiver: move the FOREIGN and V6ONLY options from listener to settings The new RX_O_FOREIGN, RX_O_V6ONLY and RX_O_V4V6 options are now set into the rx_settings part during the parsing, so that we don't need to adjust them in each and every listener anymore. We have to keep both v4v6 and v6only due to the precedence from v6only over v4v6.	2020-09-16 22:08:07 +02:00
Willy Tarreau	43046fa4f4	MINOR: listener: move the INHERITED flag down to the receiver It's the receiver's FD that's inherited from the parent process, not the listener's so the flag must move to the receiver so that appropriate actions can be taken.	2020-09-16 22:08:07 +02:00
Willy Tarreau	0b9150155e	MINOR: receiver: add a receiver-specific flag to indicate the socket is bound In order to split the receiver from the listener, we'll need to know that a socket is already bound and ready to receive. We used to do that via tha LI_O_ASSIGNED state but that's not sufficient anymore since the receiver might not belong to a listener anymore. The new RX_F_BOUND flag is used for this.	2020-09-16 22:08:07 +02:00
Willy Tarreau	818a92e87a	MINOR: listener: prefer to retrieve the socket's settings via the receiver Some socket settings used to be retrieved via the listener and the bind_conf. Now instead we use the receiver and its settings whenever appropriate. This will simplify the removal of the dependency on the listener.	2020-09-16 22:08:07 +02:00
Willy Tarreau	eef454224d	MINOR: receiver: link the receiver to its owner A receiver will have to pass a context to be installed into the fdtab for use by the handler. We need to set this into the receiver struct as the bind will happen longer after the configuration.	2020-09-16 22:08:07 +02:00
Willy Tarreau	0fce6bce34	MINOR: receiver: link the receiver to its settings Just like listeners keep a pointer to their bind_conf, receivers now also have a pointer to their rx_settings. All those belonging to a listener are automatically initialized with a pointer to the bind_conf's settings.	2020-09-16 22:08:07 +02:00
Willy Tarreau	4dfabfed13	MINOR: listener: make sock_find_compatible_fd() check the socket type sock_find_compatible_fd() can now access the protocol via the receiver hence it can access its socket type and know whether the receiver has dgram or stream sockets, so we don't need to hack around AF_CUST_UDP* anymore there.	2020-09-16 22:08:07 +02:00
Willy Tarreau	b743661f04	REORG: listener: move the listener's proto to the receiver The receiver is the one which depends on the protocol while the listener relies on the receiver. Let's move the protocol there. Since there's also a list element to get back to the listener from the proto list, this list element (proto_list) was moved as well. For now when scanning protos, we still see listeners which are linked by their rx.proto_list part.	2020-09-16 22:08:05 +02:00
Willy Tarreau	38ba647f9f	REORG: listener: move the receiving FD to struct receiver The listening socket is represented by its file descriptor, which is generic to all receivers and not just listeners, so it must move to the rx struct. It's worth noting that in order to extend receivers and listeners to other protocols such as QUIC, we'll need other handles than file descriptors here, and that either a union or a cast to uintptr_t will have to be used. This was not done yet and the field was preserved under the name "fd" to avoid adding confusion.	2020-09-16 22:08:03 +02:00
Willy Tarreau	371590661e	REORG: listener: move the listening address to a struct receiver The address will be specific to the receiver so let's move it there.	2020-09-16 22:08:01 +02:00
Willy Tarreau	be56c1038f	MINOR: listener: move the network namespace to the struct settings The netns is common to all listeners/receivers and is used to bind the listening socket so it must be in the receiver settings and not in the listener. This removes some yet another set of unnecessary loops.	2020-09-16 20:13:13 +02:00
Willy Tarreau	7e307215e8	MINOR: listener: move the interface to the struct settings The interface is common to all listeners/receivers and is used to bind the listening socket so it must be in the receiver settings and not in the listener. This removes some unnecessary loops.	2020-09-16 20:13:13 +02:00
Willy Tarreau	e26993c098	MINOR: listener: move bind_proc and bind_thread to struct settings As mentioned previously, these two fields come under the settings struct since they'll be used to bind receivers as well.	2020-09-16 20:13:13 +02:00
Willy Tarreau	6e459d7f92	MINOR: listener: create a new struct "settings" in bind_conf There currently is a large inconsistency in how binding parameters are split between bind_conf and listeners. It happens that for historical reasons some parameters are available at the listener level but cannot be configured per-listener but only for a bind_conf, and thus, need to be replicated. In addition, some of the bind_conf parameters are in fact for the listening socket itself while others are for the instanciated sockets. A previous attempt at splitting listeners into receivers failed because the boundary between all these settings is not well defined. This patch introduces a level of listening socket settings in the bind_conf, that will be detachable later. Such settings that are solely for the listening socket are: - unix socket permissions (used only during binding) - interface (used for binding) - network namespace (used for binding) - process mask and thread mask (used during startup) The rest seems to be used only to initialize the resulting sockets, or to control the accept rate. For now, only the unix params (bind_conf->ux) were moved there.	2020-09-16 20:13:13 +02:00
Willy Tarreau	e42d87f3de	BUG/MINOR: dns: gracefully handle the "udp@" address format for nameservers Just like with previous commit, DNS nameservers are affected as well with addresses starting in "udp@", but here it's different, because due to another bug in the DNS parser, the address is rejected, indicating that it doesn't have a ->connect() method. Similarly, the DNS code believes it's working on top of TCP at this point and this used to work because of this. The same fix is applied to remap the protocol and the ->connect test was dropped. No backport is needed, as the ->connect() test will never strike in 2.2 or below.	2020-09-16 20:11:52 +02:00
Willy Tarreau	e1c4c80441	BUG/MINOR: log: gracefully handle the "udp@" address format for log servers Commit `3835c0dcb` ("MEDIUM: udp: adds minimal proto udp support for message listeners.") introduced a problematic side effect in log server address parser: if "udp@", "udp4@" or "udp6@" prefixes a log server's address, the adress is passed as-is to the log server with a non-existing family and fails like this when trying to send: [ALERT] 259/195708 (3474) : socket() failed in logger #1: Address family not supported by protocol (errno=97) The problem is that till now there was no UDP family, so logs expect an AF_INET family to be passed for UDP there. This patch manually remaps AF_CUST_UDP4 and AF_CUST_UDP6 to their "tcp" equivalent that the log server parser expects. No backport is needed.	2020-09-16 20:11:52 +02:00
William Lallemand	70bf06e5f0	BUILD: fix build with openssl < 1.0.2 since bundle removal Bundle removal broke the build with openssl version < 1.0.2. Remove the #ifdef around SSL_SOCK_KEYTYPE_NAMES.	2020-09-16 18:10:00 +02:00
William Lallemand	e7eb1fec2f	CLEANUP: ssl: remove utility functions for bundle Remove the last utility functions for handling the multi-cert bundles and remove the multi-variable from the ckch structure. With this patch, the bundles are completely removed.	2020-09-16 16:28:26 +02:00
William Lallemand	5685ccf75e	CLEANUP: ssl/cli: remove test on 'multi' variable in CLI functions The multi variable is not useful anymore since the removal of the multi-certificates bundle support. It can be removed safely from the CLI functions and suppose that every ckch contains a single certificate.	2020-09-16 16:28:26 +02:00
William Lallemand	bd8e6eda59	CLEANUP: ssl: remove test on "multi" variable in ckch functions Since the removal of the multi-certificates bundle support, this variable is not useful anymore, we can remove all tests for this variable and suppose that every ckch contains a single certificate.	2020-09-16 16:28:26 +02:00
William Lallemand	dfa93be3b5	MEDIUM: ssl: emulate multi-cert bundles loading in standard loading Like the previous commit, this one emulates the bundling by loading each certificate separately and storing it in a separate SSL_CTX. This patch does it for the standard certificate loading, which means outside directories or crt-list. The multi-certificates bundle was the common way of offering multiple certificates of different types (ecdsa and rsa) for a same SSL_CTX. This was implemented with OpenSSL 1.0.2 before the client_hello callback was available. Now that all versions which does not support this callback are deprecated (< 1.1.0), we can safely removes the support for the bundle which was inconvenient and complexify too much the code.	2020-09-16 16:28:26 +02:00
William Lallemand	47da82111d	MEDIUM: ssl: emulates the multi-cert bundles in the crtlist The multi-certificates bundle was the common way of offering multiple certificates of different types (ecdsa and rsa) for a same SSL_CTX. This was implemented with OpenSSL 1.0.2 before the client_hello callback was available. Now that all versions which does not support this callback are depracated (< 1.1.0), we can safely removes the support for the bundle which was inconvenient and complexify too much the code. This patch emulates the bundle loading by looking for the bundle files when the specified file in the configuration does not exist. It then creates new entries in the crtlist, so they will appear as new line if they are dumped from the CLI.	2020-09-16 16:28:26 +02:00
William Lallemand	5622c45df4	MINOR: ssl: crtlist_entry_dup() duplicates a crtlist_entry Implement crtlist_entry_dup() which allocate and duplicate a crtlist_entry structure.	2020-09-16 16:28:26 +02:00
William Lallemand	82f2d2f1d0	MINOR: ssl: crtlist_dup_ssl_conf() duplicates a ssl_bind_conf Implement the crtlist_dup_ssl_conf() which allocates and duplicates a ssl_bind_conf structure.	2020-09-16 16:28:26 +02:00
William Lallemand	95fefa1c09	MEDIUM: ssl/cli: remove support for multi certificates bundle Remove the support for multi-certificates bundle in the CLI. There is nothing to replace here, it will use the standard codepath with the "bundle emulation" in the future.	2020-09-16 16:28:26 +02:00
William Lallemand	89d3b355ad	MEDIUM: ssl: remove bundle support in crt-list and directories The multi-cert certificates bundle is the former way, implemented with openssl 1.0.2, of doing multi-certificate (RSA, ECDSA and DSA) for the same SNI host. Remove this support temporarely so it is replaced by the loading of each certificate in a separate SSL_CTX.	2020-09-16 16:28:26 +02:00
Willy Tarreau	3b139e540a	BUG/MEDIUM: log-forward: always quit on parsing errors The err_code and goto were misplaced, causing a fatal parse error to be ignored when parsing a UDP listener's address. No backport is needed.	2020-09-16 16:25:29 +02:00
Willy Tarreau	76aaa7f5b7	MEDIUM: log-forward: use "dgram-bind" instead of "bind" for the listener The use of "bind" wasn't that wise but was temporary. The problem is that it will not allow to coexist with tcp. Let's explicitly call it "dgram-bind" so that datagram listeners are expected here, leaving some room for stream listeners later. This is the only change.	2020-09-16 15:07:22 +02:00
Willy Tarreau	f9feec2813	BUG/MINOR: log-forward: fail on unknown keywords The log-forward section silently ignores junk and unknown keywords, make it fail! No backport is needed.	2020-09-16 15:04:33 +02:00
William Lallemand	0354b658f0	BUG/MINOR: ssl/crt-list: crt-list could end without a \n Since the refactoring of the crt-list, the same function is used to parse a crt-list file and a crt-list line on the CLI. The assumption that a line on the CLI and a line in a file is finished by a \n was made. However that is potentialy not the case with a file which does not finish by a \n. This patch fixes issue #860 and must be backported in 2.2.	2020-09-16 11:55:09 +02:00
Olivier Houchard	a459826056	BUG/MEDIUM: ssl: Don't call ssl_sock_io_cb() directly. In the SSL code, when we were waiting for the availability of the crypto engine, once it is ready and its fd's I/O handler is called, don't call ssl_sock_io_cb() directly, instead, call tasklet_wakeup() on the ssl_sock_ctx's tasklet. We were calling ssl_sock_io_cb() with NULL as a tasklet, which used to be fine, but it is no longer true since the fd takeover changes. We could just provide the tasklet, but let's just wake the tasklet, as is done for other FDs, for fairness. This should fix github issue #856. This should be backported into 2.2.	2020-09-15 22:16:02 +02:00
Willy Tarreau	9743f709d0	BUG/MINOR: server: report correct error message for invalid port on "socks4" The socks4 keyword parser was a bit too much copy-pasted, it only checks for a null port and reports "invalid range". Let's properly check for the 1-65535 range and report the correct error. It may be backported everywhere "socks4" is present (2.0).	2020-09-15 12:00:29 +02:00
William Lallemand	2d6fd0a90d	BUG/MINOR: ssl: verifyhost is case sensitive In bug #835, @arjenzorgdoc reported that the verifyhost option on the server line is case-sensitive, that shouldn't be the case. This patch fixes the issue by replacing memcmp by strncasecmp and strcmp by strcasecmp. The patch was suggested by @arjenzorgdoc. This must be backported in all versions supporting the verifyhost option.	2020-09-14 15:20:10 +02:00
Tim Duesterhus	e52b6e5456	CLEANUP: Do not use a fixed type for 'sizeof' in 'calloc' Changes performed using the following coccinelle patch: @@ type T; expression E; expression t; @@ ( t = calloc(E, sizeof(t)) \| - t = calloc(E, sizeof(T)) + t = calloc(E, sizeof(t)) ) Looking through the commit history, grepping for coccinelle shows that the same replacement with a different patch was already performed in the past in commit `02779b6263`.	2020-09-12 20:31:25 +02:00
Tim Duesterhus	b53dd03dc0	BUG/MINOR: Fix type passed of sizeof() for calloc() newsrv->curr_idle_thr is of type `unsigned int`, not `int`. Fix this issue by simply passing the dereferenced pointer to sizeof, which is the preferred style anyway. This bug was introduced in commit `dc2f2753e9`. It first appeared in 2.2-dev5. The patch must be backported to 2.2+. It is notable that the `calloc` call was not introduced within the commit in question. The allocation was already happening before that commit and it already looked like it does after applying the patch. Apparently the argument for the `sizeof` managed to get broken during the rearrangement that happened in that commit: for (i = 0; i < global.nbthread; i++) - MT_LIST_INIT(&newsrv->idle_orphan_conns[i]); - newsrv->curr_idle_thr = calloc(global.nbthread, sizeof(newsrv->curr_idle_thr)); + MT_LIST_INIT(&newsrv->safe_conns[i]); + + newsrv->curr_idle_thr = calloc(global.nbthread, sizeof(int)); Even more notable is that I previously fixed that exact same* allocation in commit `017484c80f`. So apparently it was managed to break this single line twice in the same way for whatever reason there might be.	2020-09-12 20:31:25 +02:00
Tim Duesterhus	3943e4fc3e	MINOR: sample: Add iif(<true>,<false>) converter iif() takes a boolean as input and returns one of the two argument strings depending on whether the boolean is true. This converter most likely is most useful to return the proper scheme depending on the value returned by the `ssl_fc` fetch, e.g. for use within the `x-forwarded-proto` request header. However it can also be useful for use within a template that is sent to the client using `http-request return` with a `lf-file`. It allows the administrator to implement a simple condition, without needing to prefill variables within the regular configuration using `http-request set-var(req.foo)`.	2020-09-11 16:59:27 +02:00
Christopher Faulet	6cfc851674	BUG/MEDIUM: pattern: Renew the pattern expression revision when it is pruned It must be done to expire patterns cached in the LRU cache. Otherwise it is possible to retrieve an already freed pattern, attached to a released pattern expression. When a specific pattern is deleted (->delete() callback), the pattern expression revision is already renewed. Thus it is not affected by this bug. Only prune action on the pattern expression is concerned. In addition, for a pattern expression, in ->prune() callbacks when the pattern list is released, a missing LIST_DEL() has been added. It is not a real issue because the list is reinitialized at the end and all elements are released and should never be reused. But it is less confusing this way. This bug may be triggered when a map is cleared from the cli socket. A workaround is to set the pattern cache size (tune.pattern.cache-size) to 0 to disable it. This patch should fix the issue #844. It must be backported to all supported versions.	2020-09-11 09:54:34 +02:00
Tim Duesterhus	fc85494c99	CLEANUP: haproxy: Free post_check_list in deinit() This allocation is technically always reachable and cannot leak, but so are a few others that are freed.	2020-09-11 07:54:39 +02:00
Tim Duesterhus	f0c25d210c	CLEANUP: haproxy: Free per_thread__list in deinit() This allocation is technically always reachable and cannot leak, but so are a few others that are* freed.	2020-09-11 07:54:39 +02:00
Tim Duesterhus	53508d6564	CLEANUP: haproxy: Free post_proxy_check_list in deinit() This allocation is technically always reachable and cannot leak, but so are a few others that are freed.	2020-09-11 07:54:39 +02:00
Tim Duesterhus	9e0c2f34dc	CLEANUP: Free old_argv on deinit This allocation technically is always reachable and cannot leak, however other global variables such as `oldpids` are already being freed. This is in an attempt to get HAProxy to a state where there are zero live allocations after a clean exit.	2020-09-11 07:54:39 +02:00
Tim Duesterhus	00f00cf8fd	BUG/MINOR: haproxy: Free uri_auth->scope during deinit Given the following example configuration: listen http bind *:80 mode http stats scope . Running a configuration check with valgrind reports: ==16341== 26 (24 direct, 2 indirect) bytes in 1 blocks are definitely lost in loss record 3 of 13 ==16341== at 0x4C2FB55: calloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) ==16341== by 0x571C2E: stats_add_scope (uri_auth.c:296) ==16341== by 0x46CE29: cfg_parse_listen (cfgparse-listen.c:1901) ==16341== by 0x45A112: readcfgfile (cfgparse.c:2078) ==16341== by 0x50A0F5: init (haproxy.c:1828) ==16341== by 0x418248: main (haproxy.c:3012) After this patch is applied the leak is gone as expected. This is a very minor leak that can only be observed if deinit() is called, shortly before the OS will free all memory of the process anyway. No backport needed.	2020-09-11 07:54:39 +02:00
Willy Tarreau	022e5e56ed	BUILD: traces: don't pass an empty argument for missing ones It initially looked appealing to be able to call traces with ",,," for unused arguments, but tcc doesn't like empty macro arguments, and quite frankly, adding a zero between the few remaining ones is no big deal. Let's do so now.	2020-09-10 09:37:52 +02:00
Willy Tarreau	f734ebfac4	BUILD: threads: better workaround for late loading of libgcc_s Commit `77b98220e` ("BUG/MINOR: threads: work around a libgcc_s issue with chrooting") tried to address an issue with libgcc_s being loaded too late. But it turns out that the symbol used there isn't present on armhf, thus it breaks the build. Given that the issue manifests itself during pthread_exit(), the safest and most portable way to test this is to call pthread_exit(). For this we create a dummy thread which exits, during the early boot. This results in the relevant library to be loaded if needed, making sure that a later call to pthread_exit() will still work. It was tested to work fine under linux on the following platforms: glibc: - armhf - aarch64 - x86_64 - sparc64 - ppc64le musl: - mipsel Just running the code under strace easily shows the call in the dummy thread, for example here on armhf: $ strace -fe trace=file ./haproxy -v 2>&1 \| grep gcc_s [pid 23055] open("/lib/libgcc_s.so.1", O_RDONLY\|O_CLOEXEC) = 3 The code was isolated so that it's easy to #ifdef it out if needed. This should be backported where the patch above is backported (likely 2.0).	2020-09-09 19:10:46 +02:00
Willy Tarreau	4313d5ae98	BUG/MEDIUM: mux-h1: always apply the timeout on half-closed connections The condition in h1_refresh_timeout() seems insufficient to properly take care of the half-closed timeout, because depending on the ordering of operations when performing the last send() to a client, the stream may or may not still be there and we may fail to shrink the client timeout on our last opportunity to do so. Here we want to make sure that the timeout is always reduced when the last chunk was sent and the shutdown completed, regardless of the presence of a stream or not. This is what this patch does. This should be backported as far as 2.0, and should fix the issue reported in #541.	2020-09-08 15:49:40 +02:00
Victor Kislov	ec00251c88	BUG/MINOR: auth: report valid crypto(3) support depending on build options Since 1.8 with commit `e8692b41e` ("CLEANUP: auth: use the build options list to report its support"), crypt(3) is always reported as being supported in "haproxy -vv" because no test on USE_LIBCRYPT is made anymore when producing the output. This reintroduces the distinction between with and without USE_LIBCRYPT in the output by indicating "yes" or "no". It may be backported as far as 1.8, though the code differs due to a number of include files cleanups.	2020-09-08 14:34:04 +02:00
Christopher Faulet	b0b7607a54	MINOR: server: Improve log message sent when server address is updated When the server address is set for the first time, the log message is a bit ugly because there is no old ip address to report. Thus in the log, we can see : PX/SRV changed its IP from to A.B.C.D by DNS additional record. Now, when this happens, "(none)" is reported : PX/SRV changed its IP from (none) to A.B.C.D by DNS additional record. This patch may be backported to 2.2.	2020-09-08 10:44:57 +02:00
Christopher Faulet	d6c6b5f43b	BUG/MEDIUM: dns: Be sure to renew IP address for already known servers When a SRV record for an already known server is processed, only the weight is updated, if not configured to be ignored. It is a problem if the IP address carried by the associated additional record changes. Because the server IP address is never renewed. To fix this bug, If there is an addition record attached to a SRV record, we always try to set the IP address. If it is the same, no change is performed. This way, IP changes are always handled. This patch should fix the issue #841. It must be backported to 2.2.	2020-09-08 10:44:57 +02:00
Christopher Faulet	5a89175ac8	BUG/MEDIUM: dns: Don't store additional records in a linked-list A SRV record keeps a reference on the corresponding additional record, if any. But this additional record is also inserted in a separate linked-list into the dns response. The problems arise when obsolete additional records are released. The additional records list is purged but the SRV records always reference these objects, leading to an undefined behavior. Worst, this happens very quickly because additional records are never renewed. Thus, once received, an additional record will always expire. Now, the addtional record are only associated to a SRV record or simply ignored. And the last version is always used. This patch helps to fix the issue #841. It must be backported to 2.2.	2020-09-08 10:44:39 +02:00
Christopher Faulet	e720c32b78	MINOR: http-fetch: Add pathq sample fetch The pathq sample fetch extract the relative URI of a request, i.e the path with the query-string, excluding the scheme and the authority, if any. It is pretty handy to always get a relative URI independently on the HTTP version. Indeed, while relative URIs are common in HTTP/1.1, in HTTP/2, most of time clients use absolute URIs. This patch may be backported to 2.2.	2020-09-04 11:41:47 +02:00
Christopher Faulet	312294f53d	MINOR: http-rules: Add set-pathq and replace-pathq actions These actions do the same as corresponding "-path" versions except the query-string is included to the manipulated request path. This means set-pathq action replaces the path and the query-string and replace-pathq action matches and replace the path including the query-string. This patch may be backported to 2.2.	2020-09-04 11:41:46 +02:00
Christopher Faulet	1fa0cc18e1	Revert "BUG/MINOR: http-rules: Replace path and query-string in "replace-path" action" This reverts commit `4b9c0d1fc0`. Actually, the "replace-path" action is ambiguous. "set-path" action preserves the query-string. The "path" sample fetch does not contain the query-string. But "replace-path" action is documented to handle the query-string. It is probably not the expected behavior. So instead of fixing the code, we will fix the documentation to make "replace-path" action consistent with other parts of the code. In addition actions and sample fetches to handle the path with the query-string will be added. If the commit above is ever backported, this one must be as well.	2020-09-02 17:29:00 +02:00
William Lallemand	398da62c38	BUG/MINOR: startup: haproxy -s cause 100% cpu It was reported in bug #837 that haproxy -s causes a 100% CPU. However this option does not exist and haproxy must exit with the usage message. The parser was not handling the case where -s is not followed by 't' or 'f' which are the only two valid cases. This bug was introduced by df6c5a ("BUG/MEDIUM: mworker: fix the copy of options in copy_argv()") which was backported as far as 1.8. This fix must be backported as far as 1.8.	2020-09-02 16:17:14 +02:00
Willy Tarreau	e91bff2134	MAJOR: init: start all listeners via protocols and not via proxies anymore Ever since the protocols were added in 1.3.13, listeners used to be started twice: - once by start_proxies(), which iteratees over all proxies then all listeners ; - once by protocol_bind_all() which iterates over all protocols then all listeners ; It's a real mess because error reporting is not even consistent, and more importantly now that some protocols do not appear in regular proxies (peers, logs), there is no way to retry their binding should it fail on the last step. What this patch does is to make sure that listeners are exclusively started by protocols. The failure to start a listener now causes the emission of an error indicating the proxy's name (as it used to be the case per proxy), and retryable failures are silently ignored during all but last attempts. The start_proxies() function was kept solely for setting the proxy's state to READY and emitting the "Proxy started" message and log that some have likely got used to seeking in their logs.	2020-09-02 11:11:43 +02:00
Willy Tarreau	576a633868	CLEANUP: protocol: remove all ->bind_all() and ->unbind_all() functions These ones were not used anymore since the two previous patches, let's drop them.	2020-09-02 10:40:33 +02:00
Willy Tarreau	ca2126230a	MINOR: protocol: do not call proto->unbind_all() anymore Similarly to previous commit about ->bind_all(), we have the same construct for ->unbind_all() which ought not to be used either. Let's make protocol_unbind_all() iterate over all listeners and directly call unbind_listener() instead. It's worth noting that for uxst there was originally a specific ->unbind_all() function but the simplifications that came over the years have resulted in a locally reimplemented version of the same function: the test on (state > LI_ASSIGNED) there is equivalent to the one on (state >= LI_PAUSED) that is used in do_unbind_listener(), and it seems these have been equivalent since at least commit `dabf2e264` ("[MAJOR] added a new state to listeners")) (1.3.14).	2020-09-02 10:39:31 +02:00
Willy Tarreau	94320859f9	MINOR: protocol: do not call proto->bind_all() anymore All protocols only iterate over their own listeners list and start the listeners using a direct call to their ->bind() function. This code duplication doesn't make sense and prevents us from centralizing the startup error handling. Worse, it's not even symmetric because there's an unbind_all_listeners() function common to all protocols without any equivalent for binding. Let's start by directly calling each protocol's bind() function from protocol_bind_all().	2020-09-02 10:19:41 +02:00
Willy Tarreau	06a1806083	BUILD: thread: limit the libgcc_s workaround to glibc only Previous commit `77b98220e` ("BUG/MINOR: threads: work around a libgcc_s issue with chrooting") broke the build on cygwin. I didn't even know we supported threads on cygwin. But the point is that it's actually the glibc-based libpthread which requires libgcc_s, so in absence of other reports we should not apply the workaround on other libraries. This should be backported along with the aforementioned patch.	2020-09-02 09:53:47 +02:00
Willy Tarreau	77b98220e8	BUG/MINOR: threads: work around a libgcc_s issue with chrooting Sander Hoentjen reported another issue related to libgcc_s in issue #671. What happens is that when the old process quits, pthread_exit() calls something from libgcc_s.so after the process was chrooted, and this is the first call to that library, causing an attempt to load it. In a chroot, this fails, thus libthread aborts. The behavior widely differs between operating systems because some decided to use a static build for this library. In 2.2 this was resolved as a side effect of a workaround for the same issue with the backtrace() call, which is also in libgcc_s. This was in commit `0214b45` ("MINOR: debug: call backtrace() once upon startup"). But backtraces are not necessarily enabled, and we need something for older versions. By inspecting a significant number of ligcc_s on various gcc versions and platforms, it appears that a few functions have been present since gcc 3.0, one of which, _Unwind_Find_FDE() has no side effect (it only returns a pointer). What this patch does is that in the thread initialization code, if built with gcc >= 3.0, a call to this function is made in order to make sure that libgcc_s is loaded at start up time and that there will be no need to load it upon exit. An easy way to check which libs are loaded under Linux is : $ strace -e trace=openat ./haproxy -v With this patch applied, libgcc_s now appears during init. Sander confirmed that this patch was enough to put an end to the core dumps on exit in 2.0, so this patch should be backported there, and maybe as far as 1.8.	2020-09-02 08:13:44 +02:00
Willy Tarreau	17254939c5	CLEANUP: http: silence a cppcheck warning in get_http_auth() In issue #777, cppcheck wrongly assumes a useless null pointer check in the expression below while it's obvious that in a 3G/1G split on 32-bit, len can become positive if p is NULL: p = memchr(ctx.value.ptr, ' ', ctx.value.len); len = p - ctx.value.ptr; if (!p \|\| len <= 0) return 0; In addition, on 64 bits you never know given that len is a 32-bit signed int thus the sign of the result in case of a null p will always be the opposite of the 32th bit of ctx.value.ptr. Admittedly the test is ugly. Tim proposed this fix consisting in checking for p == ctx.value.ptr instead when checking for first character only, which Ilya confirmed is enough to shut cppcheck up. No backport is needed.	2020-09-02 07:18:01 +02:00
Christopher Faulet	bde2c4c621	MINOR: http-htx: Handle an optional reason when replacing the response status When calling the http_replace_res_status() function, an optional reason may now be set. It is ignored if it points to NULL and the original reason is preserved. Only the response status is replaced. Otherwise both the status and the reason are replaced. It simplifies the API and most of time, avoids an extra call to http_replace_res_reason().	2020-09-01 10:55:36 +02:00
Christopher Faulet	4b9c0d1fc0	BUG/MINOR: http-rules: Replace path and query-string in "replace-path" action The documentation stated the "replace-path" action replaces the path, including the query-string if any is present. But in the code, only the path is replaced. The query-string is preserved. So, now, instead of relying on the same action code than "set-uri" action (1), a new action code (4) is used for "replace-path" action. In http_req_replace_stline() function, when the action code is 4, we call http_replace_req_path() setting the last argument (with_qs) to 1. This way, the query-string is not skipped but included to the path to be replaced. This patch relies on the commit `b8ce505c6` ("MINOR: http-htx: Add an option to eval query-string when the path is replaced"). Both must be backported as far as 2.0. It should fix the issue #829.	2020-09-01 10:55:29 +02:00
Christopher Faulet	b8ce505c6f	MINOR: http-htx: Add an option to eval query-string when the path is replaced The http_replace_req_path() function now takes a third argument to evaluate the query-string as part of the path or to preserve it. If <with_qs> is set, the query-string is replaced with the path. Otherwise, only the path is replaced. This patch is mandatory to fix issue #829. The next commit depends on it. So be carefull during backports.	2020-09-01 10:55:14 +02:00
Christopher Faulet	7d518454bb	BUG/MEDIUM: http-ana: Don't wait to send 1xx responses received from servers When an informational response (1xx) is received, we must be sure to send it ASAP. To do so, CF_SEND_DONTWAIT flag must be set on the response channel to instruct the stream-interface to not set the CO_SFL_MSG_MORE flag on the transport layer. Otherwise the response delivery may be delayed, because of the commit `8945bb6c0` ("BUG/MEDIUM: stream-int: fix loss of CO_SFL_MSG_MORE flag in forwarding"). Note that a previous patch (`cf6898cd` ["BUG/MINOR: http-ana: Don't wait to send 1xx responses generated by HAProxy"]) add this flag on 1xx responses generated by HAProxy but not on responses coming from servers. This patch must be backported to 2.2 and may be backported as far as 1.9, for HTX part only. But this part has changed in the 2.2, so it may be a bit tricky. Note it does not fix any known bug on 2.1 and below because the CO_SFL_MSG_MORE flag is ignored by the h1 mux.	2020-08-31 11:07:08 +02:00
Willy Tarreau	1c34b881c3	BUILD: sock_unix: fix build issue with isdigit() Commit `0d06df6` ("MINOR: sock: introduce sock_inet and sock_unix") made use of isdigit() on the UNIX socket path without casting the value to unsigned char, breaking the build on cygwin and possibly other platforms. No backport is needed.	2020-08-29 06:44:37 +02:00
Willy Tarreau	9dbb6c43ce	MINOR: sock: distinguish dgram from stream types when retrieving old sockets For now we still don't retrieve dgram sockets, but the code must be able to distinguish them before we switch to receivers. This adds a new flag to the xfer_sock_list indicating that a socket is of type SOCK_DGRAM. The way to set the flag for now is by looking at the dummy address family which equals AF_CUST_UDP{4,6} in this case (given that other dgram sockets are not yet supported).	2020-08-28 19:26:39 +02:00
Willy Tarreau	a2c17877b3	MINOR: sock: do not use LI_O_* in xfer_sock_list anymore We'll want to store more info there and some info that are not represented in listener options at the moment (such as dgram vs stream) so let's get rid of these and instead use a new set of options (SOCK_XFER_OPT_*).	2020-08-28 19:26:38 +02:00
Willy Tarreau	429617459d	REORG: sock: move get_old_sockets() from haproxy.c The new function was called sock_get_old_sockets() and was left as-is except a minimum amount of style lifting to make it more readable. It will never be awesome anyway since it's used very early in the boot sequence and needs to perform socket I/O without any external help.	2020-08-28 19:24:55 +02:00
Willy Tarreau	37bafdcbb1	MINOR: sock_inet: move the IPv4/v6 transparent mode code to sock_inet This code was highly redundant, existing for TCP clients, TCP servers and UDP servers. Let's move it to sock_inet where it belongs. The new functions are sock_inet4_make_foreign() and sock_inet6_make_foreign().	2020-08-28 18:51:36 +02:00
Willy Tarreau	2d34a710b1	MINOR: sock: implement sock_find_compatible_fd() This is essentially a merge from tcp_find_compatible_fd() and uxst_find_compatible_fd() that relies on a listener's address and compare function and still checks for other variations. For AF_INET6 it compares a few of the listener's bind options. A minor change for UNIX sockets is that transparent mode, interface and namespace used to be ignored when trying to pick a previous socket while now if they are changed, the socket will not be reused. This could be refined but it's still better this way as there is no more risk of using a differently bound socket by accident. Eventually we should not pass a listener there but a set of binding parameters (address, interface, namespace etc...) which ultimately will be grouped into a receiver. For now this still doesn't exist so let's stick to the listener to break dependencies in the rest of the code.	2020-08-28 18:51:36 +02:00
Willy Tarreau	a6473ede5c	MINOR: sock: add interface and namespace length to xfer_sock_list This will ease and speed up comparisons in FD lookups.	2020-08-28 18:51:36 +02:00
Willy Tarreau	063d47d136	REORG: listener: move xfer_sock_list to sock.{c,h}. This will be used for receivers as well thus it is not specific to listeners but to sockets.	2020-08-28 18:51:36 +02:00
Willy Tarreau	e5bdc51bb5	REORG: sock_inet: move default_tcp_maxseg from proto_tcp.c Let's determine it at boot time instead of doing it on first use. It also saves us from having to keep it thread local. It's been moved to the new sock_inet_prepare() function, and the variables were renamed to sock_inet_tcp_maxseg_default and sock_inet6_tcp_maxseg_default.	2020-08-28 18:51:36 +02:00
Willy Tarreau	d88e8c06ac	REORG: sock_inet: move v6only_default from proto_tcp.c to sock_inet.c The v6only_default variable is not specific to TCP but to AF_INET6, so let's move it to the right file. It's now immediately filled on startup during the PREPARE stage so that it doesn't have to be tested each time. The variable's name was changed to sock_inet6_v6only_default.	2020-08-28 18:51:36 +02:00
Willy Tarreau	25140cc573	REORG: inet: replace tcp_is_foreign() with sock_inet_is_foreign() The function now makes it clear that it's independent on the socket type and solely relies on the address family. Note that it supports both IPv4 and IPv6 as we don't seem to need it per-family.	2020-08-28 18:51:36 +02:00
Willy Tarreau	c5a94c936b	MINOR: sock_inet: implement sock_inet_get_dst() This one is common to the TCPv4 and UDPv4 code, it retrieves the destination address of a socket, taking care of the possiblity that for an incoming connection the traffic was possibly redirected. The TCP and UDP definitions were updated to rely on it and remove duplicated code.	2020-08-28 18:51:36 +02:00
Willy Tarreau	f172558b27	MINOR: tcp/udp/unix: make use of proto->addrcmp() to compare addresses The new addrcmp() protocol member points to the function to be used to compare two addresses of the same family. When picking an FD from a previous process, we can now use the address specific address comparison functions instead of having to rely on a local implementation. This will help move that code to a more central place.	2020-08-28 18:51:36 +02:00
Willy Tarreau	0d06df6448	MINOR: sock: introduce sock_inet and sock_unix These files will regroup everything specific to AF_INET, AF_INET6 and AF_UNIX socket definitions and address management. Some code there might be agnostic to the socket type and could later move to af_xxxx.c but for now we only support regular sockets so no need to go too far. The files are quite poor at this step, they only contain the address comparison function for each address family.	2020-08-28 18:51:36 +02:00
Willy Tarreau	18b7df7a2b	REORG: sock: start to move some generic socket code to sock.c The new file sock.c will contain generic code for standard sockets relying on file descriptors. We currently have way too much duplication between proto_uxst, proto_tcp, proto_sockpair and proto_udp. For now only get_src, get_dst and sock_create_server_socket were moved, and are used where appropriate.	2020-08-28 18:51:36 +02:00
Willy Tarreau	1318034317	REORG: unix: move UNIX bind/server keywords from proto_uxst.c to cfgparse-unix.c Let's finish the cleanup and get rid of all bind and server keywords parsers from proto_uxst.c. They're now moved to cfgparse-unix.c. Now proto_uxst.c is clean and only contains code related to binding and connecting.	2020-08-28 18:51:36 +02:00
Willy Tarreau	de70ca5dfd	REORG: tcp: move TCP bind/server keywords from proto_tcp.c to cfgparse-tcp.c Let's continue the cleanup and get rid of all bind and server keywords parsers from proto_tcp.c. They're now moved to cfgparse-tcp.c, just as was done for ssl before 2.2 release. Nothing has changed beyond this. Now proto_tcp.c is clean and only contains code related to binding and connecting.	2020-08-28 18:51:36 +02:00
Willy Tarreau	8987e7a8c9	REORG: tcp: move TCP sample fetches from proto_tcp.c to tcp_sample.c Let's continue the cleanup and get rid of all sample fetch functions from proto_tcp.c. They're now moved to tcp_sample.c, just as was done for ssl before 2.2 release. Nothing has changed beyond this.	2020-08-28 18:51:36 +02:00
Willy Tarreau	478331dd93	CLEANUP: tcp: stop exporting smp_fetch_src() This is totally ugly, smp_fetch_src() is exported only so that stick_table.c can (ab)use it in the {sc,src}_* sample fetch functions. It could be argued that the sample could have been reconstructed there in place, but we don't even need to duplicate the code. We'd rather simply retrieve the "src" fetch's function from where it's used at init time and be done with it.	2020-08-28 18:51:36 +02:00
Willy Tarreau	aeae66cf22	REORG: tcp: move TCP actions from proto_tcp.c to tcp_act.c The file proto_tcp.c has become a real mess because it still contains tons of definitions that have nothing to do with the TCP protocol setup. This commit moves the ruleset actions "set-src-port", "set-dst-port", "set-src", "set-dst", and "silent-drop" to a new file "tcp_act.c". Nothing has changed beyond this.	2020-08-28 18:51:36 +02:00
Willy Tarreau	febbce87ba	BUG/MINOR: reload: do not fail when no socket is sent get_old_sockets() mistakenly sets ret=0 instead of ret2=0 before leaving when the old process announces zero FD. So it will return an error instead of success. This must be particularly rare not to have a single socket to offer though! A few comments were added to make it more obvious what to expect in return. This must be backported to 1.8 since the bug has always been there.	2020-08-28 18:45:01 +02:00
Willy Tarreau	b5a1f9e495	MEDIUM: reload: pass all exportable FDs, not just listeners Now we don't limit ourselves to listeners found in proxies nor peers anymore, we're instead scanning all known FDs for those marked with ".exported=1". Just doing so has significantly simplified the code, and will later allow to yield while sending FDs if desired. When it comes to retrieving a possible namespace name or interface name, for now this is only performed on listeners since these are the only ones carrying such info. Once this moves somewhere else, we'll be able to also pass these info for UDP receivers for example, with only tiny changes.	2020-08-26 18:33:52 +02:00
Willy Tarreau	bb1caff70f	MINOR: fd: add a new "exported" flag and use it for all regular listeners This new flag will be used to mark FDs that must be passed to any future process across the CLI's "_getsocks" command. The scheme here is quite complex and full of special cases: - FDs inherited from parent processes are not exported this way, as they are supposed to instead be passed by the master process itself across reloads. However such FDs ought never to be paused otherwise this would disrupt the socket in the parent process as well; - FDs resulting from a "bind" performed over a socket pair, which are in fact one side of a socket pair passed inside another control socket pair must not be passed either. Since all of them are used the same way, for now it's enough never to put this "exported" flag to FDs bound by the socketpair code. - FDs belonging to temporary listeners (e.g. a passive FTP data port) must not be passed either. Fortunately we don't have such FDs yet. - the rest of the listeners for now are made of TCP, UNIX stream, ABNS sockets and are exportable, so they get the flag. - UDP listeners were wrongly created as listeners and are not suitable here. Their FDs should be passed but for now they are not since the client doesn't even distinguish the SO_TYPE of the retrieved sockets. In addition, it's important to keep in mind that: - inherited FDs may never be closed in master process but may be closed in worker processes if the service is shut down (useless since still bound, but technically possible) ; - inherited FDs may not be disabled ; - exported FDs may be disabled because the caller will perform the subsequent listen() on them. However that might not work for all OSes - exported FDs may be closed, it just means the service was shut down from the worker, and will be rebound in the new process. This implies that we have to disable exported on close(). => as such, contrary to an apparently obvious equivalence, the "exported" status doesn't imply anything regarding the ability to close a listener's FD or not.	2020-08-26 18:33:52 +02:00
Willy Tarreau	63d8b6009b	CLEANUP: fd: remove fd_remove() and rename fd_dodelete() to fd_delete() This essentially undoes what we did in fd.c in 1.8 to support seamless reload. Since we don't need to remove an fd anymore we can turn fd_delete() to the simple function it used to be.	2020-08-26 18:33:52 +02:00
Willy Tarreau	67672459c7	MEDIUM: fd: replace usages of fd_remove() with fd_stop_both() We used to require fd_remove() to remove an FD from a poller when we still had the FD cache and it was not possible to directly act on the pollers. Nowadays we don't need this anymore as the pollers will automatically unregister disabled FDs. The fd_remove() hack is particularly problematic because it additionally hides the FD from the known FD list and could make one think it's closed. It's used at two places: - with the async SSL engine - with the listeners (when unbinding from an fd for another process) Let's just use fd_stop_both() instead, which will propagate down the stack to do the right thing, without removing the FD from the array of known ones. Now when dumping FDs using "show fd" on a process which still knows some of the other workers' FDs, the FD will properly be listed with a listener state equal to "ZOM" for "zombie". This guarantees that the FD is still known and will properly be passed using _getsocks().	2020-08-26 18:33:52 +02:00
William Lallemand	a78f3f0d79	BUG/MEDIUM: ssl: fix ssl_bind_conf double free w/ wildcards The fix `7df5c2d` ("BUG/MEDIUM: ssl: fix ssl_bind_conf double free") was not complete. The problem still occurs when using wildcards in certificate, during the deinit. This patch removes the free of the ssl_conf structure in ssl_sock_free_all_ctx() since it's already done in the crtlist deinit. It must be backported in 2.2.	2020-08-26 17:39:23 +02:00
Willy Tarreau	cf1f193624	MEDIUM: reload: stop passing listener options along with FDs During a reload operation, we used to send listener options associated with each passed file descriptor. These were passed as binary contents for the size of the "options" field in the struct listener. This means that any flag value change or field size change would be problematic, the former failing to properly grab certain options, the latter possibly causing permanent failures during this operation. Since these two previous commits: MINOR: reload: determine the foreing binding status from the socket BUG/MINOR: reload: detect the OS's v6only status before choosing an old socket we don't need this anymore as the values are determined from the file descriptor itself. Let's just turn the previous 32 bits to vestigal space, send them as zeroes and ignore them on receipt. The only possible side effect is if someone would want to roll back from a 2.3 to 2.2 or earlier, such options might be ignored during this reload. But other forthcoming changes might make this fail as well anyway so that's not a reason for keeping this behavior.	2020-08-26 11:04:33 +02:00
Willy Tarreau	bf3b06b03d	MINOR: reload: determine the foreing binding status from the socket Let's not look at the listener options passed by the original process and determine from the socket itself whether it is configured for transparent mode or not. This is cleaner and safer, and doesn't rely on flag values that could possibly change between versions.	2020-08-26 10:33:02 +02:00
Willy Tarreau	bca5a4e0a8	BUG/MINOR: reload: detect the OS's v6only status before choosing an old socket The v4v6 and v6only options are passed as data during the socket transfer between processes so that the new process can decide whether it wants to reuse a socket or not. But this actually misses one point: if no such option is set and the OS defaults are changed between the reloads, then the socket will still be inherited and will never be rebound using the new options. This can be seen by starting the following config: global stats socket /tmp/haproxy.sock level admin expose-fd listeners frontend testme bind :::1234 timeout client 2000ms Having a look at the OS settins, v6only is disabled: $ cat /proc/sys/net/ipv6/bindv6only 0 A first check shows it's indeed bound to v4 and v6: $ ss -an -6\|grep 1234 tcp LISTEN 0 2035 :1234 :* Reloading the process doesn't change anything (which is expected). Now let's set bindv6only: $ echo 1 \| sudo tee /proc/sys/net/ipv6/bindv6only 1 $ cat /proc/sys/net/ipv6/bindv6only 1 Reloading gives the same state: $ ss -an -6\|grep 1234 tcp LISTEN 0 2035 :1234 :* However a restart properly shows a correct bind: $ ss -an -6\|grep 1234 tcp LISTEN 0 2035 [::]:1234 [::]:* This one doesn't change once bindv6only is reset, for the same reason. This patch attacks this problem differently. Instead of passing the two options at once for each listening fd, it ignores the options and reads the socket's current state for the IPV6_V6ONLY flag and sets it only. Then before looking for a compatible FD, it checks the OS's defaults before deciding which of the v4v6 and v6only needs to be kept on the listener. And the selection is only made on this. First, it addresses this issue. Second, it also ensures that if such options are changed between reloads to identical states, the socket can still be inherited. For example adding v4v6 when bindv6only is not set will allow the socket to still be usable. Third, it avoids an undesired dependency on the LI_O_* bit values between processes across a reload (for these ones at least). It might make sense to backport this to some recent stable versions, but quite frankly the likelyhood that anyone will ever notice it is extremely faint.	2020-08-26 10:32:51 +02:00
Willy Tarreau	bbb284d675	MINOR: tcp: don't try to set/clear v6only on inherited sockets If a socket was already bound (inherited from a parent or retrieved from a previous process), there's no point trying to change its IPV6_V6ONLY state since it will fail. This is visible in strace as an EINVAL during a reload when passing FDs.	2020-08-26 10:26:42 +02:00
Shimi Gersner	adabbfe5a4	MINOR: ssl: Support SAN extension for certificate generation The use of Common Name is fading out in favor of the RFC recommended way of using SAN extensions. For example, Chrome from version 58 will only match server name against SAN. The following patch adds SAN extension by default to all generated certificates. The SAN extension will be of type DNS and based on the server name.	2020-08-25 16:36:06 +02:00
Shimi Gersner	5846c490ce	MEDIUM: ssl: Support certificate chaining for certificate generation haproxy supports generating SSL certificates based on SNI using a provided CA signing certificate. Because CA certificates may be signed by multiple CAs, in some scenarios, it is neccesary for the server to attach the trust chain in addition to the generated certificate. The following patch adds the ability to serve the entire trust chain with the generated certificate. The chain is loaded from the provided `ca-sign-file` PEM file.	2020-08-25 16:36:06 +02:00
Willy Tarreau	6ce0232a78	BUILD: task: work around a bogus warning in gcc 4.7/4.8 at -O1 As reported in issue #816, when building task.o at -O1 with gcc 4.7 or 4.8, we get the following warning: CC src/task.o In file included from include/haproxy/proxy.h:31:0, from include/haproxy/cfgparse.h:27, from src/task.c:19: src/task.c: In function 'next_timer_expiry': include/haproxy/ticks.h:121:10: warning: 'key' may be used uninitialized in this function [-Wmaybe-uninitialized] src/task.c:349:2: note: 'key' was declared here It is wrong since the condition to use 'key' is exactly the same as the one used to set it. This warning disappears at -O2 and disappeared from gcc 5 and above. Let's just initialize 'key' there, it only adds 16 bytes of code and remains cheap enough for this function. This should be backported to 2.2.	2020-08-21 05:54:00 +02:00
Willy Tarreau	3005306a71	BUILD: tools: include auxv a bit later As reported in https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=24745, haproxy fails to build with TARGET=generic and without extra options due to auxv.h not being included, since the __GLIBC__ macro is not yet defined. Let's include it after other libc headers so that the __GLIBC__ definition is known. Thanks to David and Tim for the diag. This should be backported to 2.2.	2020-08-20 16:41:55 +02:00
zurikus	6d59993cb8	MINOR: stats: prevent favicon.ico requests for stats page Haproxy stats page don't have a favicon.ico, but browsers always makes a request for it. This lead to errors during stats page requests: Aug 18 08:46:41 somehost.example.net haproxy[1521534]: X.X.X.X:61403 [18/Aug/2020:08:46:41.437] stats stats/ -1/-1/-1/-1/0 503 222 - - SC-- 2/2/0/0/0 0/0 "GET /favicon.ico HTTP/1.1" Aug 18 08:46:42 somehost.example.net haproxy[1521534]: X.X.X.X:61403 [18/Aug/2020:08:46:42.650] stats stats/ -1/-1/-1/-1/0 503 222 - - SC-- 2/2/0/0/0 0/0 "GET /favicon.ico HTTP/1.1" Patch provided disables favicon.ico requests for haproxy stats page.	2020-08-19 11:29:57 +02:00
Tim Duesterhus	ff4d86becd	MINOR: cache: Reject duplicate cache names Using a duplicate cache name most likely is the result of a misgenerated configuration. There is no good reason to allow this, as the duplicate caches can't be referred to. This commit resolves GitHub issue #820. It can be argued whether this is a fix for a bug or not. I'm erring on the side of caution and marking this as a "new feature". It can be considered for backporting to 2.2, but for other branches the risk of accidentally breaking some working (but non-ideal) configuration might be too large.	2020-08-18 22:51:24 +02:00
Tim Duesterhus	ea969f6f26	DOC: cache: Use '<name>' instead of '<id>' in error message When the cache name is left out in 'filter cache' the error message refers to a missing '<id>'. The name of the cache is called 'name' within the docs. Adjust the error message for consistency. The error message was introduced in `99a17a2d91`. This commit first appeared in 1.9, thus the patch must be backported to 1.9+.	2020-08-18 22:51:24 +02:00
Tim Duesterhus	f92afb732b	MEDIUM: cfgparse: Emit hard error on truncated lines As announced within the emitted log message this is going to become a hard error in 2.3. It's 2.3 time now, let's do this. see `2fd5bdb439`	2020-08-18 22:51:24 +02:00
William Lallemand	30f9e095f5	BUG/MEDIUM: ssl: crt-list negative filters don't work The negative filters which are supposed to exclude a SNI from a wildcard, never worked. Indeed the negative filters were skipped in the code. To fix the issue, this patch looks for negative filters that are on the same line as a the wildcard that just matched. This patch should fix issue #818. It must be backported in 2.2. The problem also exists in versions > 1.8 but the infrastructure required to fix this was only introduced in 2.1. In older versions we should probably change the documentation to state that negative filters are useless.	2020-08-17 14:57:00 +02:00
Thierry Fournier	77016daabe	MINOR: hlua: Add error message relative to the Channel manipulation and HTTP mode When the developper try to manipulate HAProxy channels in HTTP mode, an error throws without explanation. This patch adds an explanation.	2020-08-17 12:50:43 +02:00
William Lallemand	5b1d1f6e0f	CLEANUP: ssl: remove poorly readable nested ternary Replace a four level nested ternary expression by an if/else expression in ssl_sock_switchctx_cbk()	2020-08-14 15:47:48 +02:00
William Lallemand	94bd319b26	BUG/MEDIUM: ssl: does not look for all SNIs before chosing a certificate In bug #810, the SNI are not matched correctly, indeed when trying to match a certificate type in ssl_sock_switchctx_cbk() all SNIs were not looked up correctly. In the case you have in a crt-list: wildcard.subdomain.domain.tld.pem.rsa *.subdomain.domain.tld record.subdomain.domain.tld record.subdomain.domain.tld.pem.ecdsa record.subdomain.domain.tld another-record.subdomain.domain.tld If the client only supports RSA and requests "another-record.subdomain.domain.tld", HAProxy will find the single ECDSA certificate and won't try to look up for a wildcard RSA certificate. This patch fixes the code so we look for all single and wildcard before chosing the certificate type. This bug was introduced by commit `3777e3a` ("BUG/MINOR: ssl: certificate choice can be unexpected with openssl >= 1.1.1"). It must be backported as far as 1.8 once it is heavily tested.	2020-08-14 15:47:48 +02:00
David Carlier	7adf8f35df	OPTIM: regex: PCRE2 use JIT match when JIT optimisation occured. When a regex had been succesfully compiled by the JIT pass, it is better to use the related match, thanksfully having same signature, for better performance. Signed-off-by: David Carlier <devnexen@gmail.com>	2020-08-14 07:53:40 +02:00
William Lallemand	935d8294d5	BUG/MEDIUM: ssl: never generates the chain from the verify store In bug #781 it was reported that HAProxy completes the certificate chain using the verify store in the case there is no chain. Indeed, according to OpenSSL documentation, when generating the chain, OpenSSL use the chain store OR the verify store in the case there is no chain store. As a workaround, this patch always put a NULL chain in the SSL_CTX so OpenSSL does not tries to complete it. This must be backported in all branches, the code could be different, the important part is to ALWAYS set a chain, and uses sk_X509_new_null() if the chain is NULL.	2020-08-12 20:10:50 +02:00
Willy Tarreau	a6d9879e69	BUG/MEDIUM: htx: smp_prefetch_htx() must always validate the direction It is possible to process a channel based on desynchronized info if a request fetch is called from a response and conversely. However, the code in smp_prefetch_htx() already makes sure the analysis has already started before trying to fetch from a buffer, so the problem effectively lies in response rules making use of request expressions only. Usually it's not a problem as extracted data are checked against the current HTTP state, except when it comes to the start line, which is usually accessed directly from sample fetch functions such as status, path, url, url32, query and so on. In this case, trying to access the request buffer from the response path will lead to unpredictable results. When building with DEBUG_STRICT, a process violating these rules will simply die after emitting: FATAL: bug condition "htx->first == -1" matched at src/http_htx.c:67 But when this is not enabled, it may or may not crash depending on what the pending request buffer data look like when trying to spot a start line there. This is typically what happens in issue #806. This patch adds a test in smp_prefetch_htx() so that it does not try to parse an HTX buffer in a channel belonging to the wrong direction. There's one special case on the "method" sample fetch since it can retrieve info even without a buffer, from the other direction, as long as the method is one of the well known ones. Three, we call smp_prefetch_htx() only if needed. This was reported in 2.0 and must be backported there (oldest stable version with HTX).	2020-08-12 15:15:05 +02:00
William Lallemand	e3a5f84e53	BUG/MINOR: ssl: double free w/ smp_fetch_ssl_x_chain_der() smp_fetch_ssl_x_chain_der() uses the SSL_get_peer_cert_chain() which does not increment the refcount of the chain, so it should not be free'd. The bug was introduced by `a598b50` ("MINOR: ssl: add ssl_{c,s}_chain_der fetch methods"). No backport needed.	2020-08-11 11:18:46 +02:00
Willy Tarreau	7b52485f1a	BUG/MINOR: stats: use strncmp() instead of memcmp() on health states The reports for health states are checked using memcmp() in order to only focus on the first word and possibly ignore trailing %d/%d etc. This makes gcc unhappy about a potential use of "" as the string, which never happens since the string is always set. This resulted in commit `c4e6460f6` ("MINOR: build: Disable -Wstringop-overflow.") to silence these messages. However some lengths are incorrect (though cannot cause trouble), and in the end strncmp() is just safer and cleaner. This can be backported to all stable branches as it will shut a warning with gcc 8 and above.	2020-08-11 10:26:36 +02:00
William Lallemand	9a1d839f61	BUG/MINOR: ssl: ssl-skip-self-issued-ca requires >= 1.0.2 The previous fix for ssl-skip-self-issued-ca requires the use of SSL_CTX_build_cert_chain() which is only available starting from OpenSSL 1.0.2	2020-08-10 17:31:10 +02:00
William Lallemand	bf298afe2d	BUG/MEDIUM: ssl: fix the ssl-skip-self-issued-ca option In commit `f187ce6`, the ssl-skip-self-issued-ca option was accidentally made useless by reverting the SSL_CTX reworking. The previous attempt of making this feature was putting each certificate of the chain in the SSL_CTX with SSL_CTX_add_extra_chain_cert() and was skipping the Root CA. The problem here is that doing it this way instead of doing a SSL_CTX_set1_chain() break the support of the multi-certificate bundles. The SSL_CTX_build_cert_chain() function allows one to remove the Root CA with the SSL_BUILD_CHAIN_FLAG_NO_ROOT flag. Use it instead of doing tricks with the CA. Should fix issue #804. Must be backported in 2.2.	2020-08-10 17:08:54 +02:00
William Dauchy	477757c66b	CLEANUP: fix all duplicated semicolons trivial commit, does not change the code behaviour Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-08-10 08:49:38 +02:00
William Dauchy	a598b500b4	MINOR: ssl: add ssl_{c,s}_chain_der fetch methods Following work from Arjen and Mathilde, it adds ssl_{c,s}_chain_der methods; it returns DER encoded certs from SSL_get_peer_cert_chain Also update existing vtc tests to add random intermediate certificates When getting the result through this header: http-response add-header x-ssl-chain-der %[ssl_c_chain_der,hex] One can parse it with any lib accepting ASN.1 DER data, such as in go: bin, err := encoding/hex.DecodeString(cert) certs_parsed, err := x509.ParseCertificates(bin) Cc: Arjen Nienhuis <arjen@zorgdoc.nl> Signed-off-by: Mathilde Gilles <m.gilles@criteo.com> Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-08-07 15:38:40 +02:00
William Dauchy	98c35045aa	CLEANUP: ssl: ssl_sock_crt2der semicolon and spaces trivial commit, does not change the code behaviour Signed-off-by: William Dauchy <w.dauchy@criteo.com>	2020-08-07 15:38:40 +02:00
William Lallemand	efc5a9d55b	BUG/MINOR: snapshots: leak of snapshots on deinit() Free the snapshots on deinit() when they were initialized in a proxy upon an error. This was introduced by `c55015e` ("MEDIUM: snapshots: dynamically allocate the snapshots"). Should be backported as far as 1.9.	2020-08-07 14:55:33 +02:00
Christopher Faulet	6ad7df423b	MINOR: arg: Use chunk_destroy() to release string arguments This way, all fields of the buffer structure are reset when a string argument (ARGT_STR) is released. It is also a good way to explicitly specify this kind of argument is a chunk. So .data and .size fields must be set. This patch may be backported to ease backports.	2020-08-07 14:27:54 +02:00
Christopher Faulet	fd2e906084	MINOR: lua: Add support for regex as fetches and converters arguments It means now regsub() converter is now exported to the lua. Map converters based on regex are not available because the map arguments are not supported.	2020-08-07 14:27:54 +02:00
Christopher Faulet	d25d926806	MINOR: lua: Add support for userlist as fetches and converters arguments It means now http_auth() and http_auth_group() sample fetches are now exported to the lua.	2020-08-07 14:27:54 +02:00
Christopher Faulet	05e2d77441	MEDIUM: lua: Don't filter exported fetches and converters Thanks to previous commits, it is now safe to use from lua the sample fetches and sample converters that convert arguments, especially the strings (ARGT_STR). So now, there are all exported to the lua. They was filtered on the validation functions. Only fetches without validation functions or with val_hdr or val_payload_lv functions were exported, and converters without validation functions. This patch depends on following commits : * `aec27ef44` "BUG/MINOR: lua: Duplicate lua strings in sample fetches/converters arg array" * `fdea1b631` "MINOR: hlua: Don't needlessly copy lua strings in trash during args validation" It must be backported as far as 2.1 because the date() and http_date() converters are no longer exported because of the filter on the validation function, since the commit `ae6f125c7` ("MINOR: sample: add us/ms support to date/http_date)".	2020-08-07 14:27:37 +02:00
Christopher Faulet	aec27ef443	BUG/MINOR: lua: Duplicate lua strings in sample fetches/converters arg array Strings in the argument array used by sample fetches and converters must be duplicated. This is mandatory because, during the arguments validations, these strings may be converted and released. It works this way during the configuration parsing and there is no reason to adapt this behavior during the runtime when a sample fetch or a sample converter is called from the lua. In fact, there is a reason to not change the behavior. It must reamain simple for everyone to add new fetches or converters. Thus, lua strings are duplicated. It is only performed at the end of the hlua_lua2arg_check() function, if the argument is still a ARGT_STR. Of course, it requires a cleanup loop after the call or when an error is triggered. This patch depends on following commits: * `959171376` "BUG/MINOR: arg: Fix leaks during arguments validation for fetches/converters" * `fdea1b631` "MINOR: hlua: Don't needlessly copy lua strings in trash during args validation" It may be backported to all supported versions, most probably as far as 2.1 only.	2020-08-07 14:26:35 +02:00
Christopher Faulet	fdea1b6319	MINOR: hlua: Don't needlessly copy lua strings in trash during args validation Lua strings are NULL terminated. So in the hlua_lua2arg_check() function, used to check arguments against the sample fetches specification, there is no reason to copy these strings in a trash to add a terminating null byte. In addition, when the array of arguments is built from lua values, we must take care to count this terminating null bytes in the size of the buffer where a string is stored. The same must be done when a sample is built from a lua value. This patch may be backported to easy backports.	2020-08-07 14:25:31 +02:00
Christopher Faulet	e663a6e326	BUG/MINOR: lua: Check argument type to convert it to IP mask in arg validation In hlua_lua2arg_check() function, before converting an argument to an IPv4 or IPv6 mask, we must be sure to have an integer or a string argument (ARGT_SINT or ARGT_STR). This patch must be backported to all supported versions.	2020-08-07 14:25:31 +02:00
Christopher Faulet	8e09ac8592	BUG/MINOR: lua: Check argument type to convert it to IPv4/IPv6 arg validation In hlua_lua2arg_check() function, before converting a string to an IP address, we must be to sure to have a string argument (ARGT_STR). This patch must be backported to all supported versions.	2020-08-07 14:25:31 +02:00
Christopher Faulet	959171376f	BUG/MINOR: arg: Fix leaks during arguments validation for fetches/converters Some sample fetches or sample converters uses a validation functions for their arguments. In these function, string arguments (ARGT_STR) may be converted to another type (for instance a regex, a variable or a integer). Because these strings are allocated when the argument list is built, they must be freed after a conversion. Most of time, it is done. But not always. This patch fixes these minor memory leaks (only on few strings, during the configuration parsing). This patch may be backported to all supported versions, most probably as far as 2.1 only. If this commit is backported, the previous one `73292e9e6` ("BUG/MINOR: lua: Duplicate map name to load it when a new Map object is created") must also be backported. Note that some validation functions does not exists on old version. It should be easy to resolve conflicts.	2020-08-07 14:25:21 +02:00
Christopher Faulet	73292e9e66	BUG/MINOR: lua: Duplicate map name to load it when a new Map object is created When a new map is created, the sample_load_map() function is called. To do so, an argument array is created with the name as first argument. Because it is a lua string, owned by the lua, it must be duplicated. The sample_load_map() function will convert this argument to a map. In theory, after the conversion, it must release the original string. It is not performed for now and it is a bug that will be fixed in the next commit. This patch may be backported to all supported versions, most probably as far as 2.1 only. But it must be backported with the next commit "BUG/MINOR: arg: Fix leaks during arguments validation for fetches/converters".	2020-08-07 14:24:30 +02:00
Christopher Faulet	b45bf8eb70	BUG/MINOR: converters: Store the sink in an arg pointer for debug() converter The debug() converter uses a string to reference the sink where to send debug events. During the configuration parsing, this string is converted to a sink object but it is still store as a string argument. It is a problem on deinit because string arguments are released. So the sink pointer will be released twice. To fix the bug, we keep a reference on the sink using an ARGT_PTR argument. This way, it will not be freed on the deinit. This patch depends on the commit `e02fc4d0d` ("MINOR: arg: Add an argument type to keep a reference on opaque data"). Both must be backported as far as 2.1.	2020-08-07 14:24:21 +02:00
Christopher Faulet	0eb967d122	BUG/MEDIUM: map/lua: Return an error if a map is loaded during runtime In sample_load_map() function, the global mode is now tested to be sure to be in the starting mode. If not, an error is returned. At first glance, this patch may seem useless because maps are loaded during the configuration parsing. But in fact, it is possible to load a map from the lua, using Map:new() method. And, there is nothing to forbid to call this method at runtime, during a script execution. It must never be done because it may perform an filesystem access for unknown maps or allocation for known ones. So at runtime, it means a blocking call or a memroy leak. Note it is still possible to load a map from the lua, but in the global part of a script only. This part is executed during the configuration parsing. This patch must be backported in all stable versions.	2020-08-07 14:18:27 +02:00
William Lallemand	76b4a12591	BUG/MEDIUM: ssl: memory leak of ocsp data at SSL_CTX_free() This bug affects all version of HAProxy since the OCSP data is not free in the deinit(), but leaking on exit() is not really an issue. However, when doing dynamic update of certificates over the CLI, those data are not free'd upon the free of the SSL_CTX. 3 leaks are happening, the first leak is the one of the ocsp_arg structure which serves the purpose of containing the pointers in the case of a multi-certificate bundle. The second leak is the one ocsp struct. And the third leak is the one of the struct buffer in the ocsp_struct. The problem lies with SSL_CTX_set_tlsext_status_arg() which does not provide a way to free the argument upon an SSL_CTX_free(). This fix uses ex index functions instead of registering a tlsext_status_arg(). This is really convenient because it allows to register a free callback which will free the ex index content upon a SSL_CTX_free(). A refcount was also added to the ocsp_response structure since it is stored in a tree and can be reused in another SSL_CTX. Should fix part of the issue #746. This must be backported in 2.2 and 2.1.	2020-08-07 01:14:31 +02:00
William Lallemand	86e4d63316	BUG/MINOR: ssl: fix memory leak at OCSP loading Fix a memory leak when loading an OCSP file when the file was already loaded elsewhere in the configuration. Indeed, if the OCSP file already exists, a useless chunk_dup() will be done during the load. To fix it we reverts "ocsp" to "iocsp" like it was done previously. This was introduced by commit `246c024` ("MINOR: ssl: load the ocsp in/from the ckch"). Should fix part of the issue #746. It must be backported in 2.1 and 2.2.	2020-08-07 01:14:31 +02:00
Baptiste Assmann	87138c3524	BUG/MAJOR: dns: disabled servers through SRV records never recover A regression was introduced by `13a9232ebc` when I added support for Additional section of the SRV responses.. Basically, when a server is managed through SRV records additional section and it's disabled (because its associated Additional record has disappeared), it never leaves its MAINT state and so never comes back to production. This patch updates the "snr_update_srv_status()" function to clear the MAINT status when the server now has an IP address and also ensure this function is called when parsing Additional records (and associating them to new servers). This can cause severe outage for people using HAProxy + consul (or any other service registry) through DNS service discovery). This should fix issue #793. This should be backported to 2.2.	2020-08-05 21:48:23 +02:00
Baptiste Assmann	cde83033d0	CLEANUP: dns: typo in reported error message "record" instead of "recrd". This should be backported to 2.2.	2020-08-05 21:47:32 +02:00
Christopher Faulet	7a145d6823	BUG/MEDIUM: mux-h1: Refresh H1 connection timeout after a synchronous send The H1 multiplexer is able to perform synchronous send. When a large body is transfer, if nothing is received and if no error or shutdown occurs, it is possible to not go down at the H1 connection level to do I/O for a long time. When this happens, we must still take care to refresh the H1 connection timeout. Otherwise it is possible to hit the connection timeout during the transfer while it should not expire. This bug exists because only h1_process() refresh the H1 connection timeout. To fix the bug, h1_snd_buf() must also refresh this timeout. To make things more readable, a dedicated function has been introduced and called to refresh the timeout. This bug exists on all HTX versions. But it is harder to hit it on 2.1 and below because when a H1 mux is initialized, we actively try to read data instead of subscribing for receiving. So there is at least one call to h1_process(). This patch should fix the issue #790. It must be backported as far as 2.0.	2020-08-05 14:29:06 +02:00
William Lallemand	a560c06af7	BUG/MEDIUM: ssl: check OCSP calloc in ssl_sock_load_ocsp() Check the return of the calloc in ssl_sock_load_ocsp() which could lead to a NULL dereference. This was introduced by commit `be2774d` ("MEDIUM: ssl: Added support for Multi-Cert OCSP Stapling"). Could be backported as far as 1.7.	2020-07-31 11:51:20 +02:00
Ilya Shipitsin	6b79f38a7a	CLEANUP: assorted typo fixes in the code and comments This is 12th iteration of typo fixes	2020-07-31 11:18:07 +02:00
Willy Tarreau	f5ea3a8c58	MINOR: mux-h1: do not try to receive on backend before sending a request There's no point trying to perform an recv() on a back connection if we have a stream before having sent a request, as it's expected to fail. It's likely that this may avoid some spurious subscribe() calls in some keep-alive cases (the close case was already addressed at the connection level by "MINOR: connection: avoid a useless recvfrom() on outgoing connections").	2020-07-31 09:30:12 +02:00
Willy Tarreau	2febb846a4	MINOR: mux-h1: do not even try to receive if the connection is not fully set up If the connection is still waiting for L4/L6, there's no point even trying to receive as it will fail, so better return zero in h1_recv_allowed().	2020-07-31 09:30:12 +02:00
Willy Tarreau	8dbd1a2e09	MINOR: connection: avoid a useless recvfrom() on outgoing connections When a connect() doesn't immediately succeed (i.e. most of the times), fd_cant_send() is called to enable polling. But given that we don't mark that we cannot receive either, we end up performing a failed recvfrom() immediately when the connect() is finally confirmed, as indicated in issue #253. This patch simply adds fd_cant_recv() as well so that we're only notified once the recv path is ready. The reason it was not there is purely historic, as in the past when there was the fd cache, doing it would have caused a pending recv request to be placed into the fd cache, hence a useless recvfrom() upon success (i.e. what happens now). Without this patch, forwarding 100k connections does this: % time seconds usecs/call calls errors syscall ------ ----------- ----------- --------- --------- ---------------- 17.51 0.704229 7 100000 100000 connect 16.75 0.673875 3 200000 sendto 16.24 0.653222 3 200036 close 10.82 0.435082 1 300000 100000 recvfrom 10.37 0.417266 1 300012 setsockopt 7.12 0.286511 1 199954 epoll_ctl 6.80 0.273447 2 100000 shutdown 5.34 0.214942 2 100005 socket 4.65 0.187137 1 105002 5002 accept4 3.35 0.134757 1 100004 fcntl 0.61 0.024585 4 5858 epoll_wait With the patch: % time seconds usecs/call calls errors syscall ------ ----------- ----------- --------- --------- ---------------- 18.04 0.697365 6 100000 100000 connect 17.40 0.672471 3 200000 sendto 17.03 0.658134 3 200036 close 10.57 0.408459 1 300012 setsockopt 7.69 0.297270 1 200000 recvfrom 7.32 0.282934 1 199922 epoll_ctl 7.09 0.274027 2 100000 shutdown 5.59 0.216041 2 100005 socket 4.87 0.188352 1 104697 4697 accept4 3.35 0.129641 1 100004 fcntl 0.65 0.024959 4 5337 1 epoll_wait Note the total disappearance of 1/3 of failed recvfrom() without adding any extra syscall anywhere else. The trace of an HTTP health check is now totally clean, with no useless syscall at all anymore: 09:14:21.959255 connect(9, {sa_family=AF_INET, sin_port=htons(8000), sin_addr=inet_addr("127.0.0.1")}, 16) = -1 EINPROGRESS (Operation now in progress) 09:14:21.959292 epoll_ctl(4, EPOLL_CTL_ADD, 9, {EPOLLIN\|EPOLLOUT\|EPOLLRDHUP, {u32=9, u64=9}}) = 0 09:14:21.959315 epoll_wait(4, [{EPOLLOUT, {u32=9, u64=9}}], 200, 1000) = 1 09:14:21.959376 sendto(9, "OPTIONS / HTTP/1.0\r\ncontent-leng"..., 41, MSG_DONTWAIT\|MSG_NOSIGNAL, NULL, 0) = 41 09:14:21.959436 epoll_wait(4, [{EPOLLOUT, {u32=9, u64=9}}], 200, 1000) = 1 09:14:21.959456 epoll_ctl(4, EPOLL_CTL_MOD, 9, {EPOLLIN\|EPOLLRDHUP, {u32=9, u64=9}}) = 0 09:14:21.959512 epoll_wait(4, [{EPOLLIN\|EPOLLRDHUP, {u32=9, u64=9}}], 200, 1000) = 1 09:14:21.959548 recvfrom(9, "HTTP/1.0 200\r\nContent-length: 0\r"..., 16320, 0, NULL, NULL) = 126 09:14:21.959570 close(9) = 0 With the edge-triggered poller, it gets even better: 09:29:15.776201 connect(9, {sa_family=AF_INET, sin_port=htons(8000), sin_addr=inet_addr("127.0.0.1")}, 16) = -1 EINPROGRESS (Operation now in progress) 09:29:15.776256 epoll_ctl(4, EPOLL_CTL_ADD, 9, {EPOLLIN\|EPOLLOUT\|EPOLLRDHUP\|EPOLLET, {u32=9, u64=9}}) = 0 09:29:15.776287 epoll_wait(4, [{EPOLLOUT, {u32=9, u64=9}}], 200, 1000) = 1 09:29:15.776320 sendto(9, "OPTIONS / HTTP/1.0\r\ncontent-leng"..., 41, MSG_DONTWAIT\|MSG_NOSIGNAL, NULL, 0) = 41 09:29:15.776374 epoll_wait(4, [{EPOLLIN\|EPOLLOUT\|EPOLLRDHUP, {u32=9, u64=9}}], 200, 1000) = 1 09:29:15.776406 recvfrom(9, "HTTP/1.0 200\r\nContent-length: 0\r"..., 16320, 0, NULL, NULL) = 126 09:29:15.776434 close(9) = 0 It could make sense to backport this patch to 2.2 and maybe 2.1 after it has been sufficiently checked for absence of side effects in 2.3-dev, as some people had reported an extra overhead like in issue #168.	2020-07-31 09:29:36 +02:00
Willy Tarreau	8e979fa74f	BUG/MEDIUM: tcp-checks: always attach the transport before installing the mux Similarly to the issue described in commit "BUG/MEDIUM: backend: always attach the transport before installing the mux", in tcpcheck_eval_connect() we can install a handshake transport layer underneath the mux and replace its subscriptions, causing a crash if the mux had already subscribed for whatever reason. A simple reproducer consists in adding fd_cant_recv() after fd_cant_send() in tcp_connect_server() and running it on this config, as discussed in issue listen foo bind :8181 mode http option httpchk server srv1 127.0.0.1:8888 send-proxy-v2 check inter 1000 The mux may only be installed after xprt_handshake is set up, so that it registers against it and not against raw_sock or ssl_sock. This needs to be backported to 2.2 which is the first version using muxes for checks.	2020-07-31 08:49:31 +02:00
Willy Tarreau	a3b17563e1	BUG/MEDIUM: backend: always attach the transport before installing the mux In connect_server(), we can enter in a stupid situation: - conn_install_mux_be() is called to install the mux. This one subscribes for receiving and quits ; - then we discover that a handshake is required on the connection (e.g. send-proxy), so xprt_add_hs() is called and subscribes as well. - we crash in conn_subscribe() which gets a different subscriber. And if BUG_ON is disabled, we'd likely lose one event. Note that it doesn't seem to happen by default, but definitely does if connect() rightfully performs fd_cant_recv(), so it's a matter of who does what and in what order. A simple reproducer consists in adding fd_cant_recv() after fd_cant_send() in tcp_connect_server() and running it on this config, as discussed in issue listen foo bind :8181 mode http server srv1 127.0.0.1:8888 send-proxy-v2 The root cause is that xprt_add_hs() installs an xprt layer underneath the mux without taking over its subscriptions. Ideally if we want to support this, we'd need to steal the connection's wait_events and replace them by new ones. But there doesn't seem to be any case where we're interested in doing this so better simply always install the transport layer before installing the mux, that's safer and simpler. This needs to be backported to 2.1 which is constructed the same way and thus suffers from the same issue, though the code is slightly different there.	2020-07-31 08:47:58 +02:00
Christopher Faulet	2361fd9487	BUG/MINOR: lua: Fix a possible null pointer deref on lua ctx This bug was introduced by the commit `8f587ea3` ("MEDIUM: lua: Set the analyse expiration date with smaller wake_time only"). At the end of hlua_action(), the lua context may be null if the alloc failed. No backport needed, this is 2.3-dev.	2020-07-30 10:40:59 +02:00
Christopher Faulet	e96993b1f2	MINOR: stream-int: Be sure to have a mux to do sends and receives In si_cs_send() and si_cs_recv(), we explicitly test the connection's mux is defined to proceed. For si_cs_recv(), it is probably a bit overkill. But opportunistic sends are possible from the moment the server connection is created. So it is safer to do this test. This patch may be backported as far as 1.9 if necessary.	2020-07-30 09:39:20 +02:00
Christopher Faulet	b4de420472	MINOR: connection: Preinstall the mux for non-ssl connect In the connect_server() function, there is an optim to install the mux as soon as possible. It is possible if we can determine the mux to use from the configuration only. For instance if the mux is explicitly specified or if no ALPN is set. This patch adds a new condition to preinstall the mux for non-ssl connection. In this case, by default, we always use the mux_pt for raw connections and the mux-h1 for HTTP ones. This patch is related to the issue #762. It may be backported to 2.2 (and possibly as far as 1.9 if necessary).	2020-07-30 09:31:09 +02:00
Christopher Faulet	3f5bcd0c96	BUG/MEDIUM: connection: Be sure to always install a mux for sync connect Sometime, a server connection may be performed synchronously. Most of time on TCP socket, it does not happen. It is easier to have sync connect with unix socket. When it happens, if we are not waiting for any hanshake completion, we must be sure to have a mux installed before leaving the connect_server() function because an attempt to send may be done before the I/O connection handler have a chance to be executed to install the mux, if not already done. For now, It is not expected to perform a send with no mux installed, leading to a crash if it happens. This patch should fix the issue #762 and probably #779 too. It must be backported as far as 1.9.	2020-07-30 09:31:09 +02:00
Christopher Faulet	8f587ea347	MEDIUM: lua: Set the analyse expiration date with smaller wake_time only If a lua action yields for any reason and if the wake timeout is set, it only override the analyse expiration date if it is smaller. This way, a lower inspect-delay will be respected, if any.	2020-07-30 09:31:09 +02:00
Christopher Faulet	2747fbb7ac	MEDIUM: tcp-rules: Use a dedicated expiration date for tcp ruleset A dedicated expiration date is now used to apply the inspect-delay of the tcp-request or tcp-response rulesets. Before, the analyse expiratation date was used but it may also be updated by the lua (at least). So a lua script may extend or reduce the inspect-delay by side effect. This is not expected. If it becomes necessary, a specific function will be added to do this. Because, for now, it is a bit confusing.	2020-07-30 09:31:09 +02:00
Christopher Faulet	54f3e183c8	BUG/MINOR: tcp-rules: Set the inspect-delay when a tcp-response action yields On a tcp-response content ruleset evaluation, the inspect-delay is engaged when rule's conditions are not validated but not when the rule's action yields. This patch must be backported to all supported versions.	2020-07-30 09:31:09 +02:00
Christopher Faulet	19dbf2d625	BUG/MINOR: tcp-rules: Preserve the right filter analyser on content eval abort When a tcp-request or a tcp-response content ruleset evaluation is aborted, the corresponding FLT_END analyser must be preserved, if any. But because of a typo error, on the tcp-response content ruleset evaluation, we try to preserve the request analyser instead of the response one. This patch must be backported to 2.2.	2020-07-30 09:31:09 +02:00
Christopher Faulet	99aaca99b5	MINOR: tcp-rules: Return an internal error if an action yields on a final eval On a final evaluation of a tcp-request or tcp-response content ruleset, it is forbidden for an action to yield. To quickly identify bugs an internal error is now returned if it happens and a warning log message is emitted.	2020-07-30 09:31:09 +02:00
Christopher Faulet	498c483009	BUG/MINOR: lua: Abort execution of actions that yield on a final evaluation A Lua action may yield. It may happen because the action returns explicitly act.YIELD or because the script itself yield. In the first case, we must abort the script execution if it is the final rule evaluation, i.e if the ACT_OPT_FINAL flag is set. The second case is already covered. This patch must be backported to 2.2.	2020-07-30 09:31:09 +02:00
Christopher Faulet	385101e538	BUG/MEDIUM: dns: Don't yield in do-resolve action on a final evaluation When an action is evaluated, flags are passed to know if it is the first call (ACT_OPT_FIRST) and if it must be the last one (ACT_OPT_FINAL). For the do-resolve DNS action, the ACT_OPT_FINAL flag must be handled because the action may yield. It must never yield when this flag is set. Otherwise, it may lead to a wakeup loop of the stream because the inspected-delay of a tcp-request content ruleset was reached without stopping the rules evaluation. This patch is related to the issue #222. It must be backported as far as 2.0.	2020-07-30 09:31:09 +02:00
Christopher Faulet	08ed98fd79	MEDIUM: lua: Add support for the Lua 5.4 On Lua 5.4, some API changes make HAProxy compilation to fail. Among other things, the lua_resume() function has changed and now takes an extra argument in Lua 5.4 and the error LUA_ERRGCMM was removed. Thus the LUA_VERSION_NUM macro is now tested to know the lua version is used and adapt the code accordingly. Here are listed the incompatibilities with the previous Lua versions : http://www.lua.org/manual/5.4/manual.html#8 This patch comes from the HAproxy's fedora RPM, committed by Tom Callaway : https://src.fedoraproject.org/rpms/haproxy/blob/db970613/f/haproxy-2.2.0-lua-5.4.patch This patch should fix the issue #730. It must be backported to 2.2 and probably as far as 2.0.	2020-07-30 09:31:09 +02:00
Jerome Magnin	4002f8dc03	BUG/MAJOR: dns: don't treat Authority records as an error Support for DNS Service Discovery by means of SRV records was enhanced with commit `13a9232eb` ("MEDIUM: dns: use Additional records from SRV responses") to use the content of the answers Additional records when present. If there are Authority records before the Additional records we mistakenly treat that as an invalid response. To fix this, just ignore the Authority section if it exist and skip to the Additional records. As `13a9232eb` was introduced during 2.2-dev, it must be backported to 2.2. This is a fix for issue #778	2020-07-29 12:06:59 +02:00
Jerome Magnin	012261ab34	BUG/MAJOR: dns: fix null pointer dereference in snr_update_srv_status Since commit `13a9232eb` ("MEDIUM: dns: use Additional records from SRV responses"), a struct server can have a NULL dns_requester->resolution, when SRV records are used and DNS answers contain an Additional section. This is a problem when we call snr_update_srv_status() because it does not check that resolution is NULL, and dereferences it. This patch simply adds a test for resolution being NULL. When that happens, it means we are using SRV records with Additional records, and an entry was removed. This should fix issue #775. This should be backported to 2.2.	2020-07-29 12:05:55 +02:00
Christopher Faulet	471425f51d	BUG/MINOR: debug: Don't dump the lua stack if it is not initialized When the watchdog is fired because of the lua, the stack of the corresponding lua context is dumped. But we must be sure the lua context is fully initialized to do so. If we are blocked on the global lua lock, during the lua context initialization, the lua stask may be NULL. This patch should fix the issue #776. It must be backported as far as 2.0.	2020-07-27 09:37:18 +02:00
Jackie Tapia	749f74c622	DOC: Use gender neutral language This patch updates the documentation files and code comments to avoid the use of gender specific phrasing in favor of "they" or "it".	2020-07-26 22:35:43 +02:00
Baruch Siach	e1651b2970	BUILD: tools: fix build with static only toolchains uClibc toolchains built with no dynamic library support don't provide the dlfcn.h header. That leads to build failure: CC src/tools.o src/tools.c:15:10: fatal error: dlfcn.h: No such file or directory #include <dlfcn.h> ^~~~~~~~~ Enable dladdr on Linux platforms only when USE_DL is defined. This should be backported wherever `109201fc5` ("BUILD: tools: rely on __ELF__ not USE_DL to enable use of dladdr()") is backported (currently only 2.2 and 2.1).	2020-07-24 13:55:40 +02:00
Christopher Faulet	0f17a4444e	BUG/MINOR: mux-fcgi: Don't url-decode the QUERY_STRING parameter anymore In the CGI/1.1 specification, it is specified the QUERY_STRING must not be url-decoded. However, this parameter is sent decoded because it is extracted after the URI's path decoding. Now, the query-string is first extracted, then the script part of the path is url-decoded. This way, the QUERY_STRING parameter is no longer decoded. This patch should fix the issue #769. It must be backported as far as 2.1.	2020-07-23 15:44:39 +02:00
Willy Tarreau	f1ea47d896	BUG/MINOR: htx: add two missing HTX_FL_EOI and remove an unexpected one A workaround for some difficulties encountered to anticipate end of messages was addressed by commit `810df0614` ("MEDIUM: htx: Add a flag on a HTX message when no more data are expected"), but there were 3 issues in it (with minor impact): - the flag was mistakenly set before an EOH in Lua, which would only cause incomplete packets to be emitted for now but could cause truncated responses in the future. It's not needed to add it on the next EOM block as http_forward_proxy_resp() already does it. - one was still missing in hlua_applet_http_fct(), possibly causing delays on Lua services - one was missing in the Prometheus exporter. All this simply shows that this mechanism is still quite fragile and not trivial to use, especially in order to deal with the impossibility to append the EOM, so we'll need to improve the solution in the future and future backports should not be completely ruled out. This fix must be backported where the patch above is backported, typically 2.1 and later as it was required for a set of fixes.	2020-07-23 06:53:27 +02:00
Willy Tarreau	963f701f4f	CLEANUP: dns: remove 45 "return" statements from dns_validate_dns_response() The previous leak on do-resolve was particularly tricky to check due to the important code repetition in dns_validate_dns_response() which required careful examination of all return statements to check whether they needed a pool_free() or not. Let's clean all this up using a common leave point which releases the element itself. This also encourages to properly set the current response to null right after freeing or adding it so that it doesn't get added. 45 return and 22 pool_free() were replaced by one of each.	2020-07-22 17:09:20 +02:00
Christopher Faulet	2151cdd38c	BUG/MEDIUM: http-ana: Only set CF_EXPECT_MORE flag on data filtering This flag is set by HTTP analyzers to notify that more data are epxected. It is used to know if the CO_SFL_MSG_MORE flag must be set on the connection when data are sent. Historically, it was set on chuncked messages and on compressed responses. But in HTX, the chunked messages are parsed by the H1 multipexer. So for this case, the infinite forwarding is enabled and the flag must no longer be set. For the compression, the test must be extended and be applied on all data filters. Thus it is also true for the request channel. So, now, CF_EXPECT_MORE flag is set on a request or a response channel if there is at least one data filter attached to the stream. In addition, the flag is removed when the HTTP message analysis is finished. This patch should partially fix the issue #756. It must be backported to 2.1.	2020-07-22 16:46:37 +02:00
Christopher Faulet	9e3dc8305b	BUG/MEDIUM: stream-int: Don't set MSG_MORE flag if no more data are expected In HTX, if the HTX_FL_EOI message is set on the message, we don't set the CO_SFL_MSG_MORE flag on the connection. This way, the send is not delayed if only the EOM is missing in the HTX message. This patch depends on the commit "MEDIUM: htx: Add a flag on a HTX message when no more data are expected". This patch should partially fix the issue #756. It must be backported to 2.1. For earlier versions, CO_SFL_MSG_MORE is ignored by HTX muxes.	2020-07-22 16:43:32 +02:00
Christopher Faulet	810df06145	MEDIUM: htx: Add a flag on a HTX message when no more data are expected The HTX_FL_EOI flag must now be set on a HTX message when no more data are expected. Most of time, it must be set before adding the EOM block. Thus, if there is no space for the EOM, there is still an information to know all data were received and pushed in the HTX message. There is only an exception for the HTTP replies (deny, return...). For these messages, the flag is set after all blocks are pushed in the message, including the EOM block, because, on error, we remove all inserted data.	2020-07-22 16:43:32 +02:00
Christopher Faulet	010ab35a91	BUG/MEDIUM: dns: Release answer items when a DNS resolution is freed When a DNS resolution is freed, the remaining items in .ar_list and .answer_list are also released. It must be done to avoid a memory leak. And it is the last chance to release these objects. I've honestly no idea if there is a better place to release them earlier. But at least, there is no more leak. This patch should solve the issue #222. It must be backported, at least, as far as 2.0, and probably, with caution, as far as 1.8 or 1.7.	2020-07-22 16:43:07 +02:00
Christopher Faulet	5098a08c2f	BUG/MAJOR: dns: Make the do-resolve action thread-safe The do-resolve HTTP action, performing a DNS resolution of a sample expression output, is not thread-safe at all. The resolver object used to do the resolution must be locked when the action is executed or when the stream is released because its curr or wait resolution lists and the requester list inside a resolution are updated. It is also important to not wake up a released stream (with a destroyed task). Of course, because of this bug, various kind of crashes may be observed. This patch should fix the issue #236. It must be backported as far as 2.0.	2020-07-22 14:59:22 +02:00
Willy Tarreau	e5d79bccc0	MINOR: tasks/debug: add a few BUG_ON() to detect use of wrong timer queue This aims at catching calls to task_unlink_wq() performed by the wrong thread based on the shared status for the task, as well as calls to __task_queue() with the wrong timer queue being used based on the task's capabilities. This will at least help eliminate some hypothesis during debugging sessions when suspecting that a wrong thread has attempted to queue a task at the wrong place.	2020-07-22 14:42:52 +02:00
Willy Tarreau	783afbe93b	BUG/MAJOR: tasks: don't requeue global tasks into the local queue A bug was introduced by commit `77015abe0` ("MEDIUM: tasks: clean up the front side of the wait queue in wake_expired_tasks()"): front tasks that are not yet expired were incorrectly requeued into the local wait queue instead of the global one. Because of this, the same task could be found by the same thread on next invocation and be unlinked without locking, allowing another thread to requeue it in parallel, and conversely another thread could unlink it while the task was being walked over, causing all sorts of crashes and endless loops in wake_expired_tasks() and affiliates. This bug can easily be triggered by stressing the do_resolve action in multi-thread (after applying the fixes required to get do_resolve to work with threads). It certainly is the cause of issue #758. This must be backported to 2.2 only.	2020-07-22 14:12:45 +02:00
Emeric Brun	d3db3846c5	BUG/MEDIUM: resolve: fix init resolving for ring and peers section. Reported github issue #759 shows there is no name resolving on server lines for ring and peers sections. This patch introduce the resolving for those lines. This patch adds boolean a parameter to parse_server function to specify if we want the function to perform an initial name resolving using libc. This boolean is forced to true in case of peers or ring section. The boolean is kept to false in case of classic servers (from backend/listen) This patch should be backported in branches where peers sections support 'server' lines.	2020-07-21 17:59:20 +02:00
Willy Tarreau	75fd2ff83a	BUG/MEDIUM: arg: empty args list must be dropped Before commit `80b53ffb1` ("MEDIUM: arg: make make_arg_list() stop after its own arguments"), consumers of arguments would measure the length of the string between the first opening and closing parenthesis before calling make_arg_list(), and this latter one would detect an empty string early by len==0 and would not allocate an argument list. Since that commit, this has a changed a bit because the argument parser is now the one in charge for delimiting the argument string, so the early test cannot be used anymore. But the argument list is still allocated, and despite the number of arguments being returned, consumers do not necessarily rely on it but instead they rely on the non-null arg_p pointer that used to be allocated only if at least one argument was present. But as it's now always allocated, the first argument always carries the first argument's type with an empty value, which confuses all functions that take a unique optional argument (such as uuid()). The proper long term solution would be to always use the returned argument count, but at least we can make sure the function always returns an empty argument list when fed with an empty set of parenthesis, as it always used to do. This is what this patch does. This fix must be backported to 2.2 and fixes github issue #763. Thanks to Luke Seelenbinder for reporting the problem.	2020-07-21 15:53:54 +02:00
Willy Tarreau	9b9c174e60	BUILD: config: fix again bugs gcc warnings on calloc Since commit `ad37c7ab` ("BUILD: config: address build warning on raspbian+rpi4") gcc 7.3.0 complains again on x86_64 (while 8.2.0 does not) : src/cfgparse.c: In function 'check_config_validity': src/cfgparse.c:3593:26: warning: argument 1 range [18446744071562067968, 18446744073709551615] exceeds maximum object size 9223372036854775807 [-Walloc-size-larger-than=] newsrv->idle_conns = calloc(global.nbthread, sizeof(*newsrv->idle_conns)); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ This thing is completely bogus (actually the RPi one was the most wrong). Let's try to shut them both by using an unsigned short for the cast which is expected to satisfy everyone. It's worth noting that the exact same call a few lines above and below do not trigger this stupid warning. This should be backported to 2.2 since the fix above was put there already.	2020-07-17 15:04:53 +02:00
Willy Tarreau	273aea479d	BUG/MAJOR: tasks: make sure to always lock the shared wait queue if needed In run_tasks_from_task_list() we may free some tasks that have been killed. Before doing so we unlink them from the wait queue. But if such a task is in the global wait queue, the queue isn't locked so this can result in corrupting the global task list and causing loops or crashes. It's very likely one cause of issue #758. This must be backported to 2.2. For 2.1 there doesn't seem to be any case where a task could be freed this way while in the global queue, but it doesn't cost much to apply the same change (the code is in process_runnable_task there).	2020-07-17 14:37:51 +02:00
Willy Tarreau	ad37c7ab25	BUILD: config: address build warning on raspbian+rpi4 Issue #747 reports that building on raspbian for rpi4 triggers this warning: src/cfgparse.c: In function 'check_config_validity': src/cfgparse.c:3584:26: warning: argument 1 range [2147483648, 4294967295] exceeds maximum object size 2147483647 [-Walloc-size-larger-than=] newsrv->idle_conns = calloc((unsigned)global.nbthread, sizeof(*newsrv->idle_conns)); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ It's surprising because the declared type is size_t and the argument is unsigned (i.e. the same type on 32-bit) precisely to avoid cast issues, but gcc seems to be too smart at this one and to issue a warning over the valid range, implying that passing the originally required type would also warn. Given that these are the only casts in calloc and other ones don't complain, let's drop them. All 3 were added by commit `dc2f2753e` ("MEDIUM: servers: Split the connections into idle, safe, and available.") that went into 2.2, so this should be backported.	2020-07-17 14:18:36 +02:00
Christopher Faulet	f706a794d8	BUG/MEDIUM: channel: Be aware of SHUTW_NOW flag when output data are peeked The CF_SHUTW_NOW flag must be handled the same way than the CF_SHUTW flag in co_getblk_nc() and co_getline_nc() functions. It is especally important when we try to peek a line from outgoing data. In this case, an unfinished line is blocked an nothing is peeked if the CF_SHUTW_NOW flag is set. But the blocked data pevent the transition to CF_SHUTW. The above functions are only used by LUA cosockets. Because of this bug, we may experienced wakeups in loop of the cosocket's io handler if we try to read a line on a closed socket with a pending unfinished line (no LF found at the end). This patch should fix issue #744. It must be backported to all supported versions.	2020-07-17 10:11:34 +02:00
Willy Tarreau	2d067f93fb	BUG/MEDIUM: server: fix possibly uninitialized state file on close Previous fix `dc6e8a9a7` ("BUG/MEDIUM: server: resolve state file handle leak on reload") traded a bug for another one, now we get this warning when building server.c, which is valid since f is not necessarily initialized (e.g. if no global state file is passed): src/server.c: In function 'apply_server_state': src/server.c:3272:3: warning: 'f' may be used uninitialized in this function [-Wmaybe-uninitialized] fclose(f); ^~~~~~~~~ Let's initialize it first. This whole code block should really be splitted, cleaned up and reorganized as it's possible that other similar bugs are hidden in it. This must be backported to the same branches the commit above is backported to (likely 2.2 and 2.1).	2020-07-16 06:44:04 +02:00
Ilya Shipitsin	dc6e8a9a7b	BUG/MEDIUM: server: resolve state file handle leak on reload During reload, server state file is read and file handle is not released this was indepently reported in #738 and #660. partially resolves #660. This should be backported to 2.2 and 2.1.	2020-07-16 04:41:32 +02:00
Harris Kaufmann	b605a736b0	BUG/MEDIUM: fcgi-app: fix memory leak in fcgi_flt_http_headers When the loop is continued early, the memory for param_rule is not freed. This can leak memory per request, which will eventually consume all available memory on the server. This patch should fix the issue #750. It must be backported as far as 2.1.	2020-07-15 20:23:29 +02:00
Emeric Brun	45c457a629	MINOR: log: adds counters on received syslog messages. This patch adds a global counter of received syslog messages and this one is exported on CLI "show info" as "CumRecvLogs". This patch also updates internal conn counter and freq of the listener and the proxy for each received log message to prepare a further export on the "show stats".	2020-07-15 17:50:12 +02:00
Emeric Brun	12941c82d0	MEDIUM: log: adds log forwarding section. Log forwarding: It is possible to declare one or multiple log forwarding section, haproxy will forward all received log messages to a log servers list. log-forward <name> Creates a new log forwarder proxy identified as <name>. bind <addr> [param*] Used to configure a log udp listener to receive messages to forward. Only udp listeners are allowed, address must be prefixed using 'udp@', 'udp4@' or 'udp6@'. This supports for all "bind" parameters found in 5.1 paragraph but most of them are irrelevant for udp/syslog case. log global log <address> [len <length>] [format <format>] [sample <ranges>:<smp_size>] <facility> [<level> [<minlevel>]] Used to configure target log servers. See more details on proxies documentation. If no format specified, haproxy tries to keep the incoming log format. Configured facility is ignored, except if incoming message does not present a facility but one is mandatory on the outgoing format. If there is no timestamp available in the input format, but the field exists in output format, haproxy will use the local date. Example: global log stderr format iso local7 ring myring description "My local buffer" format rfc5424 maxlen 1200 size 32764 timeout connect 5s timeout server 10s # syslog tcp server server mysyslogsrv 127.0.0.1:514 log-proto octet-count log-forward sylog-loadb bind udp4@127.0.0.1:1514 # all messages on stderr log global # all messages on local tcp syslog server log ring@myring local0 # load balance messages on 4 udp syslog servers log 127.0.0.1:10001 sample 1:4 local0 log 127.0.0.1:10002 sample 2:4 local0 log 127.0.0.1:10003 sample 3:4 local0 log 127.0.0.1:10004 sample 4:4 local0	2020-07-15 17:50:12 +02:00
Emeric Brun	54932b4408	MINOR: log: adds syslog udp message handler and parsing. This patch introduce a new fd handler used to parse syslog message on udp. The parsing function returns level, facility and metadata that can be immediatly reused to forward message to a log server. This handler is enabled on udp listeners if proxy is internally set to mode PR_MODE_SYSLOG	2020-07-15 17:50:12 +02:00
Emeric Brun	546488559a	MEDIUM: log/sink: re-work and merge of build message API. This patch merges build message code between sink and log and introduce a new API based on struct ist array to prepare message header with zero copy, targeting the log forwarding feature. Log format 'iso' and 'timed' are now avalaible on logs line. A new log format 'priority' is also added.	2020-07-15 17:50:12 +02:00
Emeric Brun	3835c0dcb5	MEDIUM: udp: adds minimal proto udp support for message listeners. This patch introduce proto_udp.c targeting a further support of log forwarding feature. This code was originally produced by Frederic Lecaille working on QUIC support and only minimal requirements for syslog support have been merged.	2020-07-15 17:50:12 +02:00
Emeric Brun	2f4cc28e0f	BUG/MEDIUM: log: issue mixing sampled to not sampled log servers. A boolean was mistakenly declared 'static THREAD_LOCAL' causing the probe of a log to a 'not sampled' log server conditionned by the last evaluated 'sampled log' server test on the same thread. This results to unpredictable drops of logs on 'not sampled' log servers as soon a 'sampled' log server is declared. This patch removes the static THREAD_LOCAL attribute from this boolean, fixing the issue and allowing to mix 'sampled' and 'not sampled' servers. This fix should be backported in any branches which includes the log sampling feature.	2020-07-15 17:50:12 +02:00
Willy Tarreau	dc2ac81c41	BUG/MINOR: backend: fix potential null deref on srv_conn Commit `08016ab82` ("MEDIUM: connection: Add private connections synchronously in session server list") introduced a build warning about a potential null dereference which is actually true: in case a reuse fails an we fail to allocate a new connection, we could crash. The issue was already present earlier but the compiler couldn't detect it since it was guarded by an independent condition. This should be carefully backported to older versions (at least 2.2 and maybe 2.1), the change consists in only adding a test on srv_conn. The whole sequence of "if" blocks is ugly there and would deserve being cleaned up so that the !srv_conn condition is matched ASAP and the assignment is done later. This would remove complicated conditions.	2020-07-15 17:46:32 +02:00
Christopher Faulet	3b3096ede1	BUG/MINOR: mux-fcgi: Set flags on the right stream field for empty FCGI_STDOUT In fcgi_strm_handle_empty_stdout(), the FCGI_SF_ES_RCVD flag is set on "->state" stream field instead of "->flags". It is obviously wrong. This bug is not noticeable because the right state is set in the fcgi_process_demux() function a bit later. This patch must be backported as far as 2.1.	2020-07-15 16:04:51 +02:00
Christopher Faulet	6c99d3baea	BUG/MINOR: mux-fcgi: Set conn state to RECORD_P when skipping the record padding When the padding of a "stream" record (STDOUT or STDERR) is skipped, we must set the connection state to RECORD_P. It is especially important if the padding is not fully received. This patch must be backported as far as 2.1.	2020-07-15 15:55:55 +02:00
Christopher Faulet	7f85433a91	BUG/MINOR: mux-fcgi: Handle empty STDERR record As mentionned in the FastCGI specification, FCGI "streams" are series of non-empty stream records (length != 0), followed by an empty one. It is properly handled for FCGI_STDOUT records, but not for FCGI_STDERR ones. If an empty FCGI_STDERR record is received, the connection is blocked waiting for data which will never come. To fix the bug, when an empty FCGI_STDERR record is received, we drop it, eating the padding if any. This patch should fix the issue #743. It must be backported as far as 2.1.	2020-07-15 15:46:31 +02:00
Christopher Faulet	1bea865811	MINOR: backend: Add sample fetches to get the server's weight The following sample fetches have been added : * srv_iweight : returns the initial server's weight * srv_uweight : returns the user-visible server's weight * srv_weight : returns the current (or effetctive) server's weight The requested server must be passed as argument, evnetually preceded by the backend name. For instance : srv_weight(back-http/www1)	2020-07-15 14:08:14 +02:00
Christopher Faulet	aaa70852d9	MINOR: raw_sock: Report the number of bytes emitted using the splicing In the continuity of the commit `7cf0e4517` ("MINOR: raw_sock: report global traffic statistics"), we are now able to report the global number of bytes emitted using the splicing. It can be retrieved in "show info" output on the CLI. Note this counter is always declared, regardless the splicing support. This eases the integration with monitoring tools plugged on the CLI.	2020-07-15 14:08:14 +02:00
Christopher Faulet	23021ad7f1	BUG/MEDIUM: mux-h1: Continue to process request when switching in tunnel mode When input data are processed, if the request is switched in tunnel mode on a protocol upgrade, we must continue the processing. Otherwise, pending input data will only be processed on the next wakeup. So when new input data are received, on a timeout expiration or shutdown. Worst, if the input buffer is full when it happens, only a timeout or a shutdown will unblock the situation. This patch should fix the issue #737. It must be backported as far as 1.9. The bug does not seem to affect the 2.0 and 1.9 because, on a protocol upgrade, the request is switched in tunnel mode when the response is sent to the client. But the bug is present, so the backport remains necessary.	2020-07-15 14:08:14 +02:00
Christopher Faulet	3d52f0f1f8	MINOR: server: Factorize code to deal with reuse of server idle connections The srv_use_idle_conn() function is now responsible to update the server counters and the connection flags when an idle connection is reused. The same function is called when a new connection is created. This simplifies a bit the connect_server() function.	2020-07-15 14:08:14 +02:00
Christopher Faulet	236c93b108	MINOR: connection: Set the conncetion target during its initialisation When a new connection is created, its target is always set just after. So the connection target may set when it is created instead, during its initialisation to be precise. It is the purpose of this patch. Now, conn_new() function is called with the connection target as parameter. The target is then passed to conn_init(). It means the target must be passed when cs_new() is called. In this case, the target is only used when the conn-stream is created with no connection. This only happens for tcpchecks for now.	2020-07-15 14:08:14 +02:00
Christopher Faulet	fcc3d8a1c0	MINOR: connection: Use a dedicated function to look for a session's connection The session_get_conn() must now be used to look for an available connection matching a specific target for a given session. This simplifies a bit the connect_server() function.	2020-07-15 14:08:14 +02:00
Christopher Faulet	08016ab82d	MEDIUM: connection: Add private connections synchronously in session server list When a connection is marked as private, it is now added in the session server list. We don't wait a stream is detached from the mux to do so. When the connection is created, this happens after the mux creation. Otherwise, it is performed when the connection is marked as private. To allow that, when a connection is created, the session is systematically set as the connectin owner. Thus, a backend connection has always a owner during its creation. And a private connection has always a owner until its death. Note that outside the detach() callback, if the call to session_add_conn() failed, the error is ignored. In this situation, we retry to add the connection into the session server list in the detach() callback. If this fails at this step, the multiplexer is destroyed and the connection is closed.	2020-07-15 14:08:14 +02:00
Christopher Faulet	21ddc74e8a	MINOR: connection: Add a wrapper to mark a connection as private To set a connection as private, the conn_set_private() function must now be called. It sets the CO_FL_PRIVATE flags, but it also remove the connection from the available connection list, if necessary. For now, it never happens because only HTTP/1 connections may be set as private after their creation. And these connections are never inserted in the available connection list.	2020-07-15 14:08:14 +02:00
Christopher Faulet	c64badd573	MINOR: connection: Set new connection as private on reuse never When a new connection is created, it may immediatly be set as private if http-reuse never is configured for the backend. There is no reason to wait the call to mux->detach() to do so.	2020-07-15 14:08:14 +02:00
Christopher Faulet	27bd6ff96d	MINOR: connection: Set the SNI on server connections before installing the mux If an expression is configured to set the SNI on a server connection, the connection is marked as private. To not needlessly add it in the available connection list when the mux is installed, the SNI is now set on the connection before installing the mux, just after the call to si_connect().	2020-07-15 14:08:14 +02:00
Christopher Faulet	29ae7ffed9	BUG/MEDIUM: mux-fcgi: Don't add private connections in available connection list When a stream is detached from a backend private connection, we must not insert it in the available connection list. In addition, we must be sure to remove it from this list. To ensure it is properly performed, this part has been slightly refactored to clearly split processing of private connections from the others. This patch should probably be backported to 2.2.	2020-07-15 14:08:14 +02:00
Christopher Faulet	c5579d18d1	BUG/MEDIUM: mux-h2: Don't add private connections in available connection list When a stream is detached from a backend private connection, we must not insert it in the available connection list. In addition, we must be sure to remove it from this list. To ensure it is properly performed, this part has been slightly refactored to clearly split processing of private connections from the others. This patch should probably be backported to 2.2.	2020-07-15 14:08:14 +02:00
Willy Tarreau	950954f5f7	MINOR: tasks: use MT_LIST_ADDQ() when killing tasks. A bug in task_kill() was fixed by commy `54d31170a` ("BUG/MAJOR: sched: make sure task_kill() always queues the task") which added a list initialization before adding an element. But in fact an inconditional addition would have done the same and been simpler than first initializing then checking the element was initialized. Let's use MT_LIST_ADDQ() there to add the task to kill into the shared queue and kill the dirty LIST_INIT().	2020-07-10 08:52:13 +02:00
Willy Tarreau	a9d7b76f6a	MINOR: connection: use MT_LIST_ADDQ() to add connections to idle lists When a connection is added to an idle list, it's already detached and cannot be seen by two threads at once, so there's no point using TRY_ADDQ, there will never be any conflict. Let's just use the cheaper ADDQ.	2020-07-10 08:52:13 +02:00
Willy Tarreau	8689127816	MINOR: buffer: use MT_LIST_ADDQ() for buffer_wait lists additions The TRY_ADDQ there was not needed since the wait list is exclusively owned by the caller. There's a preliminary test on MT_LIST_ADDED() that might have been eliminated by keeping MT_LIST_TRY_ADDQ() but it would have required two more expensive writes before testing so better keep the test the way it is.	2020-07-10 08:52:13 +02:00
Willy Tarreau	de4db17dee	MINOR: lists: rename some MT_LIST operations to clarify them Initially when mt_lists were added, their purpose was to be used with the scheduler, where anyone may concurrently add the same tasklet, so it sounded natural to implement a check in MT_LIST_ADD{,Q}. Later their usage was extended and MT_LIST_ADD{,Q} started to be used on situations where the element to be added was exclusively owned by the one performing the operation so a conflict was impossible. This became more obvious with the idle connections and the new macro was called MT_LIST_ADDQ_NOCHECK. But this remains confusing and at many places it's not expected that an MT_LIST_ADD could possibly fail, and worse, at some places we start by initializing it before adding (and the test is superflous) so let's rename them to something more conventional to denote the presence of the check or not: MT_LIST_ADD{,Q} : inconditional operation, the caller owns the element, and doesn't care about the element's current state (exactly like LIST_ADD) MT_LIST_TRY_ADD{,Q}: only perform the operation if the element is not already added or in the process of being added. This means that the previously "safe" MT_LIST_ADD{,Q} are not "safe" anymore. This also means that in case of backport mistakes in the future causing this to be overlooked, the slower and safer functions will still be used by default. Note that the missing unchecked MT_LIST_ADD macro was added. The rest of the code will have to be reviewed so that a number of callers of MT_LIST_TRY_ADDQ are changed to MT_LIST_ADDQ to remove the unneeded test.	2020-07-10 08:50:41 +02:00
Willy Tarreau	5254321d14	BUILD: tcp: condition TCP keepalive settings to platforms providing them Previous commit `b24bc0d` ("MINOR: tcp: Support TCP keepalive parameters customization") broke non-Linux builds as TCP_KEEP{CNT,IDLE,INTVL} are not necessarily defined elsewhere. This patch adds the required #ifdefs to condition the visibility of the keywords, and adds a mention in the doc about their dependency on Linux.	2020-07-09 05:58:51 +02:00
MIZUTA Takeshi	b24bc0dfb6	MINOR: tcp: Support TCP keepalive parameters customization It is now possible to customize TCP keepalive parameters. These correspond to the socket options TCP_KEEPCNT, TCP_KEEPIDLE, TCP_KEEPINTVL and are valid for the defaults, listen, frontend and backend sections. This patch fixes GitHub issue #670.	2020-07-09 05:22:16 +02:00
Tim Duesterhus	d0c0ca2720	CLEANUP: Add static void hlua_deinit() Compiling HAProxy with USE_LUA=1 and running a configuration check within valgrind with a very simple configuration such as: listen foo bind :8080 Will report quite a few possible leaks afterwards: ==24048== LEAK SUMMARY: ==24048== definitely lost: 0 bytes in 0 blocks ==24048== indirectly lost: 0 bytes in 0 blocks ==24048== possibly lost: 95,513 bytes in 1,209 blocks ==24048== still reachable: 329,960 bytes in 71 blocks ==24048== suppressed: 0 bytes in 0 blocks Printing these possible leaks shows that all of them are caused by Lua. Luckily Lua makes it very* easy to free all used memory, so let's do this on shutdown. Afterwards this patch is applied the output looks much better: ==24199== LEAK SUMMARY: ==24199== definitely lost: 0 bytes in 0 blocks ==24199== indirectly lost: 0 bytes in 0 blocks ==24199== possibly lost: 0 bytes in 0 blocks ==24199== still reachable: 329,960 bytes in 71 blocks ==24199== suppressed: 0 bytes in 0 blocks	2020-07-07 16:52:35 +02:00
Tim Duesterhus	bbdd5b8ca9	CLEANUP: Add static void vars_deinit() vars_deinit() frees all var_names during deinit().	2020-07-07 16:52:35 +02:00
Tim Duesterhus	34bef074c6	CLEANUP: haproxy: Free post_server_check_list in deinit() This allocation is technically always reachable and cannot leak, but so are a few others that are freed.	2020-07-07 16:52:35 +02:00
Tim Duesterhus	0837eb11cf	CLEANUP: haproxy: Free server_deinit_list in deinit() This allocation is technically always reachable and cannot leak, but so are a few others that are freed.	2020-07-07 16:52:35 +02:00
Tim Duesterhus	fdf904a297	CLEANUP: haproxy: Free post_deinit_list in deinit() This allocation is technically always reachable and cannot leak, but so are a few others that are freed.	2020-07-07 16:52:35 +02:00
Tim Duesterhus	17e363f751	CLEANUP: haproxy: Free proxy_deinit_list in deinit() This allocation is technically always reachable and cannot leak, but so are a few others that are freed.	2020-07-07 16:52:35 +02:00
Tim Duesterhus	041a626a8a	BUG/MINOR: sample: Free str.area in smp_check_const_meth Given the following example configuration: listen foo mode http bind *:8080 http-request set-var(txn.leak) meth(GET) server x example.com:80 Running a configuration check with valgrind reports: ==25992== 4 bytes in 1 blocks are definitely lost in loss record 1 of 344 ==25992== at 0x4C2DB8F: malloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) ==25992== by 0x4E239D: my_strndup (tools.c:2261) ==25992== by 0x581E20: make_arg_list (arg.c:253) ==25992== by 0x4DE91D: sample_parse_expr (sample.c:890) ==25992== by 0x58E304: parse_store (vars.c:772) ==25992== by 0x566A3F: parse_http_req_cond (http_rules.c:95) ==25992== by 0x4A4CE6: cfg_parse_listen (cfgparse-listen.c:1339) ==25992== by 0x494C59: readcfgfile (cfgparse.c:2049) ==25992== by 0x545145: init (haproxy.c:2029) ==25992== by 0x421E42: main (haproxy.c:3175) After this patch is applied the leak is gone as expected. This is a fairly minor leak, but it can add up for many uses of the `bool()` sample fetch. The bug most likely exists since the `bool()` sample fetch was introduced in commit `cc103299c7`. The fix may be backported to HAProxy 1.6+.	2020-07-07 16:52:35 +02:00
Tim Duesterhus	c7d8a86f2f	BUG/MINOR: sample: Free str.area in smp_check_const_bool Given the following example configuration: listen foo mode http bind *:8080 http-request set-var(txn.leak) bool(1) server x example.com:80 Running a configuration check with valgrind reports: ==24233== 2 bytes in 1 blocks are definitely lost in loss record 1 of 345 ==24233== at 0x4C2DB8F: malloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) ==24233== by 0x4E238D: my_strndup (tools.c:2261) ==24233== by 0x581E10: make_arg_list (arg.c:253) ==24233== by 0x4DE90D: sample_parse_expr (sample.c:890) ==24233== by 0x58E2F4: parse_store (vars.c:772) ==24233== by 0x566A2F: parse_http_req_cond (http_rules.c:95) ==24233== by 0x4A4CE6: cfg_parse_listen (cfgparse-listen.c:1339) ==24233== by 0x494C59: readcfgfile (cfgparse.c:2049) ==24233== by 0x545135: init (haproxy.c:2029) ==24233== by 0x421E42: main (haproxy.c:3175) After this patch is applied the leak is gone as expected. This is a fairly minor leak, but it can add up for many uses of the `bool()` sample fetch. The bug most likely exists since the `bool()` sample fetch was introduced in commit `cc103299c7`. The fix may be backported to HAProxy 1.6+.	2020-07-07 16:52:35 +02:00
Tim Duesterhus	826cf0729b	BUG/MINOR: haproxy: Free srule->expr during deinit Given the following example configuration: backend foo mode http use-server %[str(x)] if { always_true } server x example.com:80 Running a configuration check with valgrind reports: ==19376== 170 (40 direct, 130 indirect) bytes in 1 blocks are definitely lost in loss record 281 of 347 ==19376== at 0x4C2FB55: calloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) ==19376== by 0x5091AC: add_sample_to_logformat_list (log.c:511) ==19376== by 0x50A5A6: parse_logformat_string (log.c:671) ==19376== by 0x4957F2: check_config_validity (cfgparse.c:2588) ==19376== by 0x54442D: init (haproxy.c:2129) ==19376== by 0x421E42: main (haproxy.c:3169) After this patch is applied the leak is gone as expected. This is a very minor leak that can only be observed if deinit() is called, shortly before the OS will free all memory of the process anyway. No backport needed.	2020-07-07 16:52:35 +02:00
Tim Duesterhus	6fb74a1dc3	BUG/MINOR: haproxy: Free srule->file during deinit Given the following example configuration: backend foo mode http use-server x if { always_true } server x example.com:80 Running a configuration check with valgrind reports: ==18650== 14 bytes in 1 blocks are definitely lost in loss record 3 of 345 ==18650== at 0x4C2DB8F: malloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) ==18650== by 0x649E489: strdup (strdup.c:42) ==18650== by 0x4A5438: cfg_parse_listen (cfgparse-listen.c:1548) ==18650== by 0x494C59: readcfgfile (cfgparse.c:2049) ==18650== by 0x5450B5: init (haproxy.c:2029) ==18650== by 0x421E42: main (haproxy.c:3168) After this patch is applied the leak is gone as expected. This is a very minor leak that can only be observed if deinit() is called, shortly before the OS will free all memory of the process anyway. No backport needed.	2020-07-07 16:52:35 +02:00
Tim Duesterhus	18c63591f0	BUG/MINOR: haproxy: Free proxy->unique_id_header during deinit Given the following example configuration: frontend foo mode http bind *:8080 unique-id-header x Running a configuration check with valgrind reports: ==17621== 2 bytes in 1 blocks are definitely lost in loss record 1 of 341 ==17621== at 0x4C2DB8F: malloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) ==17621== by 0x649E489: strdup (strdup.c:42) ==17621== by 0x4A87F1: cfg_parse_listen (cfgparse-listen.c:2747) ==17621== by 0x494C59: readcfgfile (cfgparse.c:2049) ==17621== by 0x545095: init (haproxy.c:2029) ==17621== by 0x421E42: main (haproxy.c:3167) After this patch is applied the leak is gone as expected. This is a very minor leak that can only be observed if deinit() is called, shortly before the OS will free all memory of the process anyway. No backport needed.	2020-07-07 16:52:35 +02:00

... 6 7 8 9 10 ...

10520 Commits