haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-07 23:56:57 +02:00

Author	SHA1	Message	Date
Tim Duesterhus	2164800c1b	BUG/MINOR: log: Detect missing sampling ranges in config Consider a config like: global log 127.0.0.1:10001 sample :10 local0 No sampling ranges are given here, leading to NULL being passed as the first argument to qsort. This configuration does not make sense anyway, a log without ranges would never log. Thus output an error if no ranges are given. This bug was introduced in `d95ea2897e`. This fix must be backported to HAProxy 2.0.	2019-06-26 11:15:49 +02:00
Willy Tarreau	1bfd6020ce	MINOR: logs: use the new bitmap functions instead of fd_sets for encoding maps The fd_sets we've been using in the log encoding functions are not portable and were shown to break at least under Cygwin. This patch gets rid of them in favor of the new bitmap functions. It was verified with the config below that the log output was exactly the same before and after the change : defaults mode http option httplog log stdout local0 timeout client 1s timeout server 1s timeout connect 1s frontend foo bind :8001 capture request header chars len 255 backend bar option httpchk "GET" "/" "HTTP/1.0\r\nchars: \x01\x02\x03\x04\x05\x06\x07\x08\x09\x0b\x0c\x0e\x0f\x10\x11\x12\x13\x14\x15\x16\x17\x18\x19\x1a\x1b\x1c\x1d\x1e\x1f\x20\x21\x22\x23\x24\x25\x26\x27\x28\x29\x2a\x2b\x2c\x2d\x2e\x2f\x30\x31\x32\x33\x34\x35\x36\x37\x38\x39\x3a\x3b\x3c\x3d\x3e\x3f\x40\x41\x42\x43\x44\x45\x46\x47\x48\x49\x4a\x4b\x4c\x4d\x4e\x4f\x50\x51\x52\x53\x54\x55\x56\x57\x58\x59\x5a\x5b\x5c\x5d\x5e\x5f\x60\x61\x62\x63\x64\x65\x66\x67\x68\x69\x6a\x6b\x6c\x6d\x6e\x6f\x70\x71\x72\x73\x74\x75\x76\x77\x78\x79\x7a\x7b\x7c\x7d\x7e\x7f\x80\x81\x82\x83\x84\x85\x86\x87\x88\x89\x8a\x8b\x8c\x8d\x8e\x8f\x90\x91\x92\x93\x94\x95\x96\x97\x98\x99\x9a\x9b\x9c\x9d\x9e\x9f\xa0\xa1\xa2\xa3\xa4\xa5\xa6\xa7\xa8\xa9\xaa\xab\xac\xad\xae\xaf\xb0\xb1\xb2\xb3\xb4\xb5\xb6\xb7\xb8\xb9\xba\xbb\xbc\xbd\xbe\xbf\xc0\xc1\xc2\xc3\xc4\xc5\xc6\xc7\xc8\xc9\xca\xcb\xcc\xcd\xce\xcf\xd0\xd1\xd2\xd3\xd4\xd5\xd6\xd7\xd8\xd9\xda\xdb\xdc\xdd\xde\xdf\xe0\xe1\xe2\xe3\xe4\xe5\xe6\xe7\xe8\xe9\xea\xeb\xec\xed\xee\xef\xf0\xf1\xf2\xf3\xf4\xf5\xf6\xf7\xf8\xf9\xfa\xfb\xfc\xfd\xfe\xff" server foo 127.0.0.1:8001 check	2019-06-07 11:13:24 +02:00
Willy Tarreau	082b62828d	BUG/MEDIUM: init/threads: provide per-thread alloc/free function callbacks We currently have the ability to register functions to be called early on thread creation and at thread deinitialization. It turns out this is not sufficient because certain such functions may use resources that are being allocated by the other ones, thus creating a race condition depending only on the linking order. For example the mworker needs to register a file descriptor while the pollers will reallocate the fd_updt[] array. Similarly logs and trashes may be used by some init functions while it's unclear whether they have been deduplicated. The same issue happens on deinit, if the fd_updt[] or trash is released before some functions finish to use them, we'll get into trouble. This patch creates a couple of early and late callbacks for per-thread allocation/freeing of resources. A few init functions were moved there, and the fd init code was split between the two (since it used to both allocate and initialize at once). This way the init/deinit sequence is expected to be safe now. This patch should be backported to 1.9 as at least the trash/log issue seems to be present. The run_thread_poll_loop() code is a bit different there as the mworker is not a callback, but it will have no effect and it's enough to drop the mworker changes. This bug was reported by Ilya Shipitsin in github issue #104.	2019-05-22 14:59:08 +02:00
Fr�d�ric L�caille	90a10aeb65	BUG/MINOR: log: Wrong log format initialization. This patch fixes an issue introduced by `0bad840b` commit "MINOR: log: Extract some code to send syslog messages" which leaded to wrong log format variable initializations at least for "short" and "raw" format. This commit skipped the cases where even if passed to __do_send_log(), the syslog tag and syslog pid string must not be used to format the log message with "short" and "raw". This is done iniatilizing "tag_max" and "pid_max" variables (the lengths of the tag and pid strings) to 0, then updating to them to the length of the tag and pid strings passed as variables to __do_send_log() depending on the log format and in every cases using this length for the iovec variable used to send() the log. This bug is specific to 2.0.	2019-05-14 11:12:00 +02:00
Willy Tarreau	c125cef6da	CLEANUP: ssl: make inclusion of openssl headers safe It's always a pain to have to stuff lots of #ifdef USE_OPENSSL around ssl headers, it even results in some of them appearing in a random order and multiple times just to benefit form an existing ifdef block. Let's make these headers safe for inclusion when USE_OPENSSL is not defined, they now perform the test themselves and do nothing if USE_OPENSSL is not defined. This allows to remove no less than 8 such ifdef blocks and make include blocks more readable.	2019-05-10 09:58:43 +02:00
Willy Tarreau	55e2f5ad14	BUG/MINOR: logs/threads: properly split the log area upon startup If logs were emitted before creating the threads, then the dataptr pointer keeps a copy of the end of the log header. Then after the threads are created, the headers are reallocated for each thread. However the end pointer was not reset until the end of the first second, which may result in logs emitted by multiple threads during the first second to be mangled, or possibly in some cases to use a memory area that was reused for something else. The fix simply consists in reinitializing the end pointers immediately when the threads are created. This fix must be backported to 1.9 and 1.8.	2019-05-05 10:16:13 +02:00
Dragan Dosen	61302da0e7	BUG/MINOR: log: properly free memory on logformat parse error and deinit() This patch may be backported to all supported versions.	2019-05-02 10:05:07 +02:00
Fr�d�ric L�caille	d803e475e5	MINOR: log: Enable the log sampling and load-balancing feature. This patch implements the sampling and load-balancing of log servers configured with "sample" new keyword implemented by this commit: 'MINOR: log: Add "sample" new keyword to "log" lines'. As the list of ranges used to sample the log to balance is ordered, we only have to maintain ->curr_idx member of smp_info struct which is the index of the sample and check if it belongs or not to the current range to decide if we must send it to the log server or not.	2019-04-30 09:25:09 +02:00
Fr�d�ric L�caille	d95ea2897e	MINOR: log: Add "sample" new keyword to "log" lines. This patch implements the parsing of "sample" new optional keyword for "log" lines to be able to sample and balance the load of log messages between serveral log destinations declared by "log" lines. This keyword must be followed by a list of comma seperated ranges of indexes numbered from 1 to define the samples to be used to balance the load of logs to send. This "sample" keyword must be used on "log" lines obviously before the remaining optional ones without keyword. The list of ranges must be followed by a colon character to separate it from the log sampling size. With such following configuration declarations: log stderr local0 log 127.0.0.1:10001 sample 2-3,8-11:11 local0 log 127.0.0.2:10002 sample 5:5 local0 in addition to being sent to stderr, about the second "log" line, every 11 logs the logs #2 up to #3 would be sent to 127.0.0.1:10001, then #8 up tp #11 four logs would be sent to the same log server and so on periodically. Logs would be sent to 127.0.0.2:100002 every 5 logs. It is also possible to define the size of the sample with a value different of the maximum of the high limits of the ranges, for instance as follows: log 127.0.0.1:10001 sample 2-3,8-11:15 local0 as before the two logs #2 and #3 would be sent to 127.0.0.1:10001, then #8 up tp #11 logs, but in this case here, this would be done periodically every 15 messages. Also note that the ranges must not overlap each others. This is to ease the way the logs are periodically sent.	2019-04-30 09:25:09 +02:00
Fr�d�ric L�caille	0bad840b4d	MINOR: log: Extract some code to send syslog messages. This patch extracts the code of __send_log() responsible of sending a syslog message to a syslog destination represented as a logsrv struct to define __do_send_log() function. __send_log() calls __do_send_log() for each syslog destination of a proxy after having prepared some of its parameters.	2019-04-23 14:16:51 +02:00
Robin H. Johnson	543d4507ca	MINOR: skip get_gmtime where tm is unused For LOG_FMT_TS (%Ts), the tm variable is not used, so save some cycles on the call to get_gmtime. Backport: 1.9 1.8 Signed-off-by: Robin H. Johnson <rjohnson@digitalocean.com>	2019-04-11 14:58:32 +02:00
Radek Zajic	594c456d14	BUG/MINOR: log: properly format IPv6 address when LOG_OPT_HEXA modifier is used. In lf_ip(), when LOG_OPT_HEXA modifier is used, there is a code to format the IP address as a hexadecimal string. This code does not properly handle cases when the IP address is IPv6. In such case, the code only prints `00000000`. This patch adds support for IPv6. For legacy IPv4, the format remains unchanged. If IPv6 socket is used to accept IPv6 connection, the full IPv6 address is returned. For example, IPv6 localhost, ::1, is printed as 00000000000000000000000000000001. If IPv6 socket accepts IPv4 connection, the IPv4 address is mapped by the kernel into the IPv4-mapped-IPv6 address space (RFC4291, section 2.5.5.2) and is formatted as such. For example, 127.0.0.1 becomes ::ffff:127.0.0.1, which is printed as 00000000000000000000FFFF7F000001. This should be backported to 1.9.	2019-03-22 17:31:18 +01:00
Olivier Houchard	d2ee3e7227	MEDIUM: logs: Use the new _HA_ATOMIC_* macros. Use the new _HA_ATOMIC_* macros and add barriers where needed.	2019-03-11 17:02:38 +01:00
Olivier Houchard	7c49711d60	BUG/MEDIUM: logs: Only attempt to free startup_logs once. deinit_log_buffers() can be called once per thread, however startup_logs is common to all threads. So only attempt to free it once. This should be backported to 1.9 and 1.8.	2019-03-07 14:59:34 +01:00
Willy Tarreau	b1d7b700bb	BUG/MEDIUM: log: don't mark log FDs as non-blocking on terminals With the new ability to log to a terminal, it's convenient to be able to use "log stdout" in a config file, except that it now results in setting the terminal to non-blocking mode, breaking every utility relying on stdin afterwards. Since the only reason for logging to a terminal is to debug, do not set the FD to non-blocking mode when it's a terminal. This fix must be backported to 1.9.	2019-01-02 20:12:02 +01:00
Willy Tarreau	a648399c90	BUG/MINOR: logs: leave startup-logs global and not per-thread Commit `f8188c6` ("MEDIUM: threads/logs: Make logs thread-safe") made logs thread-local but it also made the copy of the startup-logs thread-local, meaning that when threads are configured, upon startup the list of startup logs appears to be empty. Let's just remove the THEAD_LOCAL directive there, as the check for the startup period is already present. This fix should be backported to 1.8.	2018-12-15 16:55:36 +01:00
Willy Tarreau	204e3f1fab	BUG/MINOR: log: fix logging to both FD and IP PiBa-NL reported an issue affecting logs when stdout is enabled at the same time as an IP address. It does not affect FD and UNIX, but does still affect multiple FDs. What happens is that the condition to detect that the initialization was not made relies on the FD being -1, and in this case the FD points to the unique FD used for AF_INET sockets, so the configured socket used for outgoing logs over UDP gets overwritten by the last configured FD. This is not appropriate, so instead we rely on the sin_port part of the IPv4-mapped address to store the initialization state for each FD. This part deserves being significantly revamped, as IPv6 is still not possible due to the way the FDs are managed, and inherited FDs are a bit hackish. Note that this patch relies on "MINOR: tools: preset the port of fd-based "sockets" to zero" in order to operate properly. No backport is needed.	2018-12-15 15:54:13 +01:00
Olivier Houchard	f90db445cd	BUG/MEDIUM: log: Don't call sample_fetch_as_type if we don't have a stream. In sess_build_logline(), don't attempt to call sample_fetch_as_type() if we don't have a stream. It used never to happen in the past before commit `09bb27c` ("MEDIUM: log: make sess_build_logline() support being called with no stream"). But now it can happen when sess_log() is called from the lower layers (i.e. the H2 mux got garbage when it was expecting a preface frame), and it reveals that some sample fetch functions and some converter fnuctions which rely on the stream don't test for its existence. For the sample fetch functions, a durable solution is easy and would consist in adapting sample_process() to verify the SMP_USE_* bits when the stream is not set. But for the converters we don't have this option as they don't declare whether or not they use a stream (which possibly is the real issue). Thus for now let's disable sample_fetch_as_type() if a stream does not exist, until it can be more accurately refined later.	2018-12-15 14:34:31 +01:00
Willy Tarreau	30925659ef	CLEANUP: h1: remove some occurrences of unneeded h1.h inclusions Several places where h1.h was included didn't need it at all since they in fact relied on the legacy HTTP definitions.	2018-12-11 17:15:13 +01:00
Willy Tarreau	b6b3df3ed3	MEDIUM: initcall: use initcalls for a few initialization functions signal_init(), init_log(), init_stream(), and init_task() all used to only preset some values and lists. This needs to be done very early to provide a reliable interface to all other users. The calls used to be explicit in haproxy.c:init(). Now they're placed in initcalls at the STG_PREPARE stage. The functions are not exported anymore.	2018-11-26 19:50:32 +01:00
Willy Tarreau	172f5ce948	MINOR: initcall: use initcalls for most post_{check,deinit} and per_thread* Most calls to hap_register_post_check(), hap_register_post_deinit(), hap_register_per_thread_init(), hap_register_per_thread_deinit() can be done using initcalls and will not require a constructor anymore. Let's create a set of simplified macros for this, called respectively REGISTER_POST_CHECK, REGISTER_POST_DEINIT, REGISTER_PER_THREAD_INIT, and REGISTER_PER_THREAD_DEINIT. Some files were not modified because they wouldn't benefit from this or because they conditionally register (e.g. the pollers).	2018-11-26 19:50:32 +01:00
Willy Tarreau	0108d90c6c	MEDIUM: init: convert all trivial registration calls to initcalls This switches explicit calls to various trivial registration methods for keywords, muxes or protocols from constructors to INITCALL1 at stage STG_REGISTER. All these calls have in common to consume a single pointer and return void. Doing this removes 26 constructors. The following calls were addressed : - acl_register_keywords - bind_register_keywords - cfg_register_keywords - cli_register_kw - flt_register_keywords - http_req_keywords_register - http_res_keywords_register - protocol_register - register_mux_proto - sample_register_convs - sample_register_fetches - srv_register_keywords - tcp_req_conn_keywords_register - tcp_req_cont_keywords_register - tcp_req_sess_keywords_register - tcp_res_cont_keywords_register - flt_register_keywords	2018-11-26 19:50:32 +01:00
William Lallemand	9c56a22b20	MINOR: log: introduce ha_notice() It's like ha_warning() or ha_alert() but with a NOTICE prefix.	2018-11-21 19:02:23 +01:00
Joseph Herlant	85b4059b82	CLEANUP: Fix typos in the log subsystem Fix some misspells in the code comments of the log subsystem.	2018-11-18 22:26:42 +01:00
Willy Tarreau	db398435aa	MINOR: stream-int: replace si_cant_put() with si_rx_room_{blk,rdy}() Remaining calls to si_cant_put() were all for lack of room and were turned to si_rx_room_blk(). A few places where SI_FL_RXBLK_ROOM was cleared by hand were converted to si_rx_room_rdy(). The now unused si_cant_put() function was removed.	2018-11-18 21:41:50 +01:00
William Lallemand	a8b2671cf6	BUG/MEDIUM: log: don't CLOEXEC the inherited FDs When configuring the logs with a FD and using the master worker, the FD was closed upon a reload because it was configured with CLOEXEC. It leads to using the wrong FD for the logs and to close them. Which is unfortunate since the master rely on the FD left opened during a reload. The fix is to stop doing a CLOEXEC when the FD is inherited. No backport needed.	2018-11-13 19:32:45 +01:00
Willy Tarreau	c1b0645dac	MEDIUM: log: add a new "raw" format This format is pretty similar to the previous "short" format except that it also removes the severity level. Thus only the raw message is sent. This is suitable for use in containers, where only the raw information is expected and where the severity is supposed to come from the file descriptor used.	2018-11-12 18:37:55 +01:00
Willy Tarreau	e8746a08b2	MEDIUM: log: support a new "short" format This format is meant to be used with local file descriptors. It emits messages only prefixed with a level, removing all the process name, system name, date and so on. It is similar to the printk() format used on Linux. It's suitable to be sent to a local logger compatible with systemd's output format. Note that the facility is still required but not used, hence it is suggested to use "daemon" to remind that it's a local logger. Example : log stdout format short daemon # send everything to stdout log stderr format short daemon notice # send important events to stderr	2018-11-12 18:37:55 +01:00
Willy Tarreau	5a32ecc6cf	MEDIUM: log: add support for logging to existing file descriptors In certain situations it would be desirable to log to an existing file descriptor, the most common case being a pipe between containers or processes. The main issue with pipes is that using write() on them will randomly truncate messages. But there is a trick. By using writev(), we can atomically deliver or drop a message, which perfectly fits the purpose. The only caveat is that large messages (4096 bytes on modern operating systems) may be interleaved with messages from other processes if using nbproc for example. In practice such messages are rare and most of the time when users need such type of logging, the load is low enough for a single process to be running so this is not really a problem. This logging method thus uses unbuffered writev() calls and is uses more CPU than if it used its own buffer with large writes at once, though this is not a problem for moderate loads. Logging to a file descriptor attached to a file also works with the side effect that the process is significantly slowed down during disk accesses and that it's not possible to rotate the file without restarting the process. For this reason this option is not offered as a configuration option, since it would confuse most users, but one could decide to redirect haproxy's output to a file during debugging sessions. Two aliases "stdout" and "stderr" are provided, but keep in mind that these are closed by default in daemon mode. When logging to a pipe or socket at a high enough rate, some logs will be dropped and the number of dropped messages is reported in "show info".	2018-11-12 18:37:55 +01:00
Willy Tarreau	13ef773722	MINOR: log: report the number of dropped logs in the stats It's easy to detect when logs on some paths are lost as sendmsg() will return EAGAIN. This is particularly true when sending to /dev/log, which often doesn't support a big logging capacity. Let's keep track of these and report the total number of dropped messages in "show info".	2018-11-12 18:37:55 +01:00
Willy Tarreau	251fe34ca2	MINOR: log: slightly improve error message syntax on log failure The error messages used to say something along "socket logger 2 failed" or "sendmsg logger 2 failed" which are confusing. Let's rephrase this "sendmsg() failed for logger 2".	2018-11-12 18:37:55 +01:00
Willy Tarreau	0cd3bd628a	MINOR: stream-int: rename si_applet_{want\|stop\|cant}_{get\|put} It doesn't make sense to limit this code to applets, as any stream interface can use it. Let's rename it by simply dropping the "applet_" part of the name. No other change was made except updating the comments.	2018-11-11 10:18:37 +01:00
Willy Tarreau	9fa267dada	MINOR: log: make sess_log() support sess=NULL At many places in muxes we'll have to add tests to check if the connection is front or back before deciding to log. Instead let's centralize this test in sess_log() to simply do nothing when sess=NULL.	2018-10-12 16:58:01 +02:00
Willy Tarreau	1aaf324227	BUILD: log: silent build warnings due to unchecked __objt_{server,applet} These ones are safe to use there since the same check is performed in the switch/case they're used it. Let's use the unguarded versions instead.	2018-09-20 11:42:15 +02:00
Willy Tarreau	e10cd48a83	REORG: http: move the log encoding tables to log.c There are 3 tables in proto_http which are used exclusively by logs : hdr_encode_map[], url_encode_map[] and http_encode_map[]. They indicate what characters are safe to be emitted in logs depending on the part of the message where they are placed. Let's move this to log.c, as well as its initialization. It's worth noting that the rfc5424 map was already initialized there.	2018-09-11 10:30:25 +02:00
Willy Tarreau	35b51c6e5b	REORG: http: move the HTTP semantics definitions to http.h/http.c It's a bit painful to have to deal with HTTP semantics for each protocol version (H1 and H2), and working on the version-agnostic code further emphasizes the problem. This patch creates http.h and http.c which are agnostic to the version in use, and which borrow a few parts from proto_http and from h1. For example the once thought h1-specific h1_char_classes array is in fact dictated by RFC7231 and is used to parse HTTP headers. A few changes were made to a few files which were including proto_http.h while they only needed http.h. Certain string definitions pre-dated the introduction of indirect strings (ist) so some were used to simplify the definition of the known HTTP methods. The current lookup code saves 2 kB of a heavily used table and is faster than the previous table based lookup (typ. 14 ns vs 16 before).	2018-09-11 10:30:25 +02:00
Willy Tarreau	5383935856	MINOR: log: provide a function to emit a log for a session The new function sess_log() only needs a session to emit a log. It will ignore the parts that depend on the stream. It is usable to emit a log to report early errors in muxes. These ones will typically mention "<BADREQ>" for the request and 0 for the HTTP status code.	2018-09-06 09:43:41 +02:00
Willy Tarreau	09bb27cdea	MEDIUM: log: make sess_build_logline() support being called with no stream Till now it was impossible to emit logs from the lower layers only because a stream was mandatory. From now on it will at least be possible to emit a log to report a bad request or some timings for example. When the stream is null, sess_build_logline() will use default values and will extract the timing information from the session just like stream_new() does, so the resulting log line is perfectly valid. The termination state will indicate a proxy error during the request phase since it is the only realistic use for such a call with no stream.	2018-09-06 09:43:06 +02:00
Willy Tarreau	5cacab63e1	MINOR: log: use zero as the request counter if there is no stream When s==NULL we don't have any assigned request counter. Ideally we should proceed exactly like when a stream is initialized and assign a unique value. For now we only place it into a local variable.	2018-09-05 20:01:23 +02:00
Willy Tarreau	b8bc52522c	MINOR: log: keep a copy of s->flags early to avoid a dereference By placing s->flags into a local variable we'll be able to force it new values when s is NULL.	2018-09-05 20:01:23 +02:00
Willy Tarreau	02fdf4f77b	MINOR: log: use NULL for the unique_id if there is no stream Now s->unique_id is used as NULL (not set) if s==NULL.	2018-09-05 20:01:23 +02:00
Willy Tarreau	abd71a5c2e	MINOR: log: don't check the stream-int's conn_retries if the stream is NULL Let's simply forget the conn_retries when there is no stream since we haven't tried to connect yet.	2018-09-05 20:01:23 +02:00
Willy Tarreau	e1809dfdaf	MINOR: log: be sure not to dereference a null stream for a target The supported targets are either a server or an applet, so both are NULL if the stream is NULL.	2018-09-05 20:01:23 +02:00
Willy Tarreau	d4f9166f4e	MINOR: log: do not dereference a null stream to access captures If the stream is null, let's simply not check captures. That's already done if there is no capture.	2018-09-05 20:01:23 +02:00
Willy Tarreau	2393c5b6a9	MINOR: log: keep a copy of the backend connection early in sess_build_logline() This way we can avoid dereferencing a possibly inexisting stream.	2018-09-05 20:01:23 +02:00
Willy Tarreau	26ffa8544d	CLEANUP: log: make the low_level lf_{ip,port,text,text_len} functions take consts These ones were abusively relying on variables making it hard to integrate with const arguments.	2018-09-05 20:01:23 +02:00
Willy Tarreau	372ac5abff	MINOR: log: don't unconditionally pick log info from s->logs We'll soon support s==NULL so let's use an intermediary variable for the logs structure. For now it only points to s->logs but will support a local variable as an alternative later.	2018-09-05 20:01:23 +02:00
Willy Tarreau	56a91dddc6	MINOR: log: make sess_build_logline() not dereference a NULL stream for txn If the stream is NULL, the txn is NULL as well. This condition is already handled everywhere else.	2018-09-05 20:01:23 +02:00
Willy Tarreau	a21c0e60d2	MINOR: log: make the backend fall back to the frontend when there's no stream This is already what happens before the backend is assigned, except that now we don't need to dereference a NULL stream to figure this.	2018-09-05 20:01:23 +02:00
Willy Tarreau	43c538eab6	MINOR: log: move the log code to sess_build_logline() to add extra arguments The current build_logline() can only be used with valid streams, which means it is not suitable for use from muxes. We start by moving it into another more generic function which takes the session as an argument, to avoid complexifying all the internal API for jsut a few use cases. This new function is not supposed to be called directly from outside so we'll be able to instrument it to support several calling conventions. For now the behaviour and conditions remain unchanged.	2018-09-05 20:01:23 +02:00
Patrick Hemmer	ffe5e8c638	MINOR: stream: rename {srv,prx}_queue_size to *_queue_pos The current name is misleading as it implies a queue size, but the value instead indicates a position in the queue. The value is only the queue size at the exact moment the element is enqueued. Soon we will gain the ability to insert anywhere into the queue, upon which clarity of the name is more important.	2018-08-10 15:04:14 +02:00
Willy Tarreau	83061a820e	MAJOR: chunks: replace struct chunk with struct buffer Now all the code used to manipulate chunks uses a struct buffer instead. The functions are still called "chunk*", and some of them will progressively move to the generic buffer handling code as they are cleaned up.	2018-07-19 16:23:43 +02:00
Willy Tarreau	843b7cbe9d	MEDIUM: chunks: make the chunk struct's fields match the buffer struct Chunks are only a subset of a buffer (a non-wrapping version with no head offset). Despite this we still carry a lot of duplicated code between buffers and chunks. Replacing chunks with buffers would significantly reduce the maintenance efforts. This first patch renames the chunk's fields to match the name and types used by struct buffers, with the goal of isolating the code changes from the declaration changes. Most of the changes were made with spatch using this coccinelle script : @rule_d1@ typedef chunk; struct chunk chunk; @@ - chunk.str + chunk.area @rule_d2@ typedef chunk; struct chunk chunk; @@ - chunk.len + chunk.data @rule_i1@ typedef chunk; struct chunk chunk; @@ - chunk->str + chunk->area @rule_i2@ typedef chunk; struct chunk chunk; @@ - chunk->len + chunk->data Some minor updates to 3 http functions had to be performed to take size_t ints instead of ints in order to match the unsigned length here.	2018-07-19 16:23:43 +02:00
Christopher Faulet	28ac099907	MINOR: log: Keep the ref when a log server is copied to avoid duplicate entries With "log global" line, the global list of loggers are copied into the proxy's struct. The list coming from the default section is also copied when a frontend or a backend section is parsed. So it is possible to have duplicate entries in the proxy's list. For instance, with this following config, all messages will be logged twice: global log 127.0.0.1 local0 debug daemon defaults mode http log global option httplog frontend front-http log global bind *:8888 default_backend back-http backend back-http server www 127.0.0.1:8000	2018-04-05 15:13:54 +02:00
Christopher Faulet	4b0b79dd56	MINOR: log: move 'log' keyword parsing in dedicated function Now, the function parse_logsrv should be used to parse a "log" line. This function will update the list of loggers passed in argument. It can release all log servers when "no log" line was parsed (by the caller) or it can parse "log global" or "log <address> ... " lines. It takes care of checking the caller context (global or not) to prohibit "log global" usage in the global section.	2018-04-05 15:13:54 +02:00
Willy Tarreau	c98aebcdb8	MINOR: log: stop emitting alerts when it's not possible to write on the socket This is a recurring pain when using certain unix domain sockets or when sending to temporarily unroutable addresses, if the process remains in the foreground, the console is full of error which it's impossible to do anything about. It's even worse when the process is remote, or when run from a serial console which will slow the whole process down. Let's send them only once now to warn about a possible config issue, and not pollute the system nor slow everything down.	2018-03-20 16:44:25 +01:00
Christopher Faulet	789691778f	BUG/MEDIUM: mworker: Set FD_CLOEXEC flag on log fd A log socket (UDP or UNIX) is opened by the master during its startup, when the first log message is sent. So, to prevent FD leaks, we must ensure we correctly close it during a reload. By setting FD_CLOEXEC bit on it, we are sure it will be automatically closed it during a reload. This patch must be backported in 1.8.	2017-12-19 14:03:30 +01:00
Willy Tarreau	bafbe01028	CLEANUP: pools: rename all pool functions and pointers to remove this "2" During the migration to the second version of the pools, the new functions and pool pointers were all called "pool_something2()" and "pool2_something". Now there's no more pool v1 code and it's a real pain to still have to deal with this. Let's clean this up now by removing the "2" everywhere, and by renaming the pool heads "pool_head_something".	2017-11-24 17:49:53 +01:00
Christopher Faulet	767a84bcc0	CLEANUP: log: Rename Alert/Warning in ha_alert/ha_warning	2017-11-24 17:19:12 +01:00
Olivier Houchard	9aaf778129	MAJOR: connection : Split struct connection into struct connection and struct conn_stream. All the references to connections in the data path from streams and stream_interfaces were changed to use conn_streams. Most functions named "something_conn" were renamed to "something_cs" for this. Sometimes the connection still is what matters (eg during a connection establishment) and were not always renamed. The change is significant and minimal at the same time, and was quite thoroughly tested now. As of this patch, all accesses to the connection from upper layers go through the pass-through mux.	2017-10-31 18:03:23 +01:00
Christopher Faulet	cd7879adc2	BUG/MEDIUM: threads: Run the poll loop on the main thread too There was a flaw in the way the threads was created. the main one was just used to create all the others and just wait to exit. Now, it is used to run a poll loop. So we only create nbthread-1 threads. This also fixes a bug about the compression filter when there is only 1 thread (nbthread == 1 or no threads support). The bug was in the way thread-local resources was initialized. per-thread init/deinit callbacks were never called for the main process. So, with nthread set to 1, some buffers remained uninitialized.	2017-10-31 13:58:33 +01:00
Christopher Faulet	ff8abcd31d	MEDIUM: threads/proxy: Add a lock per proxy and atomically update proxy vars Now, each proxy contains a lock that must be used when necessary to protect it. Moreover, all proxy's counters are now updated using atomic operations.	2017-10-31 13:58:30 +01:00
Christopher Faulet	f8188c69fa	MEDIUM: threads/logs: Make logs thread-safe log buffers and static variables used in log functions are now thread-local. So there is no need to lock anything to log messages. Moreover, per-thread init/deinit functions are now used to initialize these buffers.	2017-10-31 13:58:30 +01:00
Christopher Faulet	c1b730a41a	MINOR: cli: Add "show startup-logs" command This command will dump all startup_logs buffer containing all alerts and warnings emitted during HAProxy startup.	2017-10-31 11:36:13 +01:00
Christopher Faulet	d46963865e	MINOR: log: Save alerts and warnings emitted during HAProxy startup Because we can't always display the standard error messages when HAProxy is started, all alerts and warnings emitted during the startup will now be saved in a buffer. It can also be handy to store these messages just in case you missed something during the startup To implement this feature, Alert and Warning functions now relies on display_message. The difference is just on conditions to call this function and it remains unchanged. In display_message, if MODE_STARTING flag is set, we save the message.	2017-10-31 11:36:13 +01:00
Emmanuel Hocdet	01da571e21	MINOR: merge ssl_sock_get calls for log and ppv2 Merge ssl_sock_get_version and ssl_sock_get_proto_version. Change ssl_sock_get_cipher to be used in ppv2.	2017-10-27 19:32:36 +02:00
David Carlier	93e8b88f06	BUG/MINOR: log: fixing small memory leak in error code path. since we do not log the sample fetch when it is invalid, we can free the log data.	2017-09-21 17:44:31 +02:00
Christopher Faulet	0132d06f68	MINOR: logs: Use dedicated function to init/deinit log buffers Now, we use init_log_buffers and deinit_log_buffers to, respectively, initialize and deinitialize log buffers used for syslog messages. These functions have been introduced to be used by threads, to deal with thread-local log buffers.	2017-09-05 10:29:31 +02:00
Willy Tarreau	d02286d6c8	BUG/MINOR: log: pin the front connection when front ip/ports are logged Mathias Weiersmueller reported an interesting issue with logs which Lukas diagnosed as dating back from commit `9b061e332` (1.5-dev9). When front connection information (ip, port) are logged in TCP mode and the log is emitted at the end of the connection (eg: because %B or any log tag requiring LW_BYTES is set), the log is emitted after the connection is closed, so the address and ports cannot be retrieved anymore. It could be argued that we'd make a special case of these to immediatly retrieve the source and destination addresses from the connection, but it seems cleaner to simply pin the front connection, marking it "tracked" by adding the LW_XPRT flag to mention that we'll need some of these elements at the last moment. Only LW_FRTIP and LW_CLIP are affected. Note that after this change, LW_FRTIP could simply be removed as it's not used anywhere. Note that the problem doesn't happen when using %[src] or %[dst] since all sample expressions set LW_XPRT. This must be backported to 1.7, 1.6 and 1.5.	2017-06-23 11:34:57 +02:00
Jim Freeman	a2278c8bbb	CLEANUP: logs: typo: simgle => single Typo in error message. Backport to 1.7.	2017-04-18 14:52:07 +02:00
Willy Tarreau	a261e9b094	CLEANUP: connection: remove all direct references to raw_sock and ssl_sock Now we exclusively use xprt_get(XPRT_RAW) instead of &raw_sock or xprt_get(XPRT_SSL) for &ssl_sock. This removes a bunch of #ifdef and include spread over a number of location including backend, cfgparse, checks, cli, hlua, log, server and session.	2016-12-22 23:26:38 +01:00
Willy Tarreau	71a8c7c49e	MINOR: listener: move the transport layer pointer to the bind_conf A mistake was made when the socket layer was cut into proto and transport, the transport was attached to the listener while all listeners in a single "bind" line always have exactly the same transport. It doesn't seem obvious but this is the reason why there are so many #ifdefs USE_OPENSSL in cfgparse : a lot of operations have to be open-coded because cfgparse only manipulates bind_conf and we don't have the information of the transport layer here. Very little code makes use of the transport layer, mainly session setup and log. These places can afford an extra pointer indirection (the listener points to the bind_conf). This change is thus very small, it saves a little bit of memory (8B per listener) and makes the code more flexible.	2016-12-22 23:26:37 +01:00
Thierry FOURNIER / OZON.IO	8a4e4420fb	MEDIUM: log-format: Use standard HAProxy log system to report errors The function log format emit its own error message using Alert(). This patch replaces this behavior and uses the standard HAProxy error system (with memprintf). The benefits are: - cleaning the log system - the logformat can ignore the caller (actually the caller must set a flag designing the caller function). - Make the usage of the logformat function easy for future components.	2016-11-25 07:32:58 +01:00
Thierry FOURNIER / OZON.IO	4ed1c9585d	MINOR: http/conf: store the use_backend configuration file and line for logs The error log of the directive use_backend doesn't provide the file and line containing the declaration. This patch stores theses informations.	2016-11-25 07:15:09 +01:00
Thierry FOURNIER / OZON.IO	a2c38d7904	MEDIUM: log-format: strict parsing and enable fail Until now, the function parse_logformat_string() never fails. It send warnings when it parses bad format, and returns expression in best effort. This patch replaces warnings by alert and returns a fail code. Maybe the warning mode is designed for a compatibility with old configuration versions. If it is the case, now this compatibility is broken. [wt: no, the reason is that an alert must cause a startup failure, but this will be OK with next patch]	2016-11-24 18:54:26 +01:00
Thierry FOURNIER / OZON.IO	6fe0e1b977	CLEANUP: log-format: remove unused arguments The log-format function parse_logformat_string() takes file and line for building parsing logs. These two parameters are embedded in the struct proxy curproxy, which is the current parsing context. This patch removes these two unused arguments.	2016-11-24 18:54:26 +01:00
Thierry FOURNIER / OZON.IO	bca46f0d9d	CLEANUP: log-format: fix return code of function parse_logformat_var_args() This patch replace the successful return code from 0 to 1. The error code is replaced from 1 to 0. The return code of this function is actually unused, so this patch cannot modify the behaviour.	2016-11-24 18:54:26 +01:00
Thierry FOURNIER / OZON.IO	eca4d95317	CLEANUP: log-format: fix return code of the function parse_logformat_var() This patch replaces the successful return code from 0 to 1. The error code is replaced from -1 to 0. The return code of this function is actually unused, so this patch cannot modify the behaviour.	2016-11-24 18:54:25 +01:00
Thierry FOURNIER / OZON.IO	9cbfef2455	BUG/MINOR: log-format: uncatched memory allocation functions Some return code of memory allocation functions are not tested. This patch fix theses checks.	2016-11-24 18:54:25 +01:00
Christopher Faulet	f7e4e7e096	MAJOR: spoe: Add an experimental Stream Processing Offload Engine SPOE makes possible the communication with external components to retrieve some info using an in-house binary protocol, the Stream Processing Offload Protocol (SPOP). In the long term, its aim is to allow any kind of offloading on the streams. This first version, besides being experimental, won't do lot of things. The most important today is to validate the protocol design and lay the foundations of what will, one day, be a full offload engine for the stream processing. So, for now, the SPOE can offload the stream processing before "tcp-request content", "tcp-response content", "http-request" and "http-response" rules. And it only supports variables creation/suppression. But, in spite of these limited features, we can easily imagine to implement a SSO solution, an ip reputation service or an ip geolocation service. Internally, the SPOE is implemented as a filter. So, to use it, you must use following line in a proxy proxy section: frontend my-front ... filter spoe [engine <name>] config <file> ... It uses its own configuration file to keep the HAProxy configuration clean. It is also a easy way to disable it by commenting out the filter line. See "doc/SPOE.txt" for all details about the SPOE configuration.	2016-11-09 22:57:01 +01:00
Thierry FOURNIER / OZON.IO	4cac359a39	MEDIUM: log: Decompose %Tq in %Th %Ti %TR Tq is the time between the instant the connection is accepted and a complete valid request is received. This time includes the handshake (SSL / Proxy-Protocol), the idle when the browser does preconnect and the request reception. This patch decomposes %Tq in 3 measurements names %Th, %Ti, and %TR which returns respectively the handshake time, the idle time and the duration of valid request reception. It also adds %Ta which reports the request's active time, which is the total time without %Th nor %Ti. It replaces %Tt as the total time, reporting accurate measurements for HTTP persistent connections. %Th is avalaible for TCP and HTTP sessions, %Ti, %TR and %Ta are only avalaible for HTTP connections. In addition to this, we have new timestamps %tr, %trg and %trl, which log the date of start of receipt of the request, respectively in the default format, in GMT time and in local time (by analogy with %t, %T and %Tl). All of them are obviously only available for HTTP. These values are more relevant as they more accurately represent the request date without being skewed by a browser's preconnect nor a keep-alive idle time. The HTTP log format and the CLF log format have been modified to use %tr, %TR, and %Ta respectively instead of %t, %Tq and %Tt. This way the default log formats now produce the expected output for users who don't want to manually fiddle with the log-format directive. Example with the following log-format : log-format "%ci:%cp [%tr] %ft %b/%s h=%Th/i=%Ti/R=%TR/w=%Tw/c=%Tc/r=%Tr/a=%Ta/t=%Tt %ST %B %CC %CS %tsc %ac/%fc/%bc/%sc/%rc %sq/%bq %hr %hs %{+Q}r" The request was sent by hand using "openssl s_client -connect" : Aug 23 14:43:20 haproxy[25446]: 127.0.0.1:45636 [23/Aug/2016:14:43:20.221] test~ test/test h=6/i=2375/R=261/w=0/c=1/r=0/a=262/t=2643 200 145 - - ---- 1/1/0/0/0 0/0 "GET / HTTP/1.1" => 6 ms of SSL handshake, 2375 waiting before sending the first char (in fact the time to type the first line), 261 ms before the end of the request, no time spent in queue, 1 ms spend connecting to the server, immediate response, total active time for this request = 262ms. Total time from accept to close : 2643 ms. The timing now decomposes like this : first request 2nd request \|<-------------------------------->\|<-------------- ... t tr t tr ... ---\|----\|----\|----\|----\|----\|----\|----\|----\|-- : Th Ti TR Tw Tc Tr Td : Ti ... :<---- Tq ---->: : :<-------------- Tt -------------->: :<--------- Ta --------->:	2016-08-23 15:18:08 +02:00
Willy Tarreau	077edcba2e	BUILD: log: iovec requires to include sys/uio.h on OpenBSD The following commit merged into 1.6-dev6 broke the build on OpenBSD : `609ac2a` ("MEDIUM: log: replace sendto() with sendmsg() in __send_log()") Including sys/uio.h is enough to fix this. This fix needs to be backported to 1.6.	2016-08-10 19:32:06 +02:00
Dragan Dosen	db1b6f9ecb	BUG/MEDIUM: log: use function "escape_string" instead of "escape_chunk" In function lf_text_len(), we used escape_chunk() to escape special characters. There could be a problem if len is greater than the real src string length (zero-terminated), eg. when calling lf_text_len() from lf_text().	2016-07-26 15:25:32 +02:00
Willy Tarreau	27b639d37f	MINOR: log: add the %Td log-format specifier As suggested by Pavlos, it's too bad that we didn't have a %Td log format tag given that there are a few mentions of Td corresponding to the data transmission time already in the doc, so this is now done. Just like the other specifiers, we report -1 if the connection failed before reaching the data transmission state.	2016-05-17 18:04:30 +02:00
Nenad Merdanovic	54e439f0b4	BUG/MINOR: log: fix a typo that would cause %HP to log <BADREQ> Typo was introduced in `57bc891` ("BUG/MEDIUM: log: fix risk of segfault when logging HTTP fields in TCP mode") which inverted the condition in the test and caused <BADREQ> to be logged when using %HP. Signed-off-by: Nenad Merdanovic <nmerdan@anine.io>	2016-04-29 07:28:44 +02:00
Willy Tarreau	57bc8917c3	BUG/MEDIUM: log: fix risk of segfault when logging HTTP fields in TCP mode David Torgerson faced an issue when using HTTP fields in log-format in TCP sections. The txn is dereferenced while it's null, resulting in a crash of the process. Such configurations are invalid and a warning is emitted, but nevertheless the process must not crash. As found by Lukas Tribus, this is a side effect of the split between the stream and the HTTP transaction that happened in 1.6, making it possible to have txn==NULL there. The fix consists in checking that txn is valid before using it. Fortunately it's easy since almost all places already used to check for the existence of a field (eg: txn->uri). This patch should be backported to 1.6.	2016-04-25 17:15:58 +02:00
Vincent Bernat	02779b6263	CLEANUP: uniformize last argument of malloc/calloc Instead of repeating the type of the LHS argument (sizeof(struct ...)) in calls to malloc/calloc, we directly use the pointer name (sizeof(...)). The following Coccinelle patch was used: @@ type T; T x; @@ x = malloc( - sizeof(T) + sizeof(x) ) @@ type T; T x; @@ x = calloc(1, - sizeof(T) + sizeof(*x) ) When the LHS is not just a variable name, no change is made. Moreover, the following patch was used to ensure that "1" is consistently used as a first argument of calloc, not the last one: @@ @@ calloc( + 1, ... - ,1 )	2016-04-03 14:17:42 +02:00
Benoit GARNIER	e2e5bde3f2	BUG/MINOR: log: Don't use strftime() which can clobber timezone if chrooted The strftime() function can call tzset() internally on some platforms. When haproxy is chrooted, the /etc/localtime file is not found, and some implementations will clobber the content of the current timezone. The GMT offset is computed by diffing the times returned by gmtime_r() and localtime_r(). These variants are guaranteed to not call tzset() and were already used in haproxy while chrooted, so they should be safe. This patch must be backported to 1.6 and 1.5.	2016-03-17 05:30:03 +01:00
Benoit GARNIER	b413c2a759	BUG/MINOR: log: GMT offset not updated when entering/leaving DST GMT offset used in local time formats was computed at startup, but was not updated when DST status changed while running. For example these two RFC5424 syslog traces where emitted 5 seconds apart, just before and after DST changed: <14>1 2016-03-27T01:59:58+01:00 bunch-VirtualBox haproxy 2098 - - Connect ... <14>1 2016-03-27T03:00:03+01:00 bunch-VirtualBox haproxy 2098 - - Connect ... It looked like they were emitted more than 1 hour apart, unlike with the fix: <14>1 2016-03-27T01:59:58+01:00 bunch-VirtualBox haproxy 3381 - - Connect ... <14>1 2016-03-27T03:00:03+02:00 bunch-VirtualBox haproxy 3381 - - Connect ... This patch should be backported to 1.6 and partially to 1.5 (no fix needed in log.c).	2016-03-13 23:48:05 +01:00
Dragan Dosen	835b9212f6	MEDIUM: log: add a new log format flag "E" The +E mode escapes characters '"', '\' and ']' with '\' as prefix. It mostly makes sense to use it in the RFC5424 structured-data log formats. Example: log-format-sd %{+Q,+E}o\ [exampleSDID@1234\ header=%[capture.req.hdr(0)]]	2016-02-12 13:36:47 +01:00
Dragan Dosen	17def46e10	BUG/MEDIUM: logs: fix time zone offset format in RFC5424 The time zone offset format used in function update_log_hdr_rfc5424() was missing ":" as a separator.	2015-10-10 00:07:03 +02:00
Dragan Dosen	43885c728e	BUG/MEDIUM: logs: segfault writing to log from Lua Michael Ezzell reported a bug causing haproxy to segfault during startup when trying to send syslog message from Lua. The function __send_log() can be called with *p that is NULL and/or when the configuration is not fully parsed, as is the case with Lua. This patch fixes this problem by using individual vectors instead of the pre-generated strings log_htp and log_htp_rfc5424. Also, this patch fixes a problem causing haproxy to write the wrong pid in the logs -- the log_htp(_rfc5424) strings were generated at the haproxy start, but "pid" value would be changed after haproxy is started in daemon/systemd mode.	2015-10-02 00:57:45 +02:00
Dragan Dosen	5b78d9b437	MEDIUM: logs: pass the trailing "\n" as an iovec This patch passes the trailing "\n" as an iovec in the function __send_log(), so that we don't need to modify the original log message.	2015-09-28 18:31:09 +02:00
Dragan Dosen	c8cfa7b4f3	MEDIUM: logs: have global.log_send_hostname not contain the trailing space This patch unifies global.log_send_hostname addition in the log header processing.	2015-09-28 18:27:45 +02:00
Dragan Dosen	0b85ecee53	MEDIUM: logs: add a new RFC5424 log-format for the structured-data This patch adds a new RFC5424-specific log-format for the structured-data that is automatically send by __send_log() when the sender is in RFC5424 mode. A new statement "log-format-sd" should be used in order to set log-format for the structured-data part in RFC5424 formatted syslog messages. Example: log-format-sd [exampleSDID@1234\ bytes=\"%B\"\ status=\"%ST\"]	2015-09-28 14:01:27 +02:00
Dragan Dosen	1322d09a6f	MEDIUM: logs: add support for RFC5424 header format per logger The function __send_log() iterates over senders and passes the header as the first vector to sendmsg(), thus it can send a logger-specific header in each message. A new logger arguments "format rfc5424" should be used in order to enable RFC5424 header format. For example: log 10.2.3.4:1234 len 2048 format rfc5424 local2 info	2015-09-28 14:01:27 +02:00
Dragan Dosen	68d2e3a742	MEDIUM: logs: remove the hostname, tag and pid part from the logheader At the moment we have to call snprintf() for every log line just to rebuild a constant. Thanks to sendmsg(), we send the message in 3 parts: time-based header, proxy-specific hostname+log-tag+pid, session-specific message.	2015-09-28 14:01:27 +02:00
Dragan Dosen	59cee973cd	MEDIUM: log: use a separate buffer for the header and for the message Make sendmsg() use two vectors, one for the message header that is updated by update_log_hdr() and one for the message buffer.	2015-09-28 14:01:27 +02:00
Dragan Dosen	609ac2ab6c	MEDIUM: log: replace sendto() with sendmsg() in __send_log() This patch replaces sendto() with sendmsg() in __send_log() and makes use of an iovec to send the log message.	2015-09-28 14:01:27 +02:00
Thierry FOURNIER	136f9d34a9	MINOR: samples: rename union from "data" to "u" The union name "data" is a little bit heavy while we read the source code because we can read "data.data.sint". The rename from "data" to "u" makes the read easiest like "data.u.sint".	2015-08-20 17:13:46 +02:00

1 2 3 4 5 ...

308 Commits