haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-08 08:07:10 +02:00

Author	SHA1	Message	Date
Dragan Dosen	1e3b16f74f	MINOR: log-format: allow to preserve spacing in log format strings Now it's possible to preserve spacing everywhere except in "log-format", "log-format-sd" and "unique-id-format" directives, where spaces are delimiters and are merged. That may be useful when the response payload is specified as a log format string by "lf-file" or "lf-string", or even for headers or anything else. In order to merge spaces, a new option LOG_OPT_MERGE_SPACES is applied exclusively on options passed to function parse_logformat_string(). This patch fixes an issue #701 ("http-request return log-format file evaluation altering spacing of ASCII output/art").	2020-07-02 10:11:44 +02:00
Dragan Dosen	2866acfb23	BUG/MEDIUM: log-format: fix possible endless loop in parse_logformat_string() This patch adds a missing break to end the loop in case when '%[' is not properly closed with ']'. The issue has been introduced with commit `cd0d2ed` ("MEDIUM: log-format: make the LF parser aware of sample expressions' end").	2020-07-01 06:30:50 +02:00
Willy Tarreau	b2551057af	CLEANUP: include: tree-wide alphabetical sort of include files This patch fixes all the leftovers from the include cleanup campaign. There were not that many (~400 entries in ~150 files) but it was definitely worth doing it as it revealed a few duplicates.	2020-06-11 10:18:59 +02:00
Willy Tarreau	dfd3de8826	REORG: include: move stream.h to haproxy/stream{,-t}.h This one was not easy because it was embarking many includes with it, which other files would automatically find. At least global.h, arg.h and tools.h were identified. 93 total locations were identified, 8 additional includes had to be added. In the rare files where it was possible to finalize the sorting of includes by adjusting only one or two extra lines, it was done. But all files would need to be rechecked and cleaned up now. It was the last set of files in types/ and proto/ and these directories must not be reused anymore.	2020-06-11 10:18:58 +02:00
Willy Tarreau	aeed4a85d6	REORG: include: move log.h to haproxy/log{,-t}.h The current state of the logging is a real mess. The main problem is that almost all files include log.h just in order to have access to the alert/warning functions like ha_alert() etc, and don't care about logs. But log.h also deals with real logging as well as log-format and depends on stream.h and various other things. As such it forces a few heavy files like stream.h to be loaded early and to hide missing dependencies depending where it's loaded. Among the missing ones is syslog.h which was often automatically included resulting in no less than 3 users missing it. Among 76 users, only 5 could be removed, and probably 70 don't need the full set of dependencies. A good approach would consist in splitting that file in 3 parts: - one for error output ("errors" ?). - one for log_format processing - and one for actual logging.	2020-06-11 10:18:58 +02:00
Willy Tarreau	5e539c9b8d	REORG: include: move stream_interface.h to haproxy/stream_interface{,-t}.h Almost no changes, removed stdlib and added buf-t and connection-t to the types to avoid a warning.	2020-06-11 10:18:58 +02:00
Willy Tarreau	209108dbbd	REORG: include: move ssl_sock.h to haproxy/ssl_sock{,-t}.h Almost nothing changed, just moved a static inline at the end and moved an export from the types to the main file.	2020-06-11 10:18:58 +02:00
Willy Tarreau	83487a833c	REORG: include: move cli.h to haproxy/cli{,-t}.h Almost no change except moving the cli_kw struct definition after the defines. Almost all users had both types&proto included, which is not surprizing since this code is old and it used to be the norm a decade ago. These places were cleaned.	2020-06-11 10:18:58 +02:00
Willy Tarreau	3f0f82e7a9	REORG: move applet.h to haproxy/applet{,-t}.h The type file was slightly tidied. The cli-specific APPCTX_CLI_ST1_* flag definitions were moved to cli.h. The type file was adjusted to include buf-t.h and not the huge buf.h. A few call places were fixed because they did not need this include.	2020-06-11 10:18:58 +02:00
Willy Tarreau	f268ee8795	REORG: include: split global.h into haproxy/global{,-t}.h global.h was one of the messiest files, it has accumulated tons of implicit dependencies and declares many globals that make almost all other file include it. It managed to silence a dependency loop between server.h and proxy.h by being well placed to pre-define the required structs, forcing struct proxy and struct server to be forward-declared in a significant number of files. It was split in to, one which is the global struct definition and the few macros and flags, and the rest containing the functions prototypes. The UNIX_MAX_PATH definition was moved to compat.h.	2020-06-11 10:18:58 +02:00
Willy Tarreau	e6ce10be85	REORG: include: move sample.h to haproxy/sample{,-t}.h This one is particularly tricky to move because everyone uses it and it depends on a lot of other types. For example it cannot include arg-t.h and must absolutely only rely on forward declarations to avoid dependency loops between vars -> sample_data -> arg. In order to address this one, it would be nice to split the sample_data part out of sample.h.	2020-06-11 10:18:58 +02:00
Willy Tarreau	762d7a5117	REORG: include: move frontend.h to haproxy/frontend.h There was no type file for this one, it only contains frontend_accept().	2020-06-11 10:18:57 +02:00
Willy Tarreau	ba2f73d40e	REORG: include: move sink.h to haproxy/sink{,-t}.h The sink files could be moved with almost no change at since they didn't rely on anything fancy. ssize_t required sys/types.h and thread.h was needed for the locks.	2020-06-11 10:18:57 +02:00
Willy Tarreau	d2ad57c352	REORG: include: move ring to haproxy/ring{,-t}.h Some includes were wrong in the type definition but beyond this no change was needed.	2020-06-11 10:18:57 +02:00
Willy Tarreau	0f6ffd652e	REORG: include: move fd.h to haproxy/fd{,-t}.h A few includes were missing in each file. A definition of struct polled_mask was moved to fd-t.h. The MAX_POLLERS macro was moved to defaults.h Stdio used to be silently inherited from whatever path but it's needed for list_pollers() which takes a FILE* and which can thus not be forward-declared.	2020-06-11 10:18:57 +02:00
Willy Tarreau	48fbcae07c	REORG: tools: split common/standard.h into haproxy/tools{,-t}.h And also rename standard.c to tools.c. The original split between tools.h and standard.h dates from version 1.3-dev and was mostly an accident. This patch moves the files back to what they were expected to be, and takes care of not changing anything else. However this time tools.h was split between functions and types, because it contains a small number of commonly used macros and structures (e.g. name_desc) which in turn cause the massive list of includes of tools.h to conflict with the callers. They remain the ugliest files of the whole project and definitely need to be cleaned and split apart. A few types are defined there only for functions provided there, and some parts are even OS-specific and should move somewhere else, such as the symbol resolution code.	2020-06-11 10:18:57 +02:00
Willy Tarreau	cd72d8c981	REORG: include: split common/http.h into haproxy/http{,-t}.h So the enums and structs were placed into http-t.h and the functions into http.h. This revealed that several files were dependeng on http.h but not including it, as it was silently inherited via other files.	2020-06-11 10:18:57 +02:00
Willy Tarreau	92b4f1372e	REORG: include: move time.h from common/ to haproxy/ This one is included almost everywhere and used to rely on a few other .h that are not needed (unistd, stdlib, standard.h). It could possibly make sense to split it into multiple parts to distinguish operations performed on timers and the internal time accounting, but at this point it does not appear much important.	2020-06-11 10:18:56 +02:00
Willy Tarreau	d678805783	REORG: include: move version.h to haproxy/ Few files were affected. The release scripts was updated.	2020-06-11 10:18:56 +02:00
Willy Tarreau	4c7e4b7738	REORG: include: update all files to use haproxy/api.h or api-t.h if needed All files that were including one of the following include files have been updated to only include haproxy/api.h or haproxy/api-t.h once instead: - common/config.h - common/compat.h - common/compiler.h - common/defaults.h - common/initcall.h - common/tools.h The choice is simple: if the file only requires type definitions, it includes api-t.h, otherwise it includes the full api.h. In addition, in these files, explicit includes for inttypes.h and limits.h were dropped since these are now covered by api.h and api-t.h. No other change was performed, given that this patch is large and affects 201 files. At least one (tools.h) was already freestanding and didn't get the new one added.	2020-06-11 10:18:42 +02:00
Emeric Brun	fa9d780119	BUG/MEDIUM: logs: fix trailing zeros on log message. This patch removes all trailing LFs and Zeros from log messages. Previously only the last LF was removed. It's a regression from e8ea0ae6f6 "BUG/MINOR: logs: prevent double line returns in some events." This should fix github issue #654	2020-05-28 15:30:51 +02:00
Emeric Brun	99c453df9d	MEDIUM: ring: new section ring to declare custom ring buffers. It is possible to globally declare ring-buffers, to be used as target for log servers or traces. ring <ringname> Creates a new ring-buffer with name <ringname>. description <text> The descritpition is an optional description string of the ring. It will appear on CLI. By default, <name> is reused to fill this field. format <format> Format used to store events into the ring buffer. Arguments: <format> is the log format used when generating syslog messages. It may be one of the following : iso A message containing only the ISO date, followed by the text. The PID, process name and system name are omitted. This is designed to be used with a local log server. raw A message containing only the text. The level, PID, date, time, process name and system name are omitted. This is designed to be used in containers or during development, where the severity only depends on the file descriptor used (stdout/stderr). This is the default. rfc3164 The RFC3164 syslog message format. This is the default. (https://tools.ietf.org/html/rfc3164) rfc5424 The RFC5424 syslog message format. (https://tools.ietf.org/html/rfc5424) short A message containing only a level between angle brackets such as '<3>', followed by the text. The PID, date, time, process name and system name are omitted. This is designed to be used with a local log server. This format is compatible with what the systemd logger consumes. timed A message containing only a level between angle brackets such as '<3>', followed by ISO date and by the text. The PID, process name and system name are omitted. This is designed to be used with a local log server. maxlen <length> The maximum length of an event message stored into the ring, including formatted header. If an event message is longer than <length>, it will be truncated to this length. size <size> This is the optional size in bytes for the ring-buffer. Default value is set to BUFSIZE. Example: global log ring@myring local7 ring myring description "My local buffer" format rfc3164 maxlen 1200 Note: ring names are resolved during post configuration processing.	2020-05-26 08:03:15 +02:00
Christopher Faulet	3b967c1210	MINOR: http-htx/proxy: Add http-error directive using http return syntax The http-error directive can now be used instead of errorfile to define an error message in a proxy section (including default sections). This directive uses the same syntax that http return rules. The only real difference is the limitation on status code that may be specified. Only status codes supported by errorfile directives are supported for this new directive. Parsing of errorfile directive remains independent from http-error parsing. But functionally, it may be expressed in terms of http-errors : errorfile <status> <file> ==> http-errror status <status> errorfile <file>	2020-05-20 18:27:14 +02:00
Emeric Brun	e709e1e777	MEDIUM: logs: buffer targets now rely on new sink_write Before this path, they rely directly on ring_write bypassing a part of the sink API. Now the maxlen parameter of the log will apply only on the text message part (and not the header, for this you woud prefer to use the maxlen parameter on the sink/ring). sink_write prototype was also reviewed to return the number of Bytes written to be compliant with the other write functions.	2020-05-19 11:04:11 +02:00
Emeric Brun	bd163817ed	MEDIUM: sink: build header in sink_write for log formats This patch extends the sink_write prototype and code to handle the rfc5424 and rfc3164 header. It uses header building tools from log.c. Doing this some functions/vars have been externalized. facility and minlevel have been removed from the struct sink and passed to args at sink_write because they depends of the log and not of the sink (they remained unused by rest of the code until now).	2020-05-19 11:04:11 +02:00
Emeric Brun	9e8ea0ae6f	BUG/MINOR: logs: prevent double line returns in some events. Historically some messages used to already contain the trailing LF but not all, and __do_send_log adds a new one in needed cases. It also does trim a trailing LF in certain cases while computing the max message length, as a result of subtracting 1 to the available room in the destination buffer. But the way it's done is wrong since some messages still contain it. So the code was fixed to always trim the trailing LF from messages if present, and then only subtract 1 from the destination buffer room instead of the size.. Note: new sink API is not designed to receive a trailing LF on event messages This could be backported to relevant stable versions with particular care since the logic of the code changed a bit since 1.6 and there may be other locations that need to be adjusted.	2020-05-19 10:59:53 +02:00
Damien Claisse	57c8eb939d	MINOR: log: Add "Tu" timer It can be sometimes useful to measure total time of a request as seen from an end user, including TCP/TLS negotiation, server response time and transfer time. "Tt" currently provides something close to that, but it also takes client idle time into account, which is problematic for keep-alive requests as idle time can be very long. "Ta" is also not sufficient as it hides TCP/TLS negotiationtime. To improve that, introduce a "Tu" timer, without idle time and everything else. It roughly estimates time spent time spent from user point of view (without DNS resolution time), assuming network latency is the same in both directions.	2020-04-28 16:30:13 +02:00
Christopher Faulet	d2236cdcc4	MINOR: log: Don't systematically set LW_REQ when a sample expr is added When a log-format string is parsed, if a sample fetch is found, the flag LW_REQ is systematically added on the proxy. Unfortunately, this produce a warning during HAProxy start-up when a log-format string is used for a tcp-check send rule. Now this flag is only added if the parsed sample fetch depends on HTTP information.	2020-04-27 09:39:37 +02:00
Christopher Faulet	5f940703b3	MINOR: log: Don't depends on a stream to process samples in log-format string When a log-format string is evaluated, there is no reason to process sample fetches only when a stream is defined. Several sample fetches are available outside the stream scope. All others should handle calls without stream. This patch is mandatory to support log-format string in tcp-check rules.	2020-04-27 09:39:37 +02:00
Ilya Shipitsin	ae40dbc93c	CLEANUP: log: fix comment of parse_logformat_string() "fmt" is passed to parse_logformat_string, adjust comment accordingly	2020-04-21 10:52:25 +02:00
Ilya Shipitsin	856aabcda5	CLEANUP: assorted typo fixes in the code and comments This is 8th iteration of typo fixes	2020-04-17 09:37:36 +02:00
Willy Tarreau	bb86986253	MINOR: init: report the haproxy version and executable path once on errors If haproxy fails to start and emits an alert, then it can be useful to have it also emit the version and the path used to load it. Some users may be mistakenly launching the wrong binary due to a misconfigured PATH variable and this will save them some troubleshooting time when it reports that some keywords are not understood. What we do here is that we try to extract the binary name from the AUX vector on glibc, and we report this as a NOTICE tag before the very first alert is emitted.	2020-04-16 10:52:41 +02:00
Willy Tarreau	bebd212064	MINOR: init: report in "haproxy -c" whether there were warnings or not This helps quickly checking if the config produces any warning. For this we reuse the "warned" bit field to add a new WARN_ANY bit that is set by ha_warning(). The rest of the bit field was also cleaned from unused bits.	2020-04-15 16:42:00 +02:00
Tim Duesterhus	cf6e0c8a83	MEDIUM: proxy_protocol: Support sending unique IDs using PPv2 This patch adds the `unique-id` option to `proxy-v2-options`. If this option is set a unique ID will be generated based on the `unique-id-format` while sending the proxy protocol v2 header and stored as the unique id for the first stream of the connection. This feature is meant to be used in `tcp` mode. It works on HTTP mode, but might result in inconsistent unique IDs for the first request on a keep-alive connection, because the unique ID for the first stream is generated earlier than the others. Now that we can send unique IDs in `tcp` mode the `%ID` log variable is made available in TCP mode.	2020-03-13 17:26:43 +01:00
Tim Duesterhus	a17e66289c	MEDIUM: stream: Make the `unique_id` member of `struct stream` a `struct ist` The `unique_id` member of `struct stream` now is a `struct ist`.	2020-03-05 20:21:58 +01:00
Tim Duesterhus	2825b4b0ca	MINOR: stream: Use stream_generate_unique_id This patch replaces the ad-hoc generation of stream's `unique_id` values by calls to `stream_generate_unique_id`.	2020-03-05 07:23:00 +01:00
Willy Tarreau	908071171b	BUILD: general: always pass unsigned chars to is* functions The isalnum(), isalpha(), isdigit() etc functions from ctype.h are supposed to take an int in argument which must either reflect an unsigned char or EOF. In practice on some platforms they're implemented as macros referencing an array, and when passed a char, they either cause a warning "array subscript has type 'char'" when lucky, or cause random segfaults when unlucky. It's quite unconvenient by the way since none of them may return true for negative values. The recent introduction of cygwin to the list of regularly tested build platforms revealed a lot of breakage there due to the same issues again. So this patch addresses the problem all over the code at once. It adds unsigned char casts to every valid use case, and also drops the unneeded double cast to int that was sometimes added on top of it. It may be backported by dropping irrelevant changes if that helps better support uncommon platforms. It's unlikely to fix bugs on platforms which would already not emit any warning though.	2020-02-25 08:16:33 +01:00
Willy Tarreau	cd0d2ed6ee	MEDIUM: log-format: make the LF parser aware of sample expressions' end For a very long time it used to be impossible to pass a closing square bracket as a valid character in argument to a sample fetch function or to a converter because the LF parser used to stop on the first such character found and to pass what was between the first '[' and the first ']' to sample_parse_expr(). This patch addresses this by passing the whole string to sample_parse_expr() which is the only one authoritative to indicate the first character that does not belong to the expression. The LF parser then verifies it matches a ']' or fails. As a result it is finally possible to write rules such as the following, which is totally valid an unambigous : http-request redirect location %[url,regsub([.:/?-],!,g)] \|-----\| \| \| arg1 \| `---> arg3 `-----> arg2 \|-----------------\| converter \|---------------------\| sample expression \|------------------------\| log-format tag	2020-02-14 19:02:06 +01:00
Willy Tarreau	e3b57bf92f	MINOR: sample: make sample_parse_expr() able to return an end pointer When an end pointer is passed, instead of complaining that a comma is missing after a keyword, sample_parse_expr() will silently return the pointer to the current location into this return pointer so that the caller can continue its parsing. This will be used by more complex expressions which embed sample expressions, and may even permit to embed sample expressions into arguments of other expressions.	2020-02-14 19:02:06 +01:00
Willy Tarreau	51013e82d4	BUG/MINOR: log: fix minor resource leaks on logformat error path As reported by Ilya in issue #392, Coverity found that we're leaking allocated strings on error paths in parse_logformat(). Let's use a proper exit label for failures instead of seeding return 0 everywhere. This should be backported to all supported versions.	2019-12-11 12:05:39 +01:00
Willy Tarreau	869efd5eeb	BUG/MINOR: log: make "show startup-log" use a ring buffer instead The copy of the startup logs used to rely on a re-allocated memory area on the fly, that would attempt to be delivered at once over the CLI. But if it's too large (too many warnings) it will take time to start up, and may not even show up on the CLI as it doesn't fit in a buffer. The ring buffer infrastructure solves all this with no more code, let's switch to this instead. It simply requires a parsing function to attach the ring via ring_attach_cli() and all the rest is automatically handled. Initially this was imagined as a code cleanup, until a test with a config involving 100k backends and just one occurrence of "load-server-state-from-file global" in the defaults section took approx 20 minutes to parse due to the O(N^2) cost of concatenating the warnings resulting in ~1 TB of data to be copied, while it took only 0.57s with the ring. Ideally this patch should be backported to 2.0 and 1.9, though it relies on the ring infrastructure which will then also need to be backported. Configs able to trigger the bug are uncommon, so another workaround for older versions without backporting the rings would consist in simply limiting the size of the error message in print_message() to something always printable, which will only return the first errors.	2019-11-15 15:50:16 +01:00
Christopher Faulet	5c6fefc8eb	MINOR: log: Provide a function to emit a log for an application Application is a generic term here. It is a modules which handle its own log server list, with no dependency on a proxy. Such applications can now call the function app_log() to log messages, passing a log server list and a tag as parameters. Internally, the function __send_log() has been adapted accordingly.	2019-09-17 10:18:54 +02:00
Willy Tarreau	c046d167e4	MEDIUM: log: add support for logging to a ring buffer Now by prefixing a log server with "ring@<name>" it's possible to send the logs to a ring buffer. One nice thing is that it allows multiple sessions to consult the logs in real time in parallel over the CLI, and without requiring file system access. At the moment, ring0 is created as a default sink for tracing purposes and is available. No option is provided to create new rings though this is trivial to add to the global section.	2019-08-30 15:24:59 +02:00
Willy Tarreau	f3dc30f6de	MINOR: log: add a target type instead of hacking the address family Instead of detecting an AF_UNSPEC address family for a log server and to deduce a file descriptor, let's create a target type field and explicitly mention that the socket is of type FD.	2019-08-30 15:07:25 +02:00
Willy Tarreau	d52a7f8c8d	MEDIUM: log: use the new generic fd_write_frag_line() function When logging to a file descriptor, we'd rather use the unified fd_write_frag_line() which uses the FD's lock than perform the writev() ourselves and use a per-server lock, because if several loggers point to the same output (e.g. stdout) they are still not locked and their logs may interleave. The function above instead relies on the fd's lock so this is safer and will even protect against concurrent accesses from other areas (e.g traces). The function also deals with the FD's non-blocking mode so we do not have to keep specific code for this anymore in the logs.	2019-08-30 15:07:25 +02:00
Willy Tarreau	7e9776ad7b	MINOR: fd/log/sink: make the non-blocking initialization depend on the initialized bit Logs and sinks were resorting to dirty hacks to initialize an FD to non-blocking mode. Now we have a bit for this in the fd tab so we can do it on the fly on first use of the file descriptor. Previously it was set per log server by writing value 1 to the port, or during a sink initialization regardless of the usage of the fd.	2019-08-30 15:07:25 +02:00
Willy Tarreau	9fbcb7e2e9	BUG/MINOR: log: make sure writev() is not interrupted on a file output Since 1.9 we support sending logs to various non-blocking outputs like stdou/stderr or flies, by using writev() which guarantees that it only returns after having written everything or nothing. However the syscall may be interrupted while doing so, and this is visible when writing to a tty during debug sessions, as some logs occasionally appear interleaved if an xterm or SSH connection is not very fast. Performance here is not a critical concern, log correctness is. Let's simply take the logger's lock around the writev() call to prevent multiple senders from stepping onto each other's toes. This may be backported to 2.0 and 1.9.	2019-07-26 15:46:18 +02:00
Willy Tarreau	6c6365f455	MINOR: log: use conn->{src,dst} instead of conn->addr.{from,to} This is used to retrieve the addresses to be logged (client, frontend, backend, server). In all places the validity check was already performed.	2019-07-19 13:50:09 +02:00
Willy Tarreau	8fa9984a17	MINOR: log: use conn_get_{dst,src}() to retrieve the cli/frt/bck/srv/ addresses This also allows us to check that the operation succeeded without logging whatever remained in the memory area in case of failure.	2019-07-19 13:50:09 +02:00
Christopher Faulet	711ed6ae4a	MAJOR: http: Remove the HTTP legacy code First of all, all legacy HTTP analyzers and all functions exclusively used by them were removed. So the most of the functions in proto_http.{c,h} were removed. Only functions to deal with the HTTP transaction have been kept. Then, http_msg and hdr_idx modules were entirely removed. And finally the structure http_msg was lightened of all its useless information about the legacy HTTP. The structure hdr_ctx was also removed because unused now, just like unused states in the enum h1_state. Note that the memory pool "hdr_idx" was removed and "http_txn" is now smaller.	2019-07-19 09:24:12 +02:00

1 2 3 4 5 ...

308 Commits