haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-10-22 13:11:00 +02:00

Author	SHA1	Message	Date
Willy Tarreau	7314be8e2c	MINOR: h1: make h1_measure_trailers() take the byte count in argument The principle is that it should not have to take this value from the buffer itself anymore.	2018-07-19 16:23:40 +02:00
Willy Tarreau	188e230704	MINOR: buffer: convert most b_ptr() calls to c_ptr() The latter uses the channel wherever a channel is known.	2018-07-19 16:23:40 +02:00
Willy Tarreau	e5f12ce7f2	MINOR: buffer: replace bi_del() and bo_del() with b_del() Till now the callers had to know which one to call for specific use cases. Let's fuse them now since a single one will remain after the API migration. Given that bi_del() may only be used where o==0, just combine the two tests by first removing output data then only input.	2018-07-19 16:23:40 +02:00
Willy Tarreau	a1f78fb652	MINOR: buffer: replace bo_getblk_nc() with b_getblk_nc() which takes an offset This will be important so that we can parse a buffer without touching it. Now we indicate where from the buffer's head we plan to start to copy, and for how many bytes. This will be used by send functions to loop at the end of the buffer without having to update the buffer's output byte count.	2018-07-19 16:23:40 +02:00
Willy Tarreau	90ed3836db	MINOR: buffer: replace bo_getblk() with direction agnostic b_getblk() This new functoin limits itself to the amount of data available in the buffer and doesn't care about the direction anymore. It's only called from co_getblk() which already checks that no more than the available output bytes is requested.	2018-07-19 16:23:40 +02:00
Willy Tarreau	e4d5a036ed	MINOR: buffer: merge b{i,o}_contig_space() These ones were merged into a single b_contig_space() that covers both (the bo_ case was a simplified version of the other one). The function doesn't use ->i nor ->o anymore.	2018-07-19 16:23:40 +02:00
Willy Tarreau	0e11d59af6	MINOR: buffer: remove bo_contig_data() The two call places now make use of b_contig_data(0) and check by themselves that the returned size is no larger than the scheduled output data.	2018-07-19 16:23:40 +02:00
Willy Tarreau	8f9c72d301	MINOR: buffer: remove bi_end() It was replaced by ci_tail() when the channel is known, or b_tail() in other cases.	2018-07-19 16:23:40 +02:00
Willy Tarreau	41e38ac0ee	MINOR: buffer: remove bo_end() It was replaced by either b_tail() when the buffer has no input data, or b_peek(b, b->o).	2018-07-19 16:23:40 +02:00
Willy Tarreau	89faf5d7c3	MINOR: buffer: remove bo_ptr() It was replaced by co_head() when a channel was known, otherwise b_head().	2018-07-19 16:23:40 +02:00
Willy Tarreau	dda2e41881	MINOR: buffer: remove bi_ptr() It's now been replaced by b_head() when b->o is null, ci_head() when the channel is known, or b_peek(b, b->o) in other situations.	2018-07-19 16:23:40 +02:00
Willy Tarreau	7194d3cc3b	MINOR: buffer: split bi_contig_data() into ci_contig_data and b_config_data() This function was sometimes used from a channel and sometimes from a buffer. In both cases it requires knowledge of the size of the output data (to skip them). Here the split ensures the channel can deal with this point, and that other places not having output data can continue to work.	2018-07-19 16:23:40 +02:00
Willy Tarreau	aa7af7213d	MINOR: buffer: replace calls to buffer_space_wraps() with b_space_wraps() And remove the unused function.	2018-07-19 16:23:40 +02:00
Willy Tarreau	bcbd39370f	MINOR: channel/buffer: replace b_{adv,rew} with c_{adv,rew} These ones manipulate the output data count which will be specific to the channel soon, so prepare the call points to use the channel only. The b_* functions are now unused and were removed.	2018-07-19 16:23:40 +02:00
Willy Tarreau	c0a51c51b1	MINOR: buffer: remove buffer_slow_realign() and the swap_buffer allocation code Since all call places can use the trash now, this is not needed anymore.	2018-07-19 16:23:40 +02:00
Willy Tarreau	0db4d10efc	MINOR: h2: use b_slow_realign() with the trash as a swap buffer H2 doesn't use the trash so it can make use of it as a swap area when calling b_slow_realign(). This way we don't need buffer_slow_realign() anymore.	2018-07-19 16:23:40 +02:00
Willy Tarreau	fd8d42f496	MEDIUM: channel: make channel_slow_realign() take a swap buffer The few call places where it's used can use the trash as a swap buffer, which is made for this exact purpose. This way we can rely on the generic b_slow_realign() call.	2018-07-19 16:23:40 +02:00
Willy Tarreau	4cf1300e6a	MINOR: channel/buffer: replace buffer_slow_realign() with channel_slow_realign() and b_slow_realign() Where relevant, the channel version is used instead. The buffer version was ported to be more generic and now takes a swap buffer and the output byte count to know where to set the alignment point. The H2 mux still uses buffer_slow_realign() with buf->o but it will change later.	2018-07-19 16:23:40 +02:00
Willy Tarreau	d5b343bf9e	MINOR: channel/buffer: use c_realign_if_empty() instead of buffer_realign() This patch removes buffer_realign() and replaces it with c_realign_if_empty() instead.	2018-07-19 16:23:40 +02:00
Willy Tarreau	4d452384a3	MINOR: compression: pass the channel to http_compression_buffer_end() This will be needed to access the output data count from the channel after the buffer/channel changes.	2018-07-19 16:23:39 +02:00
Willy Tarreau	506a29ac6e	MINOR: buffer: switch buffer sizes and offsets to size_t Passing unsigned ints everywhere is painful, and will cause some headache later when we'll want to integrate better with struct ist which already uses size_t. Let's switch buffers to use size_t instead.	2018-07-19 16:23:39 +02:00
Willy Tarreau	42d55b9b6a	BUG/MEDIUM: h2: make sure the last stream closes the connection after a timeout If a timeout strikes on the connection side with some active streams, there is a corner case which can sometimes cause the following sequence to happen : - There are active streams but there are data in the mux buffer (eg: a client suddenly disconnected during a download with pending requests). The timeout is active. - The timeout strikes, h2_timeout_task() is called, kills the task and doesn't close the connection since there are streams left ; The connection is marked in H2_CS_ERROR ; - the streams are woken up and closed ; - when the last stream closes, calling h2_detach(), it sees the tree list is empty, but there is no condition allowing the connection to be closed (mbuf->o > 0), thus it does nothing ; - since the task is dead, there's no more hope to clear this situation later For now we can take care of this by adding a test for the presence of H2_CS_ERROR and !task, implying the timeout task triggered already and will not be able to handle this again. Over the long term it seems like a more reliable test on should be made, so that it is possible to know whether or not someone is still able to close this connection. A big thanks to Janusz Dziemidowicz and Milan Petruzelka for providing many details helping in figuring this bug.	2018-07-19 14:31:47 +02:00
Willy Tarreau	00610960a1	BUG/MEDIUM: h2: never leave pending data in the output buffer on close We currently don't process trailers on H2, but this has an impact : on chunked HTTP/1 responses, we decide to emit the ES bit once we see the 0CRLF. From this point the stream switches to the CLOSED state, which aborts processing of the remaining bytes. Thus the extra CRLF which ends trailers is not processed and remains in the buffer. This prevents the stream from being notified about end of transmission, which in turn keeps the mux busy and prevents the connection from quitting. The case of the trailers is not the root cause of this issue, though it is what triggers it. The root cause is that upon error and/or close, once we know we're not going to process any more data, we must absolutely flush any remaining bytes from the output buffer, otherwise there is no way the stream can quit. This is what this patch does. It looks very likely related to the issues reported and debugged by Janusz Dziemidowicz and Milan Petruzelka. One way to reproduce it is to chain two proxies with the last one emitting chunked data (typically using the stats page) : global stats socket /tmp/sock1 mode 666 level admin stats timeout 1h tune.ssl.default-dh-param 1024 tune.bufsize 16384 defaults mode http timeout connect 4s timeout client 10s timeout server 20s listen px1 bind :4443 ssl crt rsa+dh2048.pem npn h2 alpn h2 server s1 127.0.0.1:4445 listen px2 bind :4444 ssl crt rsa+dh2048.pem npn h2 alpn h2 bind :4445 stats uri / Then use curl to fetch the stats through px1 : curl --http2 -k "https://127.0.0.1:4443/" When curl is sent to the first one, "show sess" issued to the CLI will show a remaining session during the client timeout. When curl is aimed at port 4444 (px2), there is no such remaining session. This fix needs to be backported to 1.8.	2018-07-19 11:09:12 +02:00
Willy Tarreau	c65edac804	MINOR: h2: add the mux and demux buffer lengths on "show fd" It is convenient during debugging sessions to know if the mux and demux buffers are empty/full/other. Let's report this on "show fd" output.	2018-07-19 10:54:43 +02:00
Willy Tarreau	f210191dcd	BUG/MEDIUM: h2: don't accept new streams if conn_streams are still in excess The streams bookkeeping made in H2 is used for protocol compliance only but it doesn't consider the number of conn_streams still attached to the mux. It causes an issue when http-request set-nice rules are applied on H2 requests processed on a saturated machine. Indeed, in this case, the requests are accepted and assigned a default nice value of zero. When they are processed, their nice value changes to a higher one (say 1024). The response is sent through the H2 mux, which detects the end of stream and decrements the protocol-level stream count (h2c->nb_streams). The client may then send a new request. But the conn_stream is still attached and will require a new call to process_stream() to finish, which is made through the scheduler. Given that the machine is saturated, it is assumed that many tasks are present in the scheduler. Thus the closing tasks holding a higher nice value will pass after the new stream creations. If the client is fast enough with a low latency link, it may add a lot of new stream creations before the stream terminations have a chance to disappear due to their high nice value, resulting in a huge amount of memory being used. The solution consists in letting a mux always monitor its conn_streams and refrain from creating new ones when it is full. Here the H2 mux checks the nb_cs counter and sets a new blocked flag (H2_CF_DEM_TOOMANY) if the limit was reached, so that the frame parser requests a pause in the new stream creation, leaving some time for the pending conn_streams to vanish. Several experiments were made using varying thresholds to see if overbooking would provide any benefit here but it turned out not to be the case, so the conn_stream limit remains set to the exact streams limit. Interestingly various performance measurements showed that the code tends to be slightly faster now than without the limit, probably due to the smoother memory usage. This commit requires previous patch ("MINOR: h2: keep a count of the number of conn_streams attached to the mux"). It needs to be backported to 1.8.	2018-07-19 10:23:15 +02:00
Willy Tarreau	7ac60e836a	MINOR: h2: keep a count of the number of conn_streams attached to the mux The h2 mux only knows about the number of H2 streams which are not in a CLOSED state. This is used for protocol compliance. But it doesn't hold the number of really attached streams. It is a problem because depending on scheduling, it is possible that more streams are attached to the mux than the ones seen at the protocol level, due to some streams taking some time to be detached. Let's add this count based on the conn_streams. Note: this patch is part of a series of fixes which will have to be backported to 1.8.	2018-07-19 09:06:37 +02:00
Willy Tarreau	17b4aa1adc	BUG/MINOR: ssl: properly ref-count the tls_keys entries Commit 200b0fa ("MEDIUM: Add support for updating TLS ticket keys via socket") introduced support for updating TLS ticket keys from the CLI, but missed a small corner case : if multiple bind lines reference the same tls_keys file, the same reference is used (as expected), but during the clean shutdown, it will lead to a double free when destroying the bind_conf contexts since none of the lines knows if others still use it. The impact is very low however, mostly a core and/or a message in the system's log upon old process termination. Let's introduce some basic refcounting to prevent this from happening, so that only the last bind_conf frees it. Thanks to Janusz Dziemidowicz and Thierry Fournier for both reporting the same issue with an easy reproducer. This fix needs to be backported from 1.6 to 1.8.	2018-07-18 08:59:50 +02:00
Baptiste Assmann	8e2d9430c0	MINOR: dns: new DNS options to allow/prevent IP address duplication By default, HAProxy's DNS resolution at runtime ensure that there is no IP address duplication in a backend (for servers being resolved by the same hostname). There are a few cases where people want, on purpose, to disable this feature. This patch introduces a couple of new server side options for this purpose: "resolve-opts allow-dup-ip" or "resolve-opts prevent-dup-ip".	2018-07-12 17:56:44 +02:00
Baptiste Assmann	84221b4e90	MINOR: dns: fix wrong score computation in dns_get_ip_from_response dns_get_ip_from_response() is used to compare the caller current IP to the IP available in the records returned by the DNS server. A scoring system is in place to get the best IP address available. That said, in the current implementation, there are a couple of issues: 1. a comment does not match what the code does 2. the code does not match what the commet says (score value is not incremented with '2') This patch fixes both issues. Backport status: 1.8	2018-07-12 17:56:34 +02:00
Baptiste Assmann	741e00a820	CLEANUP: dns: inacurate comment about prefered IP score The comment was about "prefered network ip version" while it's actually "prefered ip version" in the code. Fixed Backport status: 1.7 and 1.8 Be careful, this patch may not apply on 1.7, since the score was '4' for this item at that time.	2018-07-12 17:55:16 +02:00
Baptiste Assmann	e56fffd896	CLEANUP: dns: remove obsolete macro DNS_MAX_IP_REC Since a8c6db8d2d97629b2734c1d2be0860b6b11e5709, this macro is not used anymore and can be safely removed. Backport status: 1.8	2018-07-12 17:55:05 +02:00
William Lallemand	bfd8eb5909	MINOR: startup: change session/process group settings Change the way the process groups are set. Indeed setsid() was called for every processes which caused the worker to have a different process group than the master. This patch behave in a better way: - In daemon mode only, each child do a setsid() - In master worker + daemon mode, the setsid() is done in the master before forking the children - In any foreground mode, we don't do a setsid() Could be backported in 1.8 but the master-worker mode is mostly used with systemd which rely on cgroups so that won't affect much people.	2018-07-04 19:29:56 +02:00
Thierry FOURNIER	70d318ccb7	BUG/MEDIUM: lua: possible CLOSE-WAIT state with '\n' headers The Lua parser doesn't takes in account end-of-headers containing only '\n'. It expects always '\r\n'. If a '\n' is processes the Lua parser considers it miss 1 byte, and wait indefinitely for new data. When the client reaches their timeout, it closes the connection. This close is not detected and the connection keep in CLOSE-WAIT state. I guess that this patch fix only a visible part of the problem. If the Lua HTTP parser wait for data, the timeout server or the connectio closed by the client may stop the applet. How reproduce the problem: HAProxy conf: global lua-load bug38.lua frontend frt timeout client 2s timeout server 2s mode http bind *:8080 http-request use-service lua.donothing Lua conf core.register_service("donothing", "http", function(applet) end) Client request: echo -ne 'GET / HTTP/1.1\n\n' \| nc 127.0.0.1 8080 Look for CLOSE-WAIT in the connection with "netstat" or "ss". I use this script: while sleep 1; do ss \| grep CLOSE-WAIT; done This patch must be backported in 1.6, 1.7 and 1.8 Workaround: enable the "hard-stop-after" directive, and perform periodic reload.	2018-07-01 06:08:43 +02:00
Willy Tarreau	43e903553e	MINOR: stick-tables: make stktable_release() do nothing on NULL stktable_release() has been involved in two recent crashes by being used without enough care. Just like any free() function this one is often called on an exit path with a possibly unsafe argument. Given that there is another case (smp_fetch_sc_trackers()) which theorically could call it with an unchecked NULL, though it cannot happen since the function doesn't support being called with src_* hence cannot make use of tmpstkctr, let's rather move the check into the function itself to make it safer for the long term. This patch could be backported to 1.8 as a strengthening measure.	2018-06-27 06:33:20 +02:00
Tim Duesterhus	65189c17c6	BUG/MAJOR: stick_table: Complete incomplete SEGV fix This commit completes the incomplete segmentation fault fix in commit ac1f3ed64b58bd178865c6f2cc8f6f306d9e1e15. Likewise it must be backported to haproxy 1.8.	2018-06-26 20:29:36 +02:00
William Lallemand	091d827e09	BUG/BUILD: threads: unbreak build without threads The build without threads was once again broken. This issue was introduced in commit ba86c6c ("MINOR: threads: Be sure to remove threads from all_threads_mask on exit"). This is exactly the same problem as last time it happened, because of all_threads_mask not being defined with USE_THREAD= This must be backported in 1.8	2018-06-26 14:15:12 +02:00
Thierry FOURNIER	ac1f3ed64b	BUG/MAJOR: Stick-tables crash with segfault when the key is not in the stick-table When a lookup is done on a key not present in the stick-table the "st" pointer is NULL and it is used to return the converter result, but it is used untested with stktable_release(). This regression was introduced in 1.8.10 here: BUG/MEDIUM: stick-tables: Decrement ref_cnt in table_* converters commit d7bd88009d88dd413e01bc0baa90d6662a3d7718 Author: Daniel Corbett <dcorbett@haproxy.com> Date: Sun May 27 09:47:12 2018 -0400 Minimal conf for reproducong the problem: frontend test mode http stick-table type ip size 1m expire 1h store gpc0 bind *:8080 http-request redirect location /a if { src,in_table(test) } The segfault is triggered using: curl -i http://127.0.0.1:8080/ This patch must be backported in 1.8	2018-06-26 13:51:46 +02:00
Christopher Faulet	ba86c6c25b	MINOR: threads: Be sure to remove threads from all_threads_mask on exit When HAProxy is started with several threads, Each running thread holds a bit in the bitfiled all_threads_mask. This bitfield is used here and there to check which threads are registered to take part in a specific processing. So when a thread exits, it seems normal to remove it from all_threads_mask. No direct impact could be identified with this right now but it would be better to backport it to 1.8 as a preventive measure to avoid complex situations like the one in previous bug.	2018-06-22 14:55:15 +02:00
Christopher Faulet	d8fd2af882	BUG/MEDIUM: threads: Use the sync point to check active jobs and exit When HAProxy is shutting down, it exits the polling loop when there is no jobs anymore (jobs == 0). When there is no thread, it works pretty well, but when HAProxy is started with several threads, a thread can decide to exit because jobs variable reached 0 while another one is processing a task (e.g. a health-check). At this stage, the running thread could decide to request a synchronization. But because at least one of them has already gone, the others will wait infinitly in the sync point and the process will never die. To fix the bug, when the first thread (and only this one) detects there is no active jobs anymore, it requests a synchronization. And in the sync point, all threads will check if jobs variable reached 0 to exit the polling loop. This patch must be backported in 1.8.	2018-06-22 10:16:26 +02:00
Dave Chiluk	8618a6a5e2	MINOR: Some spelling cleanup in the comments. Signed-off-by: Dave Chiluk <chiluk+haproxy@indeed.com>	2018-06-21 20:43:52 +02:00
Olivier Houchard	d0e60d852a	BUG/MEDIUM: fd: Don't modify the update_mask in fd_dodelete(). Only the pollers should remove bits in the update_mask. Removing it will mean if the fd is currently in the global update list, it will never be removed, and while it's mostly harmless in 1.9, in 1.8, only update_mask is checked to know if the fd is already in the list or not, so we can end up trying to add a fd that is already in the list, and corrupt it, which means some fd may not be added to the poller. This should be backported to 1.8.	2018-06-20 10:21:44 +02:00
Emmanuel Hocdet	3448c490ca	BUG/MEDIUM: ssl: do not store pkinfo with SSL_set_ex_data Bug from 96b7834e: pkinfo is stored on SSL_CTX ex_data and should not be also stored on SSL ex_data without reservation. Simply extract pkinfo from SSL_CTX in ssl_sock_get_pkey_algo. No backport needed.	2018-06-18 13:34:09 +02:00
Thierry FOURNIER	28962c9941	BUG/MAJOR: ssl: OpenSSL context is stored in non-reserved memory slot We never saw unexplicated crash with SSL, so I suppose that we are luck, or the slot 0 is always reserved. Anyway the usage of the macro SSL_get_app_data() and SSL_set_app_data() seem wrong. This patch change the deprecated functions SSL_get_app_data() and SSL_set_app_data() by the new functions SSL_get_ex_data() and SSL_set_ex_data(), and it reserves the slot in the SSL memory space. For information, this is the two declaration which seems wrong or incomplete in the OpenSSL ssl.h file. We can see the usage of the slot 0 whoch is hardcoded, but never reserved. #define SSL_set_app_data(s,arg) (SSL_set_ex_data(s,0,(char *)arg)) #define SSL_get_app_data(s) (SSL_get_ex_data(s,0)) This patch must be backported at least in 1.8, maybe in other versions.	2018-06-18 10:32:14 +02:00
Thierry FOURNIER	16ff050478	BUG/MAJOR: ssl: Random crash with cipherlist capture The cipher list capture struct is stored in the SSL memory space, but the slot is reserved in the SSL_CTX memory space. This causes ramdom crashes. This patch should be backported to 1.8	2018-06-18 10:32:12 +02:00
Fr�d�ric L�caille	f874a83b57	BUG/MINOR: lua: Segfaults with wrong usage of types. Patrick reported that this simple configuration made haproxy segfaults: global lua-load /tmp/haproxy.lua frontend f1 mode http bind :8000 default_backend b1 http-request lua.foo backend b1 mode http server s1 127.0.0.1:8080 with this '/tmp/haproxy.lua' script: core.register_action("foo", { "http-req" }, function(txn) txn.sc:ipmask(txn.f:src(), 24, 112) end) This is due to missing initialization of the array of arguments passed to hlua_lua2arg_check() which makes it enter code with corrupted arguments. Thanks a lot to Patrick Hemmer for having reported this issue. Must be backported to 1.8, 1.7 and 1.6.	2018-06-18 10:23:47 +02:00
Olivier Houchard	9db0fedb59	BUG/MINOR: tasklets: Just make sure we don't pass a tasklet to the handler. We can't just set t to NULL if it's a tasklet, or we'd have a hard time accessing to t->process, so just make sure we pass NULL as the first parameter of t->process if it's a tasklet. This should be a non-issue at this point, as tasklets aren't used yet.	2018-06-14 18:57:26 +02:00
William Lallemand	579fb25b62	BUG/MAJOR: map: fix a segfault when using http-request set-map The bug happens with an existing entry, when you try to overwrite the value with wrong data, for example, a string when the type is INT. The code path was not secure and tried to set err and merr while err = merr = NULL when performing an http action. Must be backported in 1.6, 1.7, 1.8.	2018-06-11 11:02:06 +02:00
William Lallemand	6e1796e85d	BUG/MINOR: signals: ha_sigmask macro for multithreading The behavior of sigprocmask in an multithreaded environment is undefined. The new macro ha_sigmask() calls either pthreads_sigmask() or sigprocmask() if haproxy was built with thread support or not. This should be backported to 1.8.	2018-06-08 18:24:53 +02:00
William Lallemand	933642c6ef	BUG/MINOR: don't ignore SIG{BUS,FPE,ILL,SEGV} during signal processing We don't have any reason of blocking those signals. If SIGBUS, SIGFPE, SIGILL, or SIGSEGV are generated while they are blocked, the result is undefined, unless the signal was generated by kill(2), sigqueue(3), or raise(3). This should be backported to 1.8.	2018-06-08 18:22:43 +02:00
William Lallemand	1aab50bb4a	BUG/MEDIUM: threads: handle signal queue only in thread 0 Signals were handled in all threads which caused some signals to be lost from time to time. To avoid complicated lock system (threads+signals), we prefer handling the signals in one thread avoiding concurrent access. The side effect of this bug was that some process were not leaving from time to time during a reload. This patch must be backported in 1.8.	2018-06-08 18:22:31 +02:00

... 5 6 7 8 9 ...

6359 Commits