haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-10-24 14:11:03 +02:00

Author	SHA1	Message	Date
Christopher Faulet	ba86c6c25b	MINOR: threads: Be sure to remove threads from all_threads_mask on exit When HAProxy is started with several threads, Each running thread holds a bit in the bitfiled all_threads_mask. This bitfield is used here and there to check which threads are registered to take part in a specific processing. So when a thread exits, it seems normal to remove it from all_threads_mask. No direct impact could be identified with this right now but it would be better to backport it to 1.8 as a preventive measure to avoid complex situations like the one in previous bug.	2018-06-22 14:55:15 +02:00
Christopher Faulet	d8fd2af882	BUG/MEDIUM: threads: Use the sync point to check active jobs and exit When HAProxy is shutting down, it exits the polling loop when there is no jobs anymore (jobs == 0). When there is no thread, it works pretty well, but when HAProxy is started with several threads, a thread can decide to exit because jobs variable reached 0 while another one is processing a task (e.g. a health-check). At this stage, the running thread could decide to request a synchronization. But because at least one of them has already gone, the others will wait infinitly in the sync point and the process will never die. To fix the bug, when the first thread (and only this one) detects there is no active jobs anymore, it requests a synchronization. And in the sync point, all threads will check if jobs variable reached 0 to exit the polling loop. This patch must be backported in 1.8.	2018-06-22 10:16:26 +02:00
Dave Chiluk	8618a6a5e2	MINOR: Some spelling cleanup in the comments. Signed-off-by: Dave Chiluk <chiluk+haproxy@indeed.com>	2018-06-21 20:43:52 +02:00
Olivier Houchard	d0e60d852a	BUG/MEDIUM: fd: Don't modify the update_mask in fd_dodelete(). Only the pollers should remove bits in the update_mask. Removing it will mean if the fd is currently in the global update list, it will never be removed, and while it's mostly harmless in 1.9, in 1.8, only update_mask is checked to know if the fd is already in the list or not, so we can end up trying to add a fd that is already in the list, and corrupt it, which means some fd may not be added to the poller. This should be backported to 1.8.	2018-06-20 10:21:44 +02:00
Emmanuel Hocdet	3448c490ca	BUG/MEDIUM: ssl: do not store pkinfo with SSL_set_ex_data Bug from 96b7834e: pkinfo is stored on SSL_CTX ex_data and should not be also stored on SSL ex_data without reservation. Simply extract pkinfo from SSL_CTX in ssl_sock_get_pkey_algo. No backport needed.	2018-06-18 13:34:09 +02:00
Thierry FOURNIER	28962c9941	BUG/MAJOR: ssl: OpenSSL context is stored in non-reserved memory slot We never saw unexplicated crash with SSL, so I suppose that we are luck, or the slot 0 is always reserved. Anyway the usage of the macro SSL_get_app_data() and SSL_set_app_data() seem wrong. This patch change the deprecated functions SSL_get_app_data() and SSL_set_app_data() by the new functions SSL_get_ex_data() and SSL_set_ex_data(), and it reserves the slot in the SSL memory space. For information, this is the two declaration which seems wrong or incomplete in the OpenSSL ssl.h file. We can see the usage of the slot 0 whoch is hardcoded, but never reserved. #define SSL_set_app_data(s,arg) (SSL_set_ex_data(s,0,(char *)arg)) #define SSL_get_app_data(s) (SSL_get_ex_data(s,0)) This patch must be backported at least in 1.8, maybe in other versions.	2018-06-18 10:32:14 +02:00
Thierry FOURNIER	16ff050478	BUG/MAJOR: ssl: Random crash with cipherlist capture The cipher list capture struct is stored in the SSL memory space, but the slot is reserved in the SSL_CTX memory space. This causes ramdom crashes. This patch should be backported to 1.8	2018-06-18 10:32:12 +02:00
Fr�d�ric L�caille	f874a83b57	BUG/MINOR: lua: Segfaults with wrong usage of types. Patrick reported that this simple configuration made haproxy segfaults: global lua-load /tmp/haproxy.lua frontend f1 mode http bind :8000 default_backend b1 http-request lua.foo backend b1 mode http server s1 127.0.0.1:8080 with this '/tmp/haproxy.lua' script: core.register_action("foo", { "http-req" }, function(txn) txn.sc:ipmask(txn.f:src(), 24, 112) end) This is due to missing initialization of the array of arguments passed to hlua_lua2arg_check() which makes it enter code with corrupted arguments. Thanks a lot to Patrick Hemmer for having reported this issue. Must be backported to 1.8, 1.7 and 1.6.	2018-06-18 10:23:47 +02:00
Olivier Houchard	9db0fedb59	BUG/MINOR: tasklets: Just make sure we don't pass a tasklet to the handler. We can't just set t to NULL if it's a tasklet, or we'd have a hard time accessing to t->process, so just make sure we pass NULL as the first parameter of t->process if it's a tasklet. This should be a non-issue at this point, as tasklets aren't used yet.	2018-06-14 18:57:26 +02:00
William Lallemand	579fb25b62	BUG/MAJOR: map: fix a segfault when using http-request set-map The bug happens with an existing entry, when you try to overwrite the value with wrong data, for example, a string when the type is INT. The code path was not secure and tried to set err and merr while err = merr = NULL when performing an http action. Must be backported in 1.6, 1.7, 1.8.	2018-06-11 11:02:06 +02:00
William Lallemand	6e1796e85d	BUG/MINOR: signals: ha_sigmask macro for multithreading The behavior of sigprocmask in an multithreaded environment is undefined. The new macro ha_sigmask() calls either pthreads_sigmask() or sigprocmask() if haproxy was built with thread support or not. This should be backported to 1.8.	2018-06-08 18:24:53 +02:00
William Lallemand	933642c6ef	BUG/MINOR: don't ignore SIG{BUS,FPE,ILL,SEGV} during signal processing We don't have any reason of blocking those signals. If SIGBUS, SIGFPE, SIGILL, or SIGSEGV are generated while they are blocked, the result is undefined, unless the signal was generated by kill(2), sigqueue(3), or raise(3). This should be backported to 1.8.	2018-06-08 18:22:43 +02:00
William Lallemand	1aab50bb4a	BUG/MEDIUM: threads: handle signal queue only in thread 0 Signals were handled in all threads which caused some signals to be lost from time to time. To avoid complicated lock system (threads+signals), we prefer handling the signals in one thread avoiding concurrent access. The side effect of this bug was that some process were not leaving from time to time during a reload. This patch must be backported in 1.8.	2018-06-08 18:22:31 +02:00
Thierry FOURNIER	fc044c98e4	MINOR: lua: Increase debug information When an unrecoverable error raises, the user receive poor information for the trouble shooting. For example: [ALERT] 157/143755 (21212) : Lua function 'hello-world': runtime error: memory allocation error: block too big. Unfortunately, the memory allocation error can be throwed by many function, and we have no informatio to reach the original cause. This patch add the list of function called from the entry point to the function in error, like this: [ALERT] 157/143755 (21212) : Lua function 'hello-world': runtime error: memory allocation error: block too big from [C] method 'req_get_headers', bug35.lua:2 global 'ee', bug35.lua:6 global 'ff', bug35.lua:10 C function line 9.	2018-06-08 18:18:33 +02:00
Olivier Houchard	b4dd15bd6f	BUG/MINOR: unix: Make sure we can transfer abns sockets on seamless reload. When checking if a socket we got from the parent is suitable for a listener, we just checked that the path matched sockname.tmp, however this is unsuitable for abns sockets, where we don't have to create a temporary file and rename it later. To detect that, check that the first character of the sun_path is 0 for both, and if so, that &sun_path[1] is the same too. This should be backported to 1.8.	2018-06-07 14:33:44 +02:00
Olivier Houchard	b1ca58b245	MINOR: tasks: Don't define rqueue if we're building without threads. To make sure we don't inadvertently insert task in the global runqueue, while only the local runqueue is used without threads, make its definition and usage conditional on USE_THREAD.	2018-06-06 16:35:12 +02:00
David Carlier	cc0a957a50	MINOR: task: Fix compiler warning. Waking up task, when checking if it is a valid entry. Similarly to commit caa8a37ffe5922efda7fd7b882e96964b40d7135, casting explicitally to void pointer as HA_ATOMIC_CAS needs.	2018-06-05 13:55:57 +02:00
Willy Tarreau	34b1facbcf	MINOR: stats: also report the nice and number of calls for applets Since applets are now part of the main scheduler, it's useful to report their nice value and the number of calls to the applet handler, to see where the CPU is spent.	2018-06-05 11:18:21 +02:00
Christopher Faulet	6381650516	MAJOR: spoe: upgrade the SPOP version to 2.0 and remove the support for 1.0 The commit c4dcaff3 ("BUG/MEDIUM: spoe: Flags are not encoded in network order") introduced an incompatibility with older agents. So the major version of the SPOP is increased to make the situation unambiguous. And because before the fix, the protocol is buggy, the support of the version 1.0 is removed to be sure to not continue to support buggy agents. The agents in the contrib folder (spoa_example, modsecurity and mod_defender) are also updated to announce the SPOP version 2.0. So, to be clear, from the patch, connections to agents announcing the SPOP version 1.0 will be rejected. This patch must be backported in 1.8.	2018-06-04 17:33:48 +02:00
Thierry FOURNIER	66b8919b10	BUG/MEDIUM: lua/socket: Buffer error, may segfault The buffer pointer is already updated. It is again updated when it is given to the function ci_putblk(). This patch must be backported in 1.6, 1.7 and 1.8	2018-05-31 10:58:41 +02:00
Thierry FOURNIER	101b97619a	BUG/MEDIUM: lua/socket: Sheduling error on write: may dead-lock When we write data, we risk to encounter a dead-loack. The function "stream_int_notify()" cannot be called the the cosocket because the caller acquire a lock and when the socket is closed, the cleanup function try to acquire the same lock., so a dead-lock raises. In other way, the function stream_int_update_applet() can't be called because it schedumes the applet only if some activity in the buffers were detected. It is not always the case. We replace this function by appctx_wakeup() which wake up the applet inconditionnaly. The last part of the fix is setting right signals. the applet call the stream_int_update() function if the output buffer si not empty, and ask for put data if some rite signals are registered. This patch must be backported in 1.6, 1.7 and 1.8. Note that it requires patch "MINOR: task/notification: Is notifications registered" to be applied.	2018-05-31 10:58:41 +02:00
Thierry FOURNIER	ba42fcd064	BUG/MEDIUM: lua/socket: Notification error Each time the send function yields, a notification must be registered. Without this notification, the task is never wakeup when data arrives. Today, the notification is registered only if the buffer is not available. Other cases like the buffer is too small for all data are not processed. This patch must be backported in 1.6, 1.7 and 1.8	2018-05-31 10:58:41 +02:00
Thierry FOURNIER	7e4ee47acc	BUG/MAJOR: lua: Dead lock with sockets In some cases, when we are waiting for data and the socket timeout expires, we have a dead lock. The Lua socket locks the applet socket, and call for a notify. The notify immediately executes code and try to acquire the same lock, so ... dead lock. stream_int_notify() cant be used because it wakeup the applet task only if the stream have changes. The changes are forces by Lua, but not repported on the stream. stream_int_update_applet() cant be used because the deadlock. So, I inconditionnaly wakeup the applet. This wake is performed asynchronously, and will call a stream_int_notify(). This patch must be backported in 1.6, 1.7 and 1.8	2018-05-31 10:58:41 +02:00
Thierry FOURNIER	af4bd0867a	BUG/MEDIUM: lua/socket: wrong scheduling for sockets The appctx pointer is given from any variable which are wrong. This implies the wakeup of wrong applet, and the socket are no longer responsive. This behavior is hidden by another inherited error which is fixed in the next patch. This patch remove all wrong appctx affectations. This patch must be backported in 1.6, 1.7 and 1.8	2018-05-31 10:58:41 +02:00
Christopher Faulet	3a47e5e25c	BUG/MEDIUM: spoe: Return an error when the wrong ACK is received in sync mode This is required to let a message processing timed out. Because, when it happens, there is no more context attached to the SPOE applet that sent the NOTIFY frame. So when the ACK is received, it is too late. This is the same situation when we receive the wrong ACK. It is invalid in sync mode. Otherwise, the SPOE applet remains in the state "WAITING_SYNC_ACK" until the idle timeout is reached. In such case, the applet is seen as busy and it is unusable. If this happens too often, more and more applets will be created because some others are blocked. If there is a maxconn on the SPOE backend, all processings will be drastically slowdown. Returning an error in such cases, in sync mode, allow us to terminate the SPOE applet. Because it means the agent is unresponsive or too slow. Note this bug exists only if the sync mode is used. This patch must be backported in 1.8.	2018-05-30 15:34:48 +02:00
Ben Draut	44e609bfa5	MINOR: dns: Implement `parse-resolv-conf` directive This introduces a new directive for the `resolvers` section: `parse-resolv-conf`. When present, it will attempt to add any nameservers in `/etc/resolv.conf` to the list of nameservers for the current `resolvers` section. [Mailing list thread][1]. [1]: https://www.mail-archive.com/haproxy@formilux.org/msg29600.html	2018-05-30 05:17:16 +02:00
Olivier Houchard	082627af77	MINOR: task: Also consider the task list size when getting global tasks. We're taking tasks from the global runqueue based on the number of tasks the thread already have in its local runqueue, but now that we have a task list, we also have to take that into account.	2018-05-28 15:20:59 +02:00
Olivier Houchard	736ea41c6c	BUG/MEDIUM: task: Don't forget to decrement max_processed after each task. When the task list was introduced, we bogusly lost max_processed--, that means we would execute as much tasks as present in the list, and we would never set active_tasks_mask, so the thread would go to sleep even if more tasks were to be executed. 1.9-dev only, no backport is needed.	2018-05-28 15:20:57 +02:00
Willy Tarreau	1b0f85e47f	MINOR: stats: also report the failed header rewrites warnings on the stats page These ones concern the warnings detected during header addition/insertion. They are visible in the tooltip reporting the per-status codes stats. The frontend and backend contain a total of request+response warnings, while server only has the response warnings.	2018-05-28 15:16:23 +02:00
Tim Duesterhus	3fd1973d37	MINOR: http: Log warning if (add\|set)-header fails This patch adds a warning if an http-(request\|reponse) (add\|set)-header rewrite fails to change the respective header in a request or response. This usually happens when tune.maxrewrite is not sufficient to hold all the headers that should be added.	2018-05-28 14:53:59 +02:00
Daniel Corbett	3e60b11100	BUG/MEDIUM: stick-tables: Decrement ref_cnt in table_* converters When using table_* converters ref_cnt was incremented and never decremented causing entries to not expire. The root cause appears to be that stktable_lookup_key() was called within all sample_conv_table_* functions which was incrementing ref_cnt and not decrementing after completion. Added stktable_release() to the end of each sample_conv_table_* function and reworked the end logic to ensure that ref_cnt is always decremented after use. This should be backported to 1.8	2018-05-28 10:36:20 +02:00
Olivier Houchard	673867c357	MAJOR: applets: Use tasks, instead of rolling our own scheduler. There's no real reason to have a specific scheduler for applets anymore, so nuke it and just use tasks. This comes with some benefits, the first one being that applets cannot induce high latencies anymore since they share nice values with other tasks. Later it will be possible to configure the applets' nice value. The second benefit is that the applet scheduler was not very thread-friendly, having a big lock around it in prevision of this change. Thus applet-intensive workloads should now scale much better with threads. Some more improvement is possible now : some applets also use a task to handle timers and timeouts. These ones could now be simplified to use only one task.	2018-05-26 20:03:30 +02:00
Olivier Houchard	1599b80360	MINOR: tasks: Make the number of tasks to run at once configurable. Instead of hardcoding 200, make the number of tasks to be run configurable using tune.runqueue-depth. 200 is still the default.	2018-05-26 20:03:24 +02:00
Olivier Houchard	b0bdae7b88	MAJOR: tasks: Introduce tasklets. Introduce tasklets, lightweight tasks. They have no notion of priority, they are just run as soon as possible, and will probably be used for I/O later. For the moment they're used to replace the temporary thread-local list that was used in the scheduler. The first part of the struct is common with tasks so that tasks can be cast to tasklets and queued in this list. Once a task is in the tasklet list, it has its leaf_p set to 0x1 so that it cannot accidently be confused as not in the queue. Pure tasklets are identifiable by their nice value of -32768 (which is normally not possible).	2018-05-26 20:03:19 +02:00
Olivier Houchard	f6e6dc12cd	MAJOR: tasks: Create a per-thread runqueue. A lot of tasks are run on one thread only, so instead of having them all in the global runqueue, create a per-thread runqueue which doesn't require any locking, and add all tasks belonging to only one thread to the corresponding runqueue. The global runqueue is still used for non-local tasks, and is visited by each thread when checking its own runqueue. The nice parameter is thus used both in the global runqueue and in the local ones. The rare tasks that are bound to multiple threads will have their nice value used twice (once for the global queue, once for the thread-local one).	2018-05-26 19:27:29 +02:00
Olivier Houchard	9f6af33222	MINOR: tasks: Change the task API so that the callback takes 3 arguments. In preparation for thread-specific runqueues, change the task API so that the callback takes 3 arguments, the task itself, the context, and the state, those were retrieved from the task before. This will allow these elements to change atomically in the scheduler while the application uses the copied value, and even to have NULL tasks later.	2018-05-26 19:23:57 +02:00
Thierry FOURNIER	8c126c7235	BUG/MEDIUM: lua/socket: Length required read doesn't work The limit of data read works only if all the data is in the input buffer. Otherwise (if the data arrive in chunks), the total amount of data is not taken in acount. Only the current read data are compared to the expected amout of data. This patch must be backported from 1.9 to 1.6	2018-05-26 08:51:05 +02:00
Daniel Corbett	9215ffa6b2	BUG/MEDIUM: servers: Add srv_addr default placeholder to the state file When creating a state file using "show servers state" an empty field is created in the srv_addr column if the server is from the socket family AF_UNIX. This leads to a warning on start up when using "load-server-state-from-file". This patch defaults srv_addr to "-" if the socket family is not covered. This patch should be backported to 1.8.	2018-05-24 22:06:08 +02:00
Olivier Houchard	f3d9e608d7	BUG/MEDIUM: dns: Delay the attempt to run a DNS resolution on check failure. When checks fail, the code tries to run a dns resolution, in case the IP changed. The old way of doing that was to check, in case the last dns resolution hadn't expired yet, if there were an applicable IP, which should be useless, because it has already be done when the resolution was first done, or to run a new resolution. Both are a locking nightmare, and lead to deadlocks, so instead, just wake the resolvers task, that should do the trick. This should be backported to 1.8.	2018-05-23 16:57:15 +02:00
Lukas Tribus	926594f606	MINOR: ssl: set SSL_OP_PRIORITIZE_CHACHA Sets OpenSSL 1.1.1's SSL_OP_PRIORITIZE_CHACHA unconditionally, as per [1]: When SSL_OP_CIPHER_SERVER_PREFERENCE is set, temporarily reprioritize ChaCha20-Poly1305 ciphers to the top of the server cipher list if a ChaCha20-Poly1305 cipher is at the top of the client cipher list. This helps those clients (e.g. mobile) use ChaCha20-Poly1305 if that cipher is anywhere in the server cipher list; but still allows other clients to use AES and other ciphers. Requires SSL_OP_CIPHER_SERVER_PREFERENCE. [1] https://www.openssl.org/docs/man1.1.1/man3/SSL_CTX_clear_options.html	2018-05-23 16:55:15 +02:00
William Lallemand	8a16fe0d05	BUG/MEDIUM: cache: don't cache when an Authorization header is present RFC 7234 says: A cache MUST NOT store a response to any request, unless: [...] the Authorization header field (see Section 4.2 of [RFC7235]) does not appear in the request, if the cache is shared, unless the response explicitly allows it (see Section 3.2), [...] In this patch we completely disable the cache upon the receipt of an Authorization header in the request. In this case it's not possible to either use the cache or store into the cache anymore. Thanks to Adam Eijdenberg of Digital Transformation Agency for raising this issue. This patch must be backported to 1.8.	2018-05-23 10:36:44 +02:00
Thierry Fournier	d5b073cf1f	MINOR: lua: Improve error message The function hlua_ctx_resume return less text message and more error code. These error code allow the caller to return appropriate message to the user.	2018-05-22 18:57:46 +02:00
Willy Tarreau	cbe6da5eb0	BUG/MINOR: ssl/lua: prevent lua from affecting automatic maxconn computation Since commit 36d1374 ("BUG/MINOR: lua: Fix SSL initialisation") in 1.6, the Lua code always initializes an SSL server. It caused a small visible side effect which is that by calling ssl_sock_prepare_srv_ctx(), it forces global.ssl_used_backend to 1 and makes the initialization code believe that there are some SSL servers in certain backends. This detection is used to figure how to set the global maxconn value when only the memory usage is limited. As such, even a configuration with no SSL at all will have a very conservative maxconn. The configuration below exhibits this : global ssl-server-verify none stats socket /tmp/sock1 mode 666 level admin tune.bufsize 16384 listen px timeout client 5s timeout server 5s timeout connect 5s bind :4445 #bind :4443 ssl crt rsa+dh2048.pem #server s1 127.0.0.1:8003 ssl Starting it with "-m 200" to limit it to 200 MB of RAM reports 1500 for Maxconn, the same when uncommenting the "server" line, and 1300 when uncommenting the "bind" line, regardless of the "server" line's status. In practice it doesn't make sense to consider that Lua's server template counts for one regular SSL server, because even if used for SSL, it will not take large connection counts, compared to a backend relaying traffic. Thus the solution consists in resetting the ssl_used_backend to its previous value after creating the server_ctx from the Lua code. With the fix, the same config with the same parameters now show : - maxconn=5700 when neither side uses SSL - maxconn=1500 when only one side uses SSL - maxconn=1300 when both sides use SSL This fix can be backported to versions 1.6 and beyond.	2018-05-18 17:09:35 +02:00
Christopher Faulet	68db0235fd	CLEANUP: spoe: Remove unused variables the agent structure applets_act and applets_idle were used for debugging purpose. Now, these values are part of the agent's counters.	2018-05-18 15:04:46 +02:00
Thierry FOURNIER	c4dcaff3f0	BUG/MEDIUM: spoe: Flags are not encoded in network order The flags are direct copy of the "unsigned int" in the network stream, so the stream contains a 32 bits field encoded with the host endian. - This is not reliable for stream betwen different architecture host - For x86, the bits doesn't correspond to the documentation. This patch add some precision in the documentation and put the bitfield in the stream usig network butes order. Warning: this patch can break compatibility with existing agents. This patch should be backported in all version supporing SPOE Original network capture: 12:28:16.181343 IP 127.0.0.1.46782 > 127.0.0.1.12345: Flags [P.], seq 134:168, ack 59, win 342, options [nop,nop,TS val 2855241281 ecr 2855241281], length 34 0x0000: 4500 0056 6b94 4000 4006 d10b 7f00 0001 E..Vk.@.@....... 0x0010: 7f00 0001 b6be 3039 a3d1 ee54 7d61 d6f7 ......09...T}a.. 0x0020: 8018 0156 fe4a 0000 0101 080a aa2f 8641 ...V.J......./.A 0x0030: aa2f 8641 0000 001e 0301 0000 0000 010f ./.A............ ^^^^^^^^^^ 0x0040: 6368 6563 6b2d 636c 6965 6e74 2d69 7001 check-client-ip. 0x0050: 0006 7f00 0001 ...... Fixed network capture: 12:24:26.948165 IP 127.0.0.1.46706 > 127.0.0.1.12345: Flags [P.], seq 4066280627:4066280661, ack 3148908096, win 342, options [nop,nop,TS val 2855183972 ecr 2855177690], length 34 0x0000: 4500 0056 0538 4000 4006 3768 7f00 0001 E..V.8@.@.7h.... 0x0010: 7f00 0001 b672 3039 f25e 84b3 bbb0 8640 .....r09.^.....@ 0x0020: 8018 0156 fe4a 0000 0101 080a aa2e a664 ...V.J.........d 0x0030: aa2e 8dda 0000 001e 0300 0000 0114 010f ................ ^^^^^^^^^^ 0x0040: 6368 6563 6b2d 636c 6965 6e74 2d69 7001 check-client-ip. 0x0050: 0006 7f00 0001 ......	2018-05-18 13:50:53 +02:00
Thierry FOURNIER	01a3f20740	BUG/MINOR: spoe: Mistake in error message about SPOE configuration The announced accepted chars are "[a-zA-Z_-.]", but the real accepted alphabet is "[a-zA-Z0-9_.]". Numbers are supported and "-" is not supported. This patch should be backported to 1.8 and 1.7	2018-05-18 13:50:40 +02:00
sada	05ed330d72	BUG/MINOR: lua: Socket.send threw runtime error: 'close' needs 1 arguments. Function `hlua_socket_close` expected exactly one argument on the Lua stack. But when `hlua_socket_close` was called from `hlua_socket_write_yield`, Lua stack had 3 arguments. So `hlua_socket_close` threw the exception with message "'close' needs 1 arguments". Introduced new helper function `hlua_socket_close_helper`, which removed the Lua stack argument count check and only checked if the first argument was a socket. This fix should be backported to 1.8, 1.7 and 1.6.	2018-05-18 13:48:21 +02:00
Willy Tarreau	03f4ec47d9	BUG/MEDIUM: ssl: properly protect SSL cert generation Commit 821bb9b ("MAJOR: threads/ssl: Make SSL part thread-safe") added insufficient locking to the cert lookup and generation code : it uses lru64_lookup(), which will automatically remove and add a list element to the LRU list. It cannot be simply read-locked. A long-term improvement should consist in using a lockless mechanism in lru64_lookup() to safely move the list element at the head. For now let's simply use a write lock during the lookup. The effect will be minimal since it's used only in conjunction with automatically generated certificates, which are much more expensive and rarely used. This fix must be backported to 1.8.	2018-05-17 10:56:47 +02:00
Willy Tarreau	ba20dfc501	BUG/MEDIUM: http: don't always abort transfers on CF_SHUTR Pawel Karoluk reported on Discourse[1] that HTTP/2 breaks url_param. Christopher managed to track it down to the HTTP_MSGF_WAIT_CONN flag which is set there to ensure the connection is validated before sending the headers, as we may need to rewind the stream and hash again upon redispatch. What happens is that in the forwarding code we refrain from forwarding when this flag is set and the connection is not yet established, and for this we go through the missing_data_or_waiting path. This exit path was initially designed only to wait for data from the client, so it rightfully checks whether or not the client has already closed since in that case it must not wait for more data. But it also has the side effect of aborting such a transfer if the client has closed after the request, which is exactly what happens in H2. A study on the code reveals that this whole combined check should be revisited : while it used to be true that waiting had the same error conditions as missing data, it's not true anymore. Some other corner cases were identified, such as the risk to report a server close instead of a client timeout when waiting for the client to read the last chunk of data if the shutr is already present, or the risk to fail a redispatch when a client uploads some data and closes before the connection establishes. The compression seems to be at risk of rare issues there if a write to a full buffer is not yet possible but a shutr is already queued. At the moment these risks are extremely unlikely but they do exist, and their impact is very minor since it mostly concerns an issue not being optimally handled, and the fixes risk to cause more serious issues. Thus this patch only focuses on how the HTTP_MSGF_WAIT_CONN is handled and leaves the rest untouched. This patch needs to be backported to 1.8, and could be backported to earlier versions to properly take care of HTTP/1 requests passing via url_param which are closed immediately after the headers, though this is unlikely as this behaviour is only exhibited by scripts. [1] https://discourse.haproxy.org/t/haproxy-1-8-x-url-param-issue-in-http2/2482/13	2018-05-16 11:35:05 +02:00
William Lallemand	0154edc96f	BUG/MINOR: cli: don't stop cli_gen_usage_msg() when kw->usage == NULL In commit abbf607 ("MEDIUM: cli: Add payload support") some cli keywords without usage message have been added at the beginning of the keywords array. cli_gen_usage_usage_msg() use the kw->usage == NULL to stop generating the usage message for the current keywords array. With those keywords at the beginning, the whole array in cli.c was ignored in the usage message generation. This patch now checks the keyword itself, allowing a keyword without usage message anywhere in the array.	2018-05-15 15:16:23 +02:00

1 2 3 4 5 ...

6022 Commits