haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-09-04 05:21:21 +02:00

Author	SHA1	Message	Date
Olivier Houchard	7fc3be76c7	MINOR: servers: Free [idle\|safe\|priv]_conns on exit. Don't forget to free idle_conns, safe_conns and priv_conns on exit. This can be backported to 1.8.	2018-11-22 19:53:03 +01:00
Willy Tarreau	609aad9e73	REORG: time/activity: move activity measurements to activity.{c,h} At the moment the situation with activity measurement is quite tricky because the struct activity is defined in global.h and declared in haproxy.c, with operations made in time.h and relying on freq_ctr which are defined in freq_ctr.h which itself includes time.h. It's barely possible to touch any of these files without breaking all the circular dependency. Let's move all this stuff to activity.{c,h} and be done with it. The measurement of active and stolen time is now done in a dedicated function called just after tv_before_poll() instead of mixing the two, which used to be a lazy (but convenient) decision. No code was changed, stuff was just moved around.	2018-11-22 11:48:41 +01:00
William Lallemand	0564d41333	BUG/MEDIUM: mworker: unregister the signals of main() The signal_register_fct() does not remove the handlers assigned to a signal, but add a new handler to a list. We accidentality inherited the handlers of the main() function in the master process which is a problem because they act on the proxies. The side effect was to stop the MASTER proxy which handle the master CLI on a SIGUSR1, and to display some debug info when doing a SIGHUP and a SIGQUIT.	2018-11-22 11:42:51 +01:00
William Lallemand	220567ec34	MINOR: mworker: use ha_notice to announce a new worker Displays the PID and the relative PID when we fork a new worker with ha_notice().	2018-11-21 19:02:23 +01:00
William Lallemand	944e619b64	MEDIUM: mworker: wait mode use standard init code path The mworker waitpid mode (which is used when a reload failed to apply the new configuration) was still using a specific initialisation path. That's a problem since we use a polling loop in the master now, the master proxy is not initialized and the master CLI is not activated. This patch removes the initialisation code of the wait mode and introduce the MODE_MWORKER_WAIT in order to use the same init path as the MODE_MWORKER with some exceptions. It allows to use the master proxy and the master CLI during the waitpid mode.	2018-11-21 17:05:30 +01:00
William Lallemand	16dd1b3ead	MINOR: cli: show master information in 'show proc' Displays the master information in show proc.	2018-11-20 04:43:54 +01:00
William Lallemand	e368330128	MINOR: cli: displays uptime in `show proc` Displays the uptime of the workers in `show proc`	2018-11-20 04:43:54 +01:00
Joseph Herlant	07a0834635	MINOR: Fix an error message thrown when we run out of memory Fixes a typo in an error message that can be seen by the end user when the haproxy subsystem runs out of memory.	2018-11-18 22:23:15 +01:00
Joseph Herlant	0342090ed7	CLEANUP: Fix some typos in the haproxy subsystem Fix some typos in the code comments of the haproxy subsystem.	2018-11-18 22:23:15 +01:00
William Lallemand	a719926cf8	MEDIUM: jobs: support unstoppable jobs for soft stop This patch allows a process to properly quit when some jobs are still active, this feature is handled by the unstoppable_jobs variable, which must be atomically incremented. During each new iteration of run_poll_loop() the break condition of the loop is now (jobs - unstoppable_jobs) == 0. The unique usage of this at the moment is to handle the socketpair CLI of a the worker during the stopping of the process. During the soft stop, we could mark the CLI listener as an unstoppable job and still handle new connections till every other jobs are stopped.	2018-11-16 17:05:40 +01:00
William Lallemand	2e8fad9c30	MINOR: mworker: only close std{in,out,err} in daemon mode This allows to output messages when we are not in daemon mode which is useful to use log stdout in master worker mode.	2018-11-13 16:21:15 +01:00
David Carlier	42d9e5ae68	BUILD/MEDIUM: threads/affinity: DragonFly build fix DragonFlyBSD does not have a build on its own, it has always used the FreeBSD's. To be able to support the cpu affinity, it needs few more headers.	2018-11-12 19:16:00 +01:00
William Lallemand	e260e0df44	BUG/MEDIUM: cli: crash when trying to access a worker When using the CLI proxy of the master and trying to access a worker with the @ prefix, the worker just crash. The commit 7216032 ("MEDIUM: mworker: leave when the master die") reintroduced the old code of the pipe, which was not trying to access the pointers before. The owner of the FD was modified to a different value, this is a problem since we call listener_accept() in most cases now from the mworker_accept_wrapper() and it casts the owner variable to get the listener. This patch fix the issue by setting back the previous owner of the FD.	2018-11-08 14:48:06 +01:00
William Lallemand	a90cacfd70	BUG/MEDIUM: mworker: does not abort() in mworker_pipe_register() The process was aborting with nbthread > 1. The mworker_pipe_register() could be called several time in multithread mode, we don't want to abort() there.	2018-11-07 09:26:36 +01:00
William Lallemand	7216032e6f	MEDIUM: mworker: leave when the master die When the master die, the worker should exit too, this is achieved by checking if the FD of the socketpair/pipe was closed between the master and the worker. In the former architecture of the master-worker, there was only a pipe between the master and the workers, and it was easy to check an EOF on the pipe FD to exit() the worker. With the new architecture, we use a socketpair by process, and this socketpair is also used to accept new connections with the listener_accept() callback. This accept callback can't handle the EOF and the exit of the process, because it's very specific to the master worker. This is why we transformed the mworker_pipe_handler() function in a wrapper which check if there is an EOF and exit the process, and if not call listener_accept() to achieve the accept.	2018-11-06 18:30:57 +01:00
William Lallemand	5d05db8ce1	MINOR: mworker: displays a message when a worker is forked Displays the PID and the relative PID when we fork a new worker.	2018-11-06 18:30:50 +01:00
William Lallemand	91723745c6	MEDIUM: mworker: exit with the incriminated exit code The former behavior was to exit() the master process with the latest status code known, which was the one of the last process to exit. The problem is that the master process was not exiting with the status code which provoked the exit-on-failure.	2018-11-06 18:28:33 +01:00
William Lallemand	18e52a8834	MINOR: mworker: displays more information when leaving When a worker is leaving, we display the relative PID and the result of the strsignal() function if it was killed by a signal.	2018-11-06 18:28:33 +01:00
William Lallemand	550db6d188	MEDIUM: mworker: does not create the CLI proxy when no listener Does not create the CLI proxy if no -S argument was specified. It prevents a warning that says that the MASTER proxy does not have any bind option.	2018-11-06 18:28:33 +01:00
Willy Tarreau	2d372c2aa1	MINOR: stats: report the number of currently connected peers The active peers output indicates both the number of established peers connections and the number of peers connection attempts. The new counter "ConnectedPeers" also indicates the number of currently connected peers. This helps detect that some peers cannot be reached for example. It's worth mentioning that this value changes over time because unused peers are often disconnected and reconnected. Most of the time it should be equal to ActivePeers.	2018-11-05 17:15:21 +01:00
Willy Tarreau	199ad24661	MINOR: stats: report the number of active peers in "show info" Peers are the last type of activity which can maintain a job present, so it's important to report that such an entity is still active to explain why the job count may be higher than zero. Here by "ActivePeers" we report peers sessions, which include both established connections and outgoing connection attempts.	2018-11-05 17:15:21 +01:00
William Lallemand	309dc9adec	MEDIUM: mworker: stop the master proxy in the workers The master proxy which handles the CLI should not be used or shown in the stats of the workers. This proxy is now disabled after the fork.	2018-10-28 14:03:31 +01:00
William Lallemand	e736115d3a	MEDIUM: mworker: create CLI listeners from argv[] This patch introduces mworker_cli_proxy_new_listener() which allows the creation of new listeners for the CLI proxy. Using this function it is possible to create new listeners from the program arguments with -Sa <unix_socket>. It is allowed to create multiple listeners with several -Sa.	2018-10-28 13:51:39 +01:00
William Lallemand	8a02257d88	MEDIUM: mworker: proxy for the master CLI This patch implements a listen proxy within the master. It uses the sockpair of all the workers as servers. In the current state of the code, the proxy is only doing round robin on the CLI of the workers. A CLI mode will be needed to know to which CLI send the requests.	2018-10-28 13:51:39 +01:00
William Lallemand	1b66361f8d	MEDIUM: mworker: move proc_list gen before proxies startup We need to generate the process list before starting the proxies, because it will be used to create a proxy in the master	2018-10-28 13:51:38 +01:00
William Lallemand	7e1299bb3a	REORG: mworker: move struct mworker_proc to global.h Move the definition of the mworker_proc structure in types/global.h.	2018-10-28 13:51:38 +01:00
William Lallemand	ce83b4a5dd	MEDIUM: mworker: each worker socketpair is a CLI listener The init code of the mworker_proc structs has been moved before the init of the listeners. Each socketpair is now connected to a CLI within the workers, which allows the master to access their CLI. The inherited flag of the worker side socketpair is removed so the socket can be closed in the master.	2018-10-28 13:51:38 +01:00
William Lallemand	f1a62860c8	MINOR: mworker: number of reload in the life of a worker This patch adds a field in the mworker_proc structure which contains how much time the master reloaded during the life of a worker.	2018-10-28 13:51:38 +01:00
William Lallemand	dd319a5b1d	BUG/MEDIUM: mworker: don't poll on LI_O_INHERITED listeners The listeners with the LI_O_INHERITED flag were deleted but not unbound which is a problem since we have a polling in the master. This patch unbind every listeners which are not require for the master, but does not close the FD of those that have a LI_O_INHERITED flag.	2018-10-12 19:30:18 +02:00
Emeric Brun	c8c0ed91cb	BUG/MEDIUM: mworker: segfault receiving SIGUSR1 followed by SIGTERM. This bug appeared only if nbthread > 1. Handling the pipe with the master, multiple threads of the same worker could process the deinit(). In addition, deinit() was called while some other threads were still performing some tasks. This patch assign the handler of the pipe with master to only the first thread and removes the call to deinit() before exiting with an error. This patch should be backported in v1.8.	2018-10-11 16:29:38 +02:00
Dirkjan Bussink	1d323de5e1	CLEANUP: haproxy: Remove unused variable Looking at the code, this variable is no longer used and referenced nowhere. That means it can be safely removed.	2018-10-09 15:09:25 +02:00
Willy Tarreau	61c112aa5b	REORG: http: move HTTP rules parsing to http_rules.c These ones are mostly called from cfgparse.c for the parsing and do not depend on the HTTP representation. The functions's prototypes were moved to proto/http_rules.h, making this file work exactly like tcp_rules. Ideally we should stop calling these functions directly from cfgparse and register keywords, but there are a few cases where that wouldn't work (stats http-request) so it's probably not worth trying to go this far.	2018-10-02 18:28:05 +02:00
William Lallemand	cd5c944ea5	BUILD: fix build without thread Cyril Bont� reported that commit f9cc07c25b broke the build without thread. We don't need to initialise tid = 0 in mworker_loop, so we could completely remove it.	2018-09-12 13:59:00 +02:00
Willy Tarreau	04f1e2d202	REORG: http: move error codes production and processing to http.c These error codes and messages are agnostic to the version, even if they are represented as HTTP/1.0 messages. Ultimately they will have to be transformed into internal HTTP messages to be used everywhere. The HTTP/1.1 100 Continue message was turned to an IST and the local copy in the Lua code was removed.	2018-09-11 10:30:25 +02:00
William Lallemand	123f1f6441	MEDIUM: mworker: call per_thread deinit in mworker_reload() We need to clean the FDs registered manually in the poller to avoid FD leaking during a reload of the master. This patch call the per thread deinit function which close the thread waker pipe.	2018-09-11 10:23:24 +02:00
William Lallemand	e22f11ff47	MINOR: mworker: keep and clean the listeners Keep the listeners that should be used in the master process and clean them in the workers.	2018-09-11 10:23:24 +02:00
William Lallemand	bc19305e53	MEDIUM: mworker: replace the master pipe by socketpairs In order to communicate with the workers, the master pipe has been replaced by a socketpair() per worker. The goal is to use these sockets as stats sockets and be able to access them from the master. When reloading, the master serialize the information of the workers and put them in a environment variable. Once the master has been reexecuted it unserialize that information and it is capable of closing the FDs of the leaving children.	2018-09-11 10:21:58 +02:00
William Lallemand	f9cc07c25b	MEDIUM: mworker: master wait mode use its own initialization The master now use a poll loop, which should be initialized even in wait mode. We need to init some variables if we didn't success to load the configuration file.	2018-09-11 10:21:58 +02:00
William Lallemand	de0ff5ab20	MINOR: mworker: don't deinit the poller fd when in wait mode If haproxy failed to load its configuration, the process is reexecuted and it did not init the poller. So we must not try to deinit the poller before the exec().	2018-09-11 10:21:58 +02:00
William Lallemand	d3801c1c21	MEDIUM: startup: unify signal init between daemon and mworker mode The signals are now unblocked only once the configuration have been parsed.	2018-09-11 10:21:58 +02:00
William Lallemand	242aae96c7	MEDIUM: mworker: never block SIG{TERM,INT} during reload The master should be able to be killed even if the reload is not finished.	2018-09-11 10:21:58 +02:00
William Lallemand	ebf304f8dd	MEDIUM: mworker: block SIGCHLD until the master is ready With the new way of handling the signals in the master worker, we are are not staying in a waitpid() loop. Which means that we need to catch the SIGCHLD signals to call waitpid(). The problem is when the master is reloading, this signal is neither registered nor blocked so we lost all signals between the restart and the call to mworker_loop(). This patch blocks the SIGCHLD signals before the reloading and ensure it's not unblocked before the master registered the SIGCHLD handler.	2018-09-11 10:21:58 +02:00
William Lallemand	91c13b696a	MINOR: mworker: mworker_cleanlisteners() delete the listeners The mworker_cleanlisteners() function now remove the listeners, we don't need them in the master for now.	2018-09-11 10:21:58 +02:00
William Lallemand	3da9769ee4	BUG/MINOR: mworker: no need to stop peers for each proxy The mworker_cleanlisteners() was cleaning the peers in the proxy loop, which is useless since we need to stop the peers only once.	2018-09-11 10:21:58 +02:00
William Lallemand	b3f2be338b	MEDIUM: mworker: use the haproxy poll loop In order to reorganize the code of the master worker, the mworker_wait() function which was the main function was split. This function was handling a wait() loop, but it does not need it anymore since the code will use the poll loop of haproxy instead. The function was split in several functions: - mworker_catch_sigterm() which is a signal handler for SIGTERM ans SIGUSR1 that sends the signals to the workers - mworker_catch_sigchld() which is the code handling the leaving of a child - mworker_catch_sighup which basically call the mworker_restart() function - mworker_loop() which is the function calling the main poll loop in the master	2018-09-11 10:21:58 +02:00
William Lallemand	73e1dfcfdf	MEDIUM: mworker: remove register/unregister signal functions Remove the register and unregister signal functions specifics to the master worker, because that should be done with the generic ones.	2018-09-11 10:21:58 +02:00
Willy Tarreau	3ff577e165	MAJOR: server: make server state changes synchronous again Now we try to synchronously push updates as they come using the new rdv point, so that the call to the server update function from the main poll loop is not needed anymore. It further reduces the apparent latency in the health checks as the response time almost always appears as 0 ms, resulting in a slightly higher check rate of ~1960 conn/s. Despite this, the CPU consumption has slightly dropped again to ~32% for the same test. The only trick is that the checks code is built with a bit of recursivity because srv_update_status() calls server_recalc_eweight(), and the latter needs to signal srv_update_status() in case of updates. Thus we added an extra argument to this function to indicate whether or not it must propagate updates (no if it comes from srv_update_status).	2018-08-08 09:57:45 +02:00
Willy Tarreau	647c70b681	MINOR: threads: remove the previous synchronization point It's not needed anymore as it is fully covered by the new rendez-vous point. This also removes the pipe and its polling.	2018-08-08 09:57:45 +02:00
Willy Tarreau	85c459d7e8	MEDIUM: haproxy: don't use sync_poll_loop() anymore in the main loop This partially reverts commit d8fd2af ("BUG/MEDIUM: threads: Use the sync point to check active jobs and exit") which used to address an issue in the way the sync point used to check for present threads, which was later addressed by commit ddb6c16 ("BUG/MEDIUM: threads: Fix the exit condition of the thread barrier"). Thus there is no need anymore to use the sync point for exiting and we can completely remove this call in the main loop.	2018-08-08 09:56:32 +02:00
Willy Tarreau	3d3700f216	MEDIUM: checks: use the new rendez-vous point to spread check result The current sync point causes some important stress when a high number of threads is in use on a config with lots of checks, because it wakes up all threads every time a server state changes. A config like the following can easily saturate a 4-core machine reaching only 750 checks per second out of the ~2000 configured : global nbthread 4 defaults mode http timeout connect 5s timeout client 5s timeout server 5s frontend srv bind :8001 process 1/1 redirect location / if { method OPTIONS } { rand(100) ge 50 } stats uri / backend chk option httpchk server-template srv 1-100 127.0.0.1:8001 check rise 1 fall 1 inter 50 The reason is that the random on the fake server causes the responses to randomly match an HTTP check, and results in a lot of up/down events that are broadcasted to all threads. It's worth noting that the CPU usage already dropped by about 60% between 1.8 and 1.9 just due to the scheduler updates, but the sync point remains expensive. In addition, it's visible on the stats page that a lot of requests end up with an L7TOUT status in ~60ms. With smaller timeouts, it's even L4TOUT around 20-25ms. By not using THREAD_WANT_SYNC() anymore and only calling the server updates under thread_isolate(), we can avoid all these wakeups. The CPU usage on the same config drops to around 44% on the same machine, with all checks being delivered at ~1900 checks per second, and the stats page shows no more timeouts, even at 10 ms check interval. The difference is mainly caused by the fact that there's no more need to wait for a thread to wake up from poll() before starting to process check results.	2018-08-08 09:56:32 +02:00

1 2 3 4 5 ...

661 Commits