haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-10 17:17:06 +02:00

Author	SHA1	Message	Date
Valentine Krasnobaeva	ee7fc98320	MINOR: mworker: deserialize process list before read_cfg_in_discovery_mode This patch is a part of series to reintroduce the program support in the new master-worker architecture. For the moment we keep the order of program and worker forks the same as before the refactoring, as we need to be sure that this won't introduce regressions. So, programs are forked before the new worker process. Before the program's fork we already need deserialized processes list to find the programs launched before reload and to stop them. Processes list saved before the reload in HAPROXY_PROCESSES variable. It should be deserialized before the first configuration read in discovery mode, because resetenv keyword could be presented in the global section. So, let's move mworker_env_to_proc_list() from mworker_create_master_cli() to main(). We need to call it only after reload in master-worker mode, thus HAPROXY_MWORKER_REEXEC and HAPROXY_PROCESSES should be still presented in the re-executing process environment before the first configuration read.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	5be14b338a	MINOR: startup: add set_verbosity Let's encapsulate the logic to set verbosity modes (MODE_DEBUG and MODE_VERBOSE) in a separate function set_verbosity(). This makes the code of main() more readable and this allows to call set_verbosity() for master process in recovery mode. So, in this mode, verbosity settings before the master re-execution will be re-applied to master. set_verbosity() will be extended in future commits to reduce the verbosiness of master in order not to dump pollers list and filters, if it was started with -V or -d.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	5909d508bc	MEDIUM: mworker: add support of master recovery mode In this commit we add run_master_in_recovery_mode(), which groups all necessary initialization steps, which master should perform to be able to enter in its polling loop (run_master()), when it fails while parsing its new config. As exit_on_failure() is now adapted for master recovery mode. Let's register it as atexit handler, when master enters in this mode. And let's remove atexit_flag variable for master, because we no longer use it. We also slightly refactor here read_cfg_in_discovery_mode() in order to call run_master_in_recovery_mode() for the case, described above. Warning messages are mandatory before calling the run_master_in_recovery_mode() as this allows to stop haproxy with error, if it was launched in zero-warning mode. So, in recovery mode master does not launch any worker. It just performs its necessary initialization routines and enters in its polling loop to continue to monitor the existed worker process.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	fe4708feaa	MINOR: mworker: adapt exit_on_failure for master recovery mode Master recovery mode replaces the former wait-mode with a difference, that master in this case doesn't try to fork the new worker process. But it still needs to enter to its polling loop in order to monitor the previous worker. Master performs some initialization steps for this and it recreates its master CLI. During its initialization steps, master could potentially fail again. As we use for the moment for master init steps some common routines (step_init_2() and step_init_3()), there is no way there to signal to user that failure has happened for the master and in addition, in its recovery mode. So, in such case exit_on_failure() can be still useful in order to print an appropriate alert, as we can register this function as atexit handler for the master.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	6615e46456	MINOR: startup: add read_cfg_in_discovery_mode Let's encapsulate here the code to load and to read the configuration at the first time in MODE_DISCOVERY. This makes the code of main() more readable and this adds the structure for adding necessary master initializations routines to support master recovery mode.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	1cee184145	MINOR: startup: encapsulate master's code in run_master Let's encapsulate master's code (steps which it does before entering in its polling loop and deinitialization routines after) in a separate run_master() function. This makes the code of main() more readable. In future we plan to put in run_master() more master process related code, in order to clean completely init_step_2(), init_step_3() and init_step_4().	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	e5cd81cf8f	MINOR: startup: split main: add step_init_4 Let's encapsulate here another part of main, after binding listeners sockets and before calling the master's code in master-worker mode. This block contains the code, which applies verbosity settings, checks limits and updates the ready date. It will take some time to figure out, which of these parts are really needed for the master, or which ones it could skip. So let's put all these for the moment in step_init_4() and let's call it for all modes.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	26a6fdf542	MINOR: startup: add bind_listeners Let's encapsulate here the code, which tries to bind listeners for the new process in a separate function. This will make the main() code more readable. Master process, even if it has failed while reading its new configuration, has to bind its master CLI sockets. So like this we will can call this function in the master recovery mode. Master CLI socket address and port for external connections (user, monitoring tools) are provided for now only via the command line. So, master, even after this failure can and must reestablish master CLI connections again.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	babbcb047e	MINOR: startup: encapsulate sock_get_old_sockets in a function Let's encapsulate here the code, that calls sock_get_old_sockets() to obtain listeners sockets from the previous process into a separate function. This will make the code of main() more readable and we can move this new function (if we might need so) in future.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	f4e73b4302	MINOR: startup: simplify check for calling sock_get_old_sockets MODE_CHECK and MODE_CHECK_CONDITION are applied now very early in step_init_1() and step_init_2() in order to check the configuration or to check some condition provided via the command line. When these checks have terminated, the main process exits. So, no longer need to verify these modes at the moment, when the current process have already done its basic initialization routines and is asking for listeners sockets from the previously started one.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	c4795e4019	MINOR: startup: split main: add step_init_3 The first part of main(), just after calling the former init() and before trying to bind listeners, need to be also encapsulated into a separate step_init_3() as it is. It contains important blocks to register signals, to apply memory and nofile limits, etc. The order of these blocks should be also preserved (especially the signals part). For the moment step_init_3() must be also executed for all runtime modes.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	49772c55e3	MINOR: startup: split init() into separate initialization routines This is the first commit in a series to add a support of the 5-th reload use case, when the master process fails to read its new configuration. In this case it just need to perform its initialization steps and keep the existed worker. To add the support for this last use case we need to split init() and main() in a shorter steps in order to encapsulate necessary initialization routines into separate functions. Let's at first, make here progname as a global variable for haproxy.c, as it will be used in error messages in the initialization functions. Then let's split the init() into separate routines, which set and apply modes, write process PID in a pidfile, etc. The big part of the former init(), which called functions to allocate pools, to initialize proxies, to calculate maxconn and to perform some post checks was just encasulated as is, into step_init_2(). It will take some time to figure out exactly which parts of this initialization block are really necessary for the master process and which ones it could skip. So, for the moment step_init_2() is called for all runtime modes.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	81dbc2c2e2	MEDIUM: startup: split sending oldpids_sig logic for standalone and mworker modes Before refactoring the master-worker mode, in all runtime modes, when the new process successfully parsed its configuration and bound to sockets, it sent either SIGUSR1 or SIGTERM to the previous one in order to terminate it. Let's keep this logic as is for the standalone mode. In addition, in standalone mode we need to send the signal to old process before calling set_identity(), because in set_identity() effective user or group may change. So, the order is important here. In case of master-worker mode after refactoring, master terminates the previous worker by itself up to receiving "READY" status from the new one in _send_status(). Master also sets at this moment HAPROXY_LOAD_SUCCESS env variable and checks, if there are some other workers to terminate with max_reloads exceeded. So, now in master-worker mode we terminate old workers only, when the new one has successfully done all initialization steps and has sent "READY" status to master.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	b73a278df4	MINOR: mworker/cli: add _send_status to support state transition In the new master-worker architecture, when a worker process is forked and successfully initialized it needs somehow to communicate its "READY" state to the master, in order to terminate the previous worker and workers, that might exceeded max_reloads counter. So, let's implement for this a new master CLI _send_status command. A new worker can send its status string "READY" to the master, when it's about entering to the run poll loop, thus it can start to receive data. In _send_status() in the master context we update the status of the new worker: PROC_O_INIT flag is withdrawn. When TERM signal is sent to a worker, worker terminates and this triggers the mworker_catch_sigchld() handler in master. This handler deletes the exiting process entry from the processes list. In _send_status() we loop over the processes list twice. At the first time, in order to stop workers that exceeded the max_reloads counter. At the second time, in order to stop the worker forked before the last reload. In the corner case, when max_reloads=1, we avoid to send SIGTERM twice to the same worker by setting sigterm_sent flag during the first loop.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	154848a314	MINOR: mworker: simplify the code that sets PROC_O_LEAVING When master performs a reexec it should set for an already existed worker the flag PROC_O_LEAVING. It means that existed worked is marked as the previous one and will be terminated after the reload. In the previous implementation master process was need to do the reexec twice (the first time for parsing its configuration and the second time to free unused ressources). So the logic of setting PROC_O_LEAVING was based on comparing the number of reloads, performed by each process from the processes list, except the master. Now, as being mentioned before, reexec is performed only once. So, in this case we need to set PROC_O_LEAVING flag, when we deserialize the list. It is done for all processes, which have the number of reloads stricly positive.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	2bb07b913d	MINOR: startup: rename and adapt reexec_on_failure Previously reexec_on_failure() was called in cases when the process has failed after reload, while it was parsing its configuration or it was trying to apply it. reexec_on_failure() has called mworker_reexec() and the master process has been reexecuted. With the new architecture in such cases there is no longer need to reexecute the master process after its reload again. It simply keeps the previous worker, forked before the reload, and it lets the new one to exit with an error. But we still need the code, which increments the number of failed reloads and which notifies systemd with new "Reload failed!" status. So, let's reuse and adapt for this reexec_on_failure() and let's rename it to on_new_child_failure().	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	cf150fd73d	MEDIUM: mworker/cli: close child and parent fds, setup listeners Basically, this is the continuation of the previous commits. So, here after the fork, worker process closes the "master" end of the copied CLI sockpair and binds its end, ipc_fd[1], to the GLOBAL proxy listener. mworker_cli_global_proxy_new_listener() guarantees that GLOBAL proxy will be created, if it wasn't the case before. Master process, at first, allocates the MASTER proxy, creates master CLI listener (-S command line option) and reload sockpair and then closes the "worker" end of the copied CLI sockpair and binds its end, ipc_fd[0], to the created MASTER proxy. Usage of the new PROC_O_INIT state helps to reduce test conditions to find the newly forked worker.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	646299fc95	MINOR: mworker: add and set state PROC_O_INIT for new worker Here, to distinguish between the new worker and the previous one let's add a new process state PROC_O_INIT and let's set it, when the memory is allocated for the new worker in the processes list.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	26ad5465cc	MINOR: mworker/cli: create MASTER proxy before mcli listeners For the master process we always need to create a MASTER proxy, even if master cli settings were not provided via command line, because now we bind a listener in the master process context at ipc_fd[0]. So, MASTER proxy should be already allocated at this moment.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	6ec38c9a74	MINOR: mworker/cli: create master CLI sockpair before fork The main idea here is to create a master CLI inherited sockpair just before the master-worker fork. And only then after the fork let each process to bind a needed listener to the its end of this sockpair. Like this master and worker processes can close unused "ends" of its sockpair copy (ipc_fd[0] for worker and and ipc_fd[1] for master). When this sockpair creation happens inside the mworker_cli_global_proxy_new_listener() is not possible for the master to close ipc_fd[1] bound to the GLOBAL proxy listener, as this triggers a BUG_ON(fd->owner) in fd_insert() in master context, because master process has alredy entered in its polling loop and poller in its turn tries to reused closed fd.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	cc1a631beb	MINOR: mworker/cli: rename and clean mworker_cli_sockpair_new Let's rename mworker_cli_sockpair_new() to mworker_cli_global_proxy_new_listener() to outline that this function creates the GLOBAL proxy, allocates the listener with "master-socket" bind conf and attaches this listener to this GLOBAL proxy. Listener is bound to ipc_fd[1] of the sockpair inherited in master and in worker (master CLI sockpair).	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	223caab96f	MINOR: cfgparse: fix thread keywords sensitive to global section position thread keywords parsers are sensitive to global section position. If they are present there, the global section must be the first section in the configuration. *thread parsers logic is based on non_global_section_parsed counter. So, we need to reset it explicitly before the second configuration read done by worker or in a standalone mode.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	0ed262d7bf	MEDIUM: startup: introduce load_cfg and read_cfg This commit is a part of the series to add a support of discovery mode in the configuration parser and in initialization sequence. In order to support discovery mode, we need to read the configuration twice. So, we need to split the stage, when we load all configuration files, from the stage when we parse it. To do this, let's encapsulate in read_cfg() the part, where we load the configuration files in a separate function, load_cfg(). Like this we can call only the parsing part as many times as we need. Before reading configuration at the first time we set MODE_DISCOVERY. After the reading this mode is immediately unset, as the real runtime mode has been already set by discovery keywords parsers. Second read is performed when all primary runtime modes (daemon, master-worker) are applied, because we should not read the configuration twice in the master process.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	fe75c1e12d	MEDIUM: startup: remove MODE_MWORKER_WAIT MODE_MWORKER_WAIT becames redundant with MODE_MWORKER, due to moving master-worker fork in init(). This change allows master no longer perform reexec just after forking in order to free additional memory. As after the fork in the master process we set 'master' variable, we can replace now MODE_MWORKER_WAIT in some 'if' statements by simple check of this 'master' variable. Let's also continue to get rid of HAPROXY_MWORKER_WAIT_ONLY environment variable, as it's no longer needed as well. In cfg_program_postparser(), which is used to check if cmdline is defined to launch a program, we completely remove the check of mode for now, because the master process does not parse the configuration for the moment. 'program' section parsing will be reintroduced in master later in the next commits.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	3f5f57845b	MINOR: startup: rename exit_on_waitmode_failure to exit_on_failure As we no longer support MODE_MWORKER_WAIT for master (it became redundant with MODE_MWORKER after moving master-worker fork in init()), let's rename exit_on_waitmode_failure() callback in just exit_on_failure().	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	7795d49ae6	CLEANUP: haproxy: rm no longer used mworker_reexec_waitmode This a first commit to prepare the removal of MODE_MWORKER_WAIT support. It has became redundant with MODE_MWORKER, due to moving master-worker fork in init(). Master process does no longer perform reexec to free additional memory after forking and does no longer changing its mode to MODE_MWORKER_WAIT, where it has entered to its wait polling loop and has handled signals. Now, master enters in this loop almost immediately after forking a worker and being always in mode MODE_MWORKER. So, we can remove mworker_reexec_waitmode() wrapper, which was used to set HAPROXY_MWORKER_WAIT_ONLY variable and to call mworker_reexec(). But let's keep for the moment the logic of reexec_on_failure() atexit callback for master in order if in the future we will need to support this case again.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	cb0f1f42e1	MINOR: startup: only worker gets capabilities from bin Due to moving the master-worker fork in init(), we need to protect prepare_caps_from_permitted_set() call, which is executed after init(). This call makes sense only for worker, daemon and for foreground mono process modes. prepare_caps_from_permitted_set() allows to read Linux capabilities from haproxy binary and to move some of them in process Effective set, if 'setcap' keyword lists needed capabilities in the global section.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	fe04c2ad37	MEDIUM: startup: do set_identity() if needed in one place There are two set_identity() calls, both under quite same: 'if ((global.mode & (MODE_MWORKER\|MODE_DAEMON...)...' The first call serves to change uid/gid and set some needed Linux capabilities only for process in the foreground mode. The second comes after master-worker fork and allows to do the same in daemon and in worker modes. Due to moving the master-worker fork in init() in some previous commit, the second set_identity() now is no longer under the 'if'. So, it is executed for all modes, except MODE_MWORKER. Now in MODE_MWORKER process enters in its wait polling loop just after forking a worker and it terminates almost immediately, if it exits this loop. Worker, daemon and process in a foreground mode will perform set_identity() as before, but now it will be called in a one place at main(). global.last_checks should be verified just after set_identity() call. As it's stated in comments some configuration options may require full privileges or some Linux capabilities need to be granted to process. set_identity() via prepare_caps_for_setuid() may put configured capabilities in process Effective set and, hence, remove respective flag from global.last_checks.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	02af1fe067	MEDIUM: startup: call chroot() if needed in one place There are two 'chroot' code blocks, both under quite same: 'if ((global.mode & (MODE_MWORKER\|MODE_DAEMON...)...' The first block serves to perform chroot only for process in the foreground mode. The second comes after master-worker fork and allows to do chroot in daemon and in worker modes. Due to moving the master-worker fork in init() in some previous commit, the second 'chroot' code block now is no longer under the 'if'. So, it is executed for all modes, except MODE_MWORKER. Now in MODE_MWORKER process enters in its wait polling loop just after forking a worker and it terminates almost immediately, if it exits this loop. Worker, daemon and process in a foreground mode will perform the chroot as before, but now it will be done in a one place at main().	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	7a2ee10d71	REORG: mworker: move mworker_create_master_cli in master 'case' Let's move mworker_create_master_cli() call in 'master' case just above and get rid of redundant global.mode tests.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	e4c10a704d	MINOR: init: check MODE_MWORKER before creating master CLI mworker_create_master_cli() creates MASTER proxy and allocates listeners, which are attached to this proxy. It also creates a reload sockpair. So, it's more appropriate to do the check, that we are in a MODE_MWORKER, if master CLI settings were provided via command line, just after the config parsing. And only then, if runtime mode and command line settings are coherent, try to perform master-worker fork and try to create master CLI.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	26e53e2e8c	REORG: mworker: set nbthread=1 for master after fork After moving master-worker fork into init() and reintroducing it into a switch-case (see the previous commit), it is more appropriate to set nbthread=1 and nbtgroups=1 immediately in the 'case' for the parent process.	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	ae84f06025	BUG/MINOR: mworker: fix memory leak due to master-worker fork Before this fix, startup logs ring was duplicated before the fork(), so master and worker had both the original startup_logs ring and the duplicated one. In the worker context we freed the original ring and used a duplicated one. In the master context we did nothing, but we still create a duplicated copy again and again during the reload. So, let's duplicate startup logs ring only in the worker context. Master continues to use the original ring initialized in init() before its fork().	2024-10-16 22:02:39 +02:00
Valentine Krasnobaeva	8dd4efe42f	MAJOR: mworker: move master-worker fork in init() This refactoring allows to simplify 'master-worker' logic. The master process with this change will fork a worker very early at the initialization stage, which allows to perform a configuration parsing only for the worker. In reality only the worker process needs to parse and to apply the whole configuration. Master process just polls master CLI sockets, watches worker status, catches its termination state and handles the signals. With this refactoring there is no longer need for master to perform re-execution after reading the whole configuration file to free additional memory. And there is no longer need for worker to register atexit callbacks, in order to free the memory, when it fails to apply the new configuration. In contrast, we now need to set proc_self pointer to the new worker entry in processes list just after the fork in the worker process context. proc_self is dereferenced in mworker_sockpair_register_per_thread(), which is called when worker enters in its polling loop. Following patches will try to gather more 'worker' and 'master' specific' code in the dedicated cases of this new fork() switch, or in a separate functions.	2024-10-16 22:00:58 +02:00
Valentine Krasnobaeva	4cbfcc60f4	MEDIUM: startup: move PID handling in init() Let's move PID handling in init() from the main() code. It is more appropriate to open and to write the PID of the process just after daemonization fork. In case of daemon monoprocess mode, we will simply write a PID of the process, which is already in the background. In case of 'master-worker' mode, we keep the previous behaviour and we write only a PID of the master process. This allows to remove redundant tests of the process execution mode, tests of the pidfd value and consequent writes to this pidfd. This patch prepares the refactoring of master-worker fork by moving it in init() function as well.	2024-10-16 22:00:58 +02:00
Valentine Krasnobaeva	95c19be2ab	MINOR: startup: refactor "daemonization" fork Let's put "daemonization" fork into a switch-case. This is more readable and we don't need to allocate memory for the fork() return value here.	2024-10-16 22:00:58 +02:00
Valentine Krasnobaeva	90b8181c0a	MEDIUM: startup: move daemonization fork in init Let's move daemonization fork in init(). We need to perform this fork always before forking a worker process, in order to be able to launch master and then its worker in daemon, i.e. background mode, if haproxy was started with '-D' option. This refactoring is a preparation step, needed for replacing then master-worker fork in init() as well. This allows the master process not to read the whole configuration file and not to do re-execution in order to free additional memory, when worker was forked. In the new refactored design only the worker process will read and apply a new configuration, while the master will arrive very fast in its polling loop to wait worker's termination and to handle signals. See more details in the following commits.	2024-10-16 22:00:58 +02:00
Valentine Krasnobaeva	df12791da3	MINOR: startup: add O_CLOEXEC flag to open /dev/null As master process performs execvp() syscall to handle USR2 and HUP signals in mworker_reexec(), let's add O_CLOEXEC flag, when we open '/dev/null' in order to avoid fd leak. This a preparation step to refactor master-worker logic. See more details in the next commits.	2024-10-16 22:00:58 +02:00
Willy Tarreau	d6c060c5ae	MINOR: tools: add minimal file name management In proxies, stick-tables, servers, etc... at plenty of places we store a file name and a line number. Some file names are the result of strdup() (e.g. in proxies), others not (e.g. stick-tables) and leave dangling pointers at the end of parsing. The risk of double-free is not null either. In order to stop this, let's first add a simple tool that allows to register short strings inside a global list, these strings happening to be server names. The strings are either duplicated and stored upon failure to find them, or just added to this storage. Since file names are not expected to disappear before the end of the process, for now we don't even implement refcounting, and we free them all at the end. There's already a drop_file_name() function to reset the pointer like ha_free() used to do, and even if not strictly needed it's a good habit to get used to doing it. The strings are returned as const so that they're stored as-is in structs, and that nasty free() calls are easily caught. The pointer points to the char[] storage inside the node itself. This way later if we want to implement refcounting, it will be trivial to just look up a string and change its associated node's refcount. If needed, comparisons can also be made on pointers. For now they're not used yet and are released on deinit().	2024-09-19 15:36:58 +02:00
Nicolas CARPi	534e7e4598	CLEANUP: haproxy: fix typos in code comment Use "from" instead of "form" in ha_random_boot function code comments.	2024-08-30 14:58:59 +02:00
Valentine Krasnobaeva	28ca7fc594	BUG/MINOR: haproxy: free init_env in deinit only if allocated This fixes `7b78e1571` (" MINOR: mworker: restore initial env before wait mode"). In cases, when haproxy starts without any configuration, for example: 'haproxy -vv', init_env array to backup env variables is never allocated. So, we need to check in deinit(), when we free its memory, that init_env is not a NULL ptr.	2024-08-23 19:08:53 +02:00
Valentine Krasnobaeva	7b78e1571b	MINOR: mworker: restore initial env before wait mode This patch is the follow-up of `1811d2a6ba` (MINOR: tools: add helpers to backup/clean/restore env). In order to avoid unexpected behaviour in master-worker mode during the process reload with a new configuration, when the old one has contained '*env' keywords, let's backup its initial environment before calling parse_cfg() and let's clean and restore it in the context of master process, just before it enters in a wait polling loop. This will garantee that new workers will have a new updated environment and not the previous one inherited from the master, which does not read the configuration, when it's in a wait-mode.	2024-08-23 17:06:59 +02:00
Nathan Wehrman	5c07d58e08	MINOR: config: Created env variables for http and tcp clf formats Since we already have variables for the other formats and the change is trivial I thought it would be a nice addition for completeness	2024-08-22 09:15:58 +02:00
Amaury Denoyelle	d0d8e57d47	MINOR: quic: define sbuf pool Define a new buffer pool reserved to allocate smaller memory area. For the moment, its usage will be restricted to QUIC, as such it is declared in quic_stream module. Add a new config option "tune.bufsize.small" to specify the size of the allocated objects. A special check ensures that it is not greater than the default bufsize to avoid unexpected effects.	2024-08-20 18:12:27 +02:00
Amaury Denoyelle	1de5f718cf	MINOR: quic/config: adapt settings to new conn buffer limit QUIC MUX buffer allocation limit is now directly based on the underlying congestion window size. previous static limit based on conn-tx-buffers is now unused. As such, this commit adds a warning to users to prevent that it is now obsolete. Secondly, update max-window-size setting. It is now the main entrypoint to limit both the maximum congestion window size and the number of QUIC MUX allocated buffer on emission. Remove its special value '0' which was used to automatically adjust it on now unused conn-tx-buffers.	2024-08-20 17:59:35 +02:00
Amaury Denoyelle	c24c8667b2	MINOR: quic: define max-window-size config setting Define a new global keyword tune.quic.frontend.max-window-size. This allows to set globally the maximum congestion window size for each QUIC frontend connections. The default value is 0. It is a special value which automatically derive the size from the configured QUIC connection buffer limit. This is similar to the previous "quic-cc-algo" behavior, which can be used to override the maximum window size per bind line.	2024-08-20 17:02:29 +02:00
Valentine Krasnobaeva	aae2ff7691	MINOR: startup: fix unused value reported by coverity Unused 0 is assigned to ret, as it's rewritten by error code of read_cfg(). This issue was reported by coverity.	2024-08-08 19:54:12 +02:00
Valentine Krasnobaeva	c6cfa7cb4a	MINOR: startup: rename readcfgfile in parse_cfg As readcfgfile no longer opens configuration files and reads them with fgets, but performs only the parsing of provided data, let's rename it to parse_cfg by analogy with read_cfg in haproxy.c.	2024-08-07 18:41:41 +02:00
Valentine Krasnobaeva	5b52df4c4d	MEDIUM: startup: load and parse configs from memory Let's call load_cfg_in_ram() helper for each configuration file to load it's content in some area in memory. Adapt readcfgfile() parser function respectively. In order to limit changes in its scope we give as an argument a cfgfile structure, already filled in init_args() and in load_cfg_in_ram() with file metadata and content. Parser function (readcfgfile()) uses now fgets_from_mem() instead of standard fgets from libc implementations. SPOE filter parses its own configuration file, pointed by 'config' keyword in the configuration already loaded in memory. So, let's allocate and fill for this a supplementary cfgfile structure, which is not referenced in cfg_cfgfiles list. This structure and the memory with content of SPOE filter configuration are freed immediately in parse_spoe_flt(), when readcfgfile() returns. HAProxy OpenTracing filter also uses its own configuration file. So, let's follow the same logic as we do for SPOE filter.	2024-08-07 18:41:41 +02:00
Valentine Krasnobaeva	2bb34edb0b	MEDIUM: startup: make read_cfg() return immediately on ENOMEM This commit prepares read_cfg() to call load_cfg_in_mem() helper in order to load configuration files in memory. Before, read_cfg() calls the parser for all files from cfg_cfgfiles list and cumulates parser's errors and memprintf's errors in for_each loop. memprintf's errors did not stop this loop and were accounted just after. Now, as we plan to load configuration files in memory, we stop the loop, if memprintf() fails, and we show appropraite error message with ha_alert. Then process terminates. So not all cumulated syntax-related errors will be shown before exit in this case and we has to stop, because we run out of memory. If we can't open the current file or we fail to allocate a memory to store some configuration line, the previous behaviour is kept, process emits appropriate alert message and exits. If parser returns some syntax-related error on the current file, the previous behaviour is kept as well. We cumulate such errors for all parsed files and we check them just after the loop. All syntax-related errors for all files is shown then as before in ha_alert messages line by line during the startup. Then process will exit with 1. As now cfg_cfgfiles list contains many pointers to some memory areas with configuration files content and this content could be big, it's better to free the list explicitly, when parsing was finished. So, let's change read_cfg() to return some integer value to its caller init(), and let's perform the free routine at a caller level, as cfg_cfgfiles list was initialized and initially filled at this level.	2024-08-07 18:41:41 +02:00
Valentine Krasnobaeva	bafb0ce272	MINOR: startup: adapt list_append_word to use cfgfile list_append_word() helper was used before only to chain configuration file names in a list. As now we start to use cfgfile structure which represents entire file in memory and its metadata, let's adapt this helper to use this structure and let's rename it to list_append_cfgfile(). Adapt functions, which process configuration files and directories to use cfgfile structure and list_append_cfgfile() instead of wordlist.	2024-08-07 18:41:41 +02:00
William Lallemand	f76e8e50f4	BUILD: ssl: replace USE_OPENSSL_AWSLC by OPENSSL_IS_AWSLC Replace USE_OPENSSL_AWSLC by OPENSSL_IS_AWSLC in the code source, so we won't need to set USE_OPENSSL_AWSLC in the Makefile on the long term.	2024-07-30 18:53:08 +02:00
Valentine Krasnobaeva	fcd4bf54c8	BUG/MEDIUM: startup: fix zero-warning mode Let's check the second time a global counter of "ha_warning" messages, if zero-warning is set. And let's do this just before forking. At this moment we are sure, that we've already done all init operations, where we could emit "ha_warning", and we still have stderr fd opened. Even with the second check, we could lost some late and rare warnings about failing to drop supplementary groups and about re-enabling core dumps. Notes about this are added into 'zero-warning' keyword description.	2024-07-18 05:24:56 +02:00
Valentine Krasnobaeva	1f8addfdc2	REORG: haproxy: move limits handlers to limits This patch moves handlers to compute process related limits in 'limits' compilation unit.	2024-07-10 18:05:48 +02:00
Valentine Krasnobaeva	22db643648	MINOR: haproxy: prepare to move limits-related code This patch is done in order to prepare the move of handlers to compute and to check process related limits as maxconn, maxsock, maxpipes. So, these handlers become no longer static due to the future move. We add the handlers declarations in limits.h in this patch as well, in order to keep the next patch, dedicated to code replacement, without any additional modifications. Such split also assures that this patch can be compiled separately from the next one, where we moving the handlers. This is important in case of git-bisect.	2024-07-10 18:05:48 +02:00
Valentine Krasnobaeva	b8dc783eb9	REORG: global: move rlim_fd_*_at_boot in limits Let's move in 'limits' compilation unit global variables to keep the initial process fd limits.	2024-07-10 18:05:48 +02:00
Valentine Krasnobaeva	1517bcb5e3	MINOR: limits: prepare to keep limits in one place The code which gets, sets and checks initial and current fd limits and process related limits (maxconn, maxsock, ulimit-n, fd-hard-limit) is spread around different functions in haproxy.c and in fd.c. Let's group it together in dedicated limits.c and limits.h. This patch is done in order to prepare the moving of limits-related functions from different places to the new 'limits' compilation unit. It helps to keep clean the next patch, which will do only the move without any additional modifications. Such detailed split is needed in order to be sure not to break accidentally limits logic and in order to be able to compile each commit separately in case of git-bisect.	2024-07-10 18:05:48 +02:00
Valentine Krasnobaeva	16a5fac4bb	BUG/MEDIUM: init: fix fd_hard_limit default in compute_ideal_maxconn This commit fixes `41275a691` ("MEDIUM: init: set default for fd_hard_limit via DEFAULT_MAXFD"). fd_hard_limit is taken in account implicitly via 'ideal_maxconn' value in all maxconn adjustements, when global.rlimit_memmax is set: MIN(global.maxconn, capped by global.rlimit_memmax, ideal_maxconn); It also caps provided global.rlimit_nofile, if it couldn't be set as a current process fd limit (see more details in the main() code). So, lets set the default value for fd_hard_limit only, when there is no any other haproxy-specific limit provided, i.e. rlimit_memmax, maxconn, rlimit_nofile. Otherwise we may break users configs. Please, note, that in master-worker mode, master does not need the DEFAULT_MAXFD (1048576) as well, as we explicitly limit its maxconn to 100. Must be backported in all stable versions until v2.6.0, including v2.6.0, like the commit above.	2024-07-08 11:26:16 +02:00
Valentine Krasnobaeva	41275a6918	MEDIUM: init: set default for fd_hard_limit via DEFAULT_MAXFD Let's provide a default value for fd_hard_limit, if it's not set in the configuration. With this patch we could set some specific default via compile-time variable DEFAULT_MAXFD as well. Hope, this will be helpfull for haproxy package maintainers. make -j 8 TARGET=linux-glibc DEBUG=-DDEFAULT_MAXFD=50000 If haproxy is comipled without DEFAULT_MAXFD defined, the default will be set to 1048576. This is done to avoid killing the process by its watchdog, while it started without any limitations in its configuration or in the command line and the hard RLIMIT_NOFILE is extremely huge (~1000000000). We use in this case compute_ideal_maxconn() to calculate maxconn and maxsock, maxsock defines the size of internal fdtab, which becames very-very large as well. When the process starts to simply loop over this fdtab (0(n)), this takes a lot of time, so watchdog does it job. To avoid this, maxconn now is always reduced to some reasonable value either by explicit global.fd-hard-limit from configuration, or by its default. The default may be changed at build-time and overwritten then by global.fd-hard-limit at runtime. Explicit global.fd-hard-limit from the configuration has always precedence over DEFAULT_MAXFD, if set. Must be backported in all stable versions until v2.6.0, including v2.6.0.	2024-07-04 07:52:42 +02:00
Valentine Krasnobaeva	ed90ad895c	REORG: init: encapsulate code that reads cfg files Haproxy master process should not read its configuration the second time after performing reexec and passing to MODE_MWORKER_WAIT. So, to make this part of init() function more readable and to distinguish better the point, where configs have been read, let's encapsulate it in a separate function.	2024-06-27 16:09:38 +02:00
Valentine Krasnobaeva	5e06d45df7	REORG: init: encapsulate 'reload' sockpair and master CLI listeners creation Let's encapsulate the logic of 'reload' sockpair and master CLI listeners creation, used by master CLI into a separate function, as we needed this only in master-worker runtime mode. This makes the code of init() more readable.	2024-06-27 16:08:42 +02:00
Valentine Krasnobaeva	6f613faa71	REORG: init: encapsulate CHECK_CONDITION logic in a func As MODE_CHECK_CONDITION logic terminates the process anyway, no matter if the test for the provided condition was successfull or not, let's encapsulate it in a separate function. This makes the code of init() more readable.	2024-06-27 16:01:01 +02:00
Valentine Krasnobaeva	10de58fbfb	REORG: init: do MODE_CHECK_CONDITION logic first In MODE_CHECK_CONDITION we only parse check_condition string, provided by '-cc', and then we evaluate it. Haproxy process terminates at the end of {if..else} block anyway, if the test has failed or passed. So, it will be more appropriate to perform MODE_CHECK_CONDITION test first and then do all other process runtime mode verifications.	2024-06-27 15:59:43 +02:00
Valentine Krasnobaeva	0d79c9bedf	MINOR: cli/debug: show dev: add cmdline and version 'show dev' command is very convenient to obtain haproxy debugging information, while process is run in container. Let's extend its output with version and cmdline. cmdline is useful in a way, as it shows absolute binary path and its arguments, because sometimes the person, who is debugging failing container is not the same, who has created and deployed it. argc and argv are stored in the exported global structure, because feed_post_mortem() is added as a post check function callback in the post_check_list. So we can't simply change the signature of feed_post_mortem(), without breaking other post check callbacks APIs. Parsers are not supposed to modify argv, so we can safely bypass its pointer to debug_parse_cli_show_dev(), without copying all argument stings somewhere in the heap or on stack.	2024-06-26 07:38:21 +02:00
Amaury Denoyelle	f55748a422	MAJOR: config: prevent QUIC with clients privileged port by default Previous commit introduce new protection mechanism to forbid communications with clients which use a privileged source port. By default, this mechanism is disabled for every protocols. This patch changes the default value and activate the protection mechanism for QUIC protocol. This is justified as it is a probable sign of DNS/NTP amplification attack. This is labelled as major as it can be a breaking change with some network environments.	2024-05-24 14:36:31 +02:00
Amaury Denoyelle	45f40bac4c	MEDIUM: config: prevent communication with privileged ports This commit introduces a new global setting named harden.reject_privileged_ports.{tcp\|quic}. When active, communications with clients which use privileged source ports are forbidden. Such behavior is considered suspicious as it can be used as spoofing or DNS/NTP amplication attack. Value is configured per transport protocol. For each TCP and QUIC distinct code locations are impacted by this setting. The first one is in sock_accept_conn() which acts as a filter for all TCP based communications just after accept() returns a new connection. The second one is dedicated for QUIC communication in quic_recv(). In both cases, if a privileged source port is used and setting is disabled, received message is silently dropped. By default, protection are disabled for both protocols. This is to be able to backport it without breaking changes on stable release. This should be backported as it is an interesting security feature yet relatively simple to implement.	2024-05-24 14:36:31 +02:00
Valentine Krasnobaeva	4a9e3e102e	BUG/MINOR: haproxy: only tid 0 must not sleep if got signal This patch fixes the commit `eea152ee68` ("BUG/MINOR: signals/poller: ensure wakeup from signals"). There is some probability that run_poll_loop() becomes inifinite, if TH_FL_SLEEPING is withdrawn from all threads in the second signal_queue_len check, when a signal has received just after the first one. In such particular case, the 'wake' variable, which is used to terminate thread's poll loop is never reset to 0. So, we never enter to the "stopping" part of the run_poll_loop() and threads, except the one with id 0 (tid 0 handles signals), will continue to call _do_poll() eternally and will never sleep, as its TH_FL_SLEEPING flag was unset. This flag needs to be removed only for the tid 0, as it was done in the first signal_queue_len check. This fixes an issue #2537 "infinite loop when shutting down". This fix must be backported in every stable version.	2024-05-06 18:39:08 +02:00
Ilia Shipitsin	a65c6d3574	CLEANUP: assorted typo fixes in the code and comments This is 42nd iteration of typo fixes	2024-05-03 09:01:36 +02:00
Valentine Krasnobaeva	5cbb278fae	MINOR: capabilities: add cap_sys_admin support If 'namespace' keyword is used in the backend server settings or/and in the bind string, it means that haproxy process will call setns() to change its default namespace to the configured one and then, it will create a socket in this new namespace. setns() syscall requires CAP_SYS_ADMIN capability in the process Effective set (see man 2 setns). Otherwise, the process must be run as root. To avoid to run haproxy as root, let's add cap_sys_admin capability in the same way as we already added the support for some other network capabilities. As CAP_SYS_ADMIN belongs to CAP_SYS_* capabilities type, let's add a separate flag LSTCHK_SYSADM for it. This flag is set, if the 'namespace' keyword was found during configuration parsing. The flag may be unset only in prepare_caps_for_setuid() or in prepare_caps_from_permitted_set(), which inspect process EUID/RUID and Effective and Permitted capabilities sets. If system doesn't support Linux capabilities or 'cap_sys_admin' was not set in 'setcap', but 'namespace' keyword is presented in the configuration, we keep the previous strict behaviour. Process, that has changed uid to the non-priviledged user, will terminate with alert. This alert invites the user to recheck its configuration. In the case, when haproxy will start and run under a non-root user and 'cap_sys_admin' is not set, but 'namespace' keyword is presented, this patch does not change previous behaviour as well. We'll still let the user to try its configuration, but we inform via warning, that unexpected things, like socket creation errors, may occur.	2024-04-30 21:40:17 +02:00
William Lallemand	2ab42dddc4	BUG/MINOR: mworker: reintroduce way to disable seamless reload with -x /dev/null Since the introduction of the automatic seamless reload using the internal socketpair, there is no way of disabling the seamless reload. Previously we just needed to remove -x from the startup command line, and remove any "expose-fd" keyword on stats socket lines. This was introduced in `2be557f7c` ("MEDIUM: mworker: seamless reload use the internal sockpairs"). The patch copy /dev/null again and pass it to the next exec so we never try to get socket from the -x. Must be backported as far as 2.6.	2024-04-26 15:25:49 +02:00
Amaury Denoyelle	34ae7755b3	MINOR: stats: apply stats-file on process startup This commit is the first one of a serie to implement preloading of haproxy counters via stats-file parsing. This patch defines a basic apply_stats_file() function. It implements reading line by line of a stats-file without any parsing for the moment. It is called automatically on process startup via init().	2024-04-26 11:29:25 +02:00
Amaury Denoyelle	c02ec9a9db	BUG/MINOR: backend: use cum_sess counters instead of cum_conn This commit is part of a serie to align counters usage between frontends/listeners on one side and backends/servers on the other. "stot" metric refers to the total number of sessions. On backend side, it is interpreted as a number of streams. Previously, this was accounted using <cum_sess> be_counters field for servers, but <cum_conn> instead for backend proxies. Adjust this by using <cum_sess> for both proxies and servers. As such, <cum_conn> field can be removed from be_counters. Note that several diagnostic messages which reports total frontend and backend connections were adjusted to use <cum_sess>. However, this is an outdated and misleading information as it does reports streams count on backend side. These messages should be fixed in a separate commit. This should be backported to all stable releases.	2024-04-22 10:35:18 +02:00
Valentine Krasnobaeva	865db6307f	MINOR: init: use RLIMIT_DATA instead of RLIMIT_AS Limiting total allocatable process memory (VSZ) via setting RLIMIT_AS limit is no longer effective, in order to restrict memory consumption at run time. We can see from process memory map below, that there are many holes within the process VA space, which bumps its VSZ to 1.5G. These holes are here by many reasons and could be explaned at first by the full randomization of system VA space. Now it is usually enabled in Linux kernels by default. There are always gaps around the process stack area to trap overflows. Holes before and after shared libraries could be explained by the fact, that on many architectures libraries have a 'preferred' address to be loaded at; putting them elsewhere requires relocation work, and probably some unshared pages. Repetitive holes of 65380K are most probably correspond to the header that malloc has to allocate before asked a claimed memory block. This header is used by malloc to link allocated chunks together and for its internal book keeping. $ sudo pmap -x -p `pidof haproxy` 127136: ./haproxy -f /home/haproxy/haproxy/haproxy_h2.cfg Address Kbytes RSS Dirty Mode Mapping 0000555555554000 388 64 0 r---- /home/haproxy/haproxy/haproxy 00005555555b5000 2608 1216 0 r-x-- /home/haproxy/haproxy/haproxy 0000555555841000 916 64 0 r---- /home/haproxy/haproxy/haproxy 0000555555926000 60 60 60 r---- /home/haproxy/haproxy/haproxy 0000555555935000 116 116 116 rw--- /home/haproxy/haproxy/haproxy 0000555555952000 7872 5236 5236 rw--- [ anon ] 00007fff98000000 156 36 36 rw--- [ anon ] 00007fff98027000 65380 0 0 ----- [ anon ] 00007fffa0000000 156 36 36 rw--- [ anon ] 00007fffa0027000 65380 0 0 ----- [ anon ] 00007fffa4000000 156 36 36 rw--- [ anon ] 00007fffa4027000 65380 0 0 ----- [ anon ] 00007fffa8000000 156 36 36 rw--- [ anon ] 00007fffa8027000 65380 0 0 ----- [ anon ] 00007fffac000000 156 36 36 rw--- [ anon ] 00007fffac027000 65380 0 0 ----- [ anon ] 00007fffb0000000 156 36 36 rw--- [ anon ] 00007fffb0027000 65380 0 0 ----- [ anon ] ... 00007ffff7fce000 4 4 0 r-x-- [ anon ] 00007ffff7fcf000 4 4 0 r---- /usr/lib/x86_64-linux-gnu/ld-2.31.so 00007ffff7fd0000 140 140 0 r-x-- /usr/lib/x86_64-linux-gnu/ld-2.31.so ... 00007ffff7ffe000 4 4 4 rw--- [ anon ] 00007ffffffde000 132 20 20 rw--- [ stack ] ffffffffff600000 4 0 0 --x-- [ anon ] ---------------- ------- ------- ------- total kB 1499288 75504 72760 This exceeded VSZ makes impossible to start an haproxy process with 200M memory limit, set at its initialization stage as RLIMIT_AS. We usually have in this case such cryptic output at stderr: $ haproxy -m 200 -f haproxy_quic.cfg (null)(null)(null)(null)(null)(null) At the same time the process RSS (a memory really used) is only 75,5M. So to make process memory accounting more realistic let's base the memory limit, set by -m option, on RSS measurement and let's use RLIMIT_DATA instead of RLIMIT_AS. RLIMIT_AS was used before, because earlier versions of haproxy always allocate memory buffers for new connections, but data were not written there immediately. So these buffers were not instantly counted in RSS, but were always counted in VSZ. Now we allocate new buffers only in the case, when we will write there some data immediately, so using RLIMIT_DATA becomes more appropriate.	2024-04-19 17:36:40 +02:00
Ilya Shipitsin	ab7f05daba	CLEANUP: assorted typo fixes in the code and comments This is 41st iteration of typo fixes	2024-04-17 11:14:44 +02:00
Willy Tarreau	018443b8a1	BUILD: makefile: get rid of the CPU variable The CPU variable, when used, is almost always exclusively used with "generic" to disable any CPU-specific optimizations, or "native" to enable "-march=native". Other options are not used and are just making CPU_CFLAGS more confusing. This commit just drops all pre-configured variants and replaces them with documentation about examples of supported options. CPU_CFLAGS is preserved as it appears that it's mostly used as a proxy to inject the distro's CFLAGS, and it's just empty by default. The CPU variable is checked, and if set to anything but "generic", it emits a warning about its deprecation and invites the user to read INSTALL. Users who would just set CPU_CFLAGS will be able to continue to do so, those who were using CPU=native will have to pass CPU_CFLAGS=-march=native and those who were passing one of the other options will find it in the doc as well. Note that this also removes the "CPU=" line from haproxy -vv, that most users got used to seeing set to "generic" or occasionally "native" anyway, thus that didn't provide any useful information.	2024-04-11 17:33:28 +02:00
Valentine Krasnobaeva	f0b6436f57	MEDIUM: capabilities: check process capabilities sets Since the Linux capabilities support add-on (see the commit `bd84387beb` ("MEDIUM: capabilities: enable support for Linux capabilities")), we can also check haproxy process effective and permitted capabilities sets, when it starts and runs as non-root. Like this, if needed network capabilities are presented only in the process permitted set, we can get this information with capget and put them in the process effective set via capset. To do this properly, let's introduce prepare_caps_from_permitted_set(). First, it checks if binary effective set has CAP_NET_ADMIN or CAP_NET_RAW. If there is a match, LSTCHK_NETADM is removed from global.last_checks list to avoid warning, because in the initialization sequence some last configuration checks are based on LSTCHK_NETADM flag and haproxy process euid may stay unpriviledged. If there are no CAP_NET_ADMIN and CAP_NET_RAW in the effective set, permitted set will be checked and only capabilities given in 'setcap' keyword will be promoted in the process effective set. LSTCHK_NETADM will be also removed in this case by the same reason. In order to be transparent, we promote from permitted set only capabilities given by user in 'setcap' keyword. So, if caplist doesn't include CAP_NET_ADMIN or CAP_NET_RAW, LSTCHK_NETADM would not be unset and warning about missing priviledges will be emitted at initialization. Need to call it before protocol_bind_all() to allow binding to priviledged ports under non-root and 'setcap cap_net_bind_service' must be set in the global section in this case.	2024-04-05 18:01:54 +02:00
Valentine Krasnobaeva	e4306fb822	BUG/MINOR: init: relax LSTCHK_NETADM checks for non root Linux capabilities support and ability to preserve it for running process after switching to a global.uid was added recently by the commit `bd84387beb` ("MEDIUM: capabilities: enable support for Linux capabilities")). This new feature hasn't yet been taken into account by last config checks, which are performed at initialization stage. So, to update it, let's perform it after set_identity() call. Like this, current EUID is already changed to a global.uid and prepare_caps_for_setuid() would unset LSTCHK_NETADM flag, only if capabilities given in the 'setcap' keyword in the configuration file were preserved. Otherwise, if system doesn't support Linux capabilities or they were not set via 'setcap', we keep the previous strict behaviour: process will terminate with an alert, in order to insist that user: either needs to change run UID (worst case: start and run as root), or he needs to set/recheck capabilities listed as 'setcap' arguments. In the case, when haproxy will start and run under a non-root user this patch doesn't change the previous behaviour: we'll still let him try the configuration, but we inform via warning that unexpected things may occur. Need to be backported until v2.9, including v2.9.	2024-04-05 18:01:54 +02:00
Aurelien DARRAGON	56d8074798	MINOR: proxy: add PR_FL_CHECKED flag PR_FL_CHECKED is set on proxy once the proxy configuration was fully checked (including postparsing checks). This information may be useful to functions that need to know if some config-related proxy properties are likely to change or not due to parsing or postparsing/check logics. Also, during runtime, except for some rare cases config-related proxy properties are not supposed to be changed.	2024-04-04 19:10:01 +02:00
Tim Duesterhus	ad54273cf9	MINOR: systemd: Include MONOTONIC_USEC field in RELOADING=1 message As per the `sd_notify` manual: > A field carrying the monotonic timestamp (as per CLOCK_MONOTONIC) formatted > in decimal in μs, when the notification message was generated by the client. > This is typically used in combination with "RELOADING=1", to allow the > service manager to properly synchronize reload cycles. See systemd.service(5) > for details, specifically "Type=notify-reload". Thus this change allows users with a recent systemd to switch to `Type=notify-reload`, should they desire to do so. Correct behavior was verified with a Fedora 39 VM. see systemd/systemd#25916 [wla: the service file should be updated this way:] diff --git a/admin/systemd/haproxy.service.in b/admin/systemd/haproxy.service.in index 22a53d8aab..8c6dadb5e5 100644 --- a/admin/systemd/haproxy.service.in +++ b/admin/systemd/haproxy.service.in @@ -8,12 +8,11 @@ EnvironmentFile=-/etc/default/haproxy EnvironmentFile=-/etc/sysconfig/haproxy Environment="CONFIG=/etc/haproxy/haproxy.cfg" "PIDFILE=/run/haproxy.pid" "EXTRAOPTS=-S /run/haproxy-master.sock" ExecStart=@SBINDIR@/haproxy -Ws -f $CONFIG -p $PIDFILE $EXTRAOPTS -ExecReload=@SBINDIR@/haproxy -Ws -f $CONFIG -c $EXTRAOPTS -ExecReload=/bin/kill -USR2 $MAINPID KillMode=mixed Restart=always SuccessExitStatus=143 -Type=notify +Type=notify-reload +ReloadSignal=SIGUSR2 # The following lines leverage SystemD's sandboxing options to provide # defense in depth protection at the expense of restricting some flexibility Signed-off-by: William Lallemand <wlallemand@haproxy.com>	2024-04-04 15:58:29 +02:00
Willy Tarreau	f821a3983e	BUILD: systemd: fix build error on non-systemd systems with USE_SYSTEMD=1 Thanks to previous commit, we can now build with USE_SYSTEMD=1 on any system without requiring any parts from systemd. It just turns our that there was one remaining include in haproxy.c that needed to be replaced with haproxy/systemd.h to build correctly. That's what this commit does.	2024-04-03 17:34:36 +02:00
Willy Tarreau	6c1b29d06f	MINOR: ring: make the number of queues configurable Now the rings have one wait queue per group. This should limit the contention on systems such as EPYC CPUs where the performance drops dramatically when using more than one CCX. Tests were run with different numbers and it was showed that value 6 outperforms all other ones at 12, 24, 48, 64 and 80 threads on an EPYC, a Xeon and an Ampere CPU. Value 7 sometimes comes close and anything around these values degrades quickly. The value has been left tunable in the global section. This commit only introduces everything needed to set up the queue count so that it's easier to adjust it in the forthcoming patches, but it was initially added after the series, making it harder to compare. It was also shown that trying to group the threads in queues by their thread groups is counter-productive and that it was more efficient to do that by applying a modulo on the thread number. As surprising as it seems, it does have the benefit of well balancing any number of threads.	2024-03-25 17:34:19 +00:00
Christopher Faulet	189f74d4ff	MINOR: cfgparse: Add a global option to expose deprecated directives Similarly to "expose-exprimental-directives" option, there is no a global option to expose some deprecated directives. Idea is to have a way to silent warnings about deprecated directives when there is no alternative solution. Of course, deprecated directives covered by this option are not listed and may change. It is only a best effort to let users upgrade smoothly.	2024-03-15 11:31:48 +01:00
William Lallemand	70be894e41	MINOR: debug: enable insecure fork on the command line -dI allow to enable "insure-fork-wanted" directly from the command line, which is useful when you want to run ASAN with addr2line with a lot of configuration files without editing them.	2024-03-13 11:23:14 +01:00
Frederic Lecaille	eeeb81bb49	MINOR: quic: Dynamic packet reordering threshold Let's say that the largest packet number acknowledged by the peer is #10, when inspecting the non already acknowledged packets to detect if they are lost or not, this is the case a least if the difference between this largest packet number and and their packet numbers are bigger or equal to the packet reordering threshold as defined by the RFC 9002. This latter must not be less than QUIC_LOSS_PACKET_THRESHOLD(3). Which such a value, packets #7 and oldest are detected as lost if non acknowledged, contrary to packet number #8 or #9. So, the packet loss detection is very sensitive to such a network characteristic where non acknowledged packets are distant from each others by their packet number differences. Do not use this static value anymore for the packet reordering threshold which is used as a criteria to detect packet loss. In place, make it depend on the difference between the number of the last transmitted packet and the number of the oldest one among the packet which are still in flight before being inspected to be deemed as lost. Add new tune.quic.reorder-ratio setting to apply a ratio in percent to this dynamic packet reorder threshold. Should be backported to 2.6.	2024-02-14 11:32:29 +01:00
Willy Tarreau	75d64c0d4c	BUG/MINOR: diag: run the final diags before quitting when using -c Final diags were added in 2.4 by commit `5a6926dcf` ("MINOR: diag: create cfgdiag module"), but it's called too late in the startup process, because when "-c" is passed, the call is not made, while it's its primary use case. Let's just move the call earlier. Note that currently the check in this function is limited to verifying unicity of server cookies in a backend, so it can be backported as far as 2.4, but there is little value in insisting if it doesn't backport easily.	2024-02-03 12:08:11 +01:00
Willy Tarreau	2b930aa7c3	[RELEASE] Released version 3.0-dev1 Released version 3.0-dev1 with the following main changes : - MINOR: channel: Use dedicated functions to deal with STREAMER flags - MEDIUM: applet: Handle channel's STREAMER flags on applets size - MINOR: applets: Use channel's field to compute amount of data received - MEDIUM: cache: Save body size of cached objects and track it on delivery - MEDIUM: cache: Add support for endp-to-endp fast-forwarding - MINOR: cache: Add global option to enable/disable zero-copy forwarding - MINOR: pattern: Use reference name as filename to read patterns from a file - MEDIUM: pattern: Add support for virtual and optional files for patterns - DOC: config: Add section about name format for maps and ACLs - DOC: management/lua: Update commands about map and acl - MINOR: promex: Add support for specialized front/back/li/srv metric names - MINOR: promex: Export active/backup metrics per-server - BUG/MINOR: ssl: Double free of OCSP Certificate ID - MINOR: ssl/cli: Add ha_(warning\|alert) msgs to CLI ckch callback - BUG/MINOR: ssl: Wrong OCSP CID after modifying an SSL certficate - BUG/MINOR: lua: Wrong OCSP CID after modifying an SSL certficate (LUA) - DOC: configuration: typo req.ssl_hello_type - MINOR: hq-interop: add fastfwd support - CLEANUP: mux_quic: rename ffwd function with prefix qmux_strm_ - MINOR: mux-quic: add traces for 0-copy/fast-forward - BUG/MINOR: mworker/cli: fix set severity-output support - CLEANUP: mworker/cli: add comments about pcli_find_and_exec_kw() - BUG/MEDIUM: quic: Possible buffer overflow when building TLS records - BUILD: ssl: update types in wolfssl cert selection callback - MINOR: ssl: activate the certificate selection callback for WolfSSL - CI: github: switch to wolfssl git-c4b77ad for new PR - BUG/MEDIUM: map/acl: pat_ref_{set,delete}_by_id regressions - BUG/MINOR: ext-check: cannot use without preserve-env - CLEANUP: mux-quic: remove unused prototype - MINOR: mux-quic: clean up qcs Rx buffer allocation API - MINOR: mux-quic: clean up qcs Tx buffer allocation API - CLEANUP: mux-quic: clean up app ops callback definitions - MINOR: mux-quic: factorize QC_SF_UNKNOWN_PL_LENGTH set - MINOR: h3: complete traces for sending - MINOR: h3: adjust zero-copy sending related code - MINOR: hq-interop: use zero-copy to transfer single HTX data block - BUG/MEDIUM: quic: QUIC CID removed from tree without locking - BUG/MEDIUM: stconn: Block zero-copy forwarding if EOS/ERROR on consumer side - BUG/MEDIUM: mux-h1: Cound data from input buf during zero-copy forwarding - BUG/MEDIUM: mux-h1: Explicitly skip request's C-L header if not set originally - CLEANUP: mux-h1: Fix a trace message about C-L header addition - BUG/MEDIUM: mux-h2: Report too large HEADERS frame only when rxbuf is empty - BUG/MEDIUM: mux-quic: report early error on stream - DOC: config: add arguments to sample fetch methods in the table - DOC: config: also add arguments to the converters in the table - BUG/MINOR: resolvers: default resolvers fails when network not configured - SCRIPTS: mk-patch-list: produce a list of patches - DEV: patchbot: add the AI-based bot to pre-select candidate patches to backport - BUG/MEDIUM: mux-h2: Switch pending error to error if demux buffer is empty - BUG/MEDIUM: mux-h2: Only Report H2C error on read error if demux buffer is empty - BUG/MEDIUM: mux-h2: Don't report error on SE if error is only pending on H2C - BUG/MEDIUM: mux-h2: Don't report error on SE for closed H2 streams - DOC: config: Update documentation about local haproxy response - DEV: patchbot: use checked buttons as reference instead of internal table - DEV: patchbot: allow to show/hide backported patches - MINOR: h3: remove quic_conn only reference - BUG/MINOR: server: Use the configured address family for the initial resolution - MINOR: mux-quic: remove qcc_shutdown() from qcc_release() - MINOR: mux-quic: use qcc_release in case of init failure - MINOR: mux-quic: adjust error code in init failure - MINOR: h3: add traces for connection init stage - BUG/MINOR: h3: properly handle alloc failure on finalize - MINOR: h3: use INTERNAL_ERROR code for init failure - BUG/MAJOR: stconn: Disable zero-copy forwarding if consumer is shut or in error - MINOR: stats: store the parent proxy in stats ctx (http) - BUG/MEDIUM: stats: unhandled switching rules with TCP frontend - MEDIUM: proxy: set PR_O_HTTP_UPG on implicit upgrades - MINOR: proxy: monitor-uri works with tcp->http upgrades - OPTIM: server: eb lookup for server_find_by_name() - OPTIM: server: ebtree lookups for findserver_unique_* functions - MINOR: server/event_hdl: add server_inetaddr struct to facilitate event data usage - MINOR: server/event_hdl: update _srv_event_hdl_prepare_inetaddr prototype - BUG/MINOR: server/event_hdl: propagate map port info through inetaddr event - MINOR: server: ensure connection cleanup on server addr changes - CLEANUP: server/event_hdl: remove purge_conn hint in INETADDR event - MEDIUM: server: merge srv_update_addr() and srv_update_addr_port() logic - CLEANUP: server: remove unused server_parse_addr_change_request() function - CLEANUP: resolvers: remove duplicate func prototype - MINOR: resolvers: add unique numeric id to nameservers - MEDIUM: server: make server_set_inetaddr() updater serializable - MINOR: server/event_hdl: expose updater info through INETADDR event - MINOR: server: add dns hint in server_inetaddr_updater struct - MEDIUM: server/dns: clear RMAINT when addr resolves again - BUG/MINOR: server/dns: use server_set_inetaddr() to unset srv addr from DNS - BUG/MEDIUM: server/dns: perform svc_port updates atomically from SRV records - MEDIUM: peers: use server as stream target - CLEANUP: peers: remove unused sock_init_arg struct member - CLEANUP: peers: remove unused "proto" and "xprt" struct members - MINOR: peers: rely on srv->addr and remove peer->addr - DOC: config: add context hint for server keywords - MINOR: stktable: add table_process_entry helper function - MINOR: stktable: use {show,set,clear} table with ptr - MINOR: map: add map__key converters to provide the matching key - DOC: fix typo for fastfwd QUIC option - BUG/MINOR: mux-quic: always report error to SC on RESET_STREAM emission - MEDIUM: mux-quic: add BUG_ON if sending on locally closed QCS - BUG/MINOR: mux-quic: disable fast-fwd if connection on error - BUG/MINOR: quic: Wrong keylog callback setting. - BUG/MINOR: quic: Missing call to TLS message callbacks - MINOR: h3: check connection error during sending - BUG/MINOR: h3: close connection on header list too big - BUG/MINOR: h3: close connection on sending alloc errors - BUG/MINOR: h3: disable fast-forward on buffer alloc failure - Revert "MINOR: mux-quic: Disable zero-copy forwarding for send by default" - MINOR: stktable: stktable_data_ptr() cannot fail in table_process_entry() - CLEANUP: assorted typo fixes in the code and comments - CI: use semantic version compare for determing "latest" OpenSSL - CLEANUP: server: remove ambiguous check in srv_update_addr_port() - CLEANUP: resolvers: remove unused RSLV_UPD_OBSOLETE_IP flag - CLEANUP: resolvers: remove some more unused RSLV_UDP flags - MEDIUM: server: simplify snr_set_srv_down() to prevent confusions - MINOR: backend: export get_server_() functions - MINOR: tcpcheck: export proxy_parse_tcpcheck() - MEDIUM: udp: allow to retrieve the frontend destination address - MINOR: global: export a way to list build options - MINOR: debug: add features and build options to "show dev" - BUG/MINOR: server: fix server_find_by_name() usage during parsing - REGTESTS: check attach-srv out of order declaration - CLEANUP: quic: Remaining useless code into server part - BUILD: quic: Missing quic_ssl.h header protection - BUG/MEDIUM: h3: fix incorrect snd_buf return value - MINOR: h3: do not consider missing buf room as error on trailers - BUG/MEDIUM: stconn: Forward shutdown on write timeout only if it is forwardable - BUG/MEDIUM: stconn: Set fsb date if zero-copy forwarding is blocked during nego - BUG/MEDIUM: spoe: Never create new spoe applet if there is no server up - MINOR: mux-h2: support limiting the total number of H2 streams per connection - CLEANUP: mux-h2: remove the printfs from previous commit on h2 streams limit. - DEV: h2: add the ability to emit literals in mkhdr - DEV: h2: add the preface as well in supported output types - DEV: h2: support passing raw data for a frame - IMPORT: ebtree: implement and use flsnz_long() to count bits - IMPORT: ebtree: switch the sizes and offsets to size_t and ssize_t - IMPORT: ebtree: rework the fls macros to better deal with arch-specific ones - IMPORT: ebtree: make string_equal_bits turn back to unsigned char - IMPORT: ebtree: use unsigned ints for flznz() - IMPORT: ebtree: make string_equal_bits() return an unsigned	2024-01-06 14:09:35 +01:00
Willy Tarreau	afba58f21e	MINOR: global: export a way to list build options The new function hap_get_next_build_opt() will iterate over the list of build options. This will be used for debugging, so that the build options can be retrieved from the CLI.	2024-01-02 11:44:42 +01:00
Amaury Denoyelle	b7274e69ef	Revert "MINOR: mux-quic: Disable zero-copy forwarding for send by default" This reverts commit `18f2ccd244`. Found issues related to QUIC fast-forward were resolved (see github issue #2372). Reenable it by default. If any issue arises, it can be disabled using the global statement : tune.quit.zero-copy-fwd-send off This can be backported to 2.9, but only after a sensible period of observation.	2023-12-22 16:30:37 +01:00
Christopher Faulet	18f2ccd244	MINOR: mux-quic: Disable zero-copy forwarding for send by default There is at least an bug for now in this part and it is still unstable. Thus it is better to disable it for now by default. It can be enable by setting tune.quic.zero-copy-fwd-send to 'on'.	2023-12-04 15:36:02 +01:00
Christopher Faulet	7732323cf3	MINOR: global: Use a dedicated bitfield to customize zero-copy fast-forwarding Zero-copy fast-forwading feature is a quite new and is a bit sensitive. There is an option to disable it globally. However, all protocols have not the same maturity. For instance, for the PT multiplexer, there is nothing really new. The zero-copy fast-forwading is only another name for the kernel splicing. However, for the QUIC/H3, it is pretty new, not really optimized and it will evolved. And soon, the support will be added for the cache applet. In this context, it is usefull to be able to enable/disable zero-copy fast-forwading per-protocol and applet. And when it is applicable, on sends or receives separately. So, instead of having one flag to disable it globally, there is now a dedicated bitfield, global.tune.no_zero_copy_fwd.	2023-12-04 15:31:47 +01:00
Amaury Denoyelle	e97489a526	MINOR: trace: support -dt optional format Add an optional argument for "-dt". This argument is interpreted as a list of several trace statement separated by comma. For each statement, a specific trace name can be specifed, or none to act on all sources. Using double-colon separator, it is possible to add specifications on the wanted level and verbosity.	2023-11-27 17:15:14 +01:00
Amaury Denoyelle	cef29d3708	MINOR: trace: define simple -dt argument Add '-dt' haproxy process argument. This will automatically activate all trace sources on stderr with the error level. This could be useful to troubleshoot issues such as protocol violations.	2023-11-27 17:10:18 +01:00
William Lallemand	3dd55fa132	MINOR: mworker/cli: implement hard-reload over the master CLI The mworker mode never had a proper 'hard-stop' (-st) for the reload, this is a mode which was commonly used with the daemon mode, but it was never implemented in mworker mode. This patch fixes the problem by implementing a "hard-reload" command over the master CLI. It does the same as the "reload" command, but instead of waiting for the connections to stop in the previous process, it immediately quits the previous process after binding.	2023-11-24 21:44:25 +01:00
Willy Tarreau	45a9e4e24b	MINOR: init: add info about the main program to the post_mortem struct This way we'll still have haproxy's version, build options etc in core dumps and centralized all at once.	2023-11-23 15:39:21 +01:00
Willy Tarreau	2268f10dd6	DEBUG: tinfo: store the pthread ID and the stack pointer in tinfo When debugging a core, it's difficult to match a given gdb thread number against an internal thread. Let's just store the pthread ID and the stack pointer in each tinfo. This could help in the future by allowing to just glance over them and pick the right one depending what info is found first.	2023-11-23 14:32:55 +01:00
Amaury Denoyelle	decf29d06d	MINOR: quic: remove unneeded QUIC specific stopping function On CONNECTION_CLOSE reception/emission, QUIC connections enter CLOSING state. At this stage, only CONNECTION_CLOSE can be reemitted and all other exchanges are stopped. Previously, on haproxy process stopping, if all QUIC connections were in CLOSING state, they were released before their closing timer expiration to not block the process shutdown. However, since a recent commit, the closing timer has been shorten to a more reasonable delay. It is now consider viable to respect connections closing state even on process shutdown. As such, stopping specific code in QUIC connections idle timer task was removed. A specific function quic_handle_stopping() was implemented to notify QUIC connections on shutdown from main() function. It should have been deleted along the removal in QUIC idle timer task. This patch just does this.	2023-11-20 17:59:52 +01:00
William Lallemand	ef9a195742	BUG/MINOR: startup: set GTUNE_SOCKET_TRANSFER correctly This bug was forbidding the GTUNE_SOCKET_TRANSFER option to be set when haproxy is neither in daemon mode nor in mworker mode. So it basically only impacts the foreground mode. The fix moves the code outside the 'if (global.mode & (MODE_DAEMON \| MODE_MWORKER \| MODE_MWORKER_WAIT))' condition. Bug was introduced with `7f80eb23` ("MEDIUM: proxy: zombify proxies only when the expose-fd socket is bound"). Must be backported in every stable version.	2023-11-20 10:49:05 +01:00
Willy Tarreau	c7a90cc181	CLEANUP: haproxy: remove old comment from 1.1 from the file header There was still a totally outdated comment speaking about issues affecting solaris on 1.1.8pre4 (April 2002, 21 year-old)! This proves that comments in headers are never read, so let's take this opportunity for also removing the outdated one recommending to read the "updated" RFC7230.	2023-11-17 18:10:16 +01:00
William Lallemand	d76fa37534	BUG/MEDIUM: mworker: set the master variable earlier Since 2.7 and the mcli_reload_bind_conf (`56f73b21a5`), upon a reload failure because of a bind error, the mcli_reload_bind_conf go through a sock_unbind((). This is not supposed to do anything when a listener is RX_F_INHERITED in the master, but unfortunately this is done too early and provokes an exit of the master. We already suspected in the past that setting the 'master' variable this late could have negative impact. The fix sets the master variable earlier before the bind. This must be backported at least to 2.7. This could be backported earlier but better wait any feedbacks on the fix.	2023-11-14 14:32:39 +01:00
William Lallemand	a06f6212c9	MEDIUM: startup: 'haproxy -c' is quiet when valid MODE_CHECK does not output "Configuration file is valid" by default anymore. To display this message the -V option must be used with -c. However the warning and errors are still output by default if they exist. This allows to clean the output of the systemd unit file with is doing a -c.	2023-11-13 09:59:34 +01:00

1 2 3 4 5 ...

1245 Commits