haproxy

mirror of https://git.haproxy.org/git/haproxy.git/ synced 2025-08-10 00:57:02 +02:00

Author	SHA1	Message	Date
Willy Tarreau	19c8161b3d	MINOR: applet: add a new "owner" pointer in the appctx This pointer indicates what stream-interface the appctx belongs to, just like we have for the connections.	2015-04-23 17:56:16 +02:00
Willy Tarreau	7365dad40f	BUG/MEDIUM: stream-int: always reset si->ops when si->end is nullified It happened after changing the stream interface deinitialization sequence that we got random crashes with si_shutw() being called on NULL si->end. The reason was that si->ops was not reset after a call to si_release_endpoint() which is sometimes called directly. Thus we now move the resetting of si->ops just after any si->end assignment. It happens that si_detach() is now just the same as si_release_endpoint() and stream_int_unregister_handler(). Some cleanup will have to be performed there. It's not sure whether this problem can impact 1.5 since in 1.5 applets are part of the default embedded stream handler. The only way it could cause some trouble is if it's used with a connection, which doesn't seem possible at first glance.	2015-04-21 14:15:22 +02:00
Willy Tarreau	73b65acd46	MINOR: stream: pass the pointer to the origin explicitly to stream_new() We don't pass sess->origin anymore but the pointer to the previous step. Now it should be much easier to chain elements together once applets are moved out of streams. Indeed, the session is only used for configuration and not for the dynamic chaining anymore.	2015-04-08 18:26:29 +02:00
Thierry FOURNIER	3def393f8d	MINOR: lua: map system integration in Lua This patch cretes a new Map class that permits to do some lookup in HAProxy maps. This Map class is integration in the HAProxy update system, so we can modify the map throught the socket.	2015-04-07 15:56:21 +02:00
Willy Tarreau	02d863866d	MEDIUM: stream: return the stream upon accept() The function was called stream_accept_session(), let's rename it stream_new() and make it return the newly allocated pointer. It's more convenient for some callers who need it.	2015-04-06 11:37:34 +02:00
Willy Tarreau	c38f71cfcd	MINOR: session: introduce session_new() This one creates a new session and does the minimum initialization.	2015-04-06 11:37:33 +02:00
Willy Tarreau	a7513f5d00	MINOR: stream-int: make appctx_new() take the applet in argument Doing so simplifies the initialization of a new appctx. We don't need appctx_set_applet() anymore.	2015-04-06 11:37:32 +02:00
Willy Tarreau	9903f0e1a2	REORG: session: move the session parts out of stream.c This concerns everythins related to accepting a new session and expiring the embryonic session. There's still a hard-coded call to stream_accept_session() which could be set somewhere in the frontend, but for now it's not a problem.	2015-04-06 11:37:32 +02:00
Willy Tarreau	e73ef85a63	MAJOR: tcp: make tcp_exec_req_rules() only rely on the session It passes a NULL wherever a stream was needed (acl_exec_cond() and action_ptr mainly). It can still track the connection rate correctly and block based on ACLs.	2015-04-06 11:37:31 +02:00
Willy Tarreau	bb2ef12a60	MEDIUM: session: update the session's stick counters upon session_free() Whenever session_free() is called, any possible stick counter stored in the session will be synchronized.	2015-04-06 11:37:31 +02:00
Willy Tarreau	8b7f8688ee	MEDIUM: streams: support looking up stkctr in the session In order to support sessions tracking counters, we first ensure that there is no overlap between streams' stkctr and sessions', and we allow an automatic lookup into the session's counters when the stream doesn't have a counter or when the stream doesn't exist during an access via a sample fetch. The functions used to update the stream counters only update them and not the session counters however.	2015-04-06 11:37:31 +02:00
Willy Tarreau	7698c9080a	REORG: stktable: move the stkctr_* functions from stream to sticktable These ones are not stream-specific at all and will be needed outside of stream, so let's move them to stick_tables where struct stkctr is defined.	2015-04-06 11:37:30 +02:00
Willy Tarreau	11c3624c32	MINOR: session: implement session_free() and use it everywhere We want to call this one everywhere we have to kill a session so that future parts we move to the session can be released from there.	2015-04-06 11:37:30 +02:00
Willy Tarreau	d0d8da989b	MINOR: stream: provide a few helpers to retrieve frontend, listener and origin Expressions are quite long when using strm_sess(strm)->whatever, so let's provide a few helpers : strm_fe(), strm_li(), strm_orig().	2015-04-06 11:37:29 +02:00
Willy Tarreau	192252e2d8	MAJOR: sample: pass a pointer to the session to each sample fetch function Many such function need a session, and till now they used to dereference the stream. Once we remove the stream from the embryonic session, this will not be possible anymore. So as of now, sample fetch functions will be called with this : - sess = NULL, strm = NULL : never - sess = valid, strm = NULL : tcp-req connection - sess = valid, strm = valid, strm->txn = NULL : tcp-req content - sess = valid, strm = valid, strm->txn = valid : http-req / http-res	2015-04-06 11:37:25 +02:00
Willy Tarreau	987e3fb868	MEDIUM: http: remove the now useless http_txn from {req/res} rules The registerable http_req_rules / http_res_rules used to require a struct http_txn at the end. It's redundant with struct stream and propagates very deep into some parts (ie: it was the reason for lua requiring l7). Let's remove it now.	2015-04-06 11:35:53 +02:00
Willy Tarreau	15e91e1b36	MAJOR: sample: don't pass l7 anymore to sample fetch functions All of them can now retrieve the HTTP transaction if it exists from the stream and be sure to get NULL there when called with an embryonic session. The patch is a bit large because many locations were touched (all fetch functions had to have their prototype adjusted). The opportunity was taken to also uniformize the call names (the stream is now always "strm" instead of "l4") and to fix indent where it was broken. This way when we later introduce the session here there will be less confusion.	2015-04-06 11:35:53 +02:00
Willy Tarreau	eee5b51248	MAJOR: http: move http_txn out of struct stream Now this one is dynamically allocated. It means that 280 bytes of memory are saved per TCP stream, but more importantly that it will become possible to remove the l7 pointer from fetches and converters since it will be deduced from the stream and will support being null. A lot of care was taken because it's easy to forget a test somewhere, and the previous code used to always trust s->txn for being valid, but all places seem to have been visited. All HTTP fetch functions check the txn first so we shouldn't have any issue there even when called from TCP. When branching from a TCP frontend to an HTTP backend, the txn is properly allocated at the same time as the hdr_idx.	2015-04-06 11:35:52 +02:00
Willy Tarreau	b1ec8c4a59	MINOR: session: start to reintroduce struct session There is now a pointer to the session in the stream, which is NULL for now. The session pool is created as well. Some parts will move from the stream to the session now.	2015-04-06 11:23:57 +02:00
Willy Tarreau	e7dff02dd4	REORG/MEDIUM: stream: rename stream flags from SN_* to SF_* This is in order to keep things consistent.	2015-04-06 11:23:57 +02:00
Willy Tarreau	87b09668be	REORG/MAJOR: session: rename the "session" entity to "stream" With HTTP/2, we'll have to support multiplexed streams. A stream is in fact the largest part of what we currently call a session, it has buffers, logs, etc. In order to catch any error, this commit removes any reference to the struct session and tries to rename most "session" occurrences in function names to "stream" and "sess" to "strm" when that's related to a session. The files stream.{c,h} were added and session.{c,h} removed. The session will be reintroduced later and a few parts of the stream will progressively be moved overthere. It will more or less contain only what we need in an embryonic session. Sample fetch functions and converters will have to change a bit so that they'll use an L5 (session) instead of what's currently called "L4" which is in fact L6 for now. Once all changes are completed, we should see approximately this : L7 - http_txn L6 - stream L5 - session L4 - connection \| applet There will be at most one http_txn per stream, and a same session will possibly be referenced by multiple streams. A connection will point to a session and to a stream. The session will hold all the information we need to keep even when we don't yet have a stream. Some more cleanup is needed because some code was already far from being clean. The server queue management still refers to sessions at many places while comments talk about connections. This will have to be cleaned up once we have a server-side connection pool manager. Stream flags "SN_*" still need to be renamed, it doesn't seem like any of them will need to move to the session.	2015-04-06 11:23:56 +02:00
Willy Tarreau	9f640a1eab	CLEANUP: compression: statify all algo-specific functions There's no reason for exporting identity_* nor deflate_*, they're only used in the same file. Mark them static, it will make it easier to add other algorithms.	2015-03-28 15:46:00 +01:00
Willy Tarreau	15530d28a4	MEDIUM: compression: don't send leading zeroes with chunk size Till now we used to rely on a fixed maximum chunk size. Thanks to last commit we're now free to adjust the chunk's length before sending the data, so we don't have to use 6 digits all the time anymore, and if one wants buffers larger than 16 MB it is now possible.	2015-03-28 12:05:47 +01:00
Thierry FOURNIER	7fe75e0dab	MINOR: http: export function inet_set_tos() This is used by Lua.	2015-03-18 11:34:06 +01:00
Thierry FOURNIER	5531f87ace	MINOR: http: split http_transform_header() function in two parts. This function is a callback for HTTP actions. This function creates the replacement string from a build_logline() format and transform the header. This patch split this function in two part. With this modification, the header transformation and the replacement string are separed. We can now transform the header with another replacement string source than a build_logline() format.	2015-03-18 11:34:06 +01:00
Thierry FOURNIER	b77aece24a	MINOR: http: split the function http_action_set_req_line() in two parts The first part is the replacement engine. It take a replacement action number and a replacement string and process the action. The second part is the function which is called by the 'http-request action' to replace a request line part. This function makes the string used as replacement. This split permits to use the replacement engine in other parts of the code than the request action. The Lua use it for his own http action.	2015-03-18 11:34:06 +01:00
Willy Tarreau	91d9628a51	MINOR: peers: centralize configuration of the peers frontend This is in order to stop exporting the peer_accept() function.	2015-03-13 16:23:00 +01:00
Willy Tarreau	d85c48589a	REORG: connection: move conn_drain() to connection.c and rename it It's now called conn_sock_drain() to make it clear that it only reads at the sock layer and not at the data layer. The function was too big to remain inlined and it's used at a few places where size counts.	2015-03-13 00:42:48 +01:00
Willy Tarreau	f31fb07958	MEDIUM: connection: make conn_drain() perform more controls Currently si_idle_conn_null_cb() has to perform some low-level checks over the file descriptor and the connection configuration that should only belong to conn_drain(). Let's move these controls there. The function now automatically checks for errors and hangups on the file descriptor for example, and disables recv polling if there's no drain function at the control layer.	2015-03-13 00:32:20 +01:00
Willy Tarreau	ff3e648812	MINOR: connection: implement conn_sock_send() This function is an equivalent to send() which operates over a connection instead of a file descriptor. It checks that the control layer is ready and that it's allowed to send. If automatically enables polling if it cannot send. It simplifies the return checks by returning zero in all cases where it cannot send so that the caller only has to care about negative values indicating errors.	2015-03-13 00:04:49 +01:00
Willy Tarreau	729c69f6e5	MINOR: connection: perform the call to xprt->shutw() in conn_data_shutw() This will save callers from having to care about conn->xprt and xprt->shutw. Note that shutw() takes a second argument indicating whether it's a clean or a hard shutw. This is used by SSL which tries to close cleanly in most cases. Here we provide two versions, conn_data_shutw() which performs the clean close, and conn_data_shutw_hard() which does the unclean one.	2015-03-12 22:51:10 +01:00
Willy Tarreau	a02e8c9510	MINOR: connection: make conn_sock_shutw() actually perform the shutdown() call This function was not used yet and was only supposed to mark the connection as shutdown for write. Unfortunately at other places in stream_interface.c, we're seeing a bit of layering violations with attempts to perform the shutdown on the fd directly. Let's make this function call shutdown() itself so that the callers only have to care about the connection.	2015-03-12 22:42:29 +01:00
Willy Tarreau	78955f4c8b	MEDIUM: session: simplify receive buffer allocator to only use the channel Now that we can get the session from the channel, let's simplify the prototype of session_alloc_recv_buffer() to only require the channel. Both the caller and the function are now simplified.	2015-03-11 20:41:47 +01:00
Willy Tarreau	d5ccfa3fc5	MINOR: channel: add chn_sess() helper to retrieve session from channel Channels already have to know what session they below to. Add a helper to retrieve this pointer so that we'll use it later.	2015-03-11 20:41:47 +01:00
Willy Tarreau	81cd90069a	MEDIUM: channel: remove now unused ->prod and ->cons pointers Nothing uses them anymore.	2015-03-11 20:41:47 +01:00
Willy Tarreau	5decc05a9e	MAJOR: channel: only rely on the new CF_ISRESP flag to find the SI Now we exclusively use this flag to find what side a channel is and where the stream ints are. The ->prod and ->cons are not used anymore.	2015-03-11 20:41:47 +01:00
Willy Tarreau	73796535a9	REORG/MEDIUM: channel: only use chn_prod / chn_cons to find stream-interfaces The purpose of these two macros will be to pass via the session to find the relevant stream interfaces so that we don't need to store the ->cons nor ->prod pointers anymore. Currently they're only defined so that all references could be removed. Note that many places need a second pass of clean up so that we don't have any chn_prod(&s->req) anymore and only &s->si[0] instead, and conversely for the 3 other cases.	2015-03-11 20:41:47 +01:00
Willy Tarreau	50fe03be78	CLEANUP: stream-int: add si_opposite() to find the other stream interface At a few places we need to find one stream interface from the other one. Instead of passing via the channel, we simply use the session as an intermediary, which simply results in applying an offset to the pointer.	2015-03-11 20:41:47 +01:00
Willy Tarreau	4e4292b9af	CLEANUP: stream-int: add si_ib/si_ob to dereference the buffers This makes the code cleaner and is more intuitive to use.	2015-03-11 20:41:46 +01:00
Willy Tarreau	819d332dfd	MEDIUM: stream-int: remove any reference to the owner si->owner is not used anymore now, so let's remove any reference to it.	2015-03-11 20:41:46 +01:00
Willy Tarreau	07373b8660	MEDIUM: stream-int: use si_task() to retrieve the task from the stream int We go back to the session to get the owner. Here again it's very easy and is just a matter of relative offsets. Since the owner always exists and always points to the session's task, we can remove some unneeded tests.	2015-03-11 20:41:46 +01:00
Willy Tarreau	aefd79004c	MEDIUM: stream-int: make si_sess() use the stream int's side This one relies on the SI's side to find the pointer to the session. That the stream interface doesn't have to look at the task's context anymore.	2015-03-11 20:41:46 +01:00
Willy Tarreau	0b2fb7f9a3	MAJOR: stream-int: only rely on SI_FL_ISBACK to find the requested channel In order to plan removal of si->ib / si->ob, we now check the side of the stream interface and find the session, then the requested channel. In practice it's just an offset applied to the pointer based on the flag.	2015-03-11 20:41:46 +01:00
Willy Tarreau	a5f5d8dc69	MEDIUM: stream-int: add a flag indicating which side the SI is on This new flag "SI_FL_ISBACK" is set only on the back SI and is cleared on the front SI. That way it's possible only by looking at the SI to know what side it is.	2015-03-11 20:41:46 +01:00
Willy Tarreau	2bb4a96f8f	REORG/MEDIUM: stream-int: introduce si_ic/si_oc to access channels We'll soon remove direct references to the channels from the stream interface since everything belongs to the same session, so let's first not dereference si->ib / si->ob anymore and use macros instead.	2015-03-11 20:41:46 +01:00
Willy Tarreau	a27dc19eda	CLEANUP: remove now unused channel pool The channels are now part of the struct session. Their pool is not needed anymore.	2015-03-11 20:41:46 +01:00
Thierry FOURNIER	8fd1376014	MINOR: converters: add function to browse converters This patch adds a fucntion to browse each converter. This is used with Lua for using the converters with a wrapper.	2015-03-11 19:55:10 +01:00
Thierry FOURNIER	53e08ecc41	BUG/MEDIUM: lua: the Lua process is not waked up after sending data on requests side If we are writing in the request buffer, we are not waked up when the data are forwarded because it is useles. The request analyzers are waked up only when data is incoming. So, if the request buffer is full, we set the WAKE_ON_WRITE flag.	2015-03-09 17:47:52 +01:00
Thierry FOURNIER	ef6a2115fd	BUG/MEDIUM: lua: fix infinite loop about channel Before this patch, each yield in a Lua action set a flags to be waked up when some activity were detected on the response channel. This behavior causes loop in the analyzer process. This patch set the wake up on response buffer activity only if we really want to be waked up on this activity.	2015-03-09 17:47:52 +01:00
Thierry FOURNIER	4abd3ae184	MINOR: lua: adds "forced yield" flag This flag indicate that the current yield is returned by the Lua execution task control. If this flag is set, the current task may quit but will be set in the run queue to be re-executed immediatly. This patch modify the "hlua_yieldk()" function, it adds an argument that contain a field containing yield options.	2015-03-04 17:58:52 +01:00
Thierry FOURNIER	a097fdfb62	MINOR: lua: use bitfield and macro in place of integer and enum In the future, the lua execution must return scheduling informations. We want more than one flag, so I convert an integer used with an enum into an interer used as bitfield.	2015-03-04 17:58:52 +01:00
Thierry FOURNIER	a718b29b6d	MINOR: lua: remove some #define The #define compilation directives are centralized in the hlua include files. This permits to remove ome #ifdef from the haproxy main code.	2015-03-04 17:58:52 +01:00
Thierry FOURNIER	5c49aeb1b0	MINOR: remove unused declaration. This declaration is removed in the patch 'Lua initialisation on demand". commit id `05ac42455f`	2015-03-04 17:58:52 +01:00
Thierry FOURNIER	a4a0f3d7c8	MINOR: lua: post initialisation bindings This system permits to execute some lua function after than HAProxy complete his initialisation. These functions are executed between the end of the configuration parsing and check and the begin of the scheduler.	2015-02-28 23:12:34 +01:00
Thierry FOURNIER	380d0930bd	MINOR: lua: add runtime execution context The functions added permits to execute the LUA stack execution in HAProxy. It provides all the runtie environment and initialise the main LUA stack.	2015-02-28 23:12:33 +01:00
Thierry FOURNIER	6f1fd48ef1	MEDIUM: lua: lua integration in the build and init system. This is the first step of the lua integration. We add the useful files in the HAProxy project. These files contains the main includes, the Makefile options and empty initialisation function. Is is the LUA skeleton.	2015-02-28 23:12:33 +01:00
Thierry FOURNIER	ca16b03813	MINOR: channel: functions to get data from a buffer without copy We now have functions to retrieve one block and one line from either the input or the output part of a buffer. They return up to two (pointer,length) values in case the buffer wraps.	2015-02-28 23:12:33 +01:00
Thierry FOURNIER	cc87a11842	MEDIUM: tcp: add register keyword system. This patch introduces an action keyword registration system for TCP rulesets similar to what is available for HTTP rulesets. This sytem will be useful with lua.	2015-02-28 23:12:32 +01:00
Thierry FOURNIER	ac836baad1	MINOR: includes: fix a lot of missing or useless includes These modifications are done for resolving cross-dependent includes in the upcoming LUA code. <proto/channel.h> misses <types/channel.h>. <types/acl.h> doesn't use <types/session.h> because the session is already declared in the file as undefined pointer. appsession.c misses <unistd.h> to use "write()". Declare undefined pointer "struct session" for <types/proxy.h> and <types/queue.h>. These includes dont need the detail of this struct.	2015-02-28 23:12:32 +01:00
Thierry FOURNIER	49f45af9aa	MINOR: global: export many symbols. The functions "val_payload_lv" and "val_hdr" are useful with lua. The lua automatic binding for sample fetchs needs to compare check functions. The "arg_type_names" permit to display error messages.	2015-02-28 23:12:32 +01:00
Thierry FOURNIER	4d9a1d1a5c	MINOR: sample: add function for browsing samples. This function is useful with the incoming lua functions.	2015-02-28 23:12:32 +01:00
Thierry FOURNIER	f41a809dc9	MINOR: sample: add private argument to the struct sample_fetch The add of this private argument is to prepare the integration of the lua fetchs.	2015-02-28 23:12:31 +01:00
Willy Tarreau	b550d009ca	MEDIUM: protocol: use a family array to index the protocol handlers Instead of walking over a list, we now have a direct mapping between protocol families and their respective handlers. This will allow fast lookups.	2015-02-28 23:12:31 +01:00
Thierry FOURNIER	9cf7c4b9df	MAJOR: poll: only rely on wake_expired_tasks() to compute the wait delay Actually, HAProxy uses the function "process_runnable_tasks" and "wake_expired_tasks" to get the next task which can expires. If a task is added with "task_schedule" or other method during the execution of an other task, the expiration of this new task is not taken into account, and the execution of this task can be too late. Actualy, HAProxy seems to be no sensitive to this bug. This fix moves the call to process_runnable_tasks() before the timeout calculation and ensures that all wakeups are processed together. Only wake_expired_tasks() needs to return a timeout now.	2015-02-28 23:12:30 +01:00
Willy Tarreau	501260bf67	MEDIUM: task: always ensure that the run queue is consistent As found by Thierry Fournier, if a task manages to kill another one and if this other task is the next one in the run queue, we can do whatever including crashing, because the scheduler restarts from the saved next task. For now, there is no such concept of a task killing another one, but with Lua it will come. A solution consists in always performing the lookup of the first task in the scheduler's loop, but it's expensive and costs around 2% of the performance. Another solution consists in keeping a global next run queue node and ensuring that when this task gets removed, it updates this pointer to the next one. This allows to simplify the code a bit and in the end to slightly increase the performance (0.3-0.5%). The mechanism might still be usable if we later migrate to a multi-threaded scheduler.	2015-02-23 16:07:01 +01:00
Thierry FOURNIER	70fd7480f9	BUG/MINOR: ARG6 and ARG7 don't fit in a 32 bits word The patch "MEDIUM: args: increase arg type to 5 bits and limit arg count to 5" (`dbc79d0a`) increased the number of types supported, but forgot to remove the ARG6/ARG7 macros.	2015-02-20 14:34:16 +01:00
Simon Horman	64e3416662	MEDIUM: Allow suppression of email alerts by log level This patch adds a new option which allows configuration of the maximum log level of messages for which email alerts will be sent. The default is alert which is more restrictive than the current code which sends email alerts for all priorities. That behaviour may be configured using the new configuration option to set the maximum level to notice or greater. email-alert level notice Signed-off-by: Simon Horman <horms@verge.net.au>	2015-02-06 07:59:58 +01:00
Simon Horman	0ba0e4ac07	MEDIUM: Support sending email alerts Signed-off-by: Simon Horman <horms@verge.net.au>	2015-02-03 00:24:16 +01:00
Simon Horman	bfb5d33fe6	MEDIUM: Add free_check() helper Add free_check() helper to free the memory allocated by init_check(). Signed-off-by: Simon Horman <horms@verge.net.au>	2015-02-03 00:24:15 +01:00
Simon Horman	b1900d55df	MEDIUM: Refactor init_check and move to checks.c Refactor init_check so that an error string is returned rather than alerts being printed by it. Also init_check to checks.c and provide a prototype to allow it to be used from multiple C files. Signed-off-by: Simon Horman <horms@verge.net.au>	2015-02-03 00:24:15 +01:00
Willy Tarreau	3d241e78a1	MEDIUM: args: use #define to specify the number of bits used by arg types and counts This is in order to add new types. This patch does not change anything else. Two remaining (harmless) occurrences of a count of 8 instead of 7 were fixed by this patch : empty_arg_list[] and the for() loop counting args.	2015-01-22 14:24:53 +01:00
Willy Tarreau	319f745ba0	MINOR: channel: rename bi_erase() to channel_truncate() It applies to the channel and it doesn't erase outgoing data, only pending unread data, which is strictly equivalent to what recv() does with MSG_TRUNC, so that new name is more accurate and intuitive.	2015-01-14 20:32:59 +01:00
Willy Tarreau	b5051f8742	MINOR: channel: rename bi_avail() to channel_recv_max() This name more accurately reminds that it applies to a channel and not to a buffer, and that what is returned may be used as a max number of bytes to pass to recv().	2015-01-14 20:26:54 +01:00
Willy Tarreau	3f5096ddf2	MINOR: channel: rename buffer_max_len() to channel_recv_limit() Buffer_max_len() is ambiguous and misleading since it considers the channel. The new name more accurately designates the size limit for received data.	2015-01-14 20:21:43 +01:00
Willy Tarreau	a4178192b9	MINOR: channel: rename buffer_reserved() to channel_reserved() This applies to the channel, not the buffer, so let's fix this name. Warning, the function's name happens to be the same as the old one which was mistakenly used during 1.5.	2015-01-14 20:21:12 +01:00
Willy Tarreau	3889fffe92	MINOR: channel: rename channel_full() to !channel_may_recv() This function's name was poorly chosen and is confusing to the point of being suspiciously used at some places. The operations it does always consider the ability to forward pending input data before receiving new data. This is not obvious at all, especially at some places where it was used when consuming outgoing data to know if the buffer has any chance to ever get the missing data. The code needs to be re-audited with that in mind. Care must be taken with existing code since the polarity of the function was switched with the renaming.	2015-01-14 18:41:33 +01:00
Willy Tarreau	ba0902ede4	CLEANUP: channel: rename channel_reserved -> channel_is_rewritable channel_reserved is confusingly named. It is used to know whether or not the rewrite area is left intact for situations where we want to ensure we can use it before proceeding. Let's rename it to fix this confusion.	2015-01-14 18:41:33 +01:00
Willy Tarreau	9c06ee4ccf	BUG/MEDIUM: channel: don't schedule data in transit for leaving until connected Option http-send-name-header is still hurting. If a POST request has to be redispatched when this option is used, and the next server's name is larger than the initial one, and the POST body fills the buffer, it becomes impossible to rewrite the server's name in the buffer when redispatching. In 1.4, this is worse, the process may crash because of a negative size computation for the memmove(). The only solution to fix this is to refrain from eating the reserve before we're certain that we won't modify the buffer anymore. And the condition for that is that the connection is established. This patch introduces "channel_may_send()" which helps to detect whether it's safe to eat the reserve or not. This condition is used by channel_in_transit() introduced by recent patches. This patch series must be backported into 1.5, and a simpler version must be backported into 1.4 where fixing the bug is much easier since there were no channels by then. Note that in 1.4 the severity is major.	2015-01-14 16:08:45 +01:00
Willy Tarreau	27bb0e14a8	MEDIUM: channel: make bi_avail() use channel_in_transit() This ensures that we rely on a sane computation for the buffer size.	2015-01-14 15:57:24 +01:00
Willy Tarreau	fe57834955	MEDIUM: channel: make buffer_reserved() use channel_in_transit() This ensures that we rely on a sane computation for the buffer size.	2015-01-14 15:57:21 +01:00
Willy Tarreau	1a4484dec8	MINOR: channel: add channel_in_transit() This function returns the amount of bytes in transit in a channel's buffer, which is the amount of outgoing data plus the amount of incoming data bound to the forward limit.	2015-01-14 13:51:48 +01:00
Willy Tarreau	bb3f994f1a	BUG/MINOR: channel: compare to_forward with buf->i, not buf->size We know that all incoming data are going to be purged if to_forward is greater than them, not only if greater than the buffer size. This buf has no direct impact on this version, but it participates to some bugs affecting http-send-name-header since 1.4. This fix will have to be backported down to 1.4 albeit in a different form.	2015-01-14 13:50:24 +01:00
Willy Tarreau	0428a146c0	BUG/MEDIUM: channel: fix possible integer overflow on reserved size computation The buffer_max_len() function is subject to an integer overflow in this calculus : int ret = global.tune.maxrewrite - chn->to_forward - chn->buf->o; - chn->to_forward may be up to 2^31 - 1 - chn->buf->o may be up to chn->buf->size - global.tune.maxrewrite is by definition smaller than chn->buf->size Thus here we can subtract (2^31 + buf->o) (highly negative) from something slightly positive, and result in ret being larger than expected. Fortunately in 1.5 and 1.6, this is only used by bi_avail() which itself is used by applets which do not set high values for to_forward so this problem does not happen there. However in 1.4 the equivalent computation was used to limit the size of a read and can result in a read overflow when combined with the nasty http-send-name-header feature. This fix must be backported to 1.5 and 1.4.	2015-01-14 12:04:34 +01:00
Willy Tarreau	3c23a85550	CLEANUP: session: remove session_from_task() Since commit `3dd6a25` ("MINOR: stream-int: retrieve session pointer from stream-int"), we can get the session from the task, so let's get rid of this less obvious function.	2014-12-28 12:19:57 +01:00
Willy Tarreau	b034b2598d	MEDIUM: channel: implement a zero-copy buffer transfer bi_swpbuf() swaps the buffer passed in argument with the one attached to the channel, but only if this last one is empty. The idea is to avoid a copy when buffers can simply be swapped.	2014-12-24 23:47:33 +01:00
Willy Tarreau	a24adf0795	MAJOR: session: only wake up as many sessions as available buffers permit We've already experimented with three wake up algorithms when releasing buffers : the first naive one used to wake up far too many sessions, causing many of them not to get any buffer. The second approach which was still in use prior to this patch consisted in waking up either 1 or 2 sessions depending on the number of FDs we had released. And this was still inaccurate. The third one tried to cover the accuracy issues of the second and took into consideration the number of FDs the sessions would be willing to use, but most of the time we ended up waking up too many of them for nothing, or deadlocking by lack of buffers. This patch completely removes the need to allocate two buffers at once. Instead it splits allocations into critical and non-critical ones and implements a reserve in the pool for this. The deadlock situation happens when all buffers are be allocated for requests pending in a maxconn-limited server queue, because then there's no more way to allocate buffers for responses, and these responses are critical to release the servers's connection in order to release the pending requests. In fact maxconn on a server creates a dependence between sessions and particularly between oldest session's responses and latest session's requests. Thus, it is mandatory to get a free buffer for a response in order to release a server connection which will permit to release a request buffer. Since we definitely have non-symmetrical buffers, we need to implement this logic in the buffer allocation mechanism. What this commit does is implement a reserve of buffers which can only be allocated for responses and that will never be allocated for requests. This is made possible by the requester indicating how much margin it wants to leave after the allocation succeeds. Thus it is a cooperative allocation mechanism : the requester (process_session() in general) prefers not to get a buffer in order to respect other's need for response buffers. The session management code always knows if a buffer will be used for requests or responses, so that is not difficult : - either there's an applet on the initiator side and we really need the request buffer (since currently the applet is called in the context of the session) - or we have a connection and we really need the response buffer (in order to support building and sending an error message back) This reserve ensures that we don't take all allocatable buffers for requests waiting in a queue. The downside is that all the extra buffers are really allocated to ensure they can be allocated. But with small values it is not an issue. With this change, we don't observe any more deadlocks even when running with maxconn 1 on a server under severely constrained memory conditions. The code becomes a bit tricky, it relies on the scheduler's run queue to estimate how many sessions are already expected to run so that it doesn't wake up everyone with too few resources. A better solution would probably consist in having two queues, one for urgent requests and one for normal requests. A failed allocation for a session dealing with an error, a connection event, or the need for a response (or request when there's an applet on the left) would go to the urgent request queue, while other requests would go to the other queue. Urgent requests would be served from 1 entry in the pool, while the regular ones would be served only according to the reserve. Despite not yet having this, it works remarkably well. This mechanism is quite efficient, we don't perform too many wake up calls anymore. For 1 million sessions elapsed during massive memory contention, we observe about 4.5M calls to process_session() compared to 4.0M without memory constraints. Previously we used to observe up to 16M calls, which rougly means 12M failures. During a test run under high memory constraints (limit enforced to 27 MB instead of the 58 MB normally needed), performance used to drop by 53% prior to this patch. Now with this patch instead it increases by about 1.5%. The best effect of this change is that by limiting the memory usage to about 2/3 to 3/4 of what is needed by default, it's possible to increase performance by up to about 18% mainly due to the fact that pools are reused more often and remain hot in the CPU cache (observed on regular HTTP traffic with 20k objects, buffers.limit = maxconn/10, buffers.reserve = limit/2). Below is an example of scenario which used to cause a deadlock previously : - connection is received - two buffers are allocated in process_session() then released - one is allocated when receiving an HTTP request - the second buffer is allocated then released in process_session() for request parsing then connection establishment. - poll() says we can send, so the request buffer is sent and released - process session gets notified that the connection is now established and allocates two buffers then releases them - all other sessions do the same till one cannot get the request buffer without hitting the margin - and now the server responds. stream_interface allocates the response buffer and manages to get it since it's higher priority being for a response. - but process_session() cannot allocate the request buffer anymore => We could end up with all buffers used by responses so that none may be allocated for a request in process_session(). When the applet processing leaves the session context, the test will have to be changed so that we always allocate a response buffer regardless of the left side (eg: H2->H1 gateway). A final improvement would consists in being able to only retry the failed I/O operation without waking up a task, but to date all experiments to achieve this have proven not to be reliable enough.	2014-12-24 23:47:33 +01:00
Willy Tarreau	bf883e0aa7	MAJOR: session: implement a wait-queue for sessions who need a buffer When a session_alloc_buffers() fails to allocate one or two buffers, it subscribes the session to buffer_wq, and waits for another session to release buffers. It's then removed from the queue and woken up with TASK_WAKE_RES, and can attempt its allocation again. We decide to try to wake as many waiters as we release buffers so that if we release 2 and two waiters need only once, they both have their chance. We must never come to the situation where we don't wake enough tasks up. It's common to release buffers after the completion of an I/O callback, which can happen even if the I/O could not be performed due to half a failure on memory allocation. In this situation, we don't want to move out of the wait queue the session that was just added, otherwise it will never get any buffer. Thus, we only force ourselves out of the queue when freeing the session. Note: at the moment, since session_alloc_buffers() is not used, no task is subscribed to the wait queue.	2014-12-24 23:47:33 +01:00
Willy Tarreau	656859d478	MEDIUM: session: implement a basic atomic buffer allocator This patch introduces session_alloc_recv_buffer(), session_alloc_buffers() and session_release_buffers() whose purpose will be to allocate missing buffers and release unneeded ones around the process_session() and during I/O operations. I/O callbacks only need a single buffer for recv operations and none for send. However we still want to ensure that we don't pick the last buffer. That's what session_alloc_recv_buffer() is for. This allocator is atomic in that it always ensures we can get 2 buffers or fails. Here, if any of the buffers is not ready and cannot be allocated, the operation is cancelled. The purpose is to guarantee that we don't enter into the deadlock where all buffers are allocated by the same size of all sessions. A queue will have to be implemented for failed allocations. For now they're just reported as failures.	2014-12-24 23:47:32 +01:00
Willy Tarreau	4428a29e52	MEDIUM: channel: do not report full when buf_empty is present on a channel Till now we'd consider a buffer full even if it had size==0 due to pointing to buf.size. Now we change this : if buf_wanted is present, it means that we have already tried to allocate a buffer but failed. Thus the buffer must be considered full so that we stop trying to poll for reads on it. Otherwise if it's empty, it's buf_empty and we report !full since we may allocate it on the fly.	2014-12-24 23:47:32 +01:00
Willy Tarreau	2a4b54359b	MEDIUM: buffer: always assign a dummy empty buffer to channels Channels are now created with a valid pointer to a buffer before the buffer is allocated. This buffer is a global one called "buf_empty" and of size zero. Thus it prevents any activity from being performed on the buffer and still ensures that chn->buf may always be dereferenced. b_free() also resets the buffer to &buf_empty, and was split into b_drop() which does not reset the buffer.	2014-12-24 23:47:32 +01:00
Willy Tarreau	474cf54a97	MINOR: buffer: reset a buffer in b_reset() and not channel_init() We'll soon need to be able to switch buffers without touching the channel, so let's move buffer initialization out of channel_init(). We had the same in compressoin.c.	2014-12-24 23:47:31 +01:00
Willy Tarreau	3dd6a25323	MINOR: stream-int: retrieve session pointer from stream-int sess_from_si() does this via the owner (struct task). It works because all stream ints belong to a task nowadays.	2014-12-24 23:47:31 +01:00
Lukas Tribus	e4e30f7d52	BUILD: ssl: use OPENSSL_NO_OCSP to detect OCSP support Since commit `656c5fa7e8` ("BUILD: ssl: disable OCSP when using boringssl) the OCSP code is bypassed when OPENSSL_IS_BORINGSSL is defined. The correct thing to do here is to use OPENSSL_NO_OCSP instead, which is defined for this exact purpose in openssl/opensslfeatures.h. This makes haproxy forward compatible if boringssl ever introduces full OCSP support with the additional benefit that it links fine against a OCSP-disabled openssl. Signed-off-by: Lukas Tribus <luky-37@hotmail.com>	2014-12-09 20:49:22 +01:00
Thierry FOURNIER	315ec4217f	BUG/MEDIUM: pattern: don't load more than once a pattern list. A memory optimization can use the same pattern expression for many equal pattern list (same parse method, index method and index_smp method). The pattern expression is returned by "pattern_new_expr", but this function dont indicate if the returned pattern is already in use. So, the caller function reload the list of patterns in addition with the existing patterns. This behavior is not a problem with tree indexed pattern, but it grows the lists indexed patterns. This fix add a "reuse" flag in return of the function "pattern_new_expr". If the flag is set, I suppose that the patterns are already loaded. This fix must be backported into 1.5.	2014-11-24 15:40:16 +01:00
Willy Tarreau	5be2f35231	MAJOR: polling: centralize calls to I/O callbacks In order for HTTP/2 not to eat too much memory, we'll have to support on-the-fly buffer allocation, since most streams will have an empty request buffer at some point. Supporting allocation on the fly means being able to sleep inside I/O callbacks if a buffer is not available. Till now, the I/O callbacks were called from two locations : - when processing the cached events - when processing the polled events from the poller This change cleans up the design a bit further than what was started in 1.5. It now ensures that we never call any iocb from the poller itself and that instead, events learned by the poller are put into the cache. The benefit is important in terms of stability : we don't have to care anymore about the risk that new events are added into the poller while processing its events, and we're certain that updates are processed at a single location. To achieve this, we now modify all the fd_* functions so that instead of creating updates, they add/remove the fd to/from the cache depending on its state, and only create an update when the polling status reaches a state where it will have to change. Since the pollers make use of these functions to notify readiness (using fd_may_recv/fd_may_send), the cache is always up to date with the poller. Creating updates only when the polling status needs to change saves a significant amount of work for the pollers : a benchmark showed that on a typical TCP proxy test, the amount of updates per connection dropped from 11 to 1 on average. This also means that the update list is smaller and has more chances of not thrashing too many CPU cache lines. The first observed benefit is a net 2% performance gain on the connection rate. A second benefit is that when a connection is accepted, it's only when we're processing the cache, and the recv event is automatically added into the cache after the current one, resulting in this event to be processed immediately during the same loop. Previously we used to have a second run over the updates to detect if new events were added to catch them before waking up tasks. The next gain will be offered by the next steps on this subject consisting in implementing an I/O queue containing all cached events ordered by priority just like the run queue, and to be able to leave some events pending there as long as needed. That will allow us not to perform some FD processing if it's not the proper time for this (typically keep waiting for a buffer to be allocated if none is available for an recv()). And by only processing a small bunch of them, we'll allow priorities to take place even at the I/O level. As a result of this change, functions fd_alloc_or_release_cache_entry() and fd_process_polled_events() have disappeared, and the code dedicated to checking for new fd events after the callback during the poll() loop was removed as well. Despite the patch looking large, it's mostly a change of what function is falled upon fd_*() and almost nothing was added.	2014-11-21 20:37:32 +01:00
KOVACS Krisztian	b3e54fe387	MAJOR: namespace: add Linux network namespace support This patch makes it possible to create binds and servers in separate namespaces. This can be used to proxy between multiple completely independent virtual networks (with possibly overlapping IP addresses) and a non-namespace-aware proxy implementation that supports the proxy protocol (v2). The setup is something like this: net1 on VLAN 1 (namespace 1) -\ net2 on VLAN 2 (namespace 2) -- haproxy ==== proxy (namespace 0) net3 on VLAN 3 (namespace 3) -/ The proxy is configured to make server connections through haproxy and sending the expected source/target addresses to haproxy using the proxy protocol. The network namespace setup on the haproxy node is something like this: = 8< = $ cat setup.sh ip netns add 1 ip link add link eth1 type vlan id 1 ip link set eth1.1 netns 1 ip netns exec 1 ip addr add 192.168.91.2/24 dev eth1.1 ip netns exec 1 ip link set eth1.$id up ... = 8< = = 8< = $ cat haproxy.cfg frontend clients bind 127.0.0.1:50022 namespace 1 transparent default_backend scb backend server mode tcp server server1 192.168.122.4:2222 namespace 2 send-proxy-v2 = 8< = A bind line creates the listener in the specified namespace, and connections originating from that listener also have their network namespace set to that of the listener. A server line either forces the connection to be made in a specified namespace or may use the namespace from the client-side connection if that was set. For more documentation please read the documentation included in the patch itself. Signed-off-by: KOVACS Tamas <ktamas@balabit.com> Signed-off-by: Sarkozi Laszlo <laszlo.sarkozi@balabit.com> Signed-off-by: KOVACS Krisztian <hidden@balabit.com>	2014-11-21 07:51:57 +01:00
Willy Tarreau	eb11889f1e	MINOR: task: release the task pool when stopping When we're stopping, we're not going to create new tasks anymore, so let's release the task pool upon each task_free() in order to reduce memory fragmentation.	2014-11-13 16:57:19 +01:00
Willy Tarreau	4e21ff9244	BUG/MEDIUM: http: adjust close mode when switching to backend Commit `179085c` ("MEDIUM: http: move Connection header processing earlier") introduced a regression : the backend's HTTP mode is not considered anymore when setting the session's HTTP mode, because wait_for_request() is only called once, when the frontend receives the request (or when the frontend is in TCP mode, when the backend receives the request). The net effect is that in some situations when the frontend and the backend do not work in the same mode (eg: keep-alive vs close), the backend's mode is ignored. This patch moves all that processing to a dedicated function, which is called from the original place, as well as from session_set_backend() when switching from an HTTP frontend to an HTTP backend in different modes. This fix must be backported to 1.5.	2014-09-30 18:44:22 +02:00
Dave McCowan	328fb58d74	MEDIUM: connection: add new bit in Proxy Protocol V2 There are two sample commands to get information about the presence of a client certificate. ssl_fc_has_crt is true if there is a certificate present in the current connection ssl_c_used is true if there is a certificate present in the session. If a session has stopped and resumed, then ssl_c_used could be true, while ssl_fc_has_crt is false. In the client byte of the TLS TLV of Proxy Protocol V2, there is only one bit to indicate whether a certificate is present on the connection. The attached patch adds a second bit to indicate the presence for the session. This maintains backward compatibility. [wt: this should be backported to 1.5 to help maintain compatibility between versions]	2014-08-23 07:35:29 +02:00
Lukas Tribus	656c5fa7e8	BUILD: ssl: disable OCSP when using boringssl Google's boringssl doesn't currently support OCSP, so disable it if detected. OCSP support may be reintroduced as per: https://code.google.com/p/chromium/issues/detail?id=398677 In that case we can simply revert this commit. Signed-off-by: Lukas Tribus <luky-37@hotmail.com>	2014-08-18 14:33:48 +02:00

1 2 3 4 5 ...

898 Commits