prometheus

mirror of https://github.com/prometheus/prometheus.git synced 2025-12-15 22:41:02 +01:00

Author	SHA1	Message	Date
Yuchen Wang	5630a3906a	fix typo (#16480 ) Signed-off-by: Yuchen Wang <yuchen.wang@databricks.com>	2025-04-25 09:27:58 +02:00
Bryan Boreham	8487ed8145	Merge pull request #16440 from bboreham/faster-benchmark-loadwls [TESTS] TSDB: Faster WAL benchmarks	2025-04-22 15:59:03 +01:00
Bryan Boreham	a11772234d	Merge pull request #16333 from colega/fix-series-create-gc-race fix: race condition between series creation and garbage collection	2025-04-17 12:15:11 +01:00
machine424	a825d448da	feat(tsdb/(head\|agent)): dereference the pools at the end of the WL replay to not wait for an extra GC cycle until the built-in cleanup mechanism kicks in See https://github.com/prometheus/prometheus/pull/15778 Signed-off-by: machine424 <ayoubmrini424@gmail.com>	2025-04-17 13:06:08 +02:00
Bryan Boreham	1d4b1d76a5	[TESTS] More efficient label creation in BenchmarkLoadWLs Use the Builder abstraction instead of going via a map. Signed-off-by: Bryan Boreham <bjboreham@gmail.com>	2025-04-16 18:02:47 +01:00
Bryan Boreham	848df13d3a	[TESTS] Faster WAL Benchmarks by reusing buffer Less garbage collection. Signed-off-by: Bryan Boreham <bjboreham@gmail.com>	2025-04-16 17:58:09 +01:00
Alex Le	bce72b93d9	tsdb: Introduced new constructor for LeveledCompactor to take in metrics (#16408 ) * Introduced new constructor for LeveledCompactor to take in metrics Signed-off-by: Alex Le <leqiyue@amazon.com> * Added Metrics to LeveledCompactorOptions Signed-off-by: Alex Le <leqiyue@amazon.com> --------- Signed-off-by: Alex Le <leqiyue@amazon.com>	2025-04-11 09:17:45 +01:00
Alex Le	701d13abf9	Make sure LeveledCompactor respect context cancellation during the time opening blocks (#16407 ) Signed-off-by: Alex Le <leqiyue@amazon.com>	2025-04-08 09:04:23 +01:00
Oleg Zaytsev	f5f91a9ca4	defer a.unmarkCreatedSeriesAsPendingCommit() Signed-off-by: Oleg Zaytsev <mail@olegzaytsev.com>	2025-03-31 10:06:28 +02:00
Ben Kochie	a721daf981	Log WAL segment loading time (#16336 ) Improve readability of "WAL segment loaded" by logging the duration of each load. This helps make it easier to spot slow WAL file load times. Signed-off-by: SuperQ <superq@gmail.com>	2025-03-31 06:05:14 +02:00
Ryan Wu	7d73c1d3f8	refactor[discovery, tsdb]: simplify error handling and remove redundant checks (#16328 ) * refactor: simplify error handling and remove redundant checks Signed-off-by: Ryan Wu <rongjun0821@gmail.com> * Add the comment for return of reloading blocks failure Co-authored-by: Ayoub Mrini <ayoubmrini424@gmail.com> Signed-off-by: Ryan Wu <rongjun0821@gmail.com> * Add the comment for return of reloading blocks failure Signed-off-by: Ryan Wu <rongjun0821@gmail.com> --------- Signed-off-by: Ryan Wu <rongjun0821@gmail.com> Co-authored-by: Ayoub Mrini <ayoubmrini424@gmail.com>	2025-03-27 12:20:59 +01:00
Oleg Zaytsev	e4fe8d8684	Create memSeries with pendingCommit=true This fixes TestHead_RaceBetweenSeriesCreationAndGC. Signed-off-by: Oleg Zaytsev <mail@olegzaytsev.com>	2025-03-27 11:11:57 +01:00
Oleg Zaytsev	df33f1aace	Add TestHead_RaceBetweenSeriesCreationAndGC This test consistently fails missing ~10 series. If it doesn't fail on your machine, just increase totalSeries, that's how race conditions work. Signed-off-by: Oleg Zaytsev <mail@olegzaytsev.com>	2025-03-27 10:56:24 +01:00
Matthieu MOREL	5fa1146e21	chore: enable gci linter (#16245 ) Signed-off-by: Matthieu MOREL <matthieu.morel35@gmail.com>	2025-03-22 15:46:13 +00:00
Patryk Prus	2f43a5a3ab	TSDB: don't process exemplars older than minValidTime during WAL replay Signed-off-by: Patryk Prus <p@trykpr.us>	2025-03-19 16:24:08 -04:00
Ganesh Vernekar	bc595263c1	Merge pull request #16231 from pr00se/multiref-improvements TSDB: Handle metadata/tombstones/exemplars for duplicate series during WAL replay	2025-03-19 16:15:50 -04:00
pudongair	308c8c48c1	chore: fix some comments (#16237 ) Signed-off-by: pudongair <744355276@qq.com>	2025-03-19 16:28:34 +01:00
Ziqi Zhao	f6903bcc22	Let HistogramAppender.appendable return CounterResetHeader instead of… (#16195 ) Let HistogramAppender.appendable return CounterResetHeader instead of boolean Signed-off-by: Ziqi Zhao <zhaoziqi9146@gmail.com> Signed-off-by: Björn Rabenstein <github@rabenste.in> --------- Signed-off-by: Ziqi Zhao <zhaoziqi9146@gmail.com> Signed-off-by: Björn Rabenstein <github@rabenste.in> Co-authored-by: Björn Rabenstein <github@rabenste.in>	2025-03-18 17:40:27 +01:00
Patryk Prus	e4e1b515bc	TSDB: Handle metadata/tombstones/exemplars for duplicate series during WAL replay Signed-off-by: Patryk Prus <p@trykpr.us>	2025-03-18 12:22:33 -04:00
Fiona Liao	37c2ebb5fd	Make out-of-order native histograms flag a no-op and always enable (#16207 ) * Remove experimental out-of-order native histogram flag This feature has been available in Prometheus since September 2024, and has no known issues. Therefore proposing to remove the flag entirely and always have it on. Note that there are still two settings that need to be configured (out-of-order time window > 0 and native histograms enabled) for this feature to work. Signed-off-by: Fiona Liao <fiona.liao@grafana.com> * Update CHANGELOG Signed-off-by: Fiona Liao <fiona.liao@grafana.com> * Keep feature flag with warning Signed-off-by: Fiona Liao <fiona.liao@grafana.com> * Update CHANGELOG Signed-off-by: Fiona Liao <fiona.liao@grafana.com> * Update tsdb/head_append.go Co-authored-by: George Krajcsovits <krajorama@users.noreply.github.com> Signed-off-by: Fiona Liao <fiona.y.liao@gmail.com> * Update CHANGELOG.md Co-authored-by: George Krajcsovits <krajorama@users.noreply.github.com> Signed-off-by: Fiona Liao <fiona.y.liao@gmail.com> * Update tsdb/head_append.go Co-authored-by: George Krajcsovits <krajorama@users.noreply.github.com> Signed-off-by: Fiona Liao <fiona.y.liao@gmail.com> * Additional cleanup of comments and test names Signed-off-by: Fiona Liao <fiona.liao@grafana.com> --------- Signed-off-by: Fiona Liao <fiona.liao@grafana.com> Signed-off-by: Fiona Liao <fiona.y.liao@gmail.com> Co-authored-by: George Krajcsovits <krajorama@users.noreply.github.com>	2025-03-18 10:59:02 +00:00
Patryk Prus	86eeaf1886	Skip writing series records uniformly across the benchmark, so we skip some OOO series as well Signed-off-by: Patryk Prus <p@trykpr.us>	2025-03-17 15:17:53 -04:00
Patryk Prus	2147538d1e	Add missing series refs to benchmark Signed-off-by: Patryk Prus <p@trykpr.us>	2025-03-17 15:17:53 -04:00
Patryk Prus	401dbacf2e	Add counters for unknown series references during WAL/WBL replay Signed-off-by: Patryk Prus <p@trykpr.us>	2025-03-17 15:17:53 -04:00
Patryk Prus	85fa39032e	TSDB: Track count of unknown series referenced during WAL replay Signed-off-by: Patryk Prus <p@trykpr.us>	2025-03-17 15:17:48 -04:00
Bryan Boreham	30d04792ca	[PERF] Remote-write: re-use memory to read WAL data (#16197 ) The `:=` causes new variables to be created, which means the outer slice stays at nil, and new memory is allocated every time round the loop. Extracted from https://github.com/prometheus/prometheus/pull/16182 Credit to @bwplotka. Signed-off-by: Bryan Boreham <bjboreham@gmail.com>	2025-03-11 10:49:51 +00:00
Bartlomiej Plotka	7a7bc65237	Add util/compression package to consolidate snappy/zstd use in Prometheus. (#16156 ) # Conflicts: # tsdb/db_test.go Apply suggestions from code review tmp Addressed comments. Update util/compression/buffers.go Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com> Co-authored-by: Arthur Silva Sens <arthursens2005@gmail.com>	2025-03-10 10:36:26 +00:00
Arve Knudsen	56929ffa42	Upgrade to Go v1.24 (#16180 ) * Upgrade to Go v1.24 * Upgrade golangci-lint --------- Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>	2025-03-07 11:28:26 +01:00
Patryk Prus	61aa82865d	TSDB: keep duplicate series records in checkpoints while their samples may still be present (#16060 ) Renames the head's deleted map to walExpiries, and creates entries for any duplicate series records encountered during WAL replay, with the expiry set to the highest current WAL segment number. Any subsequent WAL checkpoints will see the duplicate series entry in the walExpiries map, and keep the series record until the last WAL segment that could contain its samples is deleted. Other considerations: WBL: series records aren't written to the WBL, so there are no duplicates to deal with agent mode: has its own WAL replay logic that handles duplicate series records differently, and is outside the scope of this PR	2025-03-05 13:45:08 -05:00
Arve Knudsen	7cbf749096	Upgrade to github.com/oklog/ulid/v2 (#16168 ) Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>	2025-03-05 16:03:25 +01:00
Bryan Boreham	42d55505f9	Merge pull request #12659 from prymitive/memChunk Short-cut common memChunk operations	2025-02-25 11:33:56 +00:00
Matthieu MOREL	c7d4b53ec1	chore: enable unused-parameter from revive Signed-off-by: Matthieu MOREL <matthieu.morel35@gmail.com>	2025-02-19 19:50:28 +01:00
Ayoub Mrini	e04913aea2	Merge pull request #15778 from machine424/reuse-pools feat(tsdb/(head\|agent)): reuse pools across segments to reduce garbage during WL replay	2025-02-17 12:48:17 +01:00
Bartlomiej Plotka	de23a9667c	prw2: Split PRW2.0 from metadata-wal-records feature (#16030 ) Rationales: * metadata-wal-records might be deprecated and replaced going forward: https://github.com/prometheus/prometheus/issues/15911 * PRW 2.0 works without metadata just fine (although it sends untyped metrics as expected). Signed-off-by: bwplotka <bwplotka@gmail.com>	2025-02-13 12:16:33 +00:00
machine424	d644324407	feat(tsdb/(head\|agent)): reuse pools across segments to avoid generating garbage during WL replay This is part of the "reduce WAL replay overhead/garbage" effort to help with https://github.com/prometheus/prometheus/issues/6934. Signed-off-by: machine424 <ayoubmrini424@gmail.com>	2025-02-10 22:40:24 +01:00
Matthieu MOREL	b472ce7010	chore: enable early-return from revive Signed-off-by: Matthieu MOREL <matthieu.morel35@gmail.com>	2025-02-10 22:08:43 +01:00
Bryan Boreham	b74cebf6bf	Merge pull request #12920 from prymitive/compactLock Fix locks in db.reloadBlocks()	2025-02-10 17:35:09 +00:00
Dimitar Dimitrov	686dcc7b0d	headIndexReader: reduce debug logging (#15993 ) Around Mimir compactions we see logging in ShardedPostings do massive allocations and drive GC up to 50% of CPU. Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>	2025-02-07 15:46:55 +00:00
SuryaPrakash	cb3b17a14c	fix: os.MkdirTemp with t.TempDir (#15860 ) Signed-off-by: Surya Prakash <surya0prakash@proton.me>	2025-01-31 14:32:20 +00:00
Alan Protasio	9d1abbb9ed	Call PostCreation callback only after the new series is added to the mempotings (#15579 ) Signed-off-by: alanprot <alanprot@gmail.com>	2025-01-28 12:11:58 +01:00
Jan Fajerski	6823f58e59	Merge pull request #15732 from bboreham/benchmark-setup-append-periodically TSDB benchmarks: Commit periodically to speed up init	2025-01-28 11:35:04 +01:00
Bryan Boreham	6ba25ba93f	tsdb tests: avoid 'defer' till end of function 'defer' only runs at the end of the function, so explicitly close the querier after we finish with it. Also check it didn't error. Signed-off-by: Bryan Boreham <bjboreham@gmail.com>	2025-01-27 19:59:43 +00:00
Bryan Boreham	2f615a200d	tsdb tests: restrict some 'defer' operations 'defer' only runs at the end of the function, so introduce some more functions / move the start, so that 'defer' can run at the end of the logical block. Signed-off-by: Bryan Boreham <bjboreham@gmail.com>	2025-01-27 19:59:43 +00:00
Bryan Boreham	f4fbe47254	tsdb tests: avoid capture-by-reference in goroutines Only one version of the variable is captured; this is a source of race conditions. Signed-off-by: Bryan Boreham <bjboreham@gmail.com>	2025-01-27 19:59:43 +00:00
piguagua	a82f2b8168	chore: fix function name and struct name in comment (#15827 ) Signed-off-by: piguagua <piguagua@aliyun.com>	2025-01-17 21:26:08 +01:00
Julius Volz	0d7db907a9	Merge pull request #15785 from crystalstall/main refactor: using slices.Contains to simplify the code	2025-01-13 10:31:41 +01:00
crystalstall	616914abe2	Signed-off-by: crystalstall <crystalruby@qq.com> refactor: using slices.Contains to simplify the code Signed-off-by: crystalstall <crystalruby@qq.com>	2025-01-11 00:41:51 +08:00
Lukasz Mierzwa	e3728122b2	Update comments for methods that require a lock Signed-off-by: Lukasz Mierzwa <l.mierzwa@gmail.com>	2025-01-09 17:20:10 +00:00
Lukasz Mierzwa	a1740cd2e7	Remove unnecessary locks Compact() is an uppercase function that deals with locks on its own, so we shouldn't have a lock around it. Signed-off-by: Lukasz Mierzwa <lukasz@cloudflare.com>	2025-01-09 17:06:05 +00:00
Łukasz Mierzwa	d106b3beb7	Wrap db.blocks read in a read lock We don't hold db.mtx lock when trying to read db.blocks here so we need a read lock around this loop. Signed-off-by: Łukasz Mierzwa <l.mierzwa@gmail.com>	2025-01-09 17:06:05 +00:00
Łukasz Mierzwa	92788d313a	Remove TestTombstoneCleanRetentionLimitsRace This test ensures that running db.reloadBlocks() and db.CleanTombstones() at the same time doesn't race. The problem is that CleanTombstones() is a public method while reloadBlocks() is internal. CleanTombstones() sets db.cmtx lock while reloadBlocks() is not protected by any locks at all, it expects the public method through which it was called to do it. So having a race between these two is not unexpected and we shouldn't really be testing this. db.cmtx ensures that no other function can be modifying the list of open blocks and so the scenario tested here cannot happen. If it would happen it would be only because some other method doesn't aquire db.ctmx lock, something this test cannot detect. Signed-off-by: Łukasz Mierzwa <l.mierzwa@gmail.com>	2025-01-09 17:06:03 +00:00

1 2 3 4 5 ...

1341 Commits