prometheus

mirror of https://github.com/prometheus/prometheus.git synced 2026-04-01 11:51:03 +02:00

Author	SHA1	Message	Date
Julien	16876bab95	Merge pull request #18200 from roidelapluie/roidelapluie/retention-validation Multiple fixes in retention configuration	2026-03-20 12:27:37 +01:00
Owen Williams	c4deef472e	Merge remote-tracking branch 'origin/main' into feature/start-time Signed-off-by: Owen Williams <owen.williams@grafana.com>	2026-03-02 14:47:44 -05:00
Julien Pivotto	3675a5e56c	tsdb: fix unit mismatch in retention duration on config reload conf.StorageConfig.TSDBConfig.Retention.Time is model.Duration which is type-aliased to time.Duration (nanoseconds), but RetentionDuration is int64 in milliseconds. The missing division by time.Millisecond caused the metric prometheus_tsdb_retention_limit_seconds to be reported 1e6 times too large after a config reload. Signed-off-by: Julien Pivotto <291750+roidelapluie@users.noreply.github.com>	2026-02-26 16:44:49 +01:00
Ganesh Vernekar	ccc3062521	Merge branch 'main' into codesome/merge-3.10 Signed-off-by: Ganesh Vernekar <ganeshvern@gmail.com>	2026-02-25 17:33:06 -08:00
bwplotka	8f3a6020d8	Merge branch 'main' into st-main-sync2	2026-02-25 13:54:25 +00:00
Julien	9d38077e50	Merge pull request #18080 from ldufr/ldufresne/retention-size-percentage Add percentage based retention	2026-02-24 15:50:36 +01:00
Laurent Dufresne	c76e78d0a4	Added test for percentage-based retention Signed-off-by: Laurent Dufresne <laurent.dufresne@grafana.com>	2026-02-24 15:28:45 +01:00
bwplotka	56c46af6a6	Merge branch 'main' into st-f-main	2026-02-23 10:00:39 +00:00
Ganesh Vernekar	ad00ed0609	Optimize TestDiskFillingUpAfterDisablingOOO to run on i386 Signed-off-by: Ganesh Vernekar <ganesh.vernekar@reddit.com>	2026-02-17 13:35:44 -08:00
bwplotka	5ac1080a60	refactor: sed enableStStorage/enableSTStorage Signed-off-by: bwplotka <bwplotka@gmail.com>	2026-02-17 11:11:46 +00:00
Arve Knudsen	b0718d5c93	tsdb: fix flaky TestBlockRanges by using explicit compaction Replace polling loops (for range 100 { time.Sleep }) with explicit db.Compact() calls after disabling background compaction, eliminating CI flakiness on slow machines. Also fix incorrect overlap assertions that were checking the wrong direction (LessOrEqual -> GreaterOrEqual). Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>	2026-02-15 11:44:23 +01:00
Owen Williams	b57f5b59b3	tsdb: ST-in-WAL: Counter implementation and benchmarks (#17671 ) Initial implementation of https://github.com/prometheus/prometheus/issues/17790. Only implements ST-per-sample for Counters. Tests and benchmarks updated. Note: This increases the size of the RefSample object for all users, whether st-per-sample is turned on or not. Signed-off-by: Owen Williams <owen.williams@grafana.com>	2026-02-12 13:17:50 -05:00
Bartlomiej Plotka	c8e7f4e2a6	tests: Unify TestDiskFillingUpAfterDisablingOOO and avoid hiding errors (#18017 ) * tests: Unify TestDiskFillingUpAfterDisablingOOO and avoid hiding errors Signed-off-by: bwplotka <bwplotka@gmail.com> * addressed comments Signed-off-by: bwplotka <bwplotka@gmail.com> --------- Signed-off-by: bwplotka <bwplotka@gmail.com>	2026-02-05 16:11:35 +00:00
Arve Knudsen	00a7faa2e3	tsdb: fix division by zero in stale series compaction (#17952 ) Guard the stale series ratio calculation by checking numSeries > 0 before computing the ratio. This prevents division by zero when the head has no series. Fixes #17949 Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>	2026-01-29 08:06:00 +01:00
Ganesh Vernekar	4f3de8da29	tsdb: Add unit tests for stale series compaction Signed-off-by: Ganesh Vernekar <ganesh.vernekar@reddit.com>	2026-01-23 18:07:34 -08:00
György Krajcsovits	28dca34f4f	auto update head sample use in tests find . -name "*.go" -type f -exec sed -E -i \ 's/([^[:alpha:]]sample\{)([^,{:]+,[^,]+,[^,]+,[^,]+\})/\10, \2/g' {} + I've omitted tsdb/ooo_head.go from the commit because I'm also adding todo there. Signed-off-by: György Krajcsovits <gyorgy.krajcsovits@grafana.com>	2026-01-14 13:15:13 +01:00
Ben Kochie	e14795bbf4	Remove copyright date from headers (#17785 ) Remove copyright dates from various files as part of [PROM-50]. [PROM-50]: https://github.com/prometheus/proposals/blob/main/proposals/0050-remove-copyright-dates.md Signed-off-by: SuperQ <superq@gmail.com>	2026-01-05 13:46:21 +01:00
NamanParlecha	c94101d023	TSDB: Option to configure TSDB Block Reload Interval (#16728 ) Add --storage.tsdb.block-reload-interval flag to configure TSDB block reload interval. --------- Signed-off-by: Naman-B-Parlecha <namanparlecha@gmail.com> Signed-off-by: NamanParlecha <namanparlecha@gmail.com> Co-authored-by: Arve Knudsen <arve.knudsen@gmail.com>	2025-12-15 09:31:17 +01:00
Bartlomiej Plotka	f6ca7145ca	refactor(tsdb): use one test newTestDB constructor (#17638 ) For tests only, we had various ways of opening DB. Reduced to one instead of: * Open * newTestDB * newTestDBOpts * openTestDB This so https://github.com/prometheus/prometheus/pull/17629 is smaller and bit easier. Also for test maintainability and consistency. Signed-off-by: bwplotka <bwplotka@gmail.com>	2025-12-03 07:55:48 +00:00
Ben Kochie	204249fcb5	Update golangci-lint (#17478 ) * Update golangci-lint to v2.6.0 * Fixup various linting issues. * Fixup deprecations. * Add exception for `labels.MetricName` deprecation. Signed-off-by: SuperQ <superq@gmail.com>	2025-11-05 13:47:34 +01:00
Fiona Liao	b004db49af	Reduce samples for TestRuntimeRetentionConfigChange (#17422 ) * Reduce samples for TestRuntimeRetentionConfigChange --------- Signed-off-by: Fiona Liao <fiona.liao@grafana.com>	2025-10-28 18:23:32 +01:00
Minh Nguyen	ad4b59c504	tsdb: Deprecate retention flags; add tsdb.retention runtime configuration (#17026 ) * Move storage from CL to config file Signed-off-by: pipiland2612 <nguyen.t.dang.minh@gmail.com> * Fix .md Signed-off-by: pipiland2612 <nguyen.t.dang.minh@gmail.com> * run make cli-documentation Signed-off-by: pipiland2612 <nguyen.t.dang.minh@gmail.com> * fix Signed-off-by: pipiland2612 <nguyen.t.dang.minh@gmail.com> * run make cli-documentation Signed-off-by: pipiland2612 <nguyen.t.dang.minh@gmail.com> * nit_fixed Signed-off-by: pipiland2612 <nguyen.t.dang.minh@gmail.com> * fix Signed-off-by: pipiland2612 <nguyen.t.dang.minh@gmail.com> * add test and update configuration.md Signed-off-by: pipiland2612 <nguyen.t.dang.minh@gmail.com> * fix lint Signed-off-by: pipiland2612 <nguyen.t.dang.minh@gmail.com> --------- Signed-off-by: pipiland2612 <nguyen.t.dang.minh@gmail.com>	2025-10-27 14:51:33 +00:00
beorn7	ad7d1aed99	Phase out native histogram feature flag The detailed plan for this is laid out in https://github.com/prometheus/prometheus/issues/16572 . This commit adds a global and local scrape config option `scrape_native_histograms`, which has to be set to true to ingest native histograms. To ease the transition, the feature flag is changed to simply set the default of `scrape_native_histograms` to true. Further implications: - The default scrape protocols now depend on the `scrape_native_histograms` setting. - Everywhere else, histograms are now "on by default". Documentation beyond the one for the feature flag and the scrape config are deliberately left out. See https://github.com/prometheus/prometheus/pull/17232 for that. Signed-off-by: beorn7 <beorn@grafana.com>	2025-10-15 14:50:52 +02:00
György Krajcsovits	30f941c57c	fix(wal): ignore invalid native histogram schemas on load Reduce the resolution of histograms as needed and ignore invalid schemas while emitting a warning log. Signed-off-by: György Krajcsovits <gyorgy.krajcsovits@grafana.com>	2025-09-24 11:41:25 +02:00
beorn7	7e82bdb75b	tsdb: Fix commit order for mixed-typed series Fixes https://github.com/prometheus/prometheus/issues/15177 The basic idea here is to divide the samples to be commited into (sub) batches whenever we detect that the same series receives a sample of a type different from the previous one. We then commit those batches one after another, and we log them to the WAL one after another, so that we hit both birds with the same stone. The cost of the stone is that we have to track the sample type of each series in a map. Given the amount of things we already track in the appender, I hope that it won't make a dent. Note that this even addresses the NHCB special case in the WAL. This does a few other things that I could not resist to pick up on the go: - It adds more zeropool.Pools and uses the existing ones more consistently. My understanding is that this was merely an oversight. Maybe the additional pool usage will compensate for the increased memory demand of the map. - Create the synthetic zero sample for histograms a bit more carefully. So far, we created a sample that always went into its own chunk. Now we create a sample that is compatible enough with the following sample to go into the same chunk. This changed the test results quite a bit. But IMHO it makes much more sense now. - Continuing past efforts, I changed more namings of `Samples` into `Floats` to keep things consistent and less confusing. (Histogram samples are also samples.) I still avoided changing names in other packages. - I added a few shortcuts `h := a.head`, saving many characters. TODOs: - Address @krajorama's TODOs about commit order and staleness handling. Signed-off-by: beorn7 <beorn@grafana.com>	2025-09-17 19:22:25 +02:00
beorn7	46cfc9fb99	tsdb: Extend TestDataNotAvailableAfterRollback This exposes the ommission of float histograms from the rollback. Signed-off-by: beorn7 <beorn@grafana.com>	2025-09-17 19:22:25 +02:00
beorn7	747c5ee2b1	Apply analyzer "modernize" to the whole codebase See https://pkg.go.dev/golang.org/x/tools/gopls/internal/analysis/modernize for details. This ran into a few issues (arguably bugs in the modernize tool), which I will fix in the next commit, so that we have transparency what was done automatically. Beyond those hiccups, I believe all the changes applied are legitimate. Even where there might be no tangible direct gain, I would argue it's still better to use the "modern" way to avoid micro discussions in tiny style PRs later. Signed-off-by: beorn7 <beorn@grafana.com>	2025-08-27 14:48:41 +02:00
Bryan Boreham	498f63e60b	Merge pull request #17029 from pr00se/wal-checkpoint-dropped-samples TSDB: use timestamps rather than WAL segment numbers to track how long deleted series should be retained in checkpoints	2025-08-20 11:15:10 +01:00
pipiland2612	82a4b12507	Add t.parallel() for ./tsdb Signed-off-by: pipiland2612 <nguyen.t.dang.minh@gmail.com>	2025-08-12 14:12:42 +02:00
Patryk Prus	0fea41ed53	Refactor keep function to work for both agent and non-agent implementations Signed-off-by: Patryk Prus <p@trykpr.us>	2025-08-08 14:12:47 -04:00
Patryk Prus	218558f543	Store mint rather than the last WAL segment in head.walExpiries during head GC Signed-off-by: Patryk Prus <p@trykpr.us>	2025-08-08 14:12:41 -04:00
Matthieu MOREL	cef219c31c	chore: enable unused-receiver rule from revive Signed-off-by: Matthieu MOREL <matthieu.morel35@gmail.com>	2025-08-04 09:43:33 +00:00
socialsister	869c946370	chore: fix some minor issues in comments Signed-off-by: socialsister <seekseat@qq.com>	2025-07-16 11:24:42 +01:00
liangmulu	b1a7df2c0c	chore: fix some minor issues in comments Signed-off-by: liangmulu <liangmulu@outlook.com>	2025-07-09 18:05:41 +08:00
Ayoub Mrini	2edc3ed6c5	feat(tsdb): introduce --use-uncached-io feature flag and allow using it for chunks writing (#15365 ) Signed-off-by: machine424 <ayoubmrini424@gmail.com> Signed-off-by: Ayoub Mrini <ayoubmrini424@gmail.com>	2025-05-21 14:42:30 +02:00
Arve Knudsen	e7e3ab2824	Fix linting issues found by golangci-lint v2.0.2 (#16368 ) * Fix linting issues found by golangci-lint v2.0.2 --------- Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>	2025-05-03 19:05:13 +02:00
Bryan Boreham	a11772234d	Merge pull request #16333 from colega/fix-series-create-gc-race fix: race condition between series creation and garbage collection	2025-04-17 12:15:11 +01:00
Ryan Wu	7d73c1d3f8	refactor[discovery, tsdb]: simplify error handling and remove redundant checks (#16328 ) * refactor: simplify error handling and remove redundant checks Signed-off-by: Ryan Wu <rongjun0821@gmail.com> * Add the comment for return of reloading blocks failure Co-authored-by: Ayoub Mrini <ayoubmrini424@gmail.com> Signed-off-by: Ryan Wu <rongjun0821@gmail.com> * Add the comment for return of reloading blocks failure Signed-off-by: Ryan Wu <rongjun0821@gmail.com> --------- Signed-off-by: Ryan Wu <rongjun0821@gmail.com> Co-authored-by: Ayoub Mrini <ayoubmrini424@gmail.com>	2025-03-27 12:20:59 +01:00
Oleg Zaytsev	e4fe8d8684	Create memSeries with pendingCommit=true This fixes TestHead_RaceBetweenSeriesCreationAndGC. Signed-off-by: Oleg Zaytsev <mail@olegzaytsev.com>	2025-03-27 11:11:57 +01:00
Fiona Liao	37c2ebb5fd	Make out-of-order native histograms flag a no-op and always enable (#16207 ) * Remove experimental out-of-order native histogram flag This feature has been available in Prometheus since September 2024, and has no known issues. Therefore proposing to remove the flag entirely and always have it on. Note that there are still two settings that need to be configured (out-of-order time window > 0 and native histograms enabled) for this feature to work. Signed-off-by: Fiona Liao <fiona.liao@grafana.com> * Update CHANGELOG Signed-off-by: Fiona Liao <fiona.liao@grafana.com> * Keep feature flag with warning Signed-off-by: Fiona Liao <fiona.liao@grafana.com> * Update CHANGELOG Signed-off-by: Fiona Liao <fiona.liao@grafana.com> * Update tsdb/head_append.go Co-authored-by: George Krajcsovits <krajorama@users.noreply.github.com> Signed-off-by: Fiona Liao <fiona.y.liao@gmail.com> * Update CHANGELOG.md Co-authored-by: George Krajcsovits <krajorama@users.noreply.github.com> Signed-off-by: Fiona Liao <fiona.y.liao@gmail.com> * Update tsdb/head_append.go Co-authored-by: George Krajcsovits <krajorama@users.noreply.github.com> Signed-off-by: Fiona Liao <fiona.y.liao@gmail.com> * Additional cleanup of comments and test names Signed-off-by: Fiona Liao <fiona.liao@grafana.com> --------- Signed-off-by: Fiona Liao <fiona.liao@grafana.com> Signed-off-by: Fiona Liao <fiona.y.liao@gmail.com> Co-authored-by: George Krajcsovits <krajorama@users.noreply.github.com>	2025-03-18 10:59:02 +00:00
Bartlomiej Plotka	7a7bc65237	Add util/compression package to consolidate snappy/zstd use in Prometheus. (#16156 ) # Conflicts: # tsdb/db_test.go Apply suggestions from code review tmp Addressed comments. Update util/compression/buffers.go Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com> Co-authored-by: Arthur Silva Sens <arthursens2005@gmail.com>	2025-03-10 10:36:26 +00:00
Patryk Prus	61aa82865d	TSDB: keep duplicate series records in checkpoints while their samples may still be present (#16060 ) Renames the head's deleted map to walExpiries, and creates entries for any duplicate series records encountered during WAL replay, with the expiry set to the highest current WAL segment number. Any subsequent WAL checkpoints will see the duplicate series entry in the walExpiries map, and keep the series record until the last WAL segment that could contain its samples is deleted. Other considerations: WBL: series records aren't written to the WBL, so there are no duplicates to deal with agent mode: has its own WAL replay logic that handles duplicate series records differently, and is outside the scope of this PR	2025-03-05 13:45:08 -05:00
Arve Knudsen	7cbf749096	Upgrade to github.com/oklog/ulid/v2 (#16168 ) Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>	2025-03-05 16:03:25 +01:00
Matthieu MOREL	c7d4b53ec1	chore: enable unused-parameter from revive Signed-off-by: Matthieu MOREL <matthieu.morel35@gmail.com>	2025-02-19 19:50:28 +01:00
Bryan Boreham	b74cebf6bf	Merge pull request #12920 from prymitive/compactLock Fix locks in db.reloadBlocks()	2025-02-10 17:35:09 +00:00
Bryan Boreham	2f615a200d	tsdb tests: restrict some 'defer' operations 'defer' only runs at the end of the function, so introduce some more functions / move the start, so that 'defer' can run at the end of the logical block. Signed-off-by: Bryan Boreham <bjboreham@gmail.com>	2025-01-27 19:59:43 +00:00
Łukasz Mierzwa	92788d313a	Remove TestTombstoneCleanRetentionLimitsRace This test ensures that running db.reloadBlocks() and db.CleanTombstones() at the same time doesn't race. The problem is that CleanTombstones() is a public method while reloadBlocks() is internal. CleanTombstones() sets db.cmtx lock while reloadBlocks() is not protected by any locks at all, it expects the public method through which it was called to do it. So having a race between these two is not unexpected and we shouldn't really be testing this. db.cmtx ensures that no other function can be modifying the list of open blocks and so the scenario tested here cannot happen. If it would happen it would be only because some other method doesn't aquire db.ctmx lock, something this test cannot detect. Signed-off-by: Łukasz Mierzwa <l.mierzwa@gmail.com>	2025-01-09 17:06:03 +00:00
György Krajcsovits	1e420ef373	Merge branch 'main' into cedwards/nhcb-wal-wbl # Conflicts: # tsdb/tsdbutil/histogram.go	2025-01-02 12:50:19 +01:00
Joel Beckmeyer	39f5a07236	fix TestOOOHeadChunkReader_Chunk on 32-bit Signed-off-by: Joel Beckmeyer <joel@beckmeyer.us>	2024-12-16 10:45:07 -05:00
Carrie Edwards	1933ccc9be	Fix test	2024-12-06 14:55:19 -08:00

1 2 3 4 5 ...

257 Commits