mirrors/omni - omni - gitea@git.xfx1.de

mirror of https://github.com/siderolabs/omni.git synced 2026-04-01 06:01:02 +02:00

Author	SHA1	Message	Date
Artem Chernyshev	44562c97eb	release(v1.6.1): prepare release This is the official v1.6.1 release. Signed-off-by: Artem Chernyshev <artem.chernyshev@talos-systems.com>	2026-03-18 19:50:04 +03:00
Oguz Kilcan	1b7fa208d4	fix: correct SQLite size metrics to include indexes and freelist Table size queries (`omni_sqlite_subsystem_size_bytes`) filtered by dbstat name, missing index sizes. Join with sqlite_master to attribute index pages to their parent table. DB size (`omni_sqlite_db_size_bytes`) used dbstat sum which excludes freelist pages. Use page_count * page_size to match actual file size. Add `omni_sqlite_db_freelist_size_bytes` metric to track wasted space. Signed-off-by: Oguz Kilcan <oguz.kilcan@siderolabs.com> (cherry picked from commit e85ab384c343ca26987ce65ca1f9f4a9ee7d2361)	2026-03-18 18:00:24 +03:00
Oguz Kilcan	cf7d752453	feat: enforce configurable machine registration limit Add `account.maxRegisteredMachines` config option to cap the number of registered machines. The provision handler atomically checks the limit under a mutex before creating new Link resources, returning ResourceExhausted when the cap is reached. Introduce a Notification resource type (ephemeral namespace) so controllers can surface warnings to users. `omnictl` displays all active notifications on every command invocation. Frontend part of showing notifications will be implemented in a different PR. MachineStatusMetricsController creates a warning notification when the registration limit is reached and tears it down when it's not. Signed-off-by: Oguz Kilcan <oguz.kilcan@siderolabs.com>	2026-03-16 12:48:47 +01:00
Oguz Kilcan	72cb85a4ff	feat: add configurable bandwidth rate limiting for SideroLink tunnel Introduce token-bucket based bandwidth rate limiting for the SideroLink WireGuard tunnel, configurable via services.siderolink.bandwidthLimitMbps and services.siderolink.bandwidthLimitBurstBytes config fields (with corresponding CLI flag fallbacks). Rate limiting is applied in both directions: outbound via a wrapped conn.Bind and inbound via a TUN input packet filter. A shared limiter drops packets exceeding the budget, relying on TCP congestion control to throttle senders. Disabled by default (0 = unlimited). Also adds a Grafana service to docker-compose with pre-built Omni dashboards for local development observability. Signed-off-by: Oguz Kilcan <oguz.kilcan@siderolabs.com>	2026-03-12 11:20:57 +01:00
Utku Ozdemir	1e9b733cb0	chore: bump deps, rekres Bump all dependencies. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2026-03-10 18:31:38 +01:00
Edward Sammut Alessi	433fe435db	chore: bump default talos version Bump default talos version to 1.12.5 Signed-off-by: Edward Sammut Alessi <edward.sammutalessi@siderolabs.com>	2026-03-10 15:25:18 +01:00
Edward Sammut Alessi	a566261b91	feat(frontend): allow specifying date range for audit logs Replace the audit log download button with a modal which includes a date range for filtering to keep file sizes down. Includes a warning if trying to download logs for longer than 7 days. Modal includes a bytes downloaded indicator to convey progres. Download is canceled if modal is closed. Signed-off-by: Edward Sammut Alessi <edward.sammutalessi@siderolabs.com>	2026-03-10 11:23:21 +01:00
Edward Sammut Alessi	beb7dba850	release(v1.6.0-beta.0): prepare release This is the official v1.6.0-beta.0 release. Signed-off-by: Edward Sammut Alessi <edward.sammutalessi@siderolabs.com>	2026-03-04 15:46:27 +01:00
Oguz Kilcan	e3df911d48	feat: enforce configurable limits on user and service account creation Add state validation that rejects identity creation when the configured maximum number of users or service accounts is reached. The gRPC resource and management servers now use the validated state so these limits are enforced for all creation paths (CLI, UI, API). Identity is created before the user resource so the validation fires before any side effects. Also adds create validation for join token name, e2e Playwright tests covering UI and AccountLimits integration test covering API and CLI for limit enforcement. Signed-off-by: Oguz Kilcan <oguz.kilcan@siderolabs.com>	2026-02-26 13:47:52 +01:00
Oguz Kilcan	1abd7ce6e9	chore: bump default talos version Bump default talos version to 1.12.4 Signed-off-by: Oguz Kilcan <oguz.kilcan@siderolabs.com>	2026-02-19 14:28:32 +01:00
Pranav Patil	01a0b3e601	fix: add required SQLite storage path flag to compose.yaml Add the required flag `--sqlite-storage-path`. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2026-02-19 09:44:13 +01:00
Utku Ozdemir	8f5d64f86f	test: add embedded etcd smoke test to helm e2e Add a two-phase approach to the helm e2e test: first install Omni with embedded etcd and run a smoke test (omnictl get defaultjointoken), then uninstall and reinstall with external etcd for the full integration suite. Other changes: - Extract reusable extract_sa_key function - Split helm values into base + external etcd overlay to remove duplication - Move helm test values to hack/test/helm/templates/ and drop .envsubst suffix - Fix empty string arg bug in configure_registry_mirrors (remove dead else branch) Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2026-02-17 15:24:00 +01:00
Utku Ozdemir	ccc197b258	refactor: replace the old helm chart with the new one Since we don't want to support/maintain the old chart anymore, we simply replace it with the new chart. Added a validation which fails on upgrades from the old one to the new one. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2026-02-17 12:29:38 +01:00
Utku Ozdemir	52f249dbcc	feat: make more things configurable in the helm chart Add support for priorityClassName, terminationGracePeriodSeconds, dnsPolicy/dnsConfig, initContainers, extraContainers (sidecars), and custom labels on all services. Also, fix some unit tests and add additional unit tests. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2026-02-16 13:59:08 +01:00
Utku Ozdemir	fbf36740f2	test: add unit and e2e tests to the helm chart Add helm unit tests (via helm-unittest) covering services, ingresses, HTTPRoutes, secrets, PrometheusRules and ServiceAccounts. Add a helm-based e2e test workflow that deploys Omni on a Talos cluster with Traefik and etcd, runs integration tests including workload proxy, and verifies the full stack end-to-end. Add a configurable TestOptions struct to the workload proxy test to allow running with smaller scale in helm e2e. Signed-off-by: Kevin Tijssen <kevin.tijssen@siderolabs.com> Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2026-02-16 13:58:56 +01:00
Utku Ozdemir	0c2c5c1cc3	test: use envsubst in tests and do small improvements Now that we have envsubst in the build container, we can simplify our scripts a bit. Also do other cosmetic improvements in the test scripts. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2026-02-16 11:04:25 +01:00
Oguz Kilcan	bd86ff3127	chore: remove deprecated migration flags, config fields, and migration code The deprecated flags and config fields kept for the SQLite migration period (v1.4.0) have been removed along with all automatic migration code for BoltDB secondary storage, file-based audit logs, file-based discovery service snapshots, and circular buffer machine logs. Signed-off-by: Oguz Kilcan <oguz.kilcan@siderolabs.com>	2026-02-16 10:38:53 +01:00
Utku Ozdemir	30d17dcf6d	chore: update Go to 1.26 in go.mod, rekres, fix linting issues Update Go in go.mod to keep it consistent with the value in the Makefile (the actual Go version the project is built with). It kicks in some new linters, causes linters to change behavior. Reformat and fix all those linting issues. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2026-02-13 10:58:59 +01:00
Utku Ozdemir	d1c869a9d8	chore: bump deps, rekres Bump all dependencies. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2026-02-12 20:43:45 +01:00
Utku Ozdemir	1e24fd222d	feat: implement helm chart v2 Helm Chart v2: - Add new Helm chart with comprehensive configuration via values.yaml - Support for both Kubernetes Ingress and Gateway API - Built-in validation for required fields and URL consistency - Prometheus metrics and ServiceMonitor support - Detailed documentation with examples for Traefik - Workload proxy setup guide Deploy directory reorganization: - Move Docker Compose files to `deploy/compose/` - Move existing Helm chart to `deploy/helm/omni/` - Add top-level `deploy/README.md` pointing to deployment options - Add deprecation warning to v1 Helm chart Documentation: - Add link to Helm chart in root README Co-authored-by: Kevin Tijssen <kevin.tijssen@siderolabs.com> Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2026-01-30 14:09:27 +01:00
Utku Ozdemir	056d5e4e18	fix: bind wireguard to configured address instead of all interfaces Having issues with wireguard connectivity from QEMU machines to Omni running on macOS revealed an issue: If macOS has multiple interfaces with IPs in the same subnet (for example connected both via ethernet and Wi-Fi), it could respond to WireGuard packets not from the interface they are received from, but from the other one, even when the wg endpoint was explicitly set to be a specific IP:PORT in Omni config. And this was breaking wg handshakes. The core issue seems to be the wireguard-go library not implementing sticky sockets (`IP_PKTINFO`) on macOS. While investigating, we found that the standard wireguard-go `StdNetBind` always binds to all interfaces (`0.0.0.0`), ignoring any specific host in the endpoint configuration. Add a custom bind implementation that respects the configured host. This fixes the macOS issue as a side benefit. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2026-01-29 12:15:32 +01:00
Oguz Kilcan	4978834232	test: fix failing workload proxy tests Fix failing workload proxy tests Signed-off-by: Oguz Kilcan <oguz.kilcan@siderolabs.com>	2026-01-28 13:32:27 +01:00
Edward Sammut Alessi	0f8a3d6c6f	test(e2e): add an e2e test for exposed services Add an E2E test which adds an nginx service through an inlineManifests config patch on the control plane, and check that it is accessible. Signed-off-by: Edward Sammut Alessi <edward.sammutalessi@siderolabs.com>	2026-01-23 18:57:19 +01:00
Artem Chernyshev	fd82327c26	release(v1.5.0-beta.0): prepare release This is the official v1.5.0-beta.0 release. Signed-off-by: Artem Chernyshev <artem.chernyshev@talos-systems.com>	2026-01-23 15:57:48 +03:00
Edward Sammut Alessi	d3ae77c0cc	chore: bump copyright to 2026 Bump copyright for conformance to 2026 Signed-off-by: Edward Sammut Alessi <edward.sammutalessi@siderolabs.com>	2026-01-21 15:30:49 +01:00
Oguz Kilcan	f56551abc3	chore: move some tests from e2e upgrades e2e test to misc upgrades test Move some tests from e2e-upgrades test to e2e-misc-upgrades to speed up the overall speed, because the test was taking too much time. Signed-off-by: Oguz Kilcan <oguz.kilcan@siderolabs.com>	2026-01-17 14:50:36 +01:00
Oguz Kilcan	2d5e58cbac	chore: rekres and bump deps * rekres * bump deps * bump go to 1.25.6 * fix linter errors Signed-off-by: Oguz Kilcan <oguz.kilcan@siderolabs.com>	2026-01-16 11:15:02 +01:00
Edward Sammut Alessi	8f6d017077	chore: bump node to 24.13 Bump node to 24.13 to address CVEs https://nodejs.org/en/blog/vulnerability/december-2025-security-releases Signed-off-by: Edward Sammut Alessi <edward.sammutalessi@siderolabs.com>	2026-01-15 16:33:56 +01:00
Oguz Kilcan	85d099489f	chore: separate integration-tests Separate integration/e2e tests for qemu and talemu, so we can run them in parallel. Signed-off-by: Oguz Kilcan <oguz.kilcan@siderolabs.com>	2026-01-14 15:45:59 +01:00
Oguz Kilcan	ef2d931aac	chore: rekres and bump deps * Rekres * Bump deps * Update default versions for talos and kubernetes Signed-off-by: Oguz Kilcan <oguz.kilcan@siderolabs.com>	2026-01-09 11:34:03 +01:00
Utku Ozdemir	9bf690ef2e	refactor: do SQLite migrations unconditionally, rework the config flags Remove the flags for turning on SQLite storage for: - Discovery service state - Audit logs - Machine logs Instead, migrate them unconditionally to SQLite on the next startup. Remove many flags which are no longer meaningful. Only keep the ones which are required for the migrations. Additionally: Make the `--sqlite-storage-path` (or its config counterpart `.storage.sqlite.path`) required with no default value, as a default value does not make sense for it in most of the cases. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2025-12-12 12:47:04 +01:00
Oguz Kilcan	7b3ffa2a56	release(v1.4.0-beta.0): prepare release This is the official v1.4.0-beta.0 release. Signed-off-by: Oguz Kilcan <oguz.kilcan@siderolabs.com>	2025-12-10 16:20:29 +01:00
Oguz Kilcan	bc2a5a9986	chore: prepare omni with talos v1.12.0-beta.1 Prepare omni for upcoming talos version 1.12.0-beta.1. Signed-off-by: Oguz Kilcan <oguz.kilcan@siderolabs.com>	2025-12-06 16:55:35 +01:00
Utku Ozdemir	52360252e6	fix: do not clear schematic meta values for non-UKI machines META section updates are no-op for non-UKI machines, but still, the recent changes in the kernel args PR started clearing them (since now we compute schematic ID always), causing the schematic ID to be updated, which caused cluster machines to be upgraded and restarted. Remove the UKI check and keep meta valus always as-is. Update the integration tests to: - Also include META values. - Make Omni upgrade test pick both UKI and non-UKI machines. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2025-11-20 12:41:18 +01:00
Utku Ozdemir	db97e09291	chore: bump Kubernetes version to 1.34.2 Updated the default Kubernetes version to 1.34.2 and adjusted related version constants in the integration script and Go files. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2025-11-14 16:57:00 +01:00
Utku Ozdemir	7468e6ea02	chore: rekres, make linters happy, bump Go, deps and Talos versions Bump Go to 1.25.4, default Talos version to 1.11.5. Bump all Go dependencies. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2025-11-10 23:51:22 +01:00
Andrey Smirnov	75a9f3ee9f	feat: use sqlite as secondary resource storage This pulls in https://github.com/cosi-project/state-sqlite/pull/2 Fixes https://github.com/siderolabs/omni/issues/1770 See https://github.com/siderolabs/omni/issues/1768 Sample migration logs: ``` 2025-11-05T11:18:47.340Z ESC[34mINFOESC[0m omni/state_sqlite.go:122 migrated resources from BoltDB to SQLite {"namespace": "metrics" , "type": "EtcdBackupOverallStatuses.omni.sidero.dev", "count": 1} 2025-11-05T11:18:47.340Z ESC[34mINFOESC[0m omni/state_sqlite.go:122 migrated resources from BoltDB to SQLite {"namespace": "metrics" , "type": "EtcdBackupStatuses.omni.sidero.dev", "count": 0} 2025-11-05T11:18:47.342Z ESC[34mINFOESC[0m omni/state_sqlite.go:122 migrated resources from BoltDB to SQLite {"namespace": "metrics" , "type": "MachineStatusLinks.omni.sidero.dev", "count": 2} 2025-11-05T11:18:47.342Z ESC[34mINFOESC[0m omni/state_sqlite.go:67 removed old BoltDB database after migration {"path": "_out/secondary-storag e/bolt.db"} ``` Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>	2025-11-05 15:40:24 +04:00
Utku Ozdemir	31d4213035	fix: remove non-machinery Talos import, fix changelog Changelog included the whole of Talos due to the unwanted dependency of `github.com/siderolabs/talos` brought in by a test. Remove that dependency, and re-generate the changelog for `v1.3.0-beta.0`. Also, bump the Talos machinery version and rekres, which also bumps Go version to `1.25.3`. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2025-10-30 19:51:09 +01:00
Utku Ozdemir	bb582359da	release(v1.3.0-beta.0): prepare release This is the official v1.3.0-beta.0 release. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2025-10-30 16:33:01 +01:00
Edward Sammut Alessi	9d3ae445d2	chore(frontend): update node to latest lts Update the node version used by the frontend to the latest LTS version 24.11.0 Signed-off-by: Edward Sammut Alessi <edward.sammutalessi@siderolabs.com>	2025-10-29 11:59:37 +01:00
Utku Ozdemir	15deddde56	feat: implement extra kernel args support (Re)implement the kernel args support functionality in the following way: - Only support UKI or UKI-like (>=1.12 with GrubUseUKICmdline) systems. - In `MachineStatusController`: - When we see a machine for the first time, do a one-time operation of extracting of the extra kernel args from it and store them in the newly introduced `KernelArgs` resource. This resource is user-owned from that point on. - Mark the `MachineStatus` with an annotation as "its kernel args are initialized". - Start storing the the raw schematic. - Take a one-time snapshot of the extensions on the machine and set them as "initial extensions". They might not be the "actual initial", i.e., the set of extensions when we actually seen the machine for the first time, but we do this in a best-effort basis. We need this, since now we cannot simply go back to the initial schematic ID when all extensions are removed - kernel args are also included in the schematic. - Start collecting the kernel cmdline from Talos machines as well. - Adapt the `SchematicConfiguration` controller to not revert to the initial schematic ID ever - it now always computes the needed schematic - when it wants to revert to the initial set of extensions, it uses the new field on the `MachineStatus`. - Introduce the resource `MachineUpgradeStatus` and its controller `MachineUpgradeStatusController`, which handles the maintenance mode upgrades when kernel args are updated. The controller is named this way, since our long-term plan is to centralize all upgrade calls to be done from this controller. Currently, it does not change Talos version or the set of extensions. It works only in maintenance mode, only for kernel args changes (when supported). - Introduce the resource `KernelArgsStatus` and its controller `KernelArgsStatusController`, which provides information about the kernel args updates. Its status is reliable in both maintenance and non-maintenance modes. - Build a UI to update these args (with @Unix4ever's help). Co-authored-by: Artem Chernyshev <artem.chernyshev@talos-systems.com> Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2025-10-28 14:44:48 +01:00
Utku Ozdemir	02425267fe	test: improve integration tests - Make sure the console output of QEMU is sent to `console=ttyS0` when non-UKI is used. - Use the new `cluster create` arg `--skip-injecting-extra-cmdline` to make sure `console=ttyS0` kernel arg is not duplicated. - Get rid of `SUDO_USER` var. - Add the missing `--omni.output-dir` flag to make sure the support bundles are collected to proper destinations. - Gather all artifacts to be collected under `TEST_OUTPUTS_DIR` for better organization in the test artifacts archive. - Quote some strings. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2025-10-28 09:53:03 +01:00
Artem Chernyshev	b5765d8d1c	test: use bridge IP for WireGuard in CI It was using local pod IP which was generating new schematic every time the test runs. Signed-off-by: Artem Chernyshev <artem.chernyshev@talos-systems.com>	2025-10-20 12:22:58 +03:00
Utku Ozdemir	d0c8b1666b	chore: bump Talos to 1.11.3, reorder CI workflow jobs Make unit tests and lint run before the integration tests. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2025-10-17 13:30:35 +02:00
Utku Ozdemir	049ab877e9	chore: revert 'feat: add support for updating kernel args' Some checks failed chromatic / Run chromatic (push) Has been cancelled default / default (push) Has been cancelled default / e2e-backups (push) Has been cancelled default / e2e-cluster-import (push) Has been cancelled default / e2e-forced-removal (push) Has been cancelled default / e2e-omni-upgrade (push) Has been cancelled default / e2e-scaling (push) Has been cancelled default / e2e-short (push) Has been cancelled default / e2e-short-secureboot (push) Has been cancelled default / e2e-templates (push) Has been cancelled default / e2e-upgrades (push) Has been cancelled default / e2e-workload-proxy (push) Has been cancelled default / integration-test (push) Has been cancelled default / lint (push) Has been cancelled default / unit-tests (push) Has been cancelled This reverts commit ae9d7cca4b3ef2c5923cc6476042a575d4158eee. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2025-10-14 18:40:58 +02:00
Oguz Kilcan	0d58ade7bf	feat: implement cluster import Allow importing existing talos cluster to Omni using `omnictl cluster import` Closes: #1315 Signed-off-by: Oguz Kilcan <oguz.kilcan@siderolabs.com>	2025-10-14 17:01:07 +02:00
Utku Ozdemir	c88503dcba	chore: bump default Talos version, deps, rekres, re-generate Bump everything to appropriate versions. Remove some unused imports. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2025-10-13 11:23:45 +02:00
Edward Sammut Alessi	ef6584f951	chore(frontend): update dependencies Update frontend dependencies Signed-off-by: Edward Sammut Alessi <edward.sammutalessi@siderolabs.com>	2025-10-08 21:25:17 +02:00
Oguz Kilcan	d27624abc6	chore: rekres and bump go to 1.25.2 Rekres, fix linter issues, bump go to 1.25.2 See groups.google.com/g/golang-nuts/c/Gxn25BP4MXk/m/3KrM-XBOBAAJ Signed-off-by: Oguz Kilcan <oguz.kilcan@siderolabs.com>	2025-10-08 13:22:55 +02:00
Utku Ozdemir	ae9d7cca4b	feat: add support for updating kernel args Allow updating kernel args, similar to the set of extensions. Signed-off-by: Utku Ozdemir <utku.ozdemir@siderolabs.com>	2025-10-08 12:08:53 +02:00

1 2 3 4

151 Commits