963 Commits

Author SHA1 Message Date
Andrey Smirnov
4f11f021de
feat: implement etcd encryption config (kube-apiserver)
This replaces existing fixed field for etcd encryption with a completely
flexible configuration which exactly matches upstream kube-apiserver
configuration.

The default machine configuration generated still retains previous
defaults.

New configuration allows:

* rotating etcd encryption secrets
* implementing any encryption policies (e.g. encrypting configmaps).

Fixes #10899

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-05-04 19:35:23 +04:00
Andrey Smirnov
13ce018795
fix: re-enable kexec on arm64
The upstream kernel bug should be fixed now.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-05-01 20:42:11 +04:00
Mateusz Urbanek
dedb7a96c1
fix(talosctl): protect k8sNames map writes with mutex
Concurrent goroutines wrote to k8sNames without synchronization,
causing a data race during drain. The comment claiming "no mutex
needed" was wrong - each goroutine writes a different key, but
the Go map implementation is not safe for concurrent writes.

Fixes #13247

Signed-off-by: Mateusz Urbanek <mateusz.urbanek@siderolabs.com>
2026-04-30 14:36:43 +02:00
Andrey Smirnov
837a9ed077
feat: move host DNS config into ResolverConfig
This deprecates more `.machine.features`, allows host DNS to be enabled
in maintenance mode.

Fixes #12438

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-04-27 20:11:01 +04:00
Mateusz Urbanek
96a8ecd1ee
feat: default to factory installer image
Defaults to installer image from factory.talos.dev. Default images now
use schematic hash naming (metal-installer/<hash>) instead of
registry-based naming.

Signed-off-by: Mateusz Urbanek <mateusz.urbanek@siderolabs.com>
2026-04-27 15:46:13 +02:00
Andrey Smirnov
c2948cef23
feat: support auth for Image Factory in cluster create
Allows to authenticate to Image Factory (if Image Factory is configured
for auth), applies for HTTP downloads (e.g. ISO), and injects registry
auth into Talos as well.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-04-23 22:21:00 +04:00
Mateusz Urbanek
3db14309e0
fix(talosctl): ensure uncordon runs after reboot/upgrade errors
Use defer blocks and error joining to guarantee uncordon cleanup
runs regardless of reboot/upgrade success or failure. Prevents nodes
from staying cordoned when operations fail.

Also added gRPC keepalive params to prevent timeout issues during
long operations.

Signed-off-by: Mateusz Urbanek <mateusz.urbanek@siderolabs.com>
2026-04-23 12:35:43 +02:00
Benoît Knecht
21f459aab5
fix(talosctl): always use default GRPC dial options
When run in "normal" mode, `talosctl` takes into account proxy
configuration, such as the `https_proxy` and `no_proxy` environment
variables; but when invoked with `--insecure`, those would be ignored,
which results in `talosctl` being unable to interact with nodes in
maintenance mode if they're only reachable through a proxy.

This commit adds the `WithDefaultGRPCDialOptions()` option to the
client created by `WithClientMaintenance()`, same as `WithClient()`.

Signed-off-by: Benoît Knecht <benoit.knecht@proton.ch>
2026-04-10 09:00:13 +02:00
Mateusz Urbanek
52b920032e
feat: add client-side Kubernetes node drain to reboot and upgrade commands
Add --drain and --drain-timeout flags to `talosctl reboot` (default off)
and `talosctl upgrade` (default on) that cordon and drain the Kubernetes
node before rebooting, then wait for Ready and uncordon after it comes
back. When --drain is enabled, --wait is forced to true.

Signed-off-by: Mateusz Urbanek <mateusz.urbanek@siderolabs.com>
2026-04-08 18:29:19 +02:00
Andrey Smirnov
968ec1e0ca
refactor: propagate NAME properly, allow to set on build
Allow to set build NAME on build, propagate it down to more consumers.

Expose name in `Version` resource, and use that in the dashboard
next to Talos version.

Fix some places where `Name` was hardcoded.

Propagate Name down to UKI build.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-04-08 17:57:43 +04:00
Dharsan Baskar
34fb2cbe51
refactor: remove manual shell completion and replace with cobra completion
Fixes #13066

Signed-off-by: Dharsan Baskar <git@dharsanb.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-04-05 22:41:01 +04:00
Mateusz Urbanek
7b6ab0c1c3
feat: add flag to force fallback to legacy upgrade
Forcing fallback to legacy flow, might be wanted by some users.

Signed-off-by: Mateusz Urbanek <mateusz.urbanek@siderolabs.com>
2026-04-03 07:41:48 +02:00
Andrey Smirnov
5e24d5265b
feat: add resource view to talosctl dashboard
Fixes #12933

There are many usecases for this:

* exploring resources and state of the system, learning available
  resources
* when a Talos machine is booted up in an environment without network
  access, learning all available network interfaces, all disks
  available, etc.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-04-01 19:20:25 +04:00
Mateusz Urbanek
10cdfa9099
fix: drop talosctl install
It does not make sense without SideroLink, and will not work,
as it requires Admin role.

Signed-off-by: Mateusz Urbanek <mateusz.urbanek@siderolabs.com>
2026-04-01 12:35:42 +02:00
Noel Georgi
919d8c3655
chore: drop debug shell
Now that talos has native `talosctl debug` `WITH_DEBUG_SHELL` seems not needed.

Signed-off-by: Noel Georgi <git@frezbo.dev>
2026-03-30 11:06:29 +02:00
Kevin Tijssen
786bf00abb
feat: add --platform=all support to image cache-create
Add support for caching all platforms in a multi-platform image index
by passing --platform=all to the images cache-create command.

When all is specified, the index manifest is fetched without platform
resolution, and each platform-specific image is downloaded individually.
Attestation manifests (unknown/unknown) are included.

Include the platform in the fetch log line so each pull is identifiable,
e.g. fetching image "..." (linux/amd64).

Signed-off-by: Kevin Tijssen <kevin.tijssen@siderolabs.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-03-25 08:59:17 +04:00
Andrey Smirnov
ad72c73006
test: improve maintenance API provision tests
Add a test that covers all maintenance APIs in general.

Add a test for transition from SideroLink.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-03-24 22:00:27 +04:00
Andrey Smirnov
a02d578faa
feat: add support for mirroring image signatures
Enable signature verification in image cache & airgapped tests.

Fixes #12892

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-03-09 20:07:51 +04:00
Laura Brehm
7f2eb48561
feat: add image verification endpoint
Add support for whole machine-wide image verification configuration.
Configuration is a set of rules applied top-down to the image reference,
each specifying a specific cosign-based identity or static public key
claim.

Talos provides a machined API to verify an image reference, resolving it
to the digest on the way as needed.

Talos itself hooks up in the image verification process, while
containerd CRI plugin accesses same API via the machined socket.

Signed-off-by: Laura Brehm <laurabrehm@hey.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-03-06 20:06:07 +04:00
Mateusz Urbanek
1e4cd20d23
feat: add talosctl install command and upgrade via LifecycleService
Add new `talosctl install` command using the LifecycleService.Install
streaming API with support for insecure (maintenance) mode and progress
reporting. Refactor `talosctl upgrade` to use the new
LifecycleService.Upgrade streaming API with automatic fallback to the
legacy MachineService.Upgrade path for older Talos versions.

Signed-off-by: Mateusz Urbanek <mateusz.urbanek@siderolabs.com>
2026-03-06 15:40:54 +01:00
Orzelius
d417d68e0d
feat: bring in new ssa logic
drop the old cli-utils based manifest apply logic and replace it with the new fluxcd/pkg/ssa based implementation

Signed-off-by: Orzelius <33936483+Orzelius@users.noreply.github.com>
2026-03-02 19:37:31 +09:00
Jan Paul
c8800b41e5
fix: update path handling on talosctl cgroups
The call to filepath.join in current code causes breakage when using talosctl on windows due to wrong slash introduced into the embed path.

Signed-off-by: Jan Paul <paulj@nerakhon.cz>
2026-02-23 15:12:50 +01:00
Andrey Smirnov
8b1c974a2a
refactor: drop termui-widgets library
Use the library built for tview, so that we don't have to have two UI
libraries working in parallel in the same TUI.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-02-21 16:24:26 +04:00
Dmitrii Sharshakov
9758bd4fe0
feat: update Go to 1.26
Via tools/pkgs, also pulling in Clang-built Linux

Update go.mod dependencies

Fix linter errors with new golangci-lint, modernize, use new()

Signed-off-by: Dmitrii Sharshakov <dmitry.sharshakov@siderolabs.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-02-19 22:15:19 +01:00
Andrey Smirnov
f20445ad09
chore: improve logging of disk encryption handling
Pulls in KMS with logging, and adds more logging to Talos.

This allows to debug encryption problems better.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-02-19 19:06:42 +04:00
Orzelius
ce53ffa900
fix: disks flag parsing and handling in create qemu command
The disks flag Set method was appending new disk requests to the existing ones,
which caused duplicate disk entries when custom values for the disks flag were set.

Signed-off-by: Orzelius <33936483+Orzelius@users.noreply.github.com>
2026-02-16 08:24:40 +09:00
Andrey Smirnov
f118ee47ea
fix: read multi-doc machine config with newer talosctl
This affects reading multi-doc machine config via `talosctl` from Talos
machines v1.11 and below by `talosctl` of v1.12 and up.

The problem is that before v1.12 Talos returned machine config as
embedded document instead of the spec if the resource, which was not
valid YAML. It worked via hacks we used in our fork of yaml library.

Talos v1.12+ cleans that up by marshaling the config as a string, and
drops the forked library. The problem is that we can't still pass
multi-doc YAML via this path, so we have to resort to going into COSI
internals to retrieve the actual value as written by Talos API.

Note: there is no problem for Omni, as it goes via protobuf path which
hasn't been affected.

Fixes #12787

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-02-13 16:26:59 +04:00
Sébastien Masset
87615f5511
feat: implement network policies with Flannel CNI
Align flannel ClusterRole with upstream chart template (cf.
https://github.com/flannel-io/flannel/blob/master/chart/kube-flannel/templates/rbac.yaml)

Add boolean in cluster flannel CNI config to deploy extra resources to
handle network policies. Inspired by flannel Helm chart handling of
netpol.enabled value (cf. https://github.com/flannel-io/flannel/blob/master/Documentation/netpol.md)

Signed-off-by: Sébastien Masset <86793256+smasset-orange@users.noreply.github.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-02-10 15:47:49 +04:00
Andrey Smirnov
9690dbad02
chore: bump tools (including linter)
Re-generate, fix new linting issues.

Update containerd library to the latest 2.2.1 to address the new cgroups
package import (via tools update).

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-02-09 13:07:35 +04:00
Andrey Smirnov
d5ebcd7cae
fix: stop building talosctl debug on Windows
This command is very specific to terminal operations which don't exist
or might not work well enough on Windows.

Windows users will have better luck with WSL and Linux talosctl.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-02-06 13:12:23 +04:00
Laura Brehm
d43a01ccbd
feat: implement talosctl debug
This implements a way to run a debug container with a provided image on
the node.

The container runs with privileged profile, allowing to issue debugging
commands (e.g. using some advanced network tools) to troubleshoot a
machine.

Signed-off-by: Laura Brehm <laurabrehm@hey.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-02-04 21:26:09 +04:00
Noel Georgi
055add7aeb
release(v1.13.0-alpha.1): prepare release
This is the official v1.13.0-alpha.1 release.

Signed-off-by: Noel Georgi <git@frezbo.dev>
2026-02-03 19:47:01 +05:30
Andrey Smirnov
8b245b8f26
feat: implement new image service APIs
These new APIs only support one2one proxying, so they don't have any
hacks, and look as regular gRPC APIs.

Old APIs are deprecated, but still supported.

Implement client-side multiplexing in `talosctl`, provide fallback to
old APIs for legacy Talos versions.

New APIs include removing an image, importing an image.

Extracted from #12392

Co-authored-by: Laura Brehm <laurabrehm@hey.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-02-02 15:55:56 +04:00
Andrey Smirnov
d90c775b84
chore: rename internal talosctl debug air-gapped
This command was always hidden, rename it to `debug-tool` to free up the
`talosctl debug` for #12932.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-02-02 14:25:01 +04:00
Andrey Smirnov
410d8cb572
fix: undo CRLF on Windows (talosctl edit)
Fixes #12664

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-01-26 19:45:39 +04:00
Andrei Kvapil
0bd48bbc6f
fix(talosctl): pass --k8s-endpoint flag to rotate-ca kubernetes rotation
The --k8s-endpoint flag was defined but never used in the rotate-ca
command. This fix passes the flag value through to the Kubernetes
client, allowing users to override the default Kubernetes API endpoint
during CA rotation.

Co-Authored-By: Claude <noreply@anthropic.com>
Signed-off-by: Andrei Kvapil <kvapss@gmail.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-01-26 18:28:49 +04:00
Andrey Smirnov
ddd6b186eb
refactor: generate GRUB images
Simplify the flow a bit by using live partition info,
avoid doing some calculations which are already done in the
partition code.

Remove some steps I believe we don't need to do.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-01-21 16:37:25 +04:00
Alexis La Goutte
634b71e2d0
docs: move talosctl pcap example to Example Block
it is before on Long but wrong display on docs website
(already use by image cache-create cmd)

Signed-off-by: Alexis La Goutte <alexis.lagoutte@gmail.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-01-14 14:35:56 +04:00
Pranav Patil
8184927316
feat: implement KubeSpan multi-document configuration
Migrate KubeSpan configuration to support multi-document format.
Add version-aware support for talosctl cluster create and gen config.
Uses multi-doc format for Talos 1.13+, legacy format for 1.12 and earlier.

Signed-off-by: Pranav Patil <pranavppatil767@gmail.com>
Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-01-13 16:08:11 +04:00
Andrey Smirnov
e4ef494dec
fix: drop the persist config flag from gen config
The `persist` value was locked to be true for a long time now, and Talos
doesn't support any other mode (machine config is persisted).

Drop the `gen config` flag and related generate options, as modern Talos
doesn't accept `persist: false`.

Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>
2026-01-12 19:53:50 +04:00
Mateusz Urbanek
c3176adcf9
feat: add EnvironmentConfig document
Add new EnvironmentConfig document for configuring the Env vars.
Deprecate .Machine.Env

Closes #12439

Signed-off-by: Mateusz Urbanek <mateusz.urbanek@siderolabs.com>
2026-01-12 15:10:20 +01:00
Orzelius
c839b38809
feat: expose more SSA options in the upgrade-k8s command
add the following flags to the upgrade-k8s command:
* `--force-conflicts`            overwrite the fields when applying even if the field manager differs
* `--inventory-policy` string    kubernetes SSA inventory policy (one of 'MustMatch', 'AdoptIfNoInventory' or 'AdoptAll') (default "AdoptIfNoInventory")
* `--no-prune`                   whether pruning of previously applied objects should happen after apply
* `--prune-timeout` int          how long to wait for resources to be pruned in secunds (set to zero to disable waiting for resources to be fully deleted) (default 180)
* `--reconcile-timeout` int      how long to wait for resources to be prfully reconciled in secunds (set to zero to disable waiting for resources to be fully reoondiled) (default 180)

Signed-off-by: Orzelius <33936483+Orzelius@users.noreply.github.com>
2026-01-12 21:17:43 +09:00
Mateusz Urbanek
080efcbda2
feat: add k8s-version parameter to k8s-bundle
Allow overriding K8s version in the  command.

Signed-off-by: Mateusz Urbanek <mateusz.urbanek@siderolabs.com>
2026-01-07 10:26:39 +01:00
Noel Georgi
70e67787d6
feat: imager: populate filesystems with root owned files
Populate filesystems from source directories with root owned files.
This completes running imager fully rootless.

Fixes: #12498

Signed-off-by: Noel Georgi <git@frezbo.dev>
2026-01-05 21:13:42 +05:30
Mateusz Urbanek
7416dca593
fix: print talosctl images to release notes
After changing `talsoctl images k8s-bundle and talos-bundle`
we stopped printing some of the images to release notes.
This fixes that issue.

Signed-off-by: Mateusz Urbanek <mateusz.urbanek@siderolabs.com>
2026-01-05 15:01:57 +01:00
Noel Georgi
dc2009e477
chore: use context when creating filesystems
Pass in context when creating filesystems with `mkfs.*` commands.

Signed-off-by: Noel Georgi <git@frezbo.dev>
2026-01-05 15:29:35 +05:30
Noel Georgi
226cd6bc1d
fix: do not allocate for the actual disk image file
Do not allocate space for the actual full disk image file, this defeats sparsing.

Signed-off-by: Noel Georgi <git@frezbo.dev>
2025-12-26 15:03:15 +05:30
Noel Georgi
53f5bf8d2c
fix: overlay installers
Overlays installers assume the `/boot/EFI` path, so we generate assets into `/boot/EFI` then move that directory to the mountPrefix+/EFI.

Signed-off-by: Noel Georgi <git@frezbo.dev>
2025-12-26 15:03:11 +05:30
Noel Georgi
77086694d1
fix: partition data population
`FileSystemTypeZeroes` and `FileSystemTypeNone` doesn't need data to be populated.

Signed-off-by: Noel Georgi <git@frezbo.dev>
2025-12-26 15:03:03 +05:30
Noel Georgi
4d5657b1a3
fix: drop SBC board code
Boards were deprecated in favor of overlays from Talos 1.7.

Now completely remove all board specific code.

Part of: #12492

Signed-off-by: Noel Georgi <git@frezbo.dev>
2025-12-26 14:45:17 +05:30