talos

mirror of https://github.com/siderolabs/talos.git synced 2025-11-06 11:21:13 +01:00

Author	SHA1	Message	Date
Andrey Smirnov	7a68504b6b	feat: support rotating Kubernetes CA Fixes #8440 Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>	2024-04-01 22:08:02 +04:00
Andrey Smirnov	3c9f7a7de6	chore: re-enable nolintlint and typecheck linters Drop startup/rand.go, as since Go 1.20 `rand.Seed` is done automatically. Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com>	2023-08-25 01:05:41 +04:00
Andrey Smirnov	badbc51e63	refactor: rewrite code to include preliminary support for multi-doc `config.Container` implements a multi-doc container which implements both `Container` interface (encoding, validation, etc.), and `Conifg` interface (accessing parts of the config). Refactor `generate` and `bundle` packages to support multi-doc, and provide backwards compatibility. Implement a first (mostly example) machine config document for SideroLink API URL. Many places don't properly support multi-doc yet (e.g. config patches). Signed-off-by: Andrey Smirnov <andrey.smirnov@talos-systems.com>	2023-05-31 18:38:05 +04:00
Andrey Smirnov	860002c735	fix: don't reload control plane pods on cert SANs changes Fixes #7159 The change looks big, but it's actually pretty simple inside: the static pods had an annotation which tracks a version of the secrets which forced control plane pods to reload on a change. At the same time `kube-apiserver` can reload certificate inputs automatically from files without restart. So the inputs were split: the dynamic (for kube-apiserver) inputs don't need to be reloaded, so its version is not tracked in static pod annotation, so they don't cause a reload. The previous non-dynamic resource still causes a reload, but it doesn't get updated when e.g. node addresses change. There might be many more refactoring done, the resource chain is a bit of a mess there, but I wanted to keep number of changes minimal to keep this backportable. Signed-off-by: Andrey Smirnov <andrey.smirnov@talos-systems.com>	2023-05-05 16:59:09 +04:00
Andrey Smirnov	96aa9638f7	chore: rename talos-systems/talos to siderolabs/talos There's a cyclic dependency on siderolink library which imports talos machinery back. We will fix that after we get talos pushed under a new name. Signed-off-by: Andrey Smirnov <andrey.smirnov@talos-systems.com>	2022-11-03 16:50:32 +04:00
Andrey Smirnov	30bbf6463a	refactor: use siderolabs/net version with netip.Addr Replace most of `net.IP` usage in Talos with `netip.Addr`, refactor code accordingly. Signed-off-by: Andrey Smirnov <andrey.smirnov@talos-systems.com>	2022-11-02 14:21:03 +04:00
Andrey Smirnov	343c55762e	chore: replace talos-systems Go modules with siderolabs This the first step towards replacing all import paths to be based on `siderolabs/` instead of `talos-systems/`. All updates contain no functional changes, just refactorings to adapt to the new path structure. Signed-off-by: Andrey Smirnov <andrey.smirnov@talos-systems.com>	2022-11-01 12:55:40 +04:00
Andrey Smirnov	f62d17125b	chore: update crypto to use new import path siderolabs/crypto No functional changes in this PR, just updating import paths. Signed-off-by: Andrey Smirnov <andrey.smirnov@talos-systems.com>	2022-09-07 23:02:50 +04:00
Dmitriy Matrenichev	068f1b6d05	feat: add ctest package and base for test suite This change adds ctest package which adds DefaultSuite and helper functions. Signed-off-by: Dmitriy Matrenichev <dmitry.matrenichev@siderolabs.com>	2022-06-17 20:12:08 +08:00
Andrey Smirnov	da2985fe1b	fix: respect local API server port It wasn't used when building an endpoint to the local API server, so Talos couldn't talk to the local API server when port was changed from the default one. Fixes #5706 Signed-off-by: Andrey Smirnov <andrey.smirnov@talos-systems.com>	2022-06-09 00:33:49 +04:00
Andrey Smirnov	85b328e997	refactor: convert secrets resources to use typed.Resource No functional changes. Signed-off-by: Andrey Smirnov <andrey.smirnov@talos-systems.com>	2022-04-26 14:51:56 +03:00
Andrey Smirnov	e91350acd7	refactor: convert time & v1alpha1 resources to use typed.Resource No functional changes, just pure refactoring. Signed-off-by: Andrey Smirnov <andrey.smirnov@talos-systems.com>	2022-04-25 22:41:52 +03:00
Andrey Smirnov	b085343dcb	feat: use discovery information for etcd join (and other etcd calls) Talos historically relied on `kubernetes` `Endpoints` resource (which specifies `kube-apiserver` endpoints) to find other controlplane members of the cluster to connect to the `etcd` nodes for the cluster (when node local etcd instance is not up, for example). This method works great, but it relies on Kubernetes endpoint being up. If the Kubernetes API is down for whatever reason, or if the loadbalancer malfunctions, endpoints are not available and join/leave operations don't work. This PR replaces the endpoints lookup to use the `Endpoints` COSI resource which is filled in using two methods: * from the discovery data (if discovery is enabled, default to enabled) * from the Kubernetes `Endpoints` resource If the discovery is disabled (or not available), this change does almost nothing: still Kubernetes is used to discover control plane endpoints, but as the data persists in memory, even if the Kubernetes control plane endpoint went down, cached copy will be used to connect to the endpoint. If the discovery is enabled, Talos can join the etcd cluster immediately on boot without waiting for Kubernetes to be up on the bootstrap node which means that Talos cluster initial bootstrap runs in parallel on all control plane nodes, while previously nodes were waiting for the first node to finish bootstrap enough to fill in the endpoints data. As the `etcd` communication is anyways protected with mutual TLS, there's no risk even if the discovery data is stale or poisoned, as etcd operations would fail on TLS mismatch. Most of the changes in this PR actually enable populating Talos `Endpoints` resource based on the `Kubernetes` `endpoints` resource using the watch API. Signed-off-by: Andrey Smirnov <andrey.smirnov@talos-systems.com>	2022-04-21 22:00:27 +03:00
Dmitriy Matrenichev	e06e1473b0	feat: update golangci-lint to 1.45.0 and gofumpt to 0.3.0 - Update golangci-lint to 1.45.0 - Update gofumpt to 0.3.0 - Fix gofumpt errors - Add goimports and format imports since gofumports is removed - Update Dockerfile - Fix .golangci.yml configuration - Fix linting errors Signed-off-by: Dmitriy Matrenichev <dmitry.matrenichev@siderolabs.com>	2022-03-24 08:14:04 +04:00
Andrey Smirnov	753a82188f	refactor: move pkg/resources to machinery Fixes #4420 No functional changes, just moving packages around. Signed-off-by: Andrey Smirnov <andrey.smirnov@talos-systems.com>	2021-11-15 19:50:35 +03:00
Andrey Smirnov	8329d21114	chore: split polymorphic RootSecret resource into specific types Fixes #4418 Only one resource (one of the very first ones) was polymorphic: its actual spec type depends on its ID. This was a bad idea, and it doesn't work with protobuf specs (as type <> protobuf relationship can't be established). Refactor this by splitting into three separate resource types: `OSRoot` (OS-level root secrets), `EtcdRoot` (for etcd), `KubernetesRoot` (for Kubernetes). Signed-off-by: Andrey Smirnov <andrey.smirnov@talos-systems.com>	2021-10-27 19:56:04 +03:00
Andrey Smirnov	c3b2429ce9	fix: suppress spurious Kubernetes API server cert updates With the last changes, `kube-apiserver` certificates are generated based on the assigned `NodeAdresses`, machine configuration, etc. Whenver the certificate is regenerated, `kube-apiserver` is reloaded to pick up the new cert. With Virtual IP enabled, Virtual IP address is included into the certificate from the beginning as it is specified in the machine configuration, but as virtual IP moves between the nodes this causes `NodeAddresses` update, which triggers the controller, generates new certs and reloads `kube-apiserver` at bad time (right after VIP got moved). Even though the cert generated is identical to the previous one, the API server reload makes it unavailable for 30-90 seconds. This change extracts `CertSANs` as a separate resource so that its updates are suppressed if the CertSANs sources change, but the final list stays the same, and in turn prevents final certificate from being updated. Signed-off-by: Andrey Smirnov <andrey.smirnov@talos-systems.com>	2021-09-09 00:31:54 +03:00
Andrey Smirnov	2c66e1b3c5	feat: provide building of local `Affiliate` structure (for the node) Fixes #4139 This builds the local (for the node) `Affiliate` structure which describes node for the cluster discovery. Dependending on the configuration, KubeSpan information might be included as well. `NodeAddresses` were updated to hold CIDRs instead of simple IPs. The `Affiliate` will be pushed to the registries, while `Affiliate`s for other nodes will be fetched back from the registries. Signed-off-by: Andrey Smirnov <andrey.smirnov@talos-systems.com>	2021-09-03 16:44:19 +03:00
Andrey Smirnov	0b347570a7	feat: use dynamic NodeAddresses/HostnameStatus in Kubernetes certs This is a PR on a path towards removing `ApplyDynamicConfig`. This fixes Kubernetes API server certificate generation to use dynamic data to generate cert with proper SANs for IPs of the node. As part of that refactored a bit apid certificate generation (without any changes). Added two unit-tests for apid and Kubernetes certificate generation. Signed-off-by: Andrey Smirnov <smirnov.andrey@gmail.com>	2021-09-01 20:56:53 +03:00

19 Commits