566 Commits

Author SHA1 Message Date
paulfantom
ea109b463d
jsonnet/kube-prometheus/kube-state-metrics: fix indentation 2020-09-18 15:49:46 +02:00
paulfantom
bb4b7019a7
jsonnet: do not relative jb import paths and move imports to the top of the file 2020-09-18 15:46:22 +02:00
paulfantom
e8cc596699
jsonnet/kube-prometheus: use prometheus-operator mixin 2020-09-18 10:36:57 +02:00
Matthias Loibl
e6111e58c4
Fix node-mixin import 2020-09-11 18:09:05 +02:00
Matthias Loibl
09293866ae
Use absolut jsonnet imports whenever possible
There are still some dependencies that we need to make work to fully
deactivate the legacyImports in the future. I'll start opening PRs
against those other repositories.
2020-09-11 18:06:43 +02:00
Matthias Loibl
2d3c10e3f7
Update dependencies and most importantly PromOp to v0.42.0 2020-09-11 13:14:22 +02:00
Damien Grisonnet
9e18cf2097 jsonnet: bump kube-rbac-proxy to v0.6.0
Signed-off-by: Damien Grisonnet <dgrisonn@redhat.com>
2020-09-10 16:15:35 +02:00
Scott Dodson
ddc853e1ac node-exporter: set maxUnavailable to 10%
This daemonset doesn't affect workload availability so allow its rollout to
be parallelized.
2020-09-01 15:08:45 -04:00
Paweł Krupa
88fe47b35e
Merge pull request #652 from PerfectMemory/fix-probe-with-ingress-650 2020-08-31 15:09:10 +02:00
Michał Łazowik
28b717a984 Support utf8 in external file alertmanager config
This could have been achieved either by switching to stringData, or doing
`std.base64(std.encodeUTF8($._config.alertmanager.config))` as per
google/jsonnet#575

I went with the former, because it's:
1. Easier to read existing config
2. Consistent with the way jsonnet object-based config is written just above
2020-08-28 01:57:50 +02:00
Julien Pervillé
4e458eeb2a Allow prometheus to get/list/watch extensions.ingresses 2020-08-20 10:58:26 +02:00
Julien Pervillé
d9d4eebd96 Add missing probeSelector: {} and probeNamespaceSelector: {} in manifests 2020-08-19 09:48:04 +02:00
Frederic Branczyk
8a8595b5f3
Merge pull request #628 from lilic/unpin-jb
Pin back to master and bump prometheus-operator to 0.41.1
2020-08-12 15:04:47 +02:00
KingJ
c62b9569d9 Thanos Sidecar Exposure and Scraping
Expose the Thanos Sidecar's HTTP metrics and gRPC StoreAPI interfaces via a dedicated service. ClusterIP is set to none to allow for full discovery of all endpoints behind the service via in-cluster DNS. This allows for a new ServiceMonitor to then scrape the metrics available on the Sidecar's HTTP port.

A new service has been used so as to separate out the metrics that Prometheus makes available, and the metrics the Thanos Sidecar makes available. The Thanos Mixins from thanos-io/thanos default to a job label of 'thanos-sidecar', and hence the service here has had this label applied.
2020-08-09 17:11:46 +01:00
Lili Cosic
5417071d15 jsonnet/prometheus-operator.libsonnet: Adjust alerts range 2020-08-06 10:56:11 +02:00
Lili Cosic
6c47a6e898 jsonnet/kube-prometheus/kube-state-metrics: Bump version 2020-08-06 10:04:25 +02:00
Lili Cosic
e254781636 jsonnet/kube-prometheus/jsonnetfile.json: Pin to master 2020-08-06 10:04:24 +02:00
paulfantom
87fcf52469
Adjust links after repository migration 2020-08-05 15:50:40 +02:00
Sergiusz Urbaniak
2326773ee1
jsonnet/kube-prometheus: pin depdencies 2020-07-31 10:18:24 +02:00
Frederic Branczyk
f0955e0540
Merge pull request #623 from brancz/add-kubelet-probes-metrics
Add scraping of endpoint for kubelet probe metrics
2020-07-29 12:57:28 +02:00
Frederic Branczyk
7c35752e3f
Add scraping of endpoint for kubelet probe metrics 2020-07-29 11:49:52 +02:00
Frederic Branczyk
b51b9b983f
prometheus-adapter: Collect metrics from Prometheus Adapter 2020-07-29 11:38:42 +02:00
Frederic Branczyk
6771c9bcc2
Merge pull request #616 from paulfantom/ciphers
Update default ciphers used by kube-rbac-proxy
2020-07-28 09:31:20 +02:00
paulfantom
8f85949438
jsonnet: update kube-rbac-proxy ciphers 2020-07-28 08:49:21 +02:00
tafkam
6dfbcf35f2 port https-metrics 2020-07-27 10:27:14 +02:00
tafkam
c1304caa28 update secure ports for other cluster 2020-07-25 18:30:07 +02:00
tafkam
4410a80e4e secure scheduler/controller metrics ports, kubeadm discovery services 2020-07-25 18:27:17 +02:00
Frederic Branczyk
40adbfae6c
Merge pull request #617 from paulfantom/node_filesystem_usage
Remove instance:node_filesystem_usage:sum
2020-07-23 21:25:55 +02:00
Simon Pasquier
fcf7a2fcbf jsonnet: update component versions 2020-07-23 17:06:48 +02:00
paulfantom
4e116aa7e2
jsonnet: remove incorrect instance:node_filesystem_usage:sum rule 2020-07-23 16:50:27 +02:00
Lili Cosic
5743540fbb prometheus-operator.libsonnet: Add List error alert and fix threshold to
Watch error alert
2020-07-15 10:24:45 +02:00
Lili Cosic
dfe9184c9b prometheus-operator.libsonnet: Add PrometheusOperatorWatchErrors alert 2020-07-13 17:35:36 +02:00
Lili Cosic
3865eacdb3 jsonnet/kube-prometheus: Bump default versions of prometheus and alertmanager 2020-07-09 11:48:22 +02:00
Abu Kashem
4d6e3d5c19
enable etcd latency metrics in kube-apiserver
kube-apiserver has a histogram etcd_request_duration_seconds that
measures latency between the kube-apiserver and etcd instance.
This metrics is currently dropped by cluster-prometheus. Enable
this metrics so we have visibility into etcd latency.

We ensured that this does not enable other unwanted metrcis

count by(name) ({name=~"etcd_request.+"})

etcd_request_duration_seconds_bucket
etcd_request_duration_seconds_count
etcd_request_duration_seconds_sum
2020-07-03 09:49:56 -04:00
Simon Pasquier
bbd4e61fc1 Bump Grafana version to v6.7.4 2020-06-24 10:51:35 +02:00
Frederic Branczyk
1d41243b54
Merge pull request #579 from tommyjmquinn/master
Updated prometheus adapter deployment to use a multi arch image repo
2020-06-23 16:09:32 +02:00
Tom Quinn
e82acdb253 Updated prometheus adapter deployment to use a multi arch image repo 2020-06-22 13:57:41 +01:00
Kristoffer Dalby
f55a17718d Allow nodeExporter address to be configured 2020-06-21 09:11:16 +01:00
Kristoffer Dalby
6b4bc0bb26 Allow nodeExporter address to be configured 2020-06-21 08:28:48 +01:00
Frederic Branczyk
6f488250fd
Merge pull request #576 from simonpasquier/fix-alertmanager-config-inconsistent-alert
Fix AlertmanagerConfigInconsistent alert
2020-06-19 16:20:40 +02:00
Simon Pasquier
c3ea4675da Fix AlertmanagerConfigInconsistent alert
Previously the alert would fire when the number of Alertmanager pods
didn't match the number of replicas defined in the Alertmanager spec
even though all the running pods had the same configuration hash. This
type of issue is already covered by KubeStatefulSetUpdateNotRolledOut
(and possibly KubePodNotReady), having AlertmanagerConfigInconsistent
also active in this situation creates unnecessary noise.

With this change, the alert expression only returns when Alertmanager
pods have different configuration hash values irrespective of the number
of pod replicas. The message annotation has also been enhanced to report
the configuration hash for each pod.

Signed-off-by: Simon Pasquier <spasquie@redhat.com>
2020-06-19 14:30:55 +02:00
Stavros Foteinopoulos
3cbc97d782
Update prometheus-adapter endpoint 2020-06-19 15:27:26 +03:00
Lili Cosic
53bb3431ad jsonnet/kube-prometheus/jsonnetfile.json: Bump prometheus-operator to
v0.40
2020-06-19 10:26:55 +02:00
Paul Gier
d1690d95f7 node_exporter: remove outdated comment and CLI arg
The ignored filesystem types now matches the default, so the
comment and arg can be removed.
2020-05-12 17:14:05 -05:00
Paul Gier
69b6883033 node-exporter: ignore kubelet pod mounts
Ignore kubelet pod filesystem mounts of the form:
/var/lib/kubelet/pods/1b260ce7-e75d-44d4-8409-922d2bd0851f/volumes...
Metrics for these volumes are available via the kubelet_volume_stats*
metrics.
2020-05-12 17:12:36 -05:00
Frederic Branczyk
f58d7b5695
Merge pull request #519 from pgier/dont-remove-preserve-unknown-fields
Revert "Remove field preserveUnknownFields from CRDs"
2020-05-11 16:16:22 +02:00
paulfantom
96ea25d5de
*: update jsonnet to use prometheus-operator v0.39 2020-05-11 11:59:46 +02:00
Frederic Branczyk
dab022fc62
Merge pull request #508 from johanneswuerbach/custom-metrics-b2
custom metrics v1beta2 api with k8s-prometheus-adapter v0.7.0
2020-05-07 10:12:42 +02:00
Paul Gier
4840cdcb66 Revert "Remove field preserveUnknownFields from CRDs"
This reverts commit cdaaf3d51c99a432726cd1ed20f5af01cbcd42d4.
2020-05-05 14:15:18 -05:00
Benjamin
7130905473 Update prometheus version to v2.17.2
Signed-off-by: Benjamin <benjamin@yunify.com>
2020-04-30 14:46:17 +08:00