dgrisonnet
e97eb0fbe9
[bot] [main] Automated version update
2021-08-02 13:37:08 +00:00
Maxime Brunet
b7fe018d29
eks: Revert back to awscni_total_ip_addresses-based alert
2021-07-31 11:37:12 -07:00
Paweł Krupa
b9c73c7b29
Merge pull request #1283 from prashbnair/node-veth
...
changing node exporter ignore list
2021-07-28 09:17:03 +02:00
Prashant Balachandran
09fdac739d
changing node exporter ignore list
2021-07-27 17:17:19 +05:30
Paweł Krupa
785789b776
Merge pull request #1257 from Luis-TT/kube-state-metrics-kubac-proxy-resources
2021-07-27 12:36:26 +02:00
lanmarti
ed48391831
Add resource requests and limits to prometheus-adapter container
2021-07-27 12:19:51 +02:00
Maxime Brunet
3a98a3478c
eks: Fix CNI metrics relabelings
...
Signed-off-by: Maxime Brunet <maxime.brunet@paytm.com>
2021-07-23 13:39:29 -07:00
Manuel Rüger
acd1eeba4c
node.libsonnet: Fix small typo
...
Signed-off-by: Manuel Rüger <manuel@rueg.eu>
2021-07-22 19:14:24 +02:00
paulfantom
cfe830f8f0
jsonnet/kube-prometheus: point to runbooks.prometheus-operator.dev
...
Signed-off-by: paulfantom <pawel@krupa.net.pl>
2021-07-22 17:30:57 +02:00
Luis Vidal Ernst
9c638162ae
Allow customizing of kubeRbacProxy in kube-state-metrics
2021-07-21 13:57:05 +02:00
Paweł Krupa
acea5efd85
Merge pull request #1268 from paulfantom/alerts-best-practices
...
Alerts best practices
2021-07-21 09:32:32 +02:00
Philip Gough
463ad065d3
jsonnet: Drop cAdvisor metrics with no (pod, namespace) labels while preserving ability to monitor system services resource usage
...
The following provides a description and cardinality estimation based on the tests in a local cluster:
container_blkio_device_usage_total - useful for containers, but not for system services (nodes*disks*services*operations*2)
container_fs_.* - add filesystem read/write data (nodes*disks*services*4)
container_file_descriptors - file descriptors limits and global numbers are exposed via (nodes*services)
container_threads_max - max number of threads in cgroup. Usually for system services it is not limited (nodes*services)
container_threads - used threads in cgroup. Usually not important for system services (nodes*services)
container_sockets - used sockets in cgroup. Usually not important for system services (nodes*services)
container_start_time_seconds - container start. Possibly not needed for system services (nodes*services)
container_last_seen - Not needed as system services are always running (nodes*services)
container_spec_.* - Everything related to cgroup specification and thus static data (nodes*services*5)
2021-07-20 12:50:02 +01:00
paulfantom
46eb1713a5
jsonnet: remove unused alert unit tests as those are moved to alertmanager repository
2021-07-20 11:14:38 +02:00
paulfantom
8c357c6bde
jsonnet: align alert annotations with best practices
...
Signed-off-by: paulfantom <pawel@krupa.net.pl>
2021-07-20 10:59:49 +02:00
paulfantom
1a3c610c61
[bot] Automated version update
2021-07-19 13:44:23 +00:00
Paweł Krupa
99ee030de3
Merge pull request #1259 from PaytmLabs/feature/eks/cni-relabel-instance
...
eks: Relabel instance with node name for CNI DaemonSet
2021-07-19 10:09:09 +02:00
Paweł Krupa
80bb15bedd
Merge pull request #1255 from yeya24/fix-dashboards-definition-length-check
2021-07-19 09:56:09 +02:00
Maxime Brunet
7394929c76
eks: Relabel instance with node name for CNI DaemonSet
2021-07-17 11:28:38 -07:00
Yury Gargay
9b08b941f8
Update kubernetes-mixin
...
From b710a868a9
2021-07-14 18:51:36 +02:00
ben.ye
43adca8df7
fmt again
...
Signed-off-by: ben.ye <ben.ye@bytedance.com>
2021-07-13 19:56:38 -07:00
ben.ye
90b2751f06
fmt code
...
Signed-off-by: ben.ye <ben.ye@bytedance.com>
2021-07-13 19:48:01 -07:00
ben.ye
dee7762ae3
create dashboardDefinitions if rawDashboards or folderDashboards are specified
...
Signed-off-by: ben.ye <ben.ye@bytedance.com>
2021-07-13 19:39:01 -07:00
Damien Grisonnet
97e77e9996
Merge pull request #1231 from dgrisonnet/fix-adapter-queries
...
Consolidate intervals used in prometheus-adapter CPU queries
2021-07-07 13:48:02 +02:00
Damien Grisonnet
b9563b9c2d
jsonnet: improve adapter queries readability
...
Signed-off-by: Damien Grisonnet <dgrisonn@redhat.com>
2021-07-05 15:29:45 +02:00
Damien Grisonnet
8812e45501
jsonnet: readjust prometheus-adapter intervals
...
Previously, prometheus-adapter configuration wasn't taking into account
the scrape interval of kubelet, node-exporter and windows-exporter
leading to getting non fresh results, and even negative results from the
CPU queries when the irate() function was extrapolating data.
To fix that, we want to set the interval used in the irate() function in
the CPU queries to 4x scrape interval in order to extrapolate data
between the last two scrapes. This will improve the freshness of the cpu
usage exposed and prevent incorrect extrapolations.
Signed-off-by: Damien Grisonnet <dgrisonn@redhat.com>
2021-07-05 15:28:25 +02:00
Sunil Thaha
0280f4ddf9
jsonnet: kube-prometheus adapt to changes to veth interfaces names
...
With OVN, the container veth network interface names that used to start
with `veth` has now changed to `<rand-hex>{15}@if<number>`(see Related
Links below).
This patch adapts to the new change introduced in ovn and ignores the network
interfaces that match `[a-z0-9]{15}@if\d+` in addition to those starting
with `veth`
Related Links:
- https://github.com/openshift/ovn-kubernetes/blob/master/go-controller/vendor/github.com/containernetworking/plugins/pkg/ip/link_linux.go#L107
- https://github.com/openshift/ovn-kubernetes/blob/master/go-controller/pkg/cni/helper_linux.go#L148
Signed-off-by: Sunil Thaha <sthaha@redhat.com>
2021-07-01 12:01:19 +10:00
Damien Grisonnet
2c5c20cfff
Merge pull request #1216 from fpetkovski/prometheus-adapter-cipher-suites
...
jsonnet: disable insecure cypher suites for prometheus-adapter
2021-06-23 21:19:24 +02:00
paulfantom
d0e21f34e5
[bot] Automated version update
2021-06-23 13:41:46 +00:00
fpetkovski
86b1207e1b
jsonnet: update alertmanager branch to main
...
Alertmanager changed its default branch to main.
This commit updates the alertmanager branch to track the new default.
Signed-off-by: fpetkovski <filip.petkovsky@gmail.com>
2021-06-23 14:25:49 +02:00
fpetkovski
0ff173efea
jsonnet: disable insecure cypher suites for prometheus-adapter
...
Running sslscan against the prometheus adapter secure port reports two
insecure SSL ciphers, ECDHE-RSA-DES-CBC3-SHA and DES-CBC3-SHA.
This commit removes those ciphers from the list.
Signed-off-by: fpetkovski <filip.petkovsky@gmail.com>
2021-06-22 14:17:09 +02:00
paulfantom
ffea8f498e
[bot] Automated version update
2021-06-18 13:50:44 +00:00
Matthias Loibl
4ea366eef7
Fix name for blackbox-exporter PodSecurityPolicy
2021-06-16 12:55:51 +02:00
paulfantom
d6201759b8
[bot] Automated version update
2021-06-14 13:50:57 +00:00
Adin Hodovic
88034c4c41
jsonnet/platforms: Import managed-cluster addon for the EKS platform
2021-06-14 01:07:18 +02:00
paulfantom
54f79428ce
[bot] Automated version update
2021-06-11 13:51:10 +00:00
Paweł Krupa
df197f6759
Merge pull request #1192 from prometheus-operator/automated-updates
2021-06-11 15:47:41 +02:00
paulfantom
5ea10d80a1
jsonnet: fix label selector for coredns ServiceMonitor
2021-06-11 10:56:54 +02:00
paulfantom
a2cf1acd95
[bot] Automated version update
2021-06-10 13:59:30 +00:00
ArthurSens
f643955034
Update alertmanager mixin
...
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2021-06-08 18:19:23 +00:00
paulfantom
feee269fdb
jsonnet: improve all-namespaces addon
...
Signed-off-by: paulfantom <pawel@krupa.net.pl>
2021-06-01 09:05:07 +02:00
Paweł Krupa
6d603cf7a9
Merge pull request #1142 from faruryo/fix/kubespray-alert
...
Fix scheduler and controller selectors for Kubespray
2021-05-31 23:14:02 +02:00
Paweł Krupa
dccf2ee085
Merge pull request #1135 from paulfantom/use-common
2021-05-31 23:12:53 +02:00
Paweł Krupa
93cc34f0f6
Merge pull request #1171 from anarcher/pr/grafana-env-1167
...
feat(grafana): add env parameter for grafana component
2021-05-31 23:11:34 +02:00
Ajit
d57542eae1
Fix for bug #1163 ( #1164 )
2021-05-31 23:08:59 +02:00
anarcher
8bcfb98a1d
feat(grafana): add env parameter for gradana component
2021-05-31 18:52:55 +09:00
Paweł Krupa
b279e38809
Merge pull request #1129 from onprem/feature-flags
2021-05-31 10:56:39 +02:00
paulportela
f7baf1599d
Fix adding private repository
...
`imageRepos` field was removed and the project no longer tries to compose image strings. Now the libraries use `$.values.common.images` to override default images.
2021-05-28 17:22:27 -07:00
Prem Saraswat
228f8ffdad
Add support for feature-flags in Prometheus
2021-05-27 23:21:30 +05:30
paulfantom
039d4a1e48
jsonnet: sort list of dropped metrics
2021-05-27 10:49:36 +02:00
paulfantom
2873857dc7
jsonnet: convert string of deprecated metrics into array
2021-05-27 10:46:58 +02:00