1. stack 开启 tls:

Background:
下载 stack chart 包,修改 values.yaml,执行 helm install 安装,没有问题。但再次修改后执行 helm upgrade,报错:

  1. client.go:205: [debug] error updating the resource "prometheus-kube-prometheus-alertmanager.rules":
  2. cannot patch "prometheus-kube-prometheus-alertmanager.rules" with kind PrometheusRule: Timeout: request did not complete within requested timeout 34s
  3. ...

Solution:
https://github.com/prometheus-community/helm-charts/issues/142

If you intend to deploy the stack behind a reverse proxy (with tls termination) and operate only within the Kubernetes cluster, then you should not have any tls enabled within the default values, but unfortunately for us (and some of you) the default values file has this value set -> prometheusOperator.tls.enabled=true

执行:

  1. 将 admissionWebhooks 设置为 false;
  2. 删除已有的 MutatingWebhookConfiguration , ValidatingWebhookConfiguration

2. alertmanager.retention

描述:

  1. ## Time duration Alertmanager shall retain data for. Default is '120h', and must match the regular expression
  2. ## [0-9]+(ms|s|m|h) (milliseconds seconds minutes hours).
  3. ##
  4. retention: 24h

解释:

3. 标记不同 prometheus server

4. thanos-query 启动参数配置 prome-server svc

描述:
待测试。。。