Skip to content

Commit

Permalink
feat(monitoring): refine unhealthy ring alert
Browse files Browse the repository at this point in the history
  • Loading branch information
Umaaz committed Oct 19, 2023
1 parent c9fdf69 commit 0687855
Show file tree
Hide file tree
Showing 7 changed files with 21 additions and 6 deletions.
3 changes: 3 additions & 0 deletions .idea/misc.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

2 changes: 1 addition & 1 deletion charts/deep-distributed/Chart.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ type: application
# This is the chart version. This version number should be incremented each time you make changes
# to the chart and its templates, including the app version.
# Versions are expected to follow Semantic Versioning (https://semver.org/)
version: 0.0.21
version: 0.0.22

# This is the version number of the application being deployed. This version number should be
# incremented each time you make changes to the application. Versions are not expected to
Expand Down
2 changes: 1 addition & 1 deletion charts/deep-distributed/templates/prometheusrule.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,6 @@ spec:
- name: deep.rules
rules:
{{- with .Values.monitoring.rules.rules }}
{{- tpl . $ | nindent 8 -}}
{{- toYaml . | nindent 8 -}}
{{- end}}
{{- end -}}
8 changes: 7 additions & 1 deletion charts/deep-distributed/values.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -1017,10 +1017,16 @@ monitoring:
# The namespace to deploy the rules in
namespace: null
# The rules to add
rules: |
rules:
- alert: Unhealthy Ring Member
expr: |
deep_ring_members{state="Unhealthy"} > 0
labels:
severity: critical
annotations:
description: "{{ $labels.service }} has {{ $value }} unhealthy nodes in ring {{ $labels.name }}."
runbook_url: https://intergral.github.io/deep/runbooks/unhealthy_ring_node/
summary: One or more ring members are unhealthy


# Configuration for the gateway
Expand Down
2 changes: 1 addition & 1 deletion charts/deep/Chart.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ type: application
# This is the chart version. This version number should be incremented each time you make changes
# to the chart and its templates, including the app version.
# Versions are expected to follow Semantic Versioning (https://semver.org/)
version: 1.0.10
version: 1.0.11

# This is the version number of the application being deployed. This version number should be
# incremented each time you make changes to the application. Versions are not expected to
Expand Down
2 changes: 1 addition & 1 deletion charts/deep/templates/prometheusrule.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,6 @@ spec:
- name: deep.rules
rules:
{{- with .Values.monitoring.rules.rules }}
{{- tpl . $ | nindent 8 -}}
{{- toYaml . | nindent 8 -}}
{{- end}}
{{- end -}}
8 changes: 7 additions & 1 deletion charts/deep/values.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -462,7 +462,13 @@ monitoring:
# The namespace to deploy the rules in
namespace: null
# The rules to add
rules: |
rules:
- alert: Unhealthy Ring Member
expr: |
deep_ring_members{state="Unhealthy"} > 0
labels:
severity: critical

Check failure on line 470 in charts/deep/values.yaml

View workflow job for this annotation

GitHub Actions / lint-test

470:29 [trailing-spaces] trailing spaces
annotations:
description: "{{ $labels.service }} has {{ $value }} unhealthy nodes in ring {{ $labels.name }}."
runbook_url: https://intergral.github.io/deep/runbooks/unhealthy_ring_node/
summary: One or more ring members are unhealthy

0 comments on commit 0687855

Please sign in to comment.