Commit b94683d1 authored by ale's avatar ale

Merge branch 'crashloop-alert' into 'master'

prometheus: account for one-minute timers when detecting crash loops

See merge request !100
parents c356d4f9 fee9c019
Pipeline #6210 failed with stage
in 3 minutes and 9 seconds
......@@ -10,7 +10,7 @@ groups:
summary: '{{ $labels.name }} has failed on {{ $labels.host }}'
description: 'The systemd unit {{ $labels.name }} has failed on {{ $labels.host }}.'
- alert: SystemdUnitCrashLooping
expr: instance:systemd_unit_restarts:delta10m > 10
expr: instance:systemd_unit_restarts:delta10m > 12
for: 30m
labels:
severity: page
......
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment