An SRE team wants to define an SLI for service availability. Which metric correctly represents the availability SLI?
Trap 1: Total requests that complete within 200 ms / Total requests
This is a latency SLI, not availability.
Trap 2: Total minutes the service is up / Total minutes in the window
This is a different measure (uptime), not request-based availability.
Trap 3: Number of requests that return a 5xx status code
This is an error count, not an availability ratio.
- A
Total requests that succeed / Total requests
Correct definition of availability SLI.
- B
Total requests that complete within 200 ms / Total requests
Why wrong: This is a latency SLI, not availability.
- C
Total minutes the service is up / Total minutes in the window
Why wrong: This is a different measure (uptime), not request-based availability.
- D
Number of requests that return a 5xx status code
Why wrong: This is an error count, not an availability ratio.