Monitor Service Level Objectives (SLOs)

The Burn Alerts feature provides notifications related to your SLO Budget. Burn Alerts can notify you when issues impact your SLO budget, which represents the maximum allocation of failures for your service. Configured alerts let you react to the incidents that matter most to you as defined in your SLO.

Burn Alert Types

	Exhaustion Time	Budget Rate
Description	Notifies when your SLO is at risk of burning through its error budget within a specified number of hours. This allows for proactive steps before the SLO budget reaches zero.	Notifies when the SLO budget drops by a minimum specified percentage within a defined time window. This allows for the detection of budget burn issues and unexpected spikes in a timely manner.
Parameters	Exhaustion Time (hours)	Time Window (hours), Budget Decrease (%)
Example Alert	Alert me when I am about to run out of budget in 24 hours.	Alert me when the SLO budget decreases by 10% in the last 2 hours.
Signal of the Alert	Alert when you are x hours away from violating your SLO.	Alert when the SLO budget starts to rapidly burn or inconsistently burn.

Adding a Burn Alert

exhaustion time

Description adds context, such as runbook links or alert summaries for the Burn Alert. When utilized, the Burn Alert description appears in the notification instead of the SLO description.
Notify configures the notification option for the Burn Alert.
Exhaustion time (hours) is when you want to be notified based on how much time (in hours) is left until your projected SLO budget will hit zero.

Tip

While it is possible to express time periods of less than an hour–for example, 0.25 corresponds to 15 minutes–usually that is not enough time to make the SLO actionable. Conversely, periods of more than a few days almost never merits notification as it effectively acts the same as the current SLO’s time period.

Learn about Best Practices for Exhaustion Time burn alerts.

Description adds context, such as runbook links or alert summaries for the Burn Alert. When utilized, the Burn Alert description appears in the notification instead of the SLO description.
Notify configures the notification option for the burn alert.
Time Window (hours) is the range of time to determine a budget rate. Minimum value is 1. Maximum value is the length of your SLO.
Budget Decrease (%) is the drop in budget percentage to be notified on. Minimum value is 0.0001. Maximum value is 100.

A Budget Burndown graph also appears, which projects potential alert frequencies based on input values.

Learn about Best Practices for Budget Rate burn alerts.

Notify Options

Notify by email appears as the default notification method, which requires entering one or more email addresses. Enter multiple emails separated by commas.

Additional integration options, like Slack and PagerDuty, are populated from SLO and Trigger Recipients, as found under Team Settings > Integrations. Once configured, these additional options can be selected.

Budget Burndown Graph

When creating a Budget Rate burn alert, the Budget Burndown graph appears. Use the Budget Burndown graph to determine the Time Window and Budget Decrease values that work best.

The graph shows the Budget Burndown over the SLO’s time period. Change the values for Time Window and/or Budget Decrease to see different graph projections. The dashed line markers appear on the graph to represent when alerts would have been sent. The light orange range represents how long an alert would remain activated.

In this example below, the SLO’s time period is 7 days. A 4-hour Time Window and a 5% Budget Decrease would cause alerts to occur 6 times. Hovering over the marker for the second alert reveals its estimated notification date is at 6:57am on October 27. You may decide that a 5% decrease alerts too often and that amount of burn over the 4-hour window is not serious enough to alert the team. How Honeycomb evaluates Budget Rate burn alerts

How Honeycomb evaluates Budget Rate burn alerts

Further experimentation may find that a 2-hour Time Window and an 8% Budget Decrease is perfect for your team. Entering these values shows a graph with the next estimated alert notification(s), based on these values. How Honeycomb evaluates Budget Rate burn alerts

Testing Burn Alert Notifications

After creation, Burn Alert notification testing becomes available. Use this feature to test if Burn Alert notifications appear as expected before an alert situation occurs.

Viewing Burn Alerts

How Burn Alerts Work

Exhaustion Time Burn Alerts

Honeycomb computes whether an Exhaustion Time burn alert may occur by extrapolating the current rate of budget burn. If this rate reaches zero percent (0%) within the specified number of hours in the alert, then Honeycomb sends a notification.

Honeycomb determines the extrapolation window by dividing the alert’s Exhaustion Time by 4. Honeycomb looks at the past data in the extrapolation window, and then extrapolates what may happen in the future for the specified numbers of Exhaustion Time hours.

An Exhaustion Time burn alert stays activated until the SLO budget will no longer exhaust within the defined exhaustion time. (Honeycomb also applies a small buffer period to avoid fluctuating notification events.) Once resolved, Honeycomb sends a notification.

The example below shows how a 4-hour Exhaustion Time alert works. Honeycomb looks at how the last hour has been, which is the extrapolation window, and then extrapolates what may happen in the next four hours, which is the Exhaustion Time value. Based on this data, the four hour estimate will dip below zero, and so the system warns the user.

Budget Rate Burn Alerts

Honeycomb computes whether a Budget Rate burn alert may occur by evaluating historical events in a given time window. A Budget Rate is determined by a drop in budget percentage over a time window. If budget decreases, at minimum, by the configured Budget Decrease value, then Honeycomb sends a notification.

This alert resolves when the consumed budget within the time window is less than the specified Budget Decrease value in the Budget Rate burn alert. (Honeycomb also applies a small buffer to avoid fluctuating notification events.) Once resolved, Honeycomb sends a notification.

The example below shows how Honeycomb evaluates a Budget Rate burn alert with a 4-hour Time Window and 30% Budget Decrease value. The shaded section shows the last four hours for this SLO. Within this range, Honeycomb evaluates the Budget at the start and end of this time window. In this example, the Budget starts at 78% and and ends at 38%, or a 40% overall decrease. Therefore, Honeycomb sends a notification because the Budget Decrease value is 30% and the SLO experienced a 40% overall decrease.

Best Practices

We recommend that you follow certain best practices when creating alerts. Some of these are general guidelines, and some are specific to alert type.

General Guidelines

Exhaustion Time Burn Alerts

When choosing the length of time for a given Budget Exhaustion burn alert, consider the context and goals of your organization. Ask questions to help frame the definition of some initial Exhaustion Time burn alerts. If you are X hours away from running out of budget:

For example, a 24-hour exhaustion time alert can be useful if service quality is slowly degrading and a Slack-based notification allows the team to remediate the issue before the budget reaches zero (0). Alternatively, a 4-hour exhaustion time alert may be more urgent and require a pager notification, such as from PagerDuty.

Budget Rate Burn Alerts

When starting with a Budget Rate burn alert, consider whether you seek an alert for a smooth, slow burn or a fast, abrupt drop. Start with a less-sensitive alert and adjust as needed. Depending on the length of your SLO’s time period, try these values when creating Budget Rate burn alerts.

30 Day SLO Example

Use the following example to create a series of Budget Rate burn alerts for your SLO. Each row represents an alert and its values.

Budget Decrease (%)	Time Window	Notification Type
2%	1 hour	PagerDuty
5%	6 hour	PagerDuty
10%	3 days	Slack

7 Day SLO Example

Use the following example to create a series of Budget Rate burn alerts for your SLO. Each row represents an alert and its values.

Budget Decrease (%)	Time Window	Notification Type
8.5%	1h	PagerDuty
21.5%	6h	PagerDuty
43.20%	3 days	Slack
50%	3.5 days	Slack

Use the Time Window to Determine the Notification Method

A long Time Window, such as 24 hours, is useful in detecting long, slow burns that use up your SLO budget faster than expected, but not fast enough to wake someone out of bed. A short Time Window, such as one hour, is useful in detecting very fast SLO budget burns that need to be addressed quickly.

Use Budget Decrease to Control Alert Frequency

Example Uses for Burn Alerts

Troubleshooting

Because an incident can dramatically deplete your SLO budget, an Exhaustion Time burn alert may take a long time to resolve even after addressing the incident and deploying a fix. It takes time for this data to age out and recovery to occur.

This means that an Exhaustion Time burn alert will remain in a fired state after triggering. If your budget stabilizes, and then starts burning again without ever going back above zero percent (0%), you will not be alerted a second time. If this scenario is not ideal, you can do one of the following:

Honeycomb.io Documentation

Monitor Service Level Objectives (SLOs) [Pro] [Ent]

Burn Alert Types

Adding a Burn Alert

Notify Options

Budget Burndown Graph

Testing Burn Alert Notifications

Viewing Burn Alerts

How Burn Alerts Work

Exhaustion Time Burn Alerts

Budget Rate Burn Alerts

Best Practices

General Guidelines

Exhaustion Time Burn Alerts

Budget Rate Burn Alerts

30 Day SLO Example

7 Day SLO Example

Use the Time Window to Determine the Notification Method

Use Budget Decrease to Control Alert Frequency

Example Uses for Burn Alerts

Troubleshooting

Monitor Service Level Objectives (SLOs)