SLO Detail View


Note
This feature is available as part of the Honeycomb Enterprise and Pro plans.

Each SLO has a detailed view that shows information based on its parameters.

At the top, the SLO detailed view shows the SLO’s:

  • Name
  • Creator user name
  • Creation date
  • Description
  • Summary chart of existing Burn Alerts with set exhaustion time and status

Select Configure Burn Alerts to view existing Burn Alerts and create new Burn Alerts.

Also, an SLO detailed view displays information in the following areas:

  • Budget Burndown - shows the cumulative error against the set SLO budget.
  • Burn Rate - gives the ratio of actual failures in the SLO to expected failures in the given time window.
  • Historical SLO Compliance - shows often the SLI has succeeded for each day over the preceding SLO Time Period.
  • Heatmap - shows events that succeed the SLI (in blue-green) and that fail (in yellow) on a heatmap of duration.
  • BubbleUp - shows the dimensions where the events that pass the SLI (in blue) and those that fail it (yellow) are most different.

Each SLO has a detailed view that shows information based on its parameters.

At the top, the SLO detailed view shows the SLO’s:

  • Name
  • Creator user name
  • Creation date
  • Description
  • Summary chart of existing Burn Alerts with set exhaustion time and status

Also, an SLO detailed view displays information in the following areas:

  • Budget Burndown - shows the cumulative error against the set SLO budget.
  • Burn Rate - gives the ratio of actual failures in the SLO to expected failures in the given time window.
  • Historical SLO Compliance - shows often the SLI has succeeded for each day over the preceding SLO Time Period.
  • Heatmap - shows events that succeed the SLI (in blue-green) and that fail (in yellow) on a heatmap of duration.
  • Detect Anomalies (BubbleUp) - shows the dimensions where the events that pass the SLI (in blue) and those that fail it (yellow) are most different.
SLO summary view.

Budget Burndown 

Budget Burndown shows the cumulative error against the SLO budget and projects how much budget remains over a given period of time. The Budget Burndown starts at 100% and decreases as the budget reduces.

This value is computed as a rolling window, so every moment is based on the preceding time.

To receive alerts on this measurement under certain conditions, use Burn Alerts.

Burn Rate 

Burn Rate gives the ratio of actual failures in the SLO to expected failures in the given time window. The burn rate, located below the Budget Burndown chart, tells the current burn rate in the last X hour(s). The time window defaults to 4 hours, but you can input a different value to recalculate the current burn rate for your situation.

SLO Historical Burn Rate

Select the caret to expand the display and view a chart for Historical Burn Rate. Given the input of X hours in the time window, the Historical Burn Rate chart displays the burn rate over X hours for the SLO.

Tip

Use the current Burn Rate to understand the severity of recent errors in the SLO.

Use the Historical Burn Rate chart to understand how consistently the SLO burns and if specific points of time impacted the SLO budget.

It is expected and acceptable for an SLO to sometimes burn at a higher rate than budgeted, and to sometimes burn at a lower rate than budgeted. Over time, you will know if you meet your specific SLO budget; the burn rate, however, informs on your SLO budget’s recent behavior.

Interpreting a Burn Rate 

A burn rate of 1.0 means that the SLO budget is reducing as expected and as budgeted, given the parameters of the SLO. This means the SLO experienced the expected number of failed events.

A burn rate of less than 1.0 means that less failures occurred than what was budgeted in the SLO. If this burn rate is consistently below 1.0, this could be a signal of an SLO that you should revisit. Is the SLO underambitious? Is the SLO capturing the signal you care about?

A burn rate of greater than 1.0 means that more failures than expected occurred, given the budget of the SLO. The higher the multiplier, the more likely the SLO is impacted by the high count of failures. This may indicate that something is clearly wrong with your service and that you need to investigate.

For example, for an SLO with a 30 day period, the following multipliers correlate to the amount of time until the SLO budget reaches zero:

  • 1x = 30 days
  • 2x = 15 days
  • 4x = 7.5 days
  • 8x = 3.25 days
  • 16x = 39 hours
  • 32x = 19 hours and 30 minutes
  • 64x = 9 hours and 45 minutes
  • 128x = almost 5 hours

To receive alerts on this measurement under certain conditions, use Burn Alerts.

Historical SLO Compliance 

Historical SLO Compliance shows how often the SLI has succeeded for each day over the preceding SLO Time Period, such as 14 days.

Heatmap 

This Heatmap display shows events that succeed the SLI (in blue-green) and that fail (in yellow) on a heatmap of duration. The time axis runs over a much shorter period than the full SLO period, which allows you to focus on individual events and times that set off the SLO.

You can modify the heatmap using the following controls:

  1. Selected Column
  2. Time Range
  3. Linear or Log Scale

Selected Column controls what field is visualized in the heatmap. Modify which field by using the “Distribution of Events failing SLI by” dropdown above the heatmap. In the example above, duration_ms is the selected column and displays duration_ms over time and how frequently each duration - time period combination occurs. In many cases, you do not need to change the selected column, or field. If your SLI consists of multiple fields like duration, status codes, and other measures, you may want to explore seeing successful and failed events across different fields.

Time Range specifies the start and end time for data in your heatmap and BubbleUp display. Modify the time range using the time picker in the upper right corner above the heatmap. This time range appears as an orange range within the Budget Burndown and Historical Compliance charts. Adjust the time range if you seek more information on your SLO, such as a sudden drop in SLO Budget.

View the heatmap in log scale or linear scale. By default, SLO heatmaps display in log scale. Log scale displays a heatmap with a LOG() transformation applied to your data, which helps visualize duration times and large outliers. Linear scale displays a heatmap with a linear axis, which helps with less extreme outliers such as status codes. Select the gear icon above the heatmap to change this setting. This setting is locally saved to the browser and does not save to the SLO.

SLO Heatmap Timerange on a drop in budget.

BubbleUp 

The BubbleUp series at the bottom of the screen shows the dimensions where the events that pass the SLI (in blue) and those that fail it (yellow) are most different. This information can provide insight into the causes for current burndown activity. The BubbleUp looks at the same time period as the heatmap.

Reset Your Remaining SLO Budget 

Burn Alerts will only activate if a SLO budget remains above zero. If the error budget was depleted due to some issue and then you fixed the problem, it is worth resetting your error budget so Burn Alerts will start working again.

Warning

Resetting your budget will erase all errors that have happened in the current SLO time period, up to and including the current hour. For example, when resetting an SLO with a 30 day period, the SLO budget will be back to 100% for that 30 days.

This reset will affect both your Budget Burndown and Historical SLO Compliance graphs on the SLO detailed view, as well as the Current Percentage displayed in the SLO lists.

SLO Reset in charts

In the image above, the reset is noted by a gray dashed line on the SLO Budget Burndown and SLO Historical Compliance charts.

  • The SLO Budget Burndown chart starts at 100% compliance to the left of the dashed line because all errors before that reset and after the start of the compliance period were erased.
  • The Historical SLO Compliance chart increase over time in the time period before the reset and after the start of the compliance period when the SLO was reset.

Reset your budget back to 100% by:

  1. Go to the target SLO’s SLO detail view.
  2. Locate the Budget Burndown chart.
  3. Select the Reset Budget button in the Budget Burndown chart.
SLO Reset button

Ignore Past Budget Resets 

If you used Budget Reset to reset your SLO’s budget, then Ignore Budget Resets gives the ability to view the SLO Budget Burndown and SLO Historical Compliance charts without accounting for past resets. This setting does not permanently remove past resets and allows you to toggle between two perspectives.

To modify the charts’ view and ignore previous budget resets, select the Ignore Budget Resets checkbox above the SLO Historical Compliance chart.

SLO ignore budget reset

Tasks 

  • Configure Burn Alerts
  • Delete
  • Edit SLO
  • Use Ignore Budget Resets to modify charts’ view