Get instant insights into your system with Board templates.
Board templates are pre-configured Boards that come with ready-made queries and visualizations, providing valuable insights with minimal setup. Use a template as starting point to create a Board.
Templates are designed for specific use cases and built around industry best practices, ensuring effective configurations for tracking key metrics and visualizing data accurately.
Choose from a variety of templates to quickly gain insights across different areas of your system:
The Service Health Board template offers an overview of your services’ health. It provides insights into request volumes, identifies where the slowest requests are occurring, and more.
The Service Health Board template includes the following queries:
Query Name | Query Description | Required Fields |
---|---|---|
Trace Counts by Service | Shows total trace volume by service. |
|
Trace Counts by HTTP Status Code | Shows total trace volume by status code. |
|
Trace Duration Heatmap | Shows a heatmap of the duration for all traces. |
|
Duration Heatmap | Shows a heatmap of duration across all services. |
|
Duration by Service | Shows key duration percentiles by service. |
|
Duration by Route | Shows duration by route or endpoint. |
|
Duration by Name | Shows duration by function name. |
|
Errors by Service | Shows a count of errors grouped by service. |
|
Errors by Route | Shows a count of errors grouped by route or endpoint. |
|
The RUM Board template provides an overview of real user monitoring data from your frontend applications.
The RUM Board template includes the following queries:
Query Name | Query Description | Required Fields |
---|---|---|
Largest Contentful Paint (LCP) | Shows ratings based on the render time for the largest content on a page. |
|
Cumulative Layout Shift (CLS) | Shows ratings based on the stability of content layout on a page. |
|
Interaction to Next Paint (INP) | Shows ratings based on the responsiveness of a page. |
|
LCP P75 | Shows the 75th percentile for LCP. |
|
CLS P75 | Shows the 75th percentile for CLS. |
|
INP P75 | Shows the 75th percentile for INP. |
|
Total Events by Type | Shows event types ranked by occurrence. |
|
Largest Resource Requests | Shows the largest resource requests ranked by the average length of their response content. |
|
Top 5 Endpoints by Request Count | Shows the top 5 endpoints ranked by number of requests. |
|
Slowest Requests by Endpoint | Shows the slowest endpoints based on the 75th percentile of request durations. |
|
Top Landing Pages by Session Count | Shows the most visited landing pages ranked by session count. |
|
Pages With the Most Events | Shows pages with the highest number of events, highlighting the most active pages. |
|
For teams using Refinery to sample their data, the Refinery Board template provides an overview of sampling operations.
The Refinery Board template includes the following queries:
Query Name | Query Description | Required Fields |
---|---|---|
Stress Relief Status | Shows the current stress level on the Refinery cluster. |
|
Dropped From Stress | Shows how many traces are being dropped due to stress on the Refinery cluster. |
|
Stress Relief Log | Shows reasons why Refinery is going into stress relief. |
|
Cache Health | Shows metrics for cache health. |
|
Cache Ejections | Shows number of traces ejected from cache. |
|
Intercommunications | Shows total events from outside Refinery and events redirected from a peer. |
|
Receive Buffers | Shows receive buffer operations. |
|
Peer Send Buffers | Show metrics for the queue used to buffer spans to send to peer nodes. |
|
Upstream Send Buffers | Shows metrics for the queue used to buffer spans to send to Honeycomb. |
|
EMADynamicSampler Performance | Shows EMADynamicSampler sampling effectiveness. |
|
EMAThroughputSampler Performance | Shows EMAThroughputSampler sampling effectiveness. |
|
WindowedThroughput Performance | Shows WindowedThroughput sampling effectiveness. |
|
TotalThroughputSampler Performance | Shows TotalThroughputSampler sampling effectiveness. |
|
DynamicSampler Performance | Shows DynamicSampler sampling effectiveness. |
|
RulesBasedSampler Performance | Shows RulesBasedSampler sampling effectiveness. |
|
Trace Indicators | Shows total traces sent before completion and span received for a trace already sent. |
|
Sampling Decisions | Shows total traces accepted and sent or dropped. |
|
Refinery Send Event Error Logs | Shows errors when sending events to its peers or upstream to our API server. |
|
Refinery Handler Event Error Logs | Shows errors when receiving or parsing events being sent to a node. |
|
Refinery Events Exceeding Max Size | Shows errors when events are too large to be sent to Honeycomb. |
|
The Kubernetes Pod Metrics Board template includes queries that help you investigate pod performance and resource usage within Kubernetes clusters:
Query Name | Query Description | Required Fields |
---|---|---|
Pod CPU Usage | Shows the amount of CPU used by each pod in the cluster. CPU is reported as the average core usage measured in cpu units. One cpu, in Kubernetes, is equivalent to 1 vCPU/Core for cloud providers, and 1 hyper-thread on bare-metal Intel processors. |
|
Pod Memory Usage | Shows the amount of memory being used by each Kubernetes pod. |
|
Pod Uptime Smokestacks | As pod uptime ever-increases, this query uses the smokestack method, which applies a LOG10 to the Pod Uptime metric, and newly started or restarted pods appear more significantly than pods that have been running a long time, which move into a straight line eventually. |
|
Unhealthy Pods | Shows trouble that pods may be experiencing during their operating lifecycle. Many of these events are present during start-up and get resolved so the presence of a count isn’t necessarily bad. |
|
Pod CPU Utilization vs. Limit | When a CPU Limit is present in a pod configuration, this query shows how much CPU that each pod uses as a percentage against that limit. |
|
Pod CPU Utilization vs. Request | When a CPU Request is present in a pod configuration, this query shows how much CPU that each pod uses as a percentage against that request value. |
|
Pod Memory Utilization vs. Limit | When a Memory Limit is present in a pod configuration, this query shows how much memory that each pod uses as a percentage against that limit value. |
|
Pod Memory Utilization vs. Request | When a Memory Request is present in a pod configuration, this query shows how much memory that each pod uses as a percentage against that request value. |
|
Pod Network IO Rates | Displays Network IO RATE_MAX for Transmit and Receive network traffic (in bytes) as a stacked graph, and gives the overall network rate and the individual rate for each node. |
|
Pods With Low Filesystem Availability | Shows any pods where filesystem availability is below 5 GB. |
|
Pod Filesystem Usage | Shows the amount of filesystem usage per Kubernetes pod, displayed in a stack graph to show total filesystem usage of all pods. |
|
Pods Per Namespace | Shows the number of pods currently running in each Kubernetes namespace. |
|
Pods Per Node | Shows the number of pods currently running in each Kubernetes Node. |
|
Pod Network Errors | Shows network errors in receive and transmit, grouped by pod. |
|
Pods Per Deployment | Shows the number of pods currently deployed in different Kubernetes deployments. |
|
The Kubernetes Node Metrics Board template includes queries that help you investigate node performance and resource usage within Kubernetes clusters:
Query Name | Query Description | Required Fields |
---|---|---|
Node CPU Usage | Shows the amount of CPU used on each node in the cluster. CPU is reported as the average core usage measured in cpu units. One cpu, in Kubernetes, is equivalent to 1 vCPU/Core for cloud providers, and 1 hyper-thread on bare-metal Intel processors. |
|
Node Memory Utilization | Shows percent of memory used on each Kubernetes node. |
|
Node Network IO Rates | Displays Network IO RATE_MAX for Transmit and Receive network traffic as a stacked graph, and gives overall network rate and the individual rate for each node. |
|
Unhealthy Nodes | Shows errors that Kubernetes nodes are experiencing. |
|
Node Filesystem Utilization | Shows percent of filesystem used on each node. |
|
Node Uptime Smokestack | As node uptime ever-increases, this query uses the smokestack method, which applies a LOG10 to the Node Uptime metric, and newly started or restarted nodes appear more significantly than nodes that have been running a long time, which move into a straight line eventually. |
|
Node Network Errors | Shows network transmit and receive errors for each node. |
|
Pods and Containers per Node | Shows the number of pods and the number of containers per node as stacked graphs, and also shows total number of pods and containers across the environment. |
|
The Kubernetes Workload Health Board template includes queries that help you diagnose Kubernetes-related application issues:
Query Name | Query Description | Required Fields |
---|---|---|
Container Restarts | Shows the total number of restarts per pod, and the rate of restarts of pods where the restart count is greater than zero. |
|
Unhealthy Pods | Shows trouble that pods may be experiencing during their operating lifecycle. Many of these events are present during start-up and get resolved so the presence of a count isn’t necessarily bad. |
|
Pending Pods | Shows pods in a “Pending” state. |
|
Failed Pods | Shows pods in a “Failed” or “Unknown” state. |
|
Unhealthy Nodes | Shows errors that Kubernetes nodes are experiencing. |
|
Unhealthy Volumes | Shows volume creation and attachment failures. |
|
Unscheduled Daemonset Pods | Tracks cases where a pod in a daemonset is not currently running on every node in the cluster as it should be. |
k8s.namespace.name |
Stateful Set Pod Readiness | Tracks any stateful sets where pods are in an non-ready state that should be in a ready state. |
|
Deployment Pod Status | Shows Deployments where Pods have not fully deployed. Numbers greater than zero show pods in a deployment that are not yet “ready”. |
|
Job Failures | Tracks the number of failed pods in Kubernetes jobs. |
|
Active Cron Jobs | Tracks the number of active pods in each Kubernetes cron job. |
|
The OpenTelemetry Collector Operations Board template includes queries with key metrics emitted by the OpenTelemetry Collector during its operation:
Query Name | Query Description | Required Fields |
---|---|---|
Exporter Span Failures | Shows when errors happen during enqueueing or sending in exporters. |
|
Collector Uptime Smokestacks | Shows the uptime for different pods with a Log10 to make it clearer where restarts are happening. |
|
Exporter Metric Send Failures | Shows when errors happen during sending from exporters. |
|
Exporter Metrics Enqueue Failures | Shows when errors happen during enqueueing in exporters. |
|
Exporter Log Records Failures | Shows when errors happen during enqueueing or sending in exporters. |
|
The OpenTelemetry Java Metrics Board template includes queries that help you investigate application issues related to the Java Virtual Machine (JVM).
Metrics for Java applications are sourced from the JVM and reported by the OpenTelemetry Java Agent or Honeycomb OpenTelemetry Distribution for Java.
Query Name | Query Description | Required Fields |
---|---|---|
JVM Memory Usage (Young Generation) | Shows memory usage for Eden space on the JVM heap, which is where newly created objects are stored. When it fills, a minor Garbage Collection (GC) occurs, moving all “live” objects to the Survivor space. In addition to current memory usage, committed represents the guaranteed available memory, and limit represents maximum usable. |
|
JVM Memory Usage (Old Generation) | Shows memory usage for tenured Gen JVM heap space, which stores long-lived objects. When a Full or Major GC is performed, it is expensive and may pause app execution. Committed represents guaranteed available memory, and limit represents maximum usable memory. |
|
JVM Garbage Collection (GC) Activity | Shows JVM garbage collection activity. JVM GC actions occur periodically to reclaim memory but consume CPU cycles to do so. In the worst cases, a GC can cause the entire JVM to pause, making the application appear unresponsive. |
|
JVM CPU Utilization | Shows system CPU utilization and 1-minute load average, as captured by the JVM. |
|
JVM Buffer Memory Usage | Shows usage of buffer memory, which is provided by the OS and is outside the JVM’s heap memory allocation. Buffer memory is used by Java NIO to quickly write data to network or disk. |
|
JVM Non-Heap Memory Usage | Shows usage of JVM non-heap memory, which is allocated above and beyond the heap size you’ve configured. JVM non-heap memory is a section of memory in the JVM that stores class information (Metaspace), compiled code cache, thread stack, and so on. It cannot be garbage collected. |
|
The AWS Lambda Health Board template includes queries that monitor the health of AWS Lambda functions, including metrics for invocations, errors, throttles, and concurrency:
Query Name | Query Description | Required Fields |
---|---|---|
Duration & Execution by ID/Version | Tracks the execution time of Lambda functions, identified by their ID or version. Useful for analyzing the performance and efficiency of different versions or instances of a function over time. |
|
Lambda Invocations by Function | Shows the total number of times each Lambda function is invoked. It helps in tracking the frequency of usage of different functions, enabling a clear understanding of which functions are most or least used. |
|
Latency by Function/Metric | Shows the response time for each Lambda function, broken down by specific metrics. Useful for identifying functions that may be experiencing performance issues due to high latency. |
|
Function Error Count and Rate | Shows two key pieces of information: the total number of errors encountered by each Lambda function and the error rate, calculated as the ratio of errors to total invocations. Useful for pinpointing functions that are failing or experiencing issues. |
|
Lambda Throttles | Shows the instances where Lambda invocations are being throttled, such as when the number of function calls exceeds the concurrency limits. Tracking this helps in managing and optimizing the scalability settings for each function. |
|
Function Concurrency | Monitors the simultaneous execution count of each Lambda function, tracking how many instances of a function are running at the same time. |
|
The Activity Log Security Board template includes queries that track API Key activity:
Query Name | Query Description | Required Fields |
---|---|---|
API Key Added Permissions | Shows when permissions are added to an existing API key. |
|
API Key Activities by User | Displays the number of changes to API keys broken down by user. |
|
Authentication Type by User | Displays which type of authentication is used for each user. |
|
The Activity Log Leaderboard Board template includes queries that highlight advanced and frequent usage of Honeycomb by your team:
Query Name | Query Description | Required Fields |
---|---|---|
Queries by User | Shows which environments are being queried. |
|
Complex Queries by User | Shows which users frequently use Visualize, Where, and Having clauses. |
|
Top Query Visualizations | Shows the most commonly used visualizations. |
|
Top Tinkerers | Lists which users perform the most updates to SLOs, Triggers, and Calculated Fields. |
|
Queries by Dataset | Shows which datasets are being queried the most. |
|
Queries by Environment | Shows a count of run queries as grouped by environment. |
|
The Activity Log Trigger and SLO Activity Board template includes queries related to trigger and SLO activations and modifications:
Query Name | Query Description | Required Fields |
---|---|---|
Trigger State Changes | Shows instances when triggers have been triggered or resolved. |
|
Trigger Modifications | Shows creations, modifications, and deletions of triggers. |
|
Most Updated Triggers | Shows triggers that received the most changes recently. |
|
Top Updated SLOs by Update Type | Shows creations, modifications, and deletions of SLOs and the supporting SLI (Calculated Field). |
|
SLOs Created and Deleted | Shows creation and deletion of SLOs. |
|
SLI Expression Changes by SLO | Shows when SLIs (Calculated Fields) related to SLOs have been changed. |
|
To explore common issues when working with Board Templates, visit Common Issues with Visualization: Board Templates.