Get insights into your system instantly with Board templates. The out-of-the-box boards provide you with custom queries and visualizations on your data with minimal clicking.
To access Board Templates:
To create a board from a template:
When creating a board from a template, customize the template with your own fields at the point of previewing the board template:
Before completing your board template creation, use Copy Link to copy and share this customized board template with other users. Selecting your shared link directs users to the setup page of this template preview with the customized fields populated.
For teams using Refinery to sample their data, the Refinery board template gives an overview of sampling operations.
The Refinery board template includes the following queries:
Query Name | Query Description | Fields Required |
---|---|---|
Stress Relief Status | Current stress level of the Refinery cluster. |
|
Dropped From Stress | Total traces dropped due to stress on the Refinery cluster. |
|
Stress Relief Log | Reasons why Refinery is going into stress relief. |
|
Cache Health | Metrics for cache health. |
|
Cache Ejections | Number of traces ejected from cache. |
|
Intercommunications | Total events from outside Refinery and events redirected from a peer. |
|
Receive Buffers | Receive buffers operations. |
|
Peer Send Buffers | Metrics for the queue used to buffer spans to send to peer nodes. |
|
Upstream Send Buffers | Metrics for the queue used to buffer spans to send to Honeycomb. |
|
EMADynamicSampler Performance | EMADynamicSampler sampling effectiveness. |
|
EMAThroughputSampler Performance | EMAThroughputSampler sampling effectiveness. |
|
WindowedThroughput Performance | WindowedThroughput sampling effectiveness. |
|
TotalThroughputSampler Performance | TotalThroughputSampler sampling effectiveness. |
|
Dynamic Performance | Dynamic sampling effectiveness. |
|
RulesBasedSampler Performance | RulesBasedSampler sampling effectiveness. |
|
Trace Indicators | Total traces sent before completion and span received for a trace already sent. |
|
Sampling Decisions | Total traces accepted and sent or dropped. |
|
Refinery Send Event Error Logs | Errors when sending events to its peers or upstream to our API server. |
|
Refinery Handler Event Error Logs | Errors when receiving or parsing events being sent to a node. |
|
Refinery Events Exceeding Max Size | Errors when events are too large to be sent to Honeycomb. |
|
The Service Health board template gives an overview of the health of your services. It provides insights into request volumes, where the slowest requests are occurring, and more.
The Service Health board template includes the following queries:
Query Name | Query Description | Fields Required |
---|---|---|
Trace Counts by Service | View total trace volume by service. |
|
Trace Counts by HTTP Status Code | View total trace volume by status code. |
|
Trace Duration Heatmap | Heatmap of the duration for all traces in the environment. |
|
Duration Heatmap | View the heatmap for duration across all services. |
|
Duration by Service | View key duration percentiles by service. |
|
Duration by Route | View duration by route or endpoint. |
|
Duration by Name | View duration by function name. |
|
Errors by Service | View errors by service. |
|
Errors by Route | View errors by route or endpoint. |
|
The RUM board template gives an overview of real user monitoring information in your frontend applications.
The RUM board template includes the following queries:
Query Name | Query Description | Fields Required |
---|---|---|
Largest Contentful Paint (LCP) | Ratings based on the render time for the largest content on a page |
|
Cumulative Layout Shift (CLS) | Ratings based on the stability of content layout on a page |
|
Interaction to Next Paint (INP) | Ratings based on the responsiveness of a page |
|
Largest Contentful Paint P75 | The 75th percentile for LCP |
|
Cumulative Layout Shift P75 | The 75th percentile for CLS |
|
Interaction to Next Paint P75 | The 75th percentile for INP |
|
Total Events by Type | Event types ranked by occurrence |
|
Largest Resource Requests | The largest resource requests ranked by the average length of their response content |
|
Top 5 Endpoints by Request Count | Top 5 endpoints ranked by number of requests |
|
Slowest Requests by Endpoint | The slowest endpoints based on the 75th percentile of request durations |
|
Top Landing Pages by Session Count | The most visited landing pages ranked by session count |
|
Pages With the Most Events | Pages with the highest number of events, highlighting the most active pages |
|
The Activity Log Security board template includes queries that show API Key activity.
Query Name | Query Description | Fields Required |
---|---|---|
API Key Added Permissions | Shows when permissions are added to an existing API key. |
|
API Key Activities by User | Displays the number of changes to API keys broken down by user. |
|
Authentication Type by User | Displays which type of authentication is used for each user. |
|
The Activity Log Leaderboard board template includes queries that show advanced and frequent usage of Honeycomb by your team.
Query Name | Query Description | Fields Required |
---|---|---|
Queries by User | Shows which environments are being queried. |
|
Complex Queries by User | Shows which users frequently use Visualize, Where, and Having clauses. |
|
Top Query Visualizations | Shows the most commonly used visualizations. |
|
Top Tinkerers | Lists which users perform the most updates to SLOs, Triggers, and Calculated Fields. |
|
Queries by Dataset | Shows which datasets are being queried the most. |
|
Queries by Environment | Shows a count of run queries as grouped by environment. |
|
The Activity Log Trigger and SLO Activity board template includes queries related to trigger and SLO activations and modifications.
Query Name | Query Description | Fields Required |
---|---|---|
Trigger State Changes | Shows instances when triggers have been triggered or resolved. |
|
Trigger Modifications | Shows creations, modifications, and deletions of triggers. |
|
Most Updated Triggers | Shows triggers that received the most changes recently. |
|
Top Updated SLOs by Update Type | Shows creations, modifications, and deletions of SLOs and the supporting SLI (Calculated Field). |
|
SLOs Created and Deleted | Shows creation and deletion of SLOs. |
|
SLI Expression Changes by SLO | Shows when SLIs (Calculated Fields) related to SLOs have been changed. |
|
The Kubernetes Pod Metrics board template includes queries to help you investigate pod performance and resource usage within Kubernetes clusters.
Query Name | Query Description | Fields Required |
---|---|---|
Pod CPU Usage | The amount of CPU used by each pod in the cluster. CPU is reported as the average core usage measured in cpu units. One cpu, in Kubernetes, is equivalent to 1 vCPU/Core for cloud providers, and 1 hyper-thread on bare-metal Intel processors. |
|
Pod Memory Usage | The amount of memory being used by each Kubernetes pod. |
|
Pod Uptime Smokestacks | As pod uptime ever-increases, this query uses the smokestack method, which applies a LOG10 to the Pod Uptime metric, and newly started or restarted pods appear more significantly than pods that have been running a long time, which move into a straight line eventually. |
|
Unhealthy Pods | This query shows trouble that pods may be experiencing during their operating lifecycle. Many of these events are present during start-up and get resolved so the presence of a count isn’t necessarily bad. |
|
Pod CPU Utilization vs. Limit | When a CPU Limit is present in a pod configuration, this query shows how much CPU that each pod uses as a percentage against that limit. |
|
Pod CPU Utilization vs. Request | When a CPU Request is present in a pod configuration, this query shows how much CPU that each pod uses as a percentage against that request value. |
|
Pod Memory Utilization vs. Limit | When a Memory Limit is present in a pod configuration, this query shows how much memory that each pod uses as a percentage against that limit value. |
|
Pod Memory Utilization vs. Request | When a Memory Request is present in a pod configuration, this query shows how much memory that each pod uses as a percentage against that request value. |
|
Pod Network IO Rates | Displays Network IO RATE_MAX for Transmit and Receive network traffic (in bytes) as a stacked graph, and gives the overall network rate and the individual rate for each node. |
|
Pods With Low Filesystem Availability | Shows any pods where filesystem availability is below 5 GB. |
|
Pod Filesystem Usage | Shows the amount of filesystem usage per Kubernetes pod, displayed in a stack graph to show total filesystem usage of all pods. |
|
Pods Per Namespace | Shows the number of pods currently running in each Kubernetes namespace. |
|
Pods Per Node | Shows the number of pods currently running in each Kubernetes Node. |
|
Pod Network Errors | Shows network errors in receive and transmit, grouped by pod. |
|
Pods Per Deployment | The number of pods currently deployed in different Kubernetes deployments. |
|
The Kubernetes Node Metrics board template includes queries to help you investigate node performance and resource usage within Kubernetes clusters.
Query Name | Query Description | Fields Required |
---|---|---|
Node CPU Usage | The amount of CPU used on each node in the cluster. CPU is reported as the average core usage measured in cpu units. One cpu, in Kubernetes, is equivalent to 1 vCPU/Core for cloud providers, and 1 hyper-thread on bare-metal Intel processors. |
|
Node Memory Utilization | Shows percent of memory used on each Kubernetes node. |
|
Node Network IO Rates | Displays Network IO RATE_MAX for Transmit and Receive network traffic as a stacked graph, and gives overall network rate and the individual rate for each node. |
|
Unhealthy Nodes | This query shows errors that Kubernetes nodes are experiencing. |
|
Node Filesystem Utilization | Shows percent of filesystem used on each node. |
|
Node Uptime Smokestack | As node uptime ever-increases, this query uses the smokestack method, which applies a LOG10 to the Node Uptime metric, and newly started or restarted nodes appear more significantly than nodes that have been running a long time, which move into a straight line eventually. |
|
Node Network Errors | Shows network transmit and receive errors for each node. |
|
Pods and Containers per Node | Shows the number of pods and the number of containers per node as stacked graphs, and also shows total number of pods and containers across the environment. |
|
The Kubernetes Workload Health board template includes queries that help you investigate Kubernetes-related application problems.
Query Name | Query Description | Fields Required |
---|---|---|
Container Restarts | Shows the total number of restarts per pod, and the rate of restarts of pods where the restart count is greater than zero. |
|
Unhealthy Pods | This query shows trouble that pods may be experiencing during their operating lifecycle. Many of these events are present during start-up and get resolved so the presence of a count isn’t necessarily bad. |
|
Pending Pods | Find pods in a “Pending” state. |
|
Failed Pods | Find pods in a “Failed” or “Unknown” state. |
|
Unhealthy Nodes | This query shows errors that Kubernetes nodes are experiencing. |
|
Unhealthy Volumes | This query shows volume creation and attachment failures. |
|
Unscheduled Daemonset Pods | Track cases where a pod in a daemonset is not currently running on every node in the cluster as it should be. |
k8s.namespace.name |
Stateful Set Pod Readiness | Track any stateful sets where pods are in an non-ready state that should be in a ready state. |
|
Deployment Pod Status | Look for Deployments where Pods have not fully deployed. Numbers greater than zero show pods in a deployment that are not yet “ready”. |
|
Job Failures | Track the number of failed pods in Kubernetes jobs. |
|
Active Cron Jobs | Track the number of active pods in each Kubernetes cron job. |
|
The OpenTelemetry Collector Operations board template includes queries with useful metrics that are emitted by the OpenTelemetry Collector during its operation.
Query Name | Query Description | Fields Required |
---|---|---|
Exporter Span Failures | Shows when errors happen during enqueueing or sending in exporters. |
|
Collector Uptime Smokestacks | Shows the uptime for different pods with a Log10 to make it clearer where restarts are happening. |
|
Exporter Metric Send Failures | Shows when errors happen during sending from exporters. |
|
Exporter Metrics Enqueue Failures | Shows when errors happen during enqueueing in exporters. |
|
Exporter Log Records Failures | Shows when errors happen during enqueueing or sending in exporters. |
|
Learn more about the OpenTelemetry Collector.
The OpenTelemetry Java Metrics board template includes queries that help to investigate application problems related to Java Virtual Machine (JVM). Metrics for Java applications come from the JVM, and are reported by the OpenTelemetry Java Agent or Honeycomb OpenTelemetry Distribution for Java.
Query Name | Query Description | Fields Required |
---|---|---|
JVM Memory Usage (Young Generation) | Eden space on the JVM heap is where newly created objects are stored. When it fills, a minor Garbage Collection (GC) occurs, moving all “live” objects to the Survivor space. In addition to current memory usage, committed represents the guaranteed available memory, and limit represents maximum usable. |
|
JVM Memory Usage (Old Generation) | Tenured Gen JVM heap space stores long-lived objects. When a Full or Major GC is performed, it is expensive and may pause app execution. Committed represents guaranteed available memory, and limit represents maximum usable memory. |
|
JVM GC (Garbage Collection) Activity | JVM GC actions occur periodically to reclaim memory but consume CPU cycles to do so. In the worst cases, a GC can cause the entire JVM to pause, making the application appear unresponsive. |
|
JVM CPU Utilization | Shows system CPU utilization and 1-minute load average, as captured by the JVM. |
|
JVM Buffer Memory Usage | Buffer memory is provided by the OS and is outside the JVM’s heap memory allocation. It is used by Java NIO to quickly write data to network or disk. |
|
JVM Non-Heap Memory Usage | JVM non-heap memory is allocated above and beyond the heap size you’ve configured. It is a section of memory in the JVM that stores class information (Metaspace), compiled code cache, thread stack, and so on. It cannot be garbage collected. |
|
The AWS Lambda Health board template includes queries for observing AWS Lambda function health via invocations, errors, throttles, and concurrency.
Query Name | Query Description | Fields Required |
---|---|---|
Duration & Execution by ID/Version | Tracks the execution time of Lambda functions, identified by their ID or version. Useful for analyzing the performance and efficiency of different versions or instances of a function over time. |
|
Lambda Invocations by Function | Displays the total number of times each Lambda function is invoked. It helps in tracking the frequency of usage of different functions, enabling a clear understanding of which functions are most or least used. |
|
Latency by Function/Metric | Displays the response time for each Lambda function, broken down by specific metrics. Useful for identifying functions that may be experiencing performance issues due to high latency. |
|
Function Error Count and Rate | Displays two key pieces of information: the total number of errors encountered by each Lambda function and the error rate, calculated as the ratio of errors to total invocations. Useful for pinpointing functions that are failing or experiencing issues. |
|
Lambda Throttles | Displays the instances where Lambda invocations are being throttled, such as when the number of function calls exceeds the concurrency limits. Tracking this helps in managing and optimizing the scalability settings for each function. |
|
Function Concurrency | Monitors the simultaneous execution count of each Lambda function, tracking how many instances of a function are running at the same time. |
|
To explore common issues when working with Board Templates, visit Common Issues with Visualization: Board Templates.