Most voice agent dashboards fail before they're built.
The problem isn't missing data — it's the wrong data. Last month, we reviewed 23 voice agent dashboards across Hamming customers. The median dashboard tracked 47 metrics. But when we asked "which metrics actually drove decisions?" — the answer was 6.
This guide gives you those 6 metrics, the exact charts to visualize them, and a copy-paste executive report template. No fluff, no "it depends."
TL;DR: Build your voice agent dashboard using Hamming's 6-Metric Dashboard Framework:
- Tier 1 (Business): Task Success Rate, Containment Rate
- Tier 2 (Quality): Latency P50/P90, Conversation Coherence Score
- Tier 3 (Operations): Error Rate, Call Volume
Include 4 core charts: success funnel, latency histogram, error breakdown, trend lines. Report weekly with the 3-section executive template below.
Methodology Note: The 6-Metric Framework and benchmarks in this guide are derived from Hamming's analysis of voice agent dashboards across 50+ deployments (2024-2025). The "47 metrics → 6 that matter" finding comes from reviewing customer dashboards and interviewing engineering leads on which metrics drove actual decisions.
Last Updated: January 2026
Related Guides:
- Voice Agent Monitoring KPIs — 10 Critical Production Metrics
- How to Evaluate Voice Agents — VOICE Framework
- Anatomy of a Perfect Dashboard — Dashboard Design Principles
What is a Voice Agent Analytics Dashboard?
A voice agent analytics dashboard is a real-time monitoring interface that tracks the key performance indicators (KPIs) of AI-powered voice agents in production. Unlike traditional call center dashboards that focus on queue times and agent utilization, voice agent dashboards must capture AI-specific metrics: response latency, task completion, conversation coherence, and error rates.
Why you need a dedicated voice agent dashboard:
| Traditional Call Center Metrics | Voice Agent Metrics |
|---|---|
| Average Handle Time | Task Success Rate |
| Abandonment Rate | Containment Rate |
| Queue Wait Time | Response Latency P50/P90 |
| Agent Utilization | Conversation Coherence |
| Service Level | Error Rate by Type |
The 6 metrics in Hamming's framework span three tiers: Business Outcomes, Quality Indicators, and Operational Health.
Voice Agent Dashboard Metrics (Complete List)
This master table provides the 6 essential metrics every voice agent dashboard needs. Each metric includes its definition, calculation formula, threshold benchmarks, and remediation actions.
| Metric | Definition | Formula | Good Threshold | Bad Threshold | What to Do If Bad |
|---|---|---|---|---|---|
| Task Success Rate | % of calls where user goal was achieved | (Successful tasks / Total calls) × 100 | ≥85% | <70% | Review failed transcripts, retrain intent detection |
| Containment Rate | % of calls resolved without human handoff | (Contained calls / Total calls) × 100 | ≥70% | <50% | Add fallback intents, improve edge case handling |
| Latency (P50) | Median response time from user speech end to agent response start | 50th percentile of response times | <1.5s | >3s | Optimize TTS/LLM pipeline, check provider SLAs |
| Latency (P90/P99) | 90th/99th percentile response time (tail latency) | 90th/99th percentile of response times | <3s | >5s | Investigate outliers, add timeout handling |
| Conversation Coherence | % of responses contextually appropriate | (Coherent responses / Total responses) × 100 | ≥90% | <75% | Review context window, check prompt engineering |
| Error Rate | % of calls with system errors (ASR fail, timeout, crash) | (Error calls / Total calls) × 100 | <3% | >10% | Check ASR confidence, add retry logic |
| Call Volume | Total calls per time period | Count of calls | Baseline ±20% | Spike >2x or drop >50% | Investigate traffic source, check for incidents |
What Charts Should Be on a Voice Agent Dashboard?
The right chart type makes the difference between actionable insights and noise. Here's what to use for each metric:
| Metric | Recommended Chart Type | Why This Chart | Update Frequency |
|---|---|---|---|
| Task Success Rate | Funnel chart | Shows drop-off at each conversation stage | Daily |
| Containment Rate | Stacked bar | Visualizes contained vs escalated breakdown | Daily |
| Latency (P50/P90) | Histogram + trend line | Shows distribution and identifies outliers | Hourly |
| Conversation Coherence | Heatmap by intent | Highlights which intents have quality issues | Weekly |
| Error Rate | Pie chart + time series | Shows error type breakdown and trends | Hourly |
| Call Volume | Area chart | Shows traffic patterns and anomalies | Real-time |
Dashboard Layout Template
Use this layout as a starting point for your voice agent dashboard:
┌─────────────────────────────────────────────────────────────────┐
│ VOICE AGENT DASHBOARD — [Agent Name] — Last 7 Days │
├─────────────────────────────────────────────────────────────────┤
│ │
│ ┌─────────────┐ ┌─────────────┐ ┌─────────────┐ ┌─────────┐│
│ │ Success │ │ Containment │ │ Latency P50 │ │ Errors ││
│ │ 87% │ │ 82% │ │ 1.8s │ │ 2.1% ││
│ │ ▲ 3% │ │ ▼ 1% │ │ ▲ 0.2s │ │ ─ ││
│ └─────────────┘ └─────────────┘ └─────────────┘ └─────────┘│
│ │
│ ┌──────────────────────────────┐ ┌────────────────────────────┤
│ │ Task Success Funnel │ │ Latency Distribution │
│ │ ████████████████ 100% │ │ ▁▂▅██▇▃▁ │
│ │ ██████████████ 92% │ │ P50 P90 │
│ │ ████████████ 87% │ │ 1.8s 3.5s │
│ └──────────────────────────────┘ └────────────────────────────┤
│ │
│ ┌──────────────────────────────┐ ┌────────────────────────────┤
│ │ Call Volume (7 days) │ │ Error Breakdown │
│ │ ▂▄▆█▇▅▃ │ │ ● ASR Timeout 45% │
│ │ Mon → Sun │ │ ● LLM Error 30% │
│ │ Peak: Tue 2pm │ │ ● Hangup 25% │
│ └──────────────────────────────┘ └────────────────────────────┤
└─────────────────────────────────────────────────────────────────┘
Voice Agent KPI Thresholds: Good vs Bad Benchmarks
Setting the right thresholds prevents alert fatigue while catching real issues. These benchmarks are derived from Hamming's analysis of 50+ production deployments.
| Metric | Excellent | Good | Acceptable | Warning | Critical |
|---|---|---|---|---|---|
| Task Success Rate | ≥90% | ≥85% | ≥75% | <75% | <70% |
| Containment Rate | ≥80% | ≥70% | ≥60% | <60% | <50% |
| Latency P50 | <1s | <1.5s | <2s | >2s | >3s |
| Latency P90/P99 | <2s | <3s | <4s | >4s | >5s |
| Coherence | ≥95% | ≥90% | ≥85% | <85% | <75% |
| Error Rate | <1% | <3% | <5% | >5% | >10% |
How to set your own thresholds:
- Baseline first: Run for 2 weeks without alerts to establish your normal ranges
- Start loose: Set thresholds 20% wider than your baseline initially
- Tighten gradually: Reduce thresholds by 5% each week as you fix issues
- Use duration filters: Alert only if threshold is breached for 5+ minutes (prevents flapping)
Voice Agent Executive Report Template
Use this structure for weekly stakeholder updates. Copy and customize for your team.
Section 1: Performance Summary (2-3 sentences)
"This week, [Agent Name] handled [X] calls with [Y]% task success rate ([up/down] [Z]% from last week). Containment held at [A]%, and median latency was [B]ms."
Example:
"This week, the Appointment Scheduler handled 12,450 calls with 87% task success rate (up 3% from last week). Containment held at 82%, and median latency was 1.8s."
Section 2: Key Metrics Table
| Metric | This Week | Last Week | Trend | Status |
|---|---|---|---|---|
| Task Success Rate | 87% | 84% | ▲ 3% | 🟢 Good |
| Containment Rate | 82% | 83% | ▼ 1% | 🟡 Watch |
| Latency (P50) | 1.8s | 1.6s | ▲ 0.2s | 🟢 Good |
| Error Rate | 2.1% | 2.3% | ▼ 0.2% | 🟢 Good |
| Call Volume | 12,450 | 11,200 | ▲ 11% | 🟢 Normal |
Section 3: Actions & Recommendations
What's Working:
- [Specific improvement with data]
- [Second win]
Areas to Watch:
- [Metric trending wrong direction] — Action: [Specific next step]
- [Issue identified] — Action: [Owner] to investigate by [date]
Next Week Focus:
- [Priority 1]
- [Priority 2]
How to Build a Voice Agent Dashboard
Follow these steps to implement the 6-Metric Dashboard Framework:
Step 1: Instrument Your Voice Agent (Day 1)
Log these events from your voice agent:
{
"event": "call.ended",
"call_id": "call_abc123",
"timestamp": "2026-01-22T10:30:00Z",
"metrics": {
"task_success": true,
"contained": true,
"latency_p50_ms": 1800,
"latency_p90_ms": 3500,
"coherence_score": 0.92,
"error_occurred": false,
"error_type": null
}
}
Step 2: Choose Your Dashboard Tool (Day 1-2)
| Option | Setup Time | Best For |
|---|---|---|
| Hamming | 5 minutes | Production voice agents with native integrations |
| Grafana | 2-4 hours | Teams with existing Prometheus/InfluxDB |
| Datadog | 2-4 hours | Teams already using Datadog for APM |
| Custom | Days/weeks | Unique requirements with dedicated eng capacity |
Step 3: Configure Alerts (Day 2-3)
Start with these four high-impact alerts:
- Task Success <75% for 10 minutes → P1 (immediate investigation)
- Latency P90 >5s for 5 minutes → P1 (user experience degradation)
- Error Rate >5% for 5 minutes → P1 (system health issue)
- Call Volume drop >50% for 15 minutes → P2 (potential outage)
Step 4: Set Up Weekly Reports (Day 3)
Schedule automated reports every Monday morning with:
- Previous week's metrics vs. targets
- Week-over-week trends
- Top 3 action items for the week
Choosing a Voice Agent Dashboard Platform
| Capability | Hamming | Datadog/Grafana | Spreadsheets | Custom Build |
|---|---|---|---|---|
| Voice-native KPIs (latency P50/P90, coherence) | ✅ 50+ built-in | ⚠️ Manual setup | ❌ Manual | ⚠️ Build it yourself |
| Pre-built dashboard templates | ✅ Ready in 5 min | ⚠️ Generic templates | ❌ From scratch | ❌ From scratch |
| Executive report automation | ✅ Weekly emails | ❌ Manual | ❌ Manual | ⚠️ Build it yourself |
| Integration with Vapi/Retell/LiveKit | ✅ Native | ⚠️ Custom webhooks | ❌ None | ⚠️ Build it yourself |
| Drill-down to transcripts | ✅ One click | ❌ Not supported | ❌ Not supported | ⚠️ Build it yourself |
| Setup time | 5 minutes | 2-4 hours | 1-2 hours | Days/weeks |
| Best for | Production voice agents | General APM monitoring | <100 calls/day | Unique requirements |
When to Use Each
- Use Hamming when: You need voice-native dashboards with automated executive reporting and want setup in minutes, not days.
- Use Datadog/Grafana when: You already have these tools for other services and want a single pane of glass (but expect custom instrumentation work).
- Use Spreadsheets when: You're in early testing with <100 calls/day and don't need real-time monitoring.
- Build Custom when: You have unique requirements that no tool supports and dedicated engineering capacity.
How Teams Implement This with Hamming
Hamming provides the infrastructure to implement the 6-Metric Dashboard Framework without custom development:
- One-click dashboard setup: Pre-built templates with all 6 metrics configured out of the box
- Automatic threshold monitoring: Alerts when metrics cross good/bad boundaries (configurable per-metric)
- Drill-down to transcripts: Click any data point to see underlying calls and identify root causes
- Executive report generation: Weekly summaries auto-generated and emailed to stakeholders
- 50+ built-in assertions: Validate task success, containment, and coherence automatically
- Integration with major platforms: Native support for Vapi, Retell, LiveKit, Twilio
- Historical benchmarking: Compare against your own baselines and industry averages
- Custom metric support: Add domain-specific KPIs alongside the standard 6
Getting Started:
- Connect your voice agent (5 minutes)
- Select the "6-Metric Dashboard" template
- Configure thresholds for your use case
- Set up weekly executive report recipients
Worked Formula Examples
Task Success Rate Calculation
Formula: (Successful tasks / Total calls) × 100
Example:
- Total calls this week: 12,450
- Calls with successful task completion: 10,831
- Task Success Rate: (10,831 / 12,450) × 100 = 87.0%
Latency Percentile Interpretation
P50 vs P90:
- P50 = 1.8s: Half of all responses are faster than 1.8s
- P90 = 3.5s: 90% of responses are faster than 3.5s; 10% are slower
Action trigger: If P90 > 2× P50, investigate tail latency causes (usually ASR timeouts or LLM cold starts).
Containment Rate Calculation
Formula: (Calls resolved by AI / Total calls) × 100
Example:
- Total calls: 12,450
- Calls resolved without human transfer: 10,209
- Containment Rate: (10,209 / 12,450) × 100 = 82.0%
Voice Agent Dashboard Quick-Start Checklist
Use this checklist to implement your dashboard in one week:
Day 1: Instrumentation
- Add call.ended event logging with all 6 metrics
- Verify events are flowing to your data store
- Test with 100+ sample calls
Day 2: Dashboard Setup
- Choose dashboard tool (Hamming recommended for voice agents)
- Create 4 scorecards: Success, Containment, Latency, Errors
- Add funnel chart for task success
- Add histogram for latency distribution
Day 3: Alerting
- Configure 4 critical alerts (see Step 3 above)
- Set up notification channels (Slack, PagerDuty, email)
- Test alerts with synthetic threshold breaches
Day 4-5: Baseline & Tune
- Let dashboard run for 2 days to establish baseline
- Adjust thresholds based on observed ranges
- Remove false positive alerts
Day 6-7: Reporting
- Set up weekly executive report
- Configure recipient list
- Send first report to stakeholders

