Deepgram API
Provides real-time and batch speech-to-text transcription via REST and WebSocket, supporting 200+ languages with the Nova-3 model.
Score Breakdown
⚙ Agent Friendliness
🔒 Security
Project-scoped API keys are a good practice; HIPAA BAA available for healthcare use cases on qualifying plans
⚡ Reliability
Best When
You need low-latency, production-grade streaming transcription or batch processing of audio at scale with strong SDK support.
Avoid When
Your budget is near zero and latency tolerance is high enough to run open-source Whisper locally.
Use Cases
- • Transcribe live phone calls or voice sessions in real time for agent decision-making
- • Convert recorded meeting audio to searchable text for knowledge extraction pipelines
- • Power voice-command interfaces where an agent listens and acts on spoken instructions
- • Generate accurate captions or subtitles for audio/video content at scale
- • Extract structured data (names, dates, intents) from spoken customer interactions via post-transcription NLP
Not For
- • Applications that need on-device, fully offline speech recognition without any cloud dependency
- • Use cases requiring speaker diarization with more than ~10 speakers at very high accuracy
- • Scenarios where per-minute transcription cost must approach zero (self-hosted Whisper may be preferable)
Interface
Authentication
API key passed via Authorization: Token <key> header; keys are project-scoped and can be created per project in the console
Pricing
$200 credit requires no credit card; credit card needed when credits are exhausted to continue usage
Agent Metadata
Known Gotchas
- ⚠ WebSocket connections require sending a CloseStream message to flush final transcript; omitting it causes incomplete final transcripts
- ⚠ The is_final vs speech_final distinction in streaming results is critical: agents acting on interim results risk acting on incorrect partial text
- ⚠ Audio format must be explicitly declared (encoding, sample_rate, channels) or auto-detection may produce degraded accuracy
- ⚠ Free $200 credit expires; agents hitting production without a billing method will receive 402 errors with no grace period
- ⚠ The Nova-3 model ID may differ by use case (nova-3 vs nova-3-medical); using the wrong model tier affects both accuracy and cost
Alternatives
Full Evaluation Report
Detailed scoring breakdown, competitive positioning, security analysis, and improvement recommendations for Deepgram API.
Scores are editorial opinions as of 2026-03-06.