Voyage AI
Provides state-of-the-art embedding models and rerankers optimized for retrieval accuracy, used by Anthropic and trusted for RAG pipelines.
Score Breakdown
⚙ Agent Friendliness
🔒 Security
TLS enforced on all endpoints; no scope-based key permissions; API key rotation supported via dashboard; data not used for model training per ToS.
⚡ Reliability
Best When
You need maximum retrieval accuracy for RAG or semantic search and are willing to pay a premium over commodity embedding APIs.
Avoid When
Your workload is cost-sensitive at scale and OpenAI or Cohere embeddings already meet your accuracy bar.
Use Cases
- • Generating high-quality embeddings for RAG pipelines over large document corpora
- • Reranking retrieved candidates to improve answer relevance before sending to LLM
- • Semantic search over code repositories using voyage-code-3 model
- • Domain-specific embedding with finance or law optimized models
- • Building long-context retrieval pipelines with up to 32K token input support
Not For
- • Generating text completions or chat responses — embeddings only
- • Real-time streaming inference requiring sub-10ms latency
- • On-premise deployment where data cannot leave your infrastructure
Interface
Authentication
Single API key passed as Bearer token in Authorization header; keys are per-account with no scope granularity.
Pricing
Free tier credited at account creation; no ongoing free quota after initial allocation is consumed.
Agent Metadata
Known Gotchas
- ⚠ Input token limit per request is 128K tokens total across all texts in a batch — splitting large batches is required
- ⚠ Model names are versioned (e.g. voyage-3) but silent updates can shift embedding space, breaking cached index compatibility
- ⚠ Reranker and embedder are separate endpoints with separate pricing — easy to conflate in cost estimation
- ⚠ voyage-code-3 is recommended for mixed code+prose but not for pure natural language where voyage-3 outperforms it
- ⚠ The Python SDK truncates inputs silently by default — set truncation=False to surface length errors explicitly
Alternatives
Full Evaluation Report
Detailed scoring breakdown, competitive positioning, security analysis, and improvement recommendations for Voyage AI.
Scores are editorial opinions as of 2026-03-06.