Trieve Search API

Open-source infrastructure for building search and RAG pipelines combining dense vector search, sparse vector search (SPLADE), BM25 full-text, and cross-encoder reranking. Trieve provides a REST API for chunk management (ingest, update, delete) and search (semantic, fulltext, hybrid), plus recommendations and RAG with citations. Self-hostable with Trieve Cloud managed option. Strong focus on search quality via multi-stage retrieval.

Evaluated Mar 06, 2026 (0d ago) vv1

Homepage ↗ Repo ↗ AI & Machine Learning search rag hybrid-search vector fulltext recommendations open-source rust llm

⚙ Agent Friendliness

/ 100

Can an agent use this?

🔒 Security

/ 100

Is it safe for agents?

⚡ Reliability

/ 100

Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality

Documentation

Error Messages

Auth Simplicity

Rate Limits

🔒 Security

TLS Enforcement

100

Auth Strength

Scope Granularity

Dep. Hygiene

Secret Handling

Apache 2.0 open-source — auditable. HTTPS enforced. Dataset-scoped API keys. Self-hosted option for data control. Early-stage company — SOC2 status not confirmed; verify for enterprise use.

⚡ Reliability

Uptime/SLA

Version Stability

Breaking Changes

Error Recovery

Best When

You want best-in-class search quality for RAG via multi-stage hybrid retrieval (dense + sparse + reranking) with an open-source option and simple REST API.

Avoid When

You need enterprise managed service with guaranteed SLAs and compliance certifications — Trieve is early-stage for enterprise use cases.

Use Cases

• Build RAG pipelines for AI agents with Trieve's hybrid search combining dense embeddings + BM25 for better retrieval than pure vector search
• Implement multi-stage retrieval (search → rerank) for agent knowledge bases to maximize answer relevance
• Create recommendation systems for agents using Trieve's recommendation API based on similar chunks
• Build search-augmented agent workflows where agents search Trieve before generating answers, with citation tracking
• Index and search large document collections for agents using Trieve's chunking API with automatic embedding

Not For

• Simple semantic search without hybrid retrieval needs — Pinecone or Weaviate are simpler for basic vector search
• Enterprise deployments needing SOC2 and managed SLA — Trieve Cloud is early-stage; verify compliance status
• Non-English content — hybrid search quality varies by language; verify SPLADE and BM25 language support for your use case

Interface

REST API

Yes

GraphQL

gRPC

MCP Server

SDK

Yes

Webhooks

OpenAPI Spec ↗

Authentication

Methods: api_key

OAuth: No Scopes: Yes

API keys with read/write/admin scopes per dataset. Keys generated in Trieve dashboard or via API. Dataset-scoped keys for multi-tenant access control. API key in TR-API-Key header.

Pricing

Model: usage_based

Free tier: Yes

Requires CC: No

Generous free tier for prototyping. Self-hosting is free (Apache 2.0). Trieve Cloud pricing is competitive. LLM costs for RAG with generation are passed through at cost.

Agent Metadata

Pagination

page

Idempotent

Partial

Retry Guidance

Not documented

Known Gotchas

⚠ Hybrid search (dense + sparse + BM25) is more expensive per request than pure vector search — configure search type based on budget and quality needs
⚠ Chunk size affects retrieval quality — too large (>1000 tokens) reduces precision; too small (<50 tokens) loses context
⚠ SPLADE sparse vectors require separate embedding model — verify SPLADE support is enabled in your Trieve deployment
⚠ Cross-encoder reranking adds latency (~200ms) — use only when retrieval quality justifies the cost
⚠ Dataset must be created before inserting chunks — agents must manage dataset lifecycle
⚠ Embedding model is configured per dataset and cannot be changed after creation — choose embedding model carefully at setup
⚠ Self-hosted deployment requires running Qdrant, Qdrant BM25, Redis, and Trieve services — significant operational overhead

Alternatives

weaviate-api qdrant-api elasticsearch-api vespa-api

Full Evaluation Report

Detailed scoring breakdown, competitive positioning, security analysis, and improvement recommendations for Trieve Search API.

$99

API endpoint ↗ Agent guide ↗ Report inaccuracy

Scores are editorial opinions as of 2026-03-06.