Qdrant REST API

Qdrant is a high-performance vector database purpose-built for storing, indexing, and querying high-dimensional embeddings. Its REST API (and parallel gRPC API) supports collection management, vector upsert with arbitrary JSON payloads, approximate nearest neighbor (ANN) search with payload filtering, named vectors for multi-modal data, and sparse vectors for hybrid dense+sparse search. Available as self-hosted (Docker/Kubernetes) or Qdrant Cloud.

Evaluated Mar 06, 2026 (0d ago) vcurrent
Homepage ↗ Repo ↗ Databases qdrant vector-database embeddings similarity-search rag agent-memory openapi rust grpc rest payload-filtering
⚙ Agent Friendliness
62
/ 100
Can an agent use this?
🔒 Security
68
/ 100
Is it safe for agents?
⚡ Reliability
84
/ 100
Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality
0
Documentation
92
Error Messages
88
Auth Simplicity
78
Rate Limits
72

🔒 Security

TLS Enforcement
75
Auth Strength
65
Scope Granularity
45
Dep. Hygiene
90
Secret Handling
72

Self-hosted Qdrant ships with no TLS and no auth — both must be explicitly configured, making misconfigured deployments a real risk. Qdrant Cloud enforces TLS and API key auth. API key scopes are binary (read-only vs. read-write) with no collection-level granularity. The Rust implementation has a strong memory safety profile and minimal CVE history. For production self-hosted: configure TLS termination at the load balancer, QDRANT__SERVICE__API_KEY env var, and network-level access restrictions. Read-only API keys should be used for agents that only need search, not upsert/delete.

⚡ Reliability

Uptime/SLA
80
Version Stability
85
Breaking Changes
82
Error Recovery
88
AF Security Reliability

Best When

An agent needs semantic similarity search over embeddings with flexible payload filtering, especially in RAG pipelines or memory systems where results must be ranked by relevance AND filtered by metadata attributes simultaneously.

Avoid When

You only need exact-match retrieval, or your vector dimensionality and dataset size fit comfortably in an in-process library like FAISS without the operational overhead of a separate service.

Use Cases

  • RAG (Retrieval-Augmented Generation) memory: store document chunk embeddings and retrieve top-k by semantic similarity
  • Agent long-term memory: upsert conversation summaries as vectors and recall relevant past context at inference time
  • Semantic deduplication: find near-duplicate content by vector distance before inserting into a primary store
  • Hybrid search combining dense embedding similarity with BM25 sparse vectors for keyword relevance
  • Recommendation systems: query by example vector with payload filters (e.g., 'similar items in category=shoes, price<100')
  • Image and multi-modal search using named vector fields for different modality embeddings per point
  • Clustering and anomaly detection over agent-generated data using collection snapshots

Not For

  • General-purpose relational or document storage — Qdrant payloads are metadata, not a primary data store
  • Exact-match key-value lookups where a hash table is simpler and cheaper
  • ACID transactional workloads requiring rollback semantics
  • Real-time streaming ingestion at millions of vectors/second without careful batch sizing

Interface

REST API
Yes
GraphQL
No
gRPC
Yes
MCP Server
No
SDK
Yes
Webhooks
No

Authentication

Methods: api_key none
OAuth: No Scopes: No

Self-hosted Qdrant has no auth by default — the API is open on port 6333. API key auth must be explicitly enabled via config (api_key field in config.yaml or QDRANT__SERVICE__API_KEY env var). Qdrant Cloud enforces API key auth with per-cluster keys; a read-only key variant is available. There are no fine-grained permission scopes beyond read-only vs. read-write. For production self-hosted deployments, TLS + API key + network-level isolation are all required because the API key is the only credential layer.

Pricing

Model: open_source
Free tier: Yes
Requires CC: No

Self-hosted Docker is the fastest path to zero-cost evaluation. Qdrant Cloud free cluster is useful for development without infrastructure management. Production HA requires at minimum 3-node cluster for replication factor >= 2.

Agent Metadata

Pagination
cursor
Idempotent
Full
Retry Guidance
Documented

Known Gotchas

  • Self-hosted Qdrant has NO authentication by default — any unauthenticated request succeeds; agents must verify API key is configured before treating the instance as secure
  • Vector dimensions must exactly match the collection's configured size at creation time; inserting a 1536-dim vector into a 768-dim collection returns a cryptic 'Wrong input: Vector inserting error' with no mention of dimensionality mismatch
  • HNSW index is built asynchronously after upsert — immediately searching after bulk upsert may return 0 results or incomplete results; agents should check collection 'indexed_vectors_count' before querying or use the 'wait=true' parameter on upsert
  • Payload filters use a different syntax from most query languages (match, range, geo_bounding_box objects) — agents generating filters from natural language frequently generate invalid filter objects that return 0 results instead of an error
  • Qdrant point IDs must be either unsigned 64-bit integers or UUIDs — string IDs like 'doc_abc' are not supported and cause a 400 error that is easy to encounter when porting from other vector stores
  • Scroll (full collection scan) vs. search (ANN query) are different endpoints with different pagination — agents that want to retrieve all points must use /collections/{name}/points/scroll, not /collections/{name}/points/search

Alternatives

Full Evaluation Report

Detailed scoring breakdown, competitive positioning, security analysis, and improvement recommendations for Qdrant REST API.

$99

Scores are editorial opinions as of 2026-03-06.

5215
Packages Evaluated
26151
Need Evaluation
173
Need Re-evaluation
Community Powered