hnswlib

Fast approximate nearest neighbor (ANN) search library implementing HNSW (Hierarchical Navigable Small World) algorithm. hnswlib features: hnswlib.Index for in-memory vector index, add_items() for batch insertion, knn_query() for k-nearest-neighbor search, save_index()/load_index() for persistence, ef_construction and M parameters for accuracy/speed trade-off, ef (query-time) parameter, multi-threading for batch queries, mark_deleted() for soft deletion, and both L2 (Euclidean) and cosine space metrics. 10-100x faster than brute-force search at slight accuracy cost. Lightweight alternative to FAISS for CPU-only environments.

Evaluated Mar 06, 2026 (0d ago) v0.8.x

Homepage ↗ Repo ↗ AI & Machine Learning python hnswlib ann vector-search embeddings hnsw approximate-nearest-neighbor similarity-search

⚙ Agent Friendliness

/ 100

Can an agent use this?

🔒 Security

/ 100

Is it safe for agents?

⚡ Reliability

/ 100

Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality

Documentation

Error Messages

Auth Simplicity

Rate Limits

🔒 Security

TLS Enforcement

Auth Strength

Scope Granularity

Dep. Hygiene

Secret Handling

Local in-memory index — no network access, no data exfiltration risk. Index files saved to disk should be protected if containing sensitive embeddings. C++ core with Python bindings — supply chain risk is low (widely used library).

⚡ Reliability

Uptime/SLA

Version Stability

Breaking Changes

Error Recovery

Best When

CPU-based semantic search over millions of embeddings — hnswlib's HNSW algorithm provides near-exact recall (>95%) with <10ms query latency for 1M vectors, with a simpler API than FAISS for CPU-only deployments.

Avoid When

You need GPU acceleration (use FAISS), billion-scale search (use DiskANN), or exact results (use brute-force).

Use Cases

• Agent semantic search — import hnswlib; dim = 384; index = hnswlib.Index(space='cosine', dim=dim); index.init_index(max_elements=10000, ef_construction=200, M=16); index.add_items(embeddings, labels); labels, distances = index.knn_query(query_embedding, k=5) — fast cosine similarity search over document embeddings; agent retrieval pipeline searches 100K documents in <10ms
• Agent recommendation engine — index = hnswlib.Index(space='l2', dim=128); index.init_index(max_elements=1000000); index.set_ef(50); index.add_items(item_embeddings, item_ids); similar_ids, dists = index.knn_query(user_embedding, k=20) — product/content recommendations via embedding similarity; agent serves personalized recommendations from 1M+ item catalog
• Agent deduplication — index.add_items(doc_embeddings); for i, emb in enumerate(doc_embeddings): labels, dists = index.knn_query(emb, k=2); if dists[0][1] < 0.1: mark_as_duplicate(i, labels[0][1]) — find near-duplicate documents via cosine similarity threshold; agent data cleaning pipeline removes semantically duplicate content
• Agent index persistence — index.save_index('embeddings.bin'); loaded = hnswlib.Index(space='cosine', dim=dim); loaded.load_index('embeddings.bin', max_elements=10000) — save and restore index across agent restarts; agent persists vector index to disk between sessions without rebuilding
• Agent dynamic index updates — index.mark_deleted(label); index.add_items(new_embeddings, new_labels) — soft-delete and add items to live index; agent knowledge base updates embeddings for edited documents without full rebuild; deleted items excluded from future searches

Not For

• GPU-accelerated search — hnswlib is CPU-only; for GPU vector search use FAISS with GPU index or cuVS
• Disk-based indexes for 100M+ vectors — hnswlib is in-memory; for billion-scale search use DiskANN or FAISS with disk-based IVF
• Exact nearest neighbor — hnswlib is approximate; for exact results use brute-force FAISS FlatIndex or scipy.spatial.cKDTree for small datasets

Interface

REST API

GraphQL

gRPC

MCP Server

SDK

Yes

Webhooks

Authentication

Methods: none

OAuth: No Scopes: No

No auth — local in-memory index.

Pricing

Model: open_source

Free tier: Yes

Requires CC: No

hnswlib is Apache 2.0 licensed. Free for all use.

Agent Metadata

Pagination

none

Idempotent

Full

Retry Guidance

Not documented

Known Gotchas

⚠ max_elements must be set at init — index.init_index(max_elements=10000) reserves memory for exactly 10000 elements; adding more raises RuntimeError: Cannot insert element — out of available memory; agent code must estimate max size upfront or use resize_index(new_max) to grow; cannot shrink
⚠ Cosine space requires pre-normalized vectors — hnswlib.Index(space='cosine') computes inner product not true cosine; vectors must be L2-normalized: v = v / np.linalg.norm(v) before add_items(); unnormalized vectors with 'cosine' space give wrong similarity scores silently
⚠ ef must be set before each query batch — index.set_ef(50) sets query-time ef globally; ef controls recall/speed trade-off (higher=better recall, slower); default ef is often too low; agent code must call set_ef() with appropriate value before knn_query(); ef must be >= k
⚠ Labels must be uint integers — add_items(embeddings, labels) requires labels as numpy uint array; Python list of ints or negative integers raise errors; agent code using string IDs must maintain separate label→string mapping; store integer label as index, lookup string from list
⚠ No true deletion — mark_deleted(label) soft-deletes by excluding from results but element stays in memory; deleted count against max_elements capacity; agent with frequent updates must rebuild index periodically to reclaim space from deleted elements
⚠ Thread safety requires num_threads — index.set_num_threads(4) enables parallel add_items() and knn_query(); single-threaded by default; agent parallel indexing pipelines must call set_num_threads() matching CPU count; concurrent knn_query() calls without thread setting cause race conditions

Alternatives

faiss-python-api

Full Evaluation Report

Detailed scoring breakdown, competitive positioning, security analysis, and improvement recommendations for hnswlib.

$99

API endpoint ↗ Agent guide ↗ Report inaccuracy

Scores are editorial opinions as of 2026-03-06.