Letta (MemGPT)

Open-source agentic framework (formerly MemGPT) with a REST API server providing agents persistent, structured memory management — enabling agents to remember users, facts, and history across unlimited conversations.

Evaluated Mar 06, 2026 (0d ago) v0.6.x
Homepage ↗ Repo ↗ AI & Machine Learning memgpt persistent-memory agent-framework stateful-agents open-source self-hosted long-context rag
⚙ Agent Friendliness
62
/ 100
Can an agent use this?
🔒 Security
81
/ 100
Is it safe for agents?
⚡ Reliability
73
/ 100
Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality
--
Documentation
80
Error Messages
74
Auth Simplicity
95
Rate Limits
88

🔒 Security

TLS Enforcement
90
Auth Strength
82
Scope Granularity
70
Dep. Hygiene
78
Secret Handling
85

Self-hosted deployment provides full data control; default local dev config has no auth — must be hardened before network exposure

⚡ Reliability

Uptime/SLA
72
Version Stability
76
Breaking Changes
68
Error Recovery
75
AF Security Reliability

Best When

You need agents with rich, durable memory that persists across sessions and can self-manage what to remember — especially when data privacy requires self-hosting.

Avoid When

Your agents are stateless, session-scoped, or you need a fully managed cloud platform without operational burden.

Use Cases

  • Build personal assistant agents that accumulate and recall user preferences, past decisions, and long-term context across many sessions
  • Create customer-facing agents that maintain persistent relationship context (purchase history, preferences, prior issues) per user
  • Deploy research agents that continuously update a structured knowledge base as they discover new information
  • Implement agents with tiered memory (in-context, archival, recall) that intelligently manage what to keep active versus store
  • Run self-hosted conversational agents for sensitive data use cases where cloud memory storage is not acceptable

Not For

  • Single-turn stateless tasks — Letta's memory architecture adds unnecessary overhead for simple request-response agents
  • Teams needing a fully managed cloud platform with SLA guarantees — self-hosted requires infrastructure management
  • Workflows requiring real-time sub-200ms responses — memory retrieval and context management add latency

Interface

REST API
Yes
GraphQL
No
gRPC
No
MCP Server
No
SDK
Yes
Webhooks
No

Authentication

Methods: api_key none
OAuth: No Scopes: No

Self-hosted server can run with no auth (local dev) or with API key; Letta Cloud uses API key authentication

Pricing

Model: open_source
Free tier: Yes
Requires CC: No

Open-source Apache 2.0 for self-hosted; Letta Cloud is freemium with paid tiers for production use

Agent Metadata

Pagination
offset
Idempotent
Conditional
Retry Guidance
Not documented

Known Gotchas

  • Memory edit tools (core_memory_append, archival_memory_insert) are called autonomously by the LLM — agents can accumulate stale or contradictory memories without explicit cleanup logic
  • Archival memory search is semantic (vector similarity) — exact-match retrieval is unreliable; agents should not rely on it for structured key-value lookups
  • Self-hosted server requires PostgreSQL with pgvector extension for production — SQLite default is not suitable for concurrent multi-agent workloads
  • Agent persona and human persona strings have character limits — truncation without warning can cause subtle context degradation
  • Streaming responses are supported but memory writes happen post-stream — tools that depend on updated memory immediately after a streamed response may read stale state

Alternatives

Full Evaluation Report

Detailed scoring breakdown, competitive positioning, security analysis, and improvement recommendations for Letta (MemGPT).

$99

Scores are editorial opinions as of 2026-03-06.

5173
Packages Evaluated
26151
Need Evaluation
173
Need Re-evaluation
Community Powered