SlowAPI

Rate limiting extension for FastAPI and Starlette — port of Flask-Limiter using the limits library. SlowAPI provides @limiter.limit() decorator for endpoint-level rate limiting with storage backends (in-memory, Redis, Memcached). Limits expressed as strings: '100/day', '10/minute', '1/second'. Supports key functions for per-user or per-IP limiting (default: client IP). Integrates with FastAPI's dependency injection and exception handlers. Agent API limiting example: @app.get('/agent/run') @limiter.limit('10/minute') for per-IP agent API rate limiting.

Evaluated Mar 06, 2026 (0d ago) v0.1.x

Homepage ↗ Repo ↗ Developer Tools python fastapi starlette rate-limiting throttling api middleware redis

⚙ Agent Friendliness

/ 100

Can an agent use this?

🔒 Security

/ 100

Is it safe for agents?

⚡ Reliability

/ 100

Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality

Documentation

Error Messages

Auth Simplicity

Rate Limits

🔒 Security

TLS Enforcement

Auth Strength

Scope Granularity

Dep. Hygiene

Secret Handling

Rate limiting is a security control — prevents agent API DoS. IP-based limiting can be bypassed by IP rotation; combine with user-based limiting via JWT key_func. Redis connection should use TLS and auth for production agent rate limit storage.

⚡ Reliability

Uptime/SLA

Version Stability

Breaking Changes

Error Recovery

Best When

You have a FastAPI agent service that needs quick endpoint-level rate limiting with minimal setup — SlowAPI adds rate limiting in 5 lines of code with Redis backend for distributed limiting.

Avoid When

You're on AWS/GCP API Gateway (use built-in throttling), your rate limits require complex business logic, or you need rate limiting at the infrastructure level.

Use Cases

• Rate limit agent API endpoints per IP — @limiter.limit('10/minute') on /agent/run FastAPI endpoint prevents abuse of compute-intensive agent processing
• Per-user agent quota enforcement — custom key_func that extracts user_id from JWT token enables per-user agent API rate limiting via Redis-backed SlowAPI
• Tiered agent rate limits — different limits per endpoint: /agent/quick at '60/minute', /agent/deep-research at '5/hour' for resource-appropriate agent API rate limiting
• Distributed agent rate limiting with Redis — SlowAPI + Redis storage backend shares rate limit counters across all agent API instances for consistent cross-instance limiting
• Agent API burst protection — '100/day;10/hour;2/minute' compound limit notation prevents both burst and sustained agent API abuse

Not For

• Non-FastAPI/Starlette apps — SlowAPI is specifically for FastAPI/Starlette; use Flask-Limiter for Flask agent APIs, or API Gateway rate limiting for AWS/GCP-hosted agent services
• API Gateway-level rate limiting — cloud-hosted agent APIs should use API Gateway built-in rate limiting for better security and performance; SlowAPI is for application-level limits in self-hosted scenarios
• Complex rate limit logic — SlowAPI's string-based limit expressions are simple; for dynamic agent quotas from database or business rules, implement custom rate limiting middleware

Interface

REST API

GraphQL

gRPC

MCP Server

SDK

Yes

Webhooks

Authentication

Methods: none

OAuth: No Scopes: No

Rate limiting library — no auth. Key functions for rate limit identity can use JWT claims, API keys, or IP address from request.

Pricing

Model: open_source

Free tier: Yes

Requires CC: No

SlowAPI is MIT licensed. Free for all use.

Agent Metadata

Pagination

none

Idempotent

Full

Retry Guidance

Not documented

Known Gotchas

⚠ Exception handler must be manually registered — SlowAPI does not auto-register the 429 exception handler; app.add_exception_handler(RateLimitExceeded, _rate_limit_exceeded_handler) required; without it, RateLimitExceeded propagates as 500 Internal Server Error for agent callers
⚠ In-memory storage is per-process — default in-memory storage doesn't work correctly with multiple agent service workers (Uvicorn workers or Gunicorn); each worker has separate counter; must use Redis backend for multi-worker or multi-instance agent deployments
⚠ IP extraction behind proxy requires X-Forwarded-For — default key_func uses client IP from request.client.host; behind load balancer or reverse proxy, this is proxy IP; configure request.headers.get('X-Forwarded-For') in key_func for real client IP-based agent rate limiting
⚠ Decorator order matters with FastAPI — @limiter.limit() must be applied before @app.get()/@router.get(); wrong order causes decorator to not attach to FastAPI route; always put limiter decorator closest to function definition
⚠ SlowAPI maintenance is limited — SlowAPI is minimally maintained; consider fastapi-limiter as alternative for better async support; check GitHub for open issues before adopting for production agent APIs
⚠ Compound limits apply all thresholds — '10/minute;100/hour' applies BOTH limits independently; agent caller hitting 10/minute limit gets 429 even if hourly quota is not exceeded; explain compound limits in agent API documentation

Alternatives

fastapi-limiter-api starlette-rate-limit-api nginx-rate-limit-api

Full Evaluation Report

Detailed scoring breakdown, competitive positioning, security analysis, and improvement recommendations for SlowAPI.

$99

API endpoint ↗ Agent guide ↗ Report inaccuracy

Scores are editorial opinions as of 2026-03-06.