arXiv API
Free open-access API for querying and retrieving academic preprints across physics, math, CS, and other sciences — no auth required for reading.
Score Breakdown
⚙ Agent Friendliness
🔒 Security
No auth surface — read-only public data. TLS enforced on API endpoint.
⚡ Reliability
Best When
Building research-oriented agents that need free, no-auth access to the latest scientific preprints in CS, AI, physics, math, and related fields.
Avoid When
You need peer-reviewed publications with citation data, or papers from life sciences (use PubMed) or broader coverage (use Semantic Scholar).
Use Cases
- • Building research agents that automatically find relevant papers on a topic via full-text search
- • Monitoring new papers in a field using arXiv's daily submission feed for automated research digests
- • Extracting paper metadata (authors, abstracts, citations) for academic knowledge graphs
- • Powering AI research assistants that can retrieve and summarize recent scientific literature
- • Automated literature review pipelines for AI agents tracking developments in specific research areas
Not For
- • Full paper PDF download for bulk processing — use arXiv bulk data access instead of API
- • Citation graph or reference data — use Semantic Scholar or OpenAlex for citation networks
- • Non-preprint published journals — arXiv only has submitted preprints
Interface
Authentication
No authentication required. Completely open API with rate limiting only. User-Agent header encouraged for identification.
Pricing
Entirely free. Rate limit requested: 1 request per 3 seconds. Exceed limits and you may be temporarily blocked.
Agent Metadata
Known Gotchas
- ⚠ Response is Atom XML not JSON — parse with feedparser or xml.etree; no JSON option exists
- ⚠ max_results default is 10, maximum is 2000 per request — for comprehensive search need multiple paginated requests
- ⚠ Paper IDs change format over time (e.g., 2401.00001 vs older format) — use full arXiv ID with version number for stable links
- ⚠ Search quality is limited — full text search in title/abstract only; no semantic search; use Semantic Scholar for better retrieval
- ⚠ Courtesy rate limit of 1 req/3 seconds is unenforced but violations can result in IP blocks; add sleep() in agent loops
Alternatives
Full Evaluation Report
Detailed scoring breakdown, competitive positioning, security analysis, and improvement recommendations for arXiv API.
Scores are editorial opinions as of 2026-03-06.