Neptune.ai API

Neptune.ai is an ML metadata store and experiment tracking platform with a Python SDK and REST API that allows agents to log, query, and compare experiment metrics, parameters, and artifacts across runs, and manage a model registry with version promotion workflows.

Evaluated Mar 06, 2026 (0d ago) vcurrent

Homepage ↗ Repo ↗ AI & Machine Learning mlops experiment-tracking model-registry metadata-store ml-observability run-comparison

⚙ Agent Friendliness

/ 100

Can an agent use this?

🔒 Security

/ 100

Is it safe for agents?

⚡ Reliability

/ 100

Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality

Documentation

Error Messages

Auth Simplicity

Rate Limits

🔒 Security

TLS Enforcement

100

Auth Strength

Scope Granularity

Dep. Hygiene

Secret Handling

API tokens are account-scoped with no endpoint-level granularity. A leaked token exposes all projects the account can access. Neptune enforces TLS on all cloud endpoints. On-premises deployment allows tighter network isolation. Service accounts on paid plans reduce individual user credential exposure.

⚡ Reliability

Uptime/SLA

Version Stability

Breaking Changes

Error Recovery

Best When

An agent needs flexible, structured metadata storage for ML runs with powerful cross-run querying and comparison, particularly in research environments with heterogeneous experiment structures.

Avoid When

You need real-time streaming metrics, model serving, or your team has no existing Neptune workspace with logged runs.

Use Cases

• Querying experiment runs by project with custom filters on metrics and parameters to identify best-performing model configurations
• Fetching model registry versions and their metadata (training dataset, metrics, artifacts) for model selection and deployment decisions
• Logging structured evaluation results from agent-orchestrated ML pipelines using Neptune's flexible namespace-based metadata model
• Comparing multiple runs side-by-side by fetching their metric histories and parameter sets for automated regression detection
• Retrieving experiment artifacts (model files, dataset samples, evaluation plots) programmatically for audit and reproducibility pipelines

Not For

• Model inference serving — Neptune stores and tracks models but does not provide inference endpoints
• Data pipeline orchestration — Neptune is a metadata layer, not a workflow scheduler or orchestrator
• Teams not running iterative ML workflows — the metadata store model provides no benefit without repeated tracked experiments

Interface

REST API

Yes

GraphQL

gRPC

MCP Server

SDK

Yes

Webhooks

Authentication

Methods: api_key

OAuth: No Scopes: No

Authentication uses a personal API token passed via NEPTUNE_API_TOKEN environment variable or explicitly in the Neptune.init() call. The SDK handles auth transparently when the environment variable is set. Tokens are account-scoped with no fine-grained permissions. Service accounts available on paid plans for CI/CD and agent use.

Pricing

Model: freemium

Free tier: Yes

Requires CC: No

The free tier is sufficient for individual agent integration development. Paid tiers unlock team collaboration features and higher storage quotas. On-premises deployment available for data residency requirements.

Agent Metadata

Pagination

cursor

Idempotent

Partial

Retry Guidance

Not documented

Known Gotchas

⚠ Neptune's namespace model is extremely flexible (any string path becomes a field) — this is powerful but requires agents to agree on and enforce a consistent logging schema, otherwise querying across runs becomes unreliable due to field naming variations.
⚠ The SDK logs asynchronously in a background thread by default — run.stop() or using the context manager is required to flush all pending metadata before the agent process exits, or logged data will be lost.
⚠ Fetching run data requires using the read_only=True mode of neptune.init_run() or the neptune.init_project() + project.fetch_runs_table() pattern — there is no simple GET endpoint for run metadata without the SDK.
⚠ Large artifact downloads go through Neptune's artifact storage and can be slow for multi-GB model checkpoints — agents should implement timeout handling and progress tracking for artifact fetch operations.
⚠ Project and workspace names are case-sensitive in the API — a common gotcha is mismatching case between the Neptune UI display name and the API identifier, causing 404 errors that look like authentication failures.

Alternatives

wandb-api comet-ml-api mlflow-api clearml-api

Full Evaluation Report

Detailed scoring breakdown, competitive positioning, security analysis, and improvement recommendations for Neptune.ai API.

$99

API endpoint ↗ Agent guide ↗ Report inaccuracy

Scores are editorial opinions as of 2026-03-06.