Comet ML API

Comet ML is an MLOps platform with REST API and Python SDK for tracking ML experiments, managing model registries, and running LLM evaluation — enabling agents to query experiment metrics, retrieve model versions, compare runs, and manage production model lifecycle.

Evaluated Mar 06, 2026 (0d ago) vcurrent

Homepage ↗ Repo ↗ AI & Machine Learning mlops experiment-tracking model-registry llm-evaluation prompt-management ml-observability

⚙ Agent Friendliness

/ 100

Can an agent use this?

🔒 Security

/ 100

Is it safe for agents?

⚡ Reliability

/ 100

Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality

Documentation

Error Messages

Auth Simplicity

Rate Limits

🔒 Security

TLS Enforcement

100

Auth Strength

Scope Granularity

Dep. Hygiene

Secret Handling

API keys are workspace-scoped with no endpoint-level granularity — a leaked key exposes all experiments and artifacts in the workspace. Enterprise service accounts improve this. TLS enforced on all cloud endpoints. Self-managed deployments require independent TLS configuration.

⚡ Reliability

Uptime/SLA

Version Stability

Breaking Changes

Error Recovery

Best When

An agent needs to query ML experiment history, retrieve model artifacts from a registry, or log structured evaluation results for LLM or classical ML pipelines with an existing Comet workspace.

Avoid When

You need model serving, data processing, or your team has no existing Comet project with logged experiments.

Use Cases

• Querying experiment runs by project to retrieve metrics, hyperparameters, and system info for model comparison and selection
• Fetching model registry versions and their associated artifact paths to retrieve the latest champion model for deployment
• Logging evaluation results from agent-orchestrated LLM or ML evaluation pipelines with structured metrics and confusion matrices
• Using Comet Opik (LLM evaluation product) to score and trace LLM calls, enabling agent-driven prompt optimization workflows
• Retrieving experiment asset files (model checkpoints, plots, code snapshots) programmatically for audit or reproducibility workflows

Not For

• Model inference serving — Comet tracks and stores models but provides no inference endpoint hosting
• Data transformation or ETL — Comet is a metadata and artifact store, not a data processing platform
• Teams not running iterative ML experiments — the platform's value requires repeated training runs to compare across

Interface

REST API

Yes

GraphQL

gRPC

MCP Server

SDK

Yes

Webhooks

OpenAPI Spec ↗

Authentication

Methods: api_key

OAuth: No Scopes: No

Authentication uses a personal API key passed as a Bearer token in the Authorization header, or set via COMET_API_KEY environment variable. The Python SDK picks up this environment variable automatically. API keys are workspace-scoped with no fine-grained endpoint permissions. Service accounts are available on Enterprise plans.

Pricing

Model: freemium

Free tier: Yes

Requires CC: No

The free tier covers most individual agent development use cases. Paid tiers add collaboration features, higher artifact storage, and enterprise security controls. LLM evaluation (Opik) has separate pricing.

Agent Metadata

Pagination

offset

Idempotent

Partial

Retry Guidance

Not documented

Known Gotchas

⚠ Experiment keys (experiment_key) are UUIDs generated at experiment creation — agents must persist these keys if they need to resume or reference experiments later, as there is no lookup by name without listing all experiments.
⚠ The REST API for querying experiment metrics uses a separate endpoint structure from the SDK — REST endpoints require projectName and workspaceName as query parameters that the SDK handles automatically but raw REST callers must provide explicitly.
⚠ Artifact downloads require generating a temporary presigned URL via the API before fetching the file — a two-step process that agents must implement, not a direct download link.
⚠ Experiment status does not automatically transition to 'stopped' if a process crashes — agents that crash mid-experiment may leave ghost experiments in 'running' state, polluting experiment lists and requiring manual cleanup.
⚠ The Opik LLM evaluation product uses a different API base URL and authentication flow from the core Comet ML experiment tracking API — agents integrating both must manage two separate client configurations.

Alternatives

wandb-api neptune-ai-api mlflow-api clearml-api

Full Evaluation Report

Detailed scoring breakdown, competitive positioning, security analysis, and improvement recommendations for Comet ML API.

$99

API endpoint ↗ Agent guide ↗ Report inaccuracy

Scores are editorial opinions as of 2026-03-06.