Arize Phoenix

Open-source LLM observability platform built on OpenTelemetry and the OpenInference spec that captures traces and spans, runs evaluations, visualizes embeddings, and performs cluster analysis to identify LLM failure modes.

Evaluated Mar 07, 2026 (0d ago) vcurrent

Homepage ↗ Repo ↗ AI & Machine Learning llm observability tracing opentelemetry openinference evals embeddings open-source self-hosted

⚙ Agent Friendliness

/ 100

Can an agent use this?

🔒 Security

/ 100

Is it safe for agents?

⚡ Reliability

/ 100

Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality

Documentation

Error Messages

Auth Simplicity

Rate Limits

🔒 Security

TLS Enforcement

100

Auth Strength

Scope Granularity

Dep. Hygiene

Secret Handling

Self-hosted deployment provides full data residency; no auth by default means the UI should be placed behind a reverse proxy with authentication for production use.

⚡ Reliability

Uptime/SLA

Version Stability

Breaking Changes

Error Recovery

Best When

You want a self-hosted, open-source LLM tracing and evaluation environment with embedding visualization and OpenTelemetry compatibility for deep failure analysis.

Avoid When

You need a managed SaaS platform, enterprise support SLAs, or long-term trace retention without managing your own storage infrastructure.

Use Cases

• Instrument any LLM framework using the OpenInference instrumentation library and visualize full trace waterfalls in Phoenix UI
• Run embedding visualizations to cluster agent inputs and identify systematic failure patterns at scale
• Evaluate trace quality with built-in Phoenix evals (hallucination, relevance, toxicity) using LLM-as-judge
• Self-host Phoenix on-premise to keep all LLM traces within your security perimeter with no external data egress
• Export OpenTelemetry spans from Phoenix to Arize cloud for longer-term storage and enterprise dashboards

Not For

• Teams that need a fully managed cloud service with uptime SLAs and zero infrastructure ownership
• Real-time business alerting and PagerDuty integrations based on LLM quality metrics
• Non-Python primary stacks where OpenInference instrumentation libraries are unavailable

Interface

REST API

Yes

GraphQL

gRPC

Yes

MCP Server

SDK

Yes

Webhooks

Authentication

Methods: api_key

OAuth: No Scopes: No

Self-hosted Phoenix requires no auth by default; Arize cloud uses API key authentication.

Pricing

Model: open_source

Free tier: Yes

Requires CC: No

Phoenix is Apache 2.0 open source; Arize cloud is the commercial managed offering.

Agent Metadata

Pagination

cursor

Idempotent

Full

Retry Guidance

Not documented

Known Gotchas

⚠ OpenInference instrumentors must be imported before the frameworks they wrap — import order bugs cause silent trace gaps
⚠ Self-hosted Phoenix stores data in SQLite by default; high trace volume requires switching to PostgreSQL manually
⚠ Embedding visualizations require running a UMAP projection which is CPU-intensive and blocks the UI for large datasets
⚠ Eval functions make LLM calls using your configured provider — parallel eval runs can exhaust rate limits unexpectedly
⚠ Phoenix notebook mode and server mode cannot run simultaneously; agents that spawn a local Phoenix server conflict with existing notebook sessions

Alternatives

opik-api trulens-api langsmith-api braintrust-ai-api

Full Evaluation Report

Comprehensive deep-dive: security analysis, reliability audit, agent experience review, cost modeling, competitive positioning, and improvement roadmap for Arize Phoenix.

AI-powered analysis · PDF + markdown · Delivered within 30 minutes

$99

Package Brief

Quick verdict, integration guide, cost projections, gotchas with workarounds, and alternatives comparison.

Delivered within 10 minutes

Score Monitoring

Get alerted when this package's AF, security, or reliability scores change significantly. Stay ahead of regressions.

Continuous monitoring

$3/mo

API endpoint ↗ Agent guide ↗ Report inaccuracy

Scores are editorial opinions as of 2026-03-07.