youtube-mcp-server

Provides an MCP server over SSE that lets AI agents extract YouTube video metadata (via yt-dlp) and generate audio transcriptions using VAD (Silero VAD) and Whisper, with caching and configurable performance settings.

Evaluated Mar 30, 2026 (67d ago)

Repo ↗ Ai Ml ai-ml mcp transcription youtube speech-recognition whisper vad sse python

⚙ Agent Friendliness

/ 100

Can an agent use this?

🔒 Security

/ 100

Is it safe for agents?

⚡ Reliability

/ 100

Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality

Documentation

Error Messages

Auth Simplicity

Rate Limits

🔒 Security

TLS Enforcement

Auth Strength

Scope Granularity

Dep. Hygiene

Secret Handling

Local-only SSE is implied, but TLS is not documented. No auth or scopes are described, so access control likely depends entirely on network isolation. This tool downloads/uses external resources (YouTube via yt-dlp; model downloads via dependencies), which may expand your threat surface. No details are provided on secret handling, logging, or safe filesystem permissions for caching/transcripts.

⚡ Reliability

Uptime/SLA

Version Stability

Breaking Changes

Error Recovery

Best When

You want a locally run MCP tool for YouTube transcription/metadata that can be called programmatically by an agent, and you can install ffmpeg plus the required Python dependencies/models.

Avoid When

You need strict production-grade guarantees (documented SLAs, explicit retry/idempotency semantics, and well-specified error contracts) or you cannot run external binaries/models (ffmpeg, Whisper weights, Silero models).

Use Cases

• Agent-assisted research: fetch title/description/duration/tags/thumbnails for YouTube videos
• Meeting/interview analysis: transcribe long-form videos into timestamped segments
• Multilingual transcription and translation for content pipelines
• Automated content indexing: store transcripts + metadata in a local cache for retrieval

Not For

• Use where you need OAuth-protected access control or per-user authorization
• Highly sensitive environments that prohibit third-party external network calls to YouTube or model downloads
• Applications requiring strong, documented rate-limiting guarantees and usage quotas

Interface

REST API

GraphQL

gRPC

MCP Server

Yes ↗

SDK

Webhooks

Authentication

Methods: None documented for local SSE MCP server

OAuth: No Scopes: No

README shows local startup and MCP client configuration with only the URL; no authentication/authorization mechanism is described.

Pricing

Free tier: No

Requires CC: No

No SaaS pricing mentioned; the workload depends on the selected Whisper model and local compute resources.

Agent Metadata

Pagination

none

Idempotent

False

Retry Guidance

Not documented

Known Gotchas

⚠ Long-running operations: transcription can be slow for lengthy videos; agents may need timeouts/async handling.
⚠ External dependencies: requires ffmpeg plus model downloads (Whisper weights, Silero VAD).
⚠ Concurrency tuning: MAX_WORKERS may increase CPU/RAM usage and could trigger failures under constrained environments.
⚠ Rate limits: no rate-limit documentation is provided; YouTube/yt-dlp access may be subject to external throttling.
⚠ Caching behavior: transcript cache location is configurable but cache invalidation/versioning strategy is not described.

Alternatives

Run a REST service you control around yt-dlp + Whisper/pyannote/other VAD (self-hosted) Use existing transcription services/APIs and add a thin MCP wrapper Use generic YouTube transcript APIs (where available) instead of audio transcription

Full Evaluation Report

Comprehensive deep-dive: security analysis, reliability audit, agent experience review, cost modeling, competitive positioning, and improvement roadmap for youtube-mcp-server.

AI-powered analysis · PDF + markdown · Delivered within 30 minutes

$99

Package Brief

Quick verdict, integration guide, cost projections, gotchas with workarounds, and alternatives comparison.

Delivered within 10 minutes

Score Monitoring

Get alerted when this package's AF, security, or reliability scores change significantly. Stay ahead of regressions.

Continuous monitoring

$3/mo

API endpoint ↗ Agent guide ↗ Report inaccuracy

Scores are editorial opinions as of 2026-03-30.