mcp-server

An MCP (Model Context Protocol) server that runs user-provided code inside isolated, resource-limited sandbox environments using runc/OCI containers. It supports multiple programming languages via language-specific handlers and offers persistence for sandbox state via SQLite. It can expose MCP over stdio or HTTP/WebSocket and includes configurable security policies and audit logging.

Evaluated Apr 04, 2026 (61d ago)

Repo ↗ Infrastructure mcp sandbox containerization code-execution runc oci security multi-language

⚙ Agent Friendliness

/ 100

Can an agent use this?

🔒 Security

/ 100

Is it safe for agents?

⚡ Reliability

/ 100

Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality

Documentation

Error Messages

Auth Simplicity

Rate Limits

🔒 Security

TLS Enforcement

Auth Strength

Scope Granularity

Dep. Hygiene

Secret Handling

Security is primarily based on container/process isolation via runc/OCI, resource limits, configurable security policies, and audit logging (as claimed in the README). However, the provided content does not document TLS enforcement details, authentication/authorization, scope granularity, or how secrets are handled, so scores are conservative. For sandboxing, additional crucial controls (network egress restrictions, filesystem mounts, seccomp/capabilities, image patching cadence) are not described in the supplied text.

⚡ Reliability

Uptime/SLA

Version Stability

Breaking Changes

Error Recovery

Best When

When you need an agent-accessible, multi-language execution sandbox that isolates processes via OCI containers and you can deploy on Linux with container support.

Avoid When

When you cannot enforce/verify network, filesystem, and container escape protections beyond what is documented, or when you need first-class enterprise auth, documented rate limiting, and strong API error/retry semantics.

Use Cases

• Securely executing untrusted code snippets or scripts for analysis
• AI-assisted coding workflows where the model needs to run/verify code safely
• Polyglot “run code” tooling for LLM agents (Python/JS/Go/Rust/Java/C++/C#/Shell, etc.)
• Testing or CI-like execution of multi-language programs in a controlled environment

Not For

• Running code that must access the public internet or sensitive internal systems without strict network/file isolation guarantees
• High-trust environments where container sandboxing is unnecessary
• Use cases requiring strong enterprise auth/identity integrations (not evidenced here)
• Systems needing documented idempotent APIs/operations guarantees (not evidenced here)

Interface

REST API

Yes

GraphQL

gRPC

MCP Server

Yes

SDK

Webhooks

Authentication

OAuth: No Scopes: No

No authentication mechanism (API keys/OAuth/etc.) is described in the provided README content. For an HTTP/WebSocket MCP mode, authentication/authorization would be expected but is not documented here.

Pricing

Free tier: No

Requires CC: No

Agent Metadata

Pagination

none

Idempotent

False

Retry Guidance

Not documented

Known Gotchas

⚠ Sandboxed execution commonly has limits (CPU/memory/disk) that may cause failures; agents should be prepared to handle execution timeouts/OOM/disk-full conditions.
⚠ When using container-based runners, agents may need to explicitly provide all dependencies/source in the request; “import/package availability” may differ from the agent’s environment.
⚠ If MCP is exposed over HTTP/WebSocket, ensure transport/auth/network isolation is configured appropriately; the README does not document auth details.
⚠ State persistence via SQLite may require cleanup/retention policies to avoid unintended cross-run state or storage growth.

Alternatives

Other MCP sandbox/runner implementations (various community projects) Dedicated job runners with container isolation (e.g., Kubernetes Jobs with restricted PodSecurity) Serverless code execution platforms with sandboxing Workflow engines that execute code in CI containers (e.g., GitHub Actions runners)

Full Evaluation Report

Comprehensive deep-dive: security analysis, reliability audit, agent experience review, cost modeling, competitive positioning, and improvement roadmap for mcp-server.

AI-powered analysis · PDF + markdown · Delivered within 30 minutes

$99

Package Brief

Quick verdict, integration guide, cost projections, gotchas with workarounds, and alternatives comparison.

Delivered within 10 minutes

Score Monitoring

Get alerted when this package's AF, security, or reliability scores change significantly. Stay ahead of regressions.

Continuous monitoring

$3/mo

API endpoint ↗ Agent guide ↗ Report inaccuracy

Scores are editorial opinions as of 2026-04-04.