mcp-server

An MCP (Model Context Protocol) server that runs user-provided code inside isolated, resource-limited sandbox environments using runc/OCI containers. It supports multiple programming languages via language-specific handlers and offers persistence for sandbox state via SQLite. It can expose MCP over stdio or HTTP/WebSocket and includes configurable security policies and audit logging.

Evaluated Apr 04, 2026 (16d ago)
Repo ↗ Infrastructure mcp sandbox containerization code-execution runc oci security multi-language
⚙ Agent Friendliness
42
/ 100
Can an agent use this?
🔒 Security
35
/ 100
Is it safe for agents?
⚡ Reliability
21
/ 100
Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality
70
Documentation
55
Error Messages
0
Auth Simplicity
20
Rate Limits
10

🔒 Security

TLS Enforcement
60
Auth Strength
20
Scope Granularity
20
Dep. Hygiene
40
Secret Handling
40

Security is primarily based on container/process isolation via runc/OCI, resource limits, configurable security policies, and audit logging (as claimed in the README). However, the provided content does not document TLS enforcement details, authentication/authorization, scope granularity, or how secrets are handled, so scores are conservative. For sandboxing, additional crucial controls (network egress restrictions, filesystem mounts, seccomp/capabilities, image patching cadence) are not described in the supplied text.

⚡ Reliability

Uptime/SLA
0
Version Stability
30
Breaking Changes
20
Error Recovery
35
AF Security Reliability

Best When

When you need an agent-accessible, multi-language execution sandbox that isolates processes via OCI containers and you can deploy on Linux with container support.

Avoid When

When you cannot enforce/verify network, filesystem, and container escape protections beyond what is documented, or when you need first-class enterprise auth, documented rate limiting, and strong API error/retry semantics.

Use Cases

  • Securely executing untrusted code snippets or scripts for analysis
  • AI-assisted coding workflows where the model needs to run/verify code safely
  • Polyglot “run code” tooling for LLM agents (Python/JS/Go/Rust/Java/C++/C#/Shell, etc.)
  • Testing or CI-like execution of multi-language programs in a controlled environment

Not For

  • Running code that must access the public internet or sensitive internal systems without strict network/file isolation guarantees
  • High-trust environments where container sandboxing is unnecessary
  • Use cases requiring strong enterprise auth/identity integrations (not evidenced here)
  • Systems needing documented idempotent APIs/operations guarantees (not evidenced here)

Interface

REST API
Yes
GraphQL
No
gRPC
No
MCP Server
Yes
SDK
No
Webhooks
No

Authentication

OAuth: No Scopes: No

No authentication mechanism (API keys/OAuth/etc.) is described in the provided README content. For an HTTP/WebSocket MCP mode, authentication/authorization would be expected but is not documented here.

Pricing

Free tier: No
Requires CC: No

Agent Metadata

Pagination
none
Idempotent
False
Retry Guidance
Not documented

Known Gotchas

  • Sandboxed execution commonly has limits (CPU/memory/disk) that may cause failures; agents should be prepared to handle execution timeouts/OOM/disk-full conditions.
  • When using container-based runners, agents may need to explicitly provide all dependencies/source in the request; “import/package availability” may differ from the agent’s environment.
  • If MCP is exposed over HTTP/WebSocket, ensure transport/auth/network isolation is configured appropriately; the README does not document auth details.
  • State persistence via SQLite may require cleanup/retention policies to avoid unintended cross-run state or storage growth.

Alternatives

Full Evaluation Report

Comprehensive deep-dive: security analysis, reliability audit, agent experience review, cost modeling, competitive positioning, and improvement roadmap for mcp-server.

AI-powered analysis · PDF + markdown · Delivered within 30 minutes

$99

Package Brief

Quick verdict, integration guide, cost projections, gotchas with workarounds, and alternatives comparison.

Delivered within 10 minutes

$3

Score Monitoring

Get alerted when this package's AF, security, or reliability scores change significantly. Stay ahead of regressions.

Continuous monitoring

$3/mo

Scores are editorial opinions as of 2026-04-04.

8642
Packages Evaluated
17761
Need Evaluation
586
Need Re-evaluation
Community Powered