GrowthBook

Provides open-source feature flags and A/B testing with warehouse-native statistics so agents can manage experiments that compute significance directly from data warehouse query results rather than a proprietary event pipeline.

Evaluated Mar 06, 2026 (0d ago) vcurrent
Homepage ↗ Repo ↗ Developer Tools feature-flags open-source a-b-testing warehouse-native statistical-significance self-hosted
⚙ Agent Friendliness
58
/ 100
Can an agent use this?
🔒 Security
79
/ 100
Is it safe for agents?
⚡ Reliability
79
/ 100
Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality
--
Documentation
82
Error Messages
79
Auth Simplicity
75
Rate Limits
73

🔒 Security

TLS Enforcement
100
Auth Strength
75
Scope Granularity
65
Dep. Hygiene
82
Secret Handling
75

Open-source codebase is auditable. No token scoping is a significant security gap. Self-hosted deployments control their own data and can add auth layers. SaaS SOC2 certified.

⚡ Reliability

Uptime/SLA
80
Version Stability
80
Breaking Changes
78
Error Recovery
78
AF Security Reliability

Best When

An agent needs to run statistically rigorous A/B tests computed against an existing data warehouse (BigQuery, Snowflake, Redshift) without sending events to a proprietary vendor pipeline.

Avoid When

The team has no data warehouse and needs a fully managed real-time metrics pipeline with instant results, or requires enterprise SLA guarantees and dedicated support.

Use Cases

  • Create a feature flag via REST API linked to a data source metric to begin a warehouse-native A/B test without a separate analytics pipeline
  • Retrieve experiment results including statistical significance and confidence intervals to automate go/no-go decisions on a feature rollout
  • List all running experiments and their metric assignments to generate a weekly experimentation portfolio review
  • Update feature flag targeting rules to adjust rollout percentage when an agent detects a metric crossing a threshold
  • Sync feature flag definitions from GrowthBook API into an internal catalog service to maintain a single source of truth for flag inventory

Not For

  • Real-time error monitoring and crash reporting (use Sentry or Bugsnag instead)
  • Infrastructure and host-level metrics alerting (use Datadog or Prometheus)
  • Session replay and user behavior analytics (use LogRocket or FullStory)

Interface

REST API
Yes
GraphQL
No
gRPC
No
MCP Server
No
SDK
Yes
Webhooks
Yes

Authentication

Methods: api_key bearer_token
OAuth: No Scopes: No

Secret API keys per organization. Self-hosted instances can configure any auth method. No granular token scopes — keys are org-level admin. No OAuth2.

Pricing

Model: freemium
Free tier: Yes
Requires CC: No

Open-source self-hosted is fully free forever. SaaS pricing is per seat. Warehouse-native stats work on all tiers.

Agent Metadata

Pagination
cursor
Idempotent
Partial
Retry Guidance
Not documented

Known Gotchas

  • Statistical results are computed asynchronously by querying your data warehouse — the experiments/{id}/results endpoint returns stale data until a manual or scheduled analysis run completes, with no staleness indicator in the response
  • Self-hosted and SaaS API versions may diverge; an agent built against the SaaS API may encounter missing endpoints or different response shapes on a self-hosted instance running an older version
  • Feature flags and experiments are separate top-level resources linked by ID — creating an experiment does not automatically create a feature flag, and vice versa; agents must create both and link them explicitly
  • No granular token scopes — every API key has full org-admin access, so leaking a GrowthBook API key is equivalent to leaking org owner credentials
  • Warehouse data source credentials are stored in GrowthBook and used server-side for analysis queries — agents that rotate warehouse credentials must update GrowthBook data source config via API or analysis will silently fail

Alternatives

Full Evaluation Report

Detailed scoring breakdown, competitive positioning, security analysis, and improvement recommendations for GrowthBook.

$99

Scores are editorial opinions as of 2026-03-06.

5215
Packages Evaluated
26151
Need Evaluation
173
Need Re-evaluation
Community Powered