Airbyte CDK

Python SDK for building custom Airbyte connectors (sources and destinations). Provides base classes (AbstractSource, HttpStream, IncrementalMixin) that handle HTTP pagination, state management, schema inference, and retry logic. Custom connectors built with CDK run in Docker containers and integrate with Airbyte's orchestration layer. Used to connect custom APIs or proprietary data sources to Airbyte's connector catalog.

Evaluated Mar 07, 2026 (0d ago) v0.80+
Homepage ↗ Repo ↗ Other airbyte connector etl python data-integration source destination open-source
⚙ Agent Friendliness
58
/ 100
Can an agent use this?
🔒 Security
84
/ 100
Is it safe for agents?
⚡ Reliability
75
/ 100
Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality
--
Documentation
78
Error Messages
75
Auth Simplicity
80
Rate Limits
80

🔒 Security

TLS Enforcement
90
Auth Strength
82
Scope Granularity
80
Dep. Hygiene
80
Secret Handling
85

Credentials handled via Airbyte's secret management. Connectors run in isolated Docker containers. Source credentials never leave the connector's execution context.

⚡ Reliability

Uptime/SLA
80
Version Stability
72
Breaking Changes
68
Error Recovery
80
AF Security Reliability

Best When

You need to build a reusable connector for a data source that isn't in Airbyte's catalog and want to use Airbyte's orchestration, monitoring, and scheduling infrastructure.

Avoid When

You don't use Airbyte, need one-time data extracts, or need real-time streaming — CDK connectors are batch-sync focused.

Use Cases

  • Build custom Airbyte source connectors for internal APIs or proprietary data sources not in Airbyte's catalog
  • Create Airbyte destination connectors for custom data warehouses or streaming services
  • Implement incremental sync (cursor-based) for custom data sources to reduce API calls in recurring syncs
  • Publish custom connectors to Airbyte's open-source connector registry for community use
  • Build agent data ingestion pipelines where agent outputs need to flow into data warehouses via Airbyte

Not For

  • Teams without Airbyte infrastructure — CDK connectors only run within Airbyte's orchestration system
  • One-time data migrations — CDK is for recurring sync connectors; use dbt or custom scripts for one-time migrations
  • Real-time streaming — Airbyte is batch-oriented; use Kafka or Flink for real-time streaming

Interface

REST API
No
GraphQL
No
gRPC
No
MCP Server
No
SDK
Yes
Webhooks
No

Authentication

Methods: none
OAuth: No Scopes: No

SDK library. Source/destination auth is implemented within the connector using Airbyte's spec/config pattern.

Pricing

Model: open_source
Free tier: Yes
Requires CC: No

MIT license. Part of the Airbyte open source project.

Agent Metadata

Pagination
cursor
Idempotent
Full
Retry Guidance
Documented

Known Gotchas

  • CDK connectors run in Docker containers — local testing requires Docker and the Airbyte connector runner; development feedback loop is slower than pure Python testing
  • Schema inference vs. explicit schema definition — HttpStream can infer JSON schema but inferred schemas may change unexpectedly; explicit catalog specs are more reliable
  • Incremental sync requires implementing get_updated_state() — incorrect state management causes either duplicate data or missed records on subsequent syncs
  • HTTP rate limiting must be implemented in the connector — CDK's HttpStream handles retry but the rate limit strategy must match the source API's limits
  • Connector spec (config JSON schema) must match exactly what the source expects — mismatches cause authentication failures that are hard to debug without direct API access
  • CDK versions update frequently with deprecation cycles — pin CDK version and test upgrade paths; some CDK releases have breaking changes to base class APIs

Alternatives

Full Evaluation Report

Comprehensive deep-dive: security analysis, reliability audit, agent experience review, cost modeling, competitive positioning, and improvement roadmap for Airbyte CDK.

AI-powered analysis · PDF + markdown · Delivered within 30 minutes

$99

Package Brief

Quick verdict, integration guide, cost projections, gotchas with workarounds, and alternatives comparison.

Delivered within 10 minutes

$3

Score Monitoring

Get alerted when this package's AF, security, or reliability scores change significantly. Stay ahead of regressions.

Continuous monitoring

$3/mo

Scores are editorial opinions as of 2026-03-07.

6470
Packages Evaluated
26150
Need Evaluation
173
Need Re-evaluation
Community Powered