kafka-python

Python client for Apache Kafka — produce and consume Kafka messages with pure Python implementation. kafka-python features: KafkaProducer for publishing (send(), flush(), batch_size, linger_ms, compression_type), KafkaConsumer for subscribing (subscribe(), assign(), commit(), auto_offset_reset, group_id), AdminClient for topic management (create_topics, list_topics), message serialization (key_serializer/value_serializer), SSL/SASL authentication, consumer group rebalancing, offset management (seek, committed), and partition assignment. Pure Python Kafka client without librdkafka dependency. Alternative: confluent-kafka-python (uses librdkafka, faster but C dependency).

Evaluated Mar 06, 2026 (0d ago) v2.0.x

Homepage ↗ Repo ↗ Developer Tools python kafka kafka-python producer consumer messaging streaming event-driven

⚙ Agent Friendliness

/ 100

Can an agent use this?

🔒 Security

/ 100

Is it safe for agents?

⚡ Reliability

/ 100

Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality

Documentation

Error Messages

Auth Simplicity

Rate Limits

🔒 Security

TLS Enforcement

Auth Strength

Scope Granularity

Dep. Hygiene

Secret Handling

Use SSL for all production Kafka connections. Store SASL credentials in environment variables or secrets manager. Kafka ACLs control topic-level access — restrict agent consumers to only required topics. TLS certificate validation is on by default (ssl_check_hostname=True).

⚡ Reliability

Uptime/SLA

Version Stability

Breaking Changes

Error Recovery

Best When

Python Kafka producer/consumer without C library dependency — kafka-python's pure Python implementation is simpler to install and debug than confluent-kafka for agent systems with moderate throughput requirements (<10K msg/sec).

Avoid When

You need high throughput (>10K msg/sec, use confluent-kafka), stream processing (use Faust/Bytewax), or real-time analytics.

Use Cases

• Agent event publishing — from kafka import KafkaProducer; producer = KafkaProducer(bootstrap_servers='kafka:9092', value_serializer=lambda v: json.dumps(v).encode()); producer.send('agent-events', {'action': 'completed', 'task_id': '123'}); producer.flush() — agent publishes events to Kafka topic; event-driven orchestration triggers downstream agent pipelines on task completion
• Agent event consumption — consumer = KafkaConsumer('tasks', bootstrap_servers='kafka:9092', group_id='agent-workers', auto_offset_reset='earliest', value_deserializer=lambda m: json.loads(m.decode())); for msg in consumer: process(msg.value); consumer.commit() — agent worker pool consumes tasks from Kafka; horizontal scaling via consumer groups; each partition processed by one agent
• Agent exactly-once processing — producer = KafkaProducer(acks='all', retries=5, max_in_flight_requests_per_connection=1); consumer = KafkaConsumer(enable_auto_commit=False); msg = next(consumer); process(msg); consumer.commit() — manual commit after processing; agent implements at-least-once processing with idempotent operations for reliable pipeline
• Agent topic management — from kafka.admin import KafkaAdminClient, NewTopic; admin = KafkaAdminClient(bootstrap_servers='kafka:9092'); admin.create_topics([NewTopic('agent-results', num_partitions=8, replication_factor=2)]) — agent provisioning creates Kafka topics; 8 partitions enables 8 parallel agent consumers
• Agent batch processing — producer = KafkaProducer(batch_size=65536, linger_ms=100); futures = [producer.send('topic', v) for v in batch]; producer.flush(); [f.get(timeout=10) for f in futures] — batch produce with linger for throughput; agent data pipeline publishes 10K events/sec by batching with 100ms linger

Not For

• High-throughput production — confluent-kafka-python (librdkafka) is 3-10x faster; use confluent-kafka for >100K msg/sec production workloads
• Kafka Streams / KSQL — kafka-python is a client library; for stream processing use Faust or Bytewax for Python
• Real-time analytics — Kafka is messaging; for stream processing with aggregations use Apache Flink or Spark Streaming

Interface

REST API

GraphQL

gRPC

MCP Server

SDK

Yes

Webhooks

Authentication

Methods: ssl sasl

OAuth: No Scopes: No

SSL/TLS for transport encryption. SASL/PLAIN, SASL/SCRAM, SASL/GSSAPI (Kerberos) for authentication. AWS MSK uses IAM via SASL/OAUTHBEARER.

Pricing

Model: open_source

Free tier: Yes

Requires CC: No

kafka-python client is Apache 2.0. Kafka broker hosting has separate costs.

Agent Metadata

Pagination

cursor

Idempotent

Partial

Retry Guidance

Not documented

Known Gotchas

⚠ flush() required before shutdown — KafkaProducer.send() is async; without producer.flush() or producer.close() pending messages are lost on process exit; agent code in containers or Lambda must call producer.flush() before exit; add try/finally to ensure flush even on exceptions
⚠ Consumer group rebalancing pauses consumption — consumer.subscribe(['topic']) triggers rebalance when new consumers join or leave; during rebalance (seconds to minutes), consumer receives no messages; agent consumer must handle rebalance gracefully via ConsumerRebalanceListener; set session_timeout_ms high enough for agent processing time
⚠ auto_offset_reset='latest' misses backlog — new consumer group with auto_offset_reset='latest' starts from current end, missing existing messages; agent code processing existing events must use auto_offset_reset='earliest'; 'latest' only for real-time-only agents that don't need historical events
⚠ Manual commit after processing not before — consumer.commit() before msg = next(consumer) can lose messages if process crashes; agent exactly-once pattern: msg = consume(); process(msg); consumer.commit(); crash before commit means message reprocessed (at-least-once); design agent handlers to be idempotent
⚠ enable_auto_commit=True loses messages on failure — default auto_commit commits last offset periodically; if agent crashes between commits, messages are re-delivered but previous auto-commit may mark them as consumed; agent reliable processing must use enable_auto_commit=False with explicit commit after processing
⚠ KafkaConsumer is not thread-safe — single consumer per thread; agent multi-threaded consumers need one KafkaConsumer per thread or use confluent-kafka's thread-safe API; sharing consumer across threads causes protocol errors and CommitFailedError

Alternatives

confluent-kafka-api

Full Evaluation Report

Detailed scoring breakdown, competitive positioning, security analysis, and improvement recommendations for kafka-python.

$99

API endpoint ↗ Agent guide ↗ Report inaccuracy

Scores are editorial opinions as of 2026-03-06.