Estuary
icon
REAL-TIME ETL & CDC

Stream into Apache Iceberg with your free account

Continously ingest and deliver both streaming and batch change data from 100s of sources using Estuary's custom no-code connectors.

  • <100ms Data pipelines
  • 100+ Connectors
  • 2-5x less than batch ELT
01. Select a source02. Transform in-flight03. Deliver to Apache Iceberg
Apache Iceberg logo
take a tour
Apache Iceberg logo

Apache Iceberg connector details

The Apache Iceberg connector materializes Flow collections into Iceberg tables for large-scale analytics and lakehouse architectures. It orchestrates Spark jobs on AWS EMR Serverless (or another configured compute backend) to merge incoming changes, ensuring tables stay continuously up to date.

  • Materializes Flow collections into Apache Iceberg tables for open table formats
  • Orchestrates Spark jobs on AWS EMR Serverless to process updates
  • Supports hard deletes for accurate CDC replication or soft deletes for cost savings
  • Compatible with AWS Glue, S3 Tables, and Snowflake Open Catalogs via REST APIs
  • Configurable sync schedules to balance performance and cost

For more details about the Apache Iceberg connector, check out the documentation page.

How to connect your data source to Apache Iceberg in 3 easy steps

1

Connect your data source

Select from more than 100 supported databases and SaaS platforms including PostgreSQL, MySQL, SQL Server, MongoDB, and Kafka.

2

Prepare and transform your data

Apply transformations and schema mapping as data moves whether you are streaming in real time or loading in batches.

3

Sync to Apache Iceberg

Continuously or periodically deliver data into your destination with support for change data capture and reliable delivery for accurate insights.

Learn more with some related videos

Dive deeper into Apache Iceberg with tutorials and walkthroughs from our YouTube channel.

Get Started Free

Trusted by data teams worldwide

All data connections are fully encrypted in transit and at rest. Estuary also supports private cloud and BYOC deployments for maximum security and compliance.

icon-2

HIGH THROUGHPUT

Distributed event-driven architecture enable boundless scaling with exactly-once semantics.

icon-3

DURABLE REPLICATION

Cloud storage backed CDC w/ heart beats ensures reliability, even if your destination is down.

icon-1

REAL-TIME INGESTION

Capture and relay every insert, update, and delete in milliseconds.

Real-timehigh throughput

Point a connector and replicate changes to Apache Iceberg in <100ms. Leverage high-availability, high-throughput Change Data Capture.Or choose from 100s of batch and real-time connectors to move and transform data using ELT and ETL.

  • Ensure your Apache Iceberg insights always reflect the latest data by connecting your databases to Apache Iceberg with change data capture.
  • Or connect critical SaaS apps to Apache Iceberg with real-time data pipelines.

See how you can integrate any source with Apache Iceberg:

Details

or choose from these popular data sources:

PostgreSQL logo
PostgreSQL
MySQL logo
MySQL
SQL Server logo
SQL Server
MongoDB logo
MongoDB
Apache Kafka logo
Apache Kafka
BigQuery logo
BigQuery
Snowflake Data Cloud logo
Snowflake Data Cloud

Don't see a connector?Request and our team will get back to you in 24 hours

Pipelines as fast as Kafka, easy as managed ELT/ETL, cheaper than building it.

Feature Comparison

EstuaryBatch ELT/ETLDIY PythonKafka
Price$$$-$$$$$-$$$$$-$$$$
Speed<100ms5min+Varies<100ms
EaseAnalysts can manageAnalysts can manageData EngineerSenior Data Engineer
Scale
Maintenance EffortLowMediumHighHigh
Detailed Comparison

Deliver real-time and batch data from DBs, SaaS, APIs, and more

Connection-1

Popular sources/destinations you can sync your data with

Choose from more than 100 supported databases and SaaS applications. Click any source/destination below to open the integration guide and learn how to sync your data in real time or batches.