Estuary
Integration icon
FASTEST, MOST RELIABLE CDC AND ETL

Stream data from PostgreSQL to Databricks

Sync your PostgreSQL data with Databricks in minutes using Estuary Flow for real-time, no-code integration and seamless data pipelines.

  • No credit card required
  • 30-day free trial
PostgreSQL logo
Databricks logo
  • 100SOf connectors
  • 5500+Active users
  • <100MSEnd-to-end latency
  • 7+GB/SECSingle dataflow
PostgreSQL logo

PostgreSQL connector details

Built for real-time data integration, the Estuary PostgreSQL connector streams inserts, updates, and deletes from PostgreSQL databases using Change Data Capture (CDC) via logical replication. It reads directly from the write-ahead log (WAL) to deliver low-latency, exactly-once data movement into Flow collections. The connector supports self-hosted, RDS, Aurora, Cloud SQL, Azure Database for PostgreSQL, and Supabase, with secure connectivity options such as SSH tunneling and SSL.

  • Continuous CDC streaming through PostgreSQL logical replication
  • Works with managed and on-prem PostgreSQL instances
  • Supports backfill and read-only captures
  • Automatically manages replication slots and publications
  • Secure setup via SSH or SSL

For more details about the PostgreSQL connector, check out the documentation page.

Databricks logo

Databricks connector details

Estuary’s Databricks connector materializes Flow collections into tables in a Databricks SQL Warehouse. Data changes are first written to a Unity Catalog Volume and then applied transactionally to Databricks tables, ensuring reliable delivery of real-time updates. This connector supports both standard merges and delta updates for high-performance CDC workflows.

  • Real-time CDC data sync from Flow collections into Databricks tables
  • Choice of standard (merge) or delta updates to optimize performance and costs
  • Native integration with Databricks Unity Catalog for secure, managed storage
  • Flexible sync scheduling to balance freshness and warehouse costs

For more details about the Databricks connector, check out the documentation page.

How to integrate PostgreSQL with Databricks in 3 simple steps using Estuary Flow

1

Connect PostgreSQL as Your Real-Time Data Source

Set up a real-time source connector for PostgreSQL in minutes. Estuary captures change data (CDC), events, or snapshots — no custom pipelines, agents or manual configs needed.

2

Configure Databricks as Your Target

Choose Databricks as your target system. Estuary intelligently maps schemas, supports both batch and streaming loads, and adapts to schema changes automatically.

3

Deploy and Monitor Your End-to-End Data Pipeline

Launch your pipeline and monitor it from a single UI. Estuary Flow guarantees exactly-once delivery, handles backfills and replays, and scales with your data — without engineering overhead.

Try Estuary for Free

Estuary Flow in action

See how to build end-to-end pipelines using no-code connectors in minutes. Estuary Flow does the rest.

Why Estuary Flow is the best choice for data integration

Estuary Flow combines the most real-time, streaming change data capture (CDC), and batch connectors together into a unified modern data pipeline:

Real-time ETL with Estuary Flow: Seamlessly move data from source to destination for immediate analysis and actionable insights.

What customers are saying

  • Together AI avatar

    YuTong (Julia) Zhang

    Senior Software Engineer, Together AI
    Together AI avatar

    For AI systems like ours, freshness of data is everything. Estuary gives us sub-second latency without the complexity of maintaining streaming infrastructure ourselves. That reliability means our teams can focus on advancing AI models instead of pipelines.

  • Glossier avatar

    Brandon Besash

    Director, Business Intelligence, Glossier
    Glossier avatar

    Estuary enabled us to finally implement our ERP’s new data endpoint with all our inventory transactions, purchasing, and shipping data. We can now unlock data blocked by cost before, and sync times are much faster and are always being improved by the Estuary team.

    Read the Success Story
  • Xometry avatar

    Andrew Woelfel

    Senior Manager, Data Engineering and Analytics, Xometry
    Xometry avatar

    “Estuary has been a pleasure to work with and has significantly modernized our data infrastructure, delivering real-time and scalable processes that will significantly impact company-wide operations. Every data-driven organization should be looking at Estuary today.”

    Read the Success Story
  • Cosuno avatar

    Maximilian Seifert

    CTO, Cosuno
    Cosuno avatar

    Estuary just works. We’ve never had an incident, and it cut our data movement costs in half.

    Read the Success Story
  • Shippit avatar

    Keat Min Woo


    We didn’t want to be locked into a system where faster syncs meant higher bills. Estuary gives us real-time pipelines without pricing games or the burden of running Kafka ourselves.

    Read the Success Story
  • Livble avatar

    Uri Vinetz

    Director of Data, Livble
    Livble avatar

    We needed something self-serve, fast, and reliable, and Estuary delivered exactly that. It’s a huge unlock for our operations, reporting, and machine learning.

    Read the Success Story
  • Resend avatar

    Jonni Lundy

    COO, Resend
    Resend avatar

    Estuary Flow transformed how we operationalize our data for fraud, security, support, and beyond.. Instead of unreliable, expensive backfills, we have real-time visibility into platform activity. The proactive support and hands-on approach make all the difference.

    Read the Success Story
  • Recart avatar

    Istvan Kovacs

    CTO, Recart
    Recart avatar

    Estuary became our real-time data backbone without the cost or complexity of traditional solutions. We replaced a fragile, high-maintenance pipeline with a managed system that just works and scales.?

    Read the Success Story
  • Headset avatar

    Scott Vickers

    CTO, Headset
    Headset avatar

    Estuary has been a game-changer for Headset’s data infrastructure. Compared to our previous solutions, it has dramatically improved reliability while reducing our overall costs significantly.

    Read the Success Story
  • Revunit avatar

    Revunit


    Estuary is our preferred CDC solution for importing data from application databases into BigQuery for analytics. It offers a transparent pricing structure, timely support responses, and an intuitive CLI tool for bulk configuration tasks. In contrast, other market solutions often have ambiguous pricing and fewer options for precise data replication across environments. This makes choosing to use Estuary an obvious decision.

  • PDI. avatar

    PDI.


    Estuary Flow makes tough data transformation problems a piece of cake with its intuitive user interface and incredible breadth of features.

  • OneCommerce avatar

    OneCommerce


    Estuary Flow is the only SaaS tool that we found which can do a simple loop and calculate COGS from an array of objects nested in a property. We love to write transformations in typescript because it's in the same codebase and super easy to maintain and read. It's a true game changer.

  • Minima Global avatar

    Minima Global


    Getting #MINIMA real-time data replication out to the Postgres database was not fun until we found @EstuaryDev it is the best materialization.

  • Seattle Data Guy avatar

    Ben Rogojan

    Owner, Seattle Data Guy
    Seattle Data Guy avatar

    Estuary makes working with real-time data more cost effective and just as simple as working with batch data.

  • Pompato avatar

    Pompato


    This tool is 1000x times better than LogStash or Elastic Enterprise Data Ingestion Tool.

  • DeepSync avatar

    DeepSync


    Estuary Flow allows us to integrate low-latency CDC and connect to SaaS apps across our entire reporting stack and it’s the only solution that we’ve found that lets us do both.

  • Fenestra avatar

    Fenestra


    We needed a platform to help us optimize marketing campaigns with low-latency. Estuary provided an unparalleled solution to do that at terabyte scale.

  • Coalesce avatar

    Coalesce


    Estuary is the only system we’ve found that can seamlessly replicate large scale Firestore data for analytics. After months of research and trying everything, we can confidently say that Estuary is the only company that can help us get easy, accurate analytics on our data within Snowflake when replicating from Firestore data.

  • Flashpack avatar

    Flashpack


    We're a big fan of Estuary's real-time, no code model. It's magic that we're getting real time data without much effort and we don't have to spend time thinking about broken pipelines. We've also experienced fantastic support by Estuary.

    Increase productivity 4x

    With Flow companies increase productivity 4x and deliver new projects in days, not months. Spend much less time on troubleshooting, and much more on building new features faster. Flow decouples sources and destinations so you can add and change systems without impacting others, and share data across analytics, apps, and AI.

    Spend 2-5x less

    Estuary customers not only do 4x more. They also spend 2-5x less on ETL and ELT. Flow's unique ability to mix and match streaming and batch loading has also helped customers save as much as 40% on data warehouse compute costs.

    Data moved

    It's free up to 10 GB/month and 2 connector instances.

    GB

    Choose number of sources and destinations.

    Your price at Estuary

    Free
    2GB of data moved
    2 connector instances
    Try it Free

    Pricing comparisons

    Compared to Confluent, you save $1,201 / month!
    Compared to Fivetran, you save $1,479 / month!

    Success stories

    RELATED ARTICLES

    Frequently Asked Questions

      What is the difference between ETL, ELT, and CDC?

      ETL extracts data from sources, transforms it, and loads it into a destination. ELT loads raw data first and transforms it within the destination for flexibility. CDC captures real-time changes (inserts, updates, deletes) and syncs them to the destination.

      Estuary Flow supports real-time CDC, ETL, and ELT, allowing you to choose the best approach for your needs.

      Stream data from PostgreSQL with sub-100 ms latency. Just connect to Postgres and select the tables you want to capture. Estuary Flow immediately starts streaming the write-ahead log (WAL), and incremental micro-snapshots from each table. Data is simultaneously streamed exactly once and stored in parallel inside your own private cloud storage for reuse at any time.

      Move data from Postgres to any number of other destinations in parallel - Postgres to BigQuery, Postgres to Databricks, Postgres to Elasticsearch, Postgres to Redshift, Postgres to Snowflake, Postgres to SQL and NoSQL databases - and many more (see integrations).

      1. Set Up Capture: In Estuary Flow, go to Sources, click + NEW CAPTURE, and select the PostgreSQL connector.
      2. Enter Details: Add your PostgreSQL connection details and click SAVE AND PUBLISH.
      3. Materialize Data: Go to Destinations, choose your target system, link the PostgreSQL capture, and publish.

      Estuary offers competitive and transparent pricing, with a free tier that includes 2 connector instances and up to 10 GB of data transfer per month. Explore our pricing options to see which plan fits your data integration needs.

    Getting started with Estuary

    • Free account

      Getting started with Estuary is simple. Sign up for a free account.

      Sign up
    • Docs

      Make sure you read through the documentation, especially the get started section.

      Learn more
    • Community

      I highly recommend you also join the Slack community. It's the easiest way to get support while you're getting started.

      Join Slack Community
    • Estuary 101

      I highly recommend you also join the Slack community. It's the easiest way to get support while you're getting started.

      Watch

    QUESTIONS? FEEL FREE TO CONTACT US ANY TIME!

    Contact us

    DataOps made simple

    Add advanced capabilities like schema inference and evolution with a few clicks. Or automate your data pipeline and integrate into your existing DataOps using Flow's rich CLI.

    Schema evolution options

    One platform for all data movement

    Try Now