Estuary

What are Schema Inference, Write and Read Schemas?

USE CASES3/20/2023

Lesser known facts about schemas:
What is schema inference?
What are write and read schemas?
When to use both?

Try Estuary for free: https://www.estuary.dev/

Join our Slack channel with a community of developers:
https://estuary-dev.slack.com/

Flow documents and collections always have an associated schema that defines the structure, representation, and constraints of your documents. Collections must have one schema, but may have two distinct schemas: one for when documents are added to the collection, and one for when documents are read from that collection.

Schemas are a powerful tool for data quality. Flow verifies every document against its schema whenever it's read or written, which provides a strong guarantee that your collections hold only "clean" data, and that bugs and invalid documents are caught before they can impact downstream data products.

In most cases, Flow generates a functioning schema on your behalf during the discovery phase of capture. In advanced use cases, however, customizing your schema becomes more important.

Flow performs static inference of the collection schema to verify the existence and types of all keyed document locations, and will report an error if the location could not exist, or could exist with the wrong type.

#schema #estuaryflow #data #dataops #dataengineering #datapipeline

More videos

Estuary logo on light background

Seamless Data Integration, Unlimited Potential

Discover the simplest way to connect and move your data.
Get hands-on for free, or schedule a demo to see the possibilities for your team.