What is a schema?
A schema defines the shape of the data you want to extract from a document. Think of it as a contract:- What fields should exist
- What type of data each field contains
- What “correct” looks like
When you use schemas
You use a schema when:- You want consistent output across many documents
- You care about structure, not just raw text
- You plan to validate or improve results over time
What schemas are not
Schemas are not:- Rules engines
- Code
- OCR configurations
