Workflows - anyformat

A workflow is the set of instructions that tells anyformat what to do with your documents. You build it once, and it runs the same way on every document you give it. Think of it as an assembly line made of a few simple steps:

Read the document (Parse)
Pull out the fields you care about (Extract)
and, when you need them, sort documents by type (Classify), split a multi-document file apart (Split), or check the results against your own rules (Validate)

Each workflow targets one kind of document and one shape of output. Need a different output? Build a different workflow.

You don’t have to assemble these steps by hand. The fastest way to start is to describe what you want in plain language and let anyformat build a Parse → Extract workflow for you. To arrange the steps yourself, use Studio, the visual editor.

The five building blocks

Every workflow is built from five steps. Most workflows only need the first two.

Step	What it does	Example
Parse	Reads the document and turns it into clean text and tables anyformat can work with. Every workflow starts here.	Upload a bank statement PDF — Parse reads and prepares all the text and layout.
Extract	Pulls out the specific fields you ask for.	Get the invoice number, total, and date from each invoice.
Classify	Sorts documents into types you define, so each type can be handled differently.	Label each incoming file as “Invoice”, “Contract”, or “Statement”.
Split	Breaks one file that contains several documents into separate pieces.	A PDF with four invoices stapled together becomes four documents.
Validate (Beta)	Checks the extracted data against rules you write in plain language, and flags anything that fails.	Flag expired documents, or check that an IBAN looks valid.

Most users never go past Parse → Extract. Classify, Split, and Validate are there for when your documents are more complicated.

Three common shapes

These are the same building blocks arranged in different ways. You arrange them visually in Studio.

Read only

Just Parse, nothing else. You get clean text and tables back — handy when you want to feed anyformat’s output into your own tools (search, a custom AI, etc.) rather than pulling out specific fields.

[ Parse ]

Read, then pull out — the usual one

Parse into Extract. This is the default: “read this document and give me these fields.”

[ Parse ] → [ Extract ]

Sort first, then pull out

Classify decides what kind of document it is, then sends it to an Extract step tailored to that type. (Split works the same way for files that hold several documents.)

                          ┌──> [ Extract: invoice ]
[ Parse ] → [ Classify ] ─┤
                          └──> [ Extract: receipt ]

See the recipes for end-to-end examples of each shape, or the Studio guide to build them yourself.

How to think about workflows

Most users follow the same lifecycle:

Create

Decide which steps you need and how they connect — or just describe what you want and let anyformat build it

Refine

Run a few sample documents through it, look at the results, and tighten your fields and instructions

Publish

Mark it ready for everyday use

Run at scale

Apply it to many documents — by uploading them, connecting cloud storage, or (for developers) calling the API

In the web platform you build workflows visually — either by describing what you want from the home screen, or by arranging the steps yourself in Studio.

What’s next?

Studio

The visual editor where you arrange the building blocks into a workflow

Runs & results

What happens when you run a workflow, and what comes back

Field types

The kinds of values a field can hold, and when to use each

Build your first workflow

A step-by-step walkthrough — start in the UI, no code needed

​The five building blocks

​Three common shapes

​Read only

​Read, then pull out — the usual one

​Sort first, then pull out

​How to think about workflows

​What’s next?

Studio

Runs & results

Field types

Build your first workflow

The five building blocks

Three common shapes

Read only

Read, then pull out — the usual one

Sort first, then pull out

How to think about workflows

What’s next?