Each block in the document is routed through a typed strategy (text / table / figure / dense-table-agent) instead of a single batched call — better tables and richer figure descriptions.Documentation Index
Fetch the complete documentation index at: https://docs.anyformat.ai/llms.txt
Use this file to discover all available pages before exploring further.
What is agentic parsing?
Standard parsing sends every block to one LLM call with one prompt. Agentic parsing first routes each block (using a fast LLM with the page image) to the right specialised strategy:| Block type | Strategy | Why |
|---|---|---|
| Plain text | text-bytes-first | Cheap, fast — bypasses LLM entirely when source bytes are extractable |
| Tables | dense-table-agent | Tool-calling agent that handles spans, merged cells, multi-page tables |
| Figures / charts | single-call vision | LLM with image input for structured figure descriptions |
End-to-end
Python package + class names are provisional.
pip install anyformat-sdk and from anyformat.sdk import Client work today, but both are expected to change before the official launch — pin the version you ship with.- curl
- TypeScript
- Python
Agentic mode takes longer than standard mode — typically 30–90s for a 3-page document — because each block hits its own LLM strategy. Use webhooks instead of polling for production workloads.
Sample response
Per-block vs. document confidence
The response carries two document-level rollups plus a per-blockdata-confidence attribute inside the markdown.
| Field | Use for |
|---|---|
parse.parse_confidence — char-weighted mean of per-block LLM logprobs (typical 80–99); null when no block had logprob-based confidence | Triage: “is this doc worth processing further?” |
parse.layout_confidence — char-weighted mean of YOLO layout-segmentation scores (typical 30–60); always present when blocks exist | Fallback when parse_confidence is null. Measures “is this region a table?”, not “is the parsed content accurate?” |
data-confidence per <section> | UI highlighting: dim or flag low-confidence regions inline |
Mode caveat: in agentic mode, per-block strategies don’t always populate parser logprobs (e.g. the fast
text-bytes-first strategy never calls an LLM). When logprobs are absent, parse_confidence is null and callers fall back to layout_confidence. For calibrated parser confidence comparable to extraction confidence (80–99 range), use mode="standard".Use with AI coding agents
Building an integration on top of anyformat? Install the anyformat Claude Code skill so Claude (or any compatible agent) knows the right endpoints, payloads, and gotchas out of the box. See Coding assistant for installation and example prompts.Next steps
Create workflow reference
Full reference for the typed-graph endpoint
Parse-only workflow
Standard parse-only cookbook with the same endpoint
Add extraction
Add an extract node after the parse step to pull structured fields
Webhooks
Skip polling — receive
extraction.completed events