Skip to main content
anyformat works with the same extracted data, but presents it in different representations, depending on what you want to do next.

CSV Output

Structured tabular data

CSV output represents extracted data as a table:
  • Each document becomes one or more rows
  • Each field becomes a column
  • Subtables are expanded in a consistent way

CSV is designed for

  • Spreadsheets
  • Imports
  • Data analysis
  • Sharing with non-technical users
Think of CSV as: “Structured data optimized for tables.”

Excel Output

Structured data with a spreadsheet interface

Excel output contains the same structured data as CSV, but packaged in a format that:
  • Is easier to open for business users
  • Preserves table structure
  • Supports multiple sheets (for subtables or metadata)

Excel is useful when

  • Manual review is part of the process
  • Data is shared across teams
  • You want minimal friction for non-technical stakeholders

JSON Output

Structured data for systems and automation

JSON output represents extracted data as a structured object. Conceptually:
  • Fields become keys
  • Nested structures are preserved
  • Data types remain explicit

JSON is useful when

  • Integrating with other systems
  • Building automation
  • Preserving hierarchical data (like subtables)
Think of JSON as: “Structured data optimized for machines.”

Markdown Output

Human-readable extraction view (review-only)

Markdown output is a human-readable representation of the extraction result. It is used to:
  • Inspect what was extracted
  • Verify that parsing worked correctly
  • Review results per document
Important to know:
  • Markdown is not a bulk export format
  • It’s generated per document
  • It exists to help you understand the extraction, not to move data elsewhere
Think of Markdown as: “A debugging and review view for humans.”

What’s next?