Skip to main content

February 2026

February 25, 2026

  • PaddleOCR Engine: Added PaddleOCR as a new OCR engine with an updated model for Latin languages and memory-efficient model loading
  • LLM Cost Tracking: Billing transactions now include per-extraction LLM token counts and costs for full cost transparency
  • Table Extraction Accuracy: Fixed cross-row bounding box contamination with Y-coordinate constraints and improved prompts for tables containing surrounding text
  • Extraction Pipeline Metrics: New Prometheus instrumentation and Grafana dashboards for real-time visibility into extraction pipeline performance and worker queues
  • Auto-scaling Workers: Extraction workers now auto-scale based on queue depth using KEDA and Prometheus metrics
  • Sample Workflows: New accounts are prepopulated with sample workflows and documents for a guided onboarding experience
  • Langfuse Tracing: Improved extraction tracing with workflow names, LLM generation spans, and decoupled OpenTelemetry integration
  • Bug Fix: Fixed a crash in smart lookup when the search corpus was empty
  • Bug Fix: Fixed N+1 queries on extraction callbacks and field list endpoints
  • Bug Fix: Fixed file download filenames not being readable by the frontend

February 21, 2026

  • Visual Grounding 2.0: Completely revamped evidence highlighting with block-level and element-level precision, making it easier to trace extracted values back to their exact location in the original document
  • Billing Dashboard: New usage dashboard with remaining credits, weekly summaries, per-workflow usage breakdown, and CSV export
  • Document Segmentation: Advanced document layout analysis now enabled in production for improved table and section detection
  • Bug Fix: Fixed an issue where bounding box highlights could remain stuck after navigating between fields
  • Bug Fix: Fixed calibration configuration for confidence scoring

February 14, 2026

  • Redesigned Home Page: New personalized home with recent workflows, at-a-glance file status indicators, and a streamlined workflow creation experience with templates
  • New Schema Builder: Redesigned workflow configuration with a two-step Define & Refine flow, AI-powered field suggestions, inline file previews, and drag-and-drop uploads
  • Faster Page Rotation: Upgraded page orientation detection engine for ~2x faster processing per page
  • Improved Field Suggestions: More reliable AI field suggestions powered by an upgraded model
  • Smart Lookup Fix: Fixed an issue where smart lookup settings were not saved when publishing a workflow version
  • Bug Fix: Fixed sidebar overflow in the workflow studio

February 7, 2026

  • New Dashboard Design: Refreshed visual design across the platform with updated components, colors, and layout
  • Improved Visual Grounding: Evidence highlighting now uses an event-driven architecture, making interactions faster and more reliable across all views
  • Secure File Access: Files and organization logos are now served securely through the backend, improving security and simplifying local development
  • Workflow Name Suggestions: New AI-powered workflow name suggestions based on your description and uploaded files
  • File Status Counts: Workflow listings now show at-a-glance counts of files by processing status
  • Bug Fix: Fixed document element classification that could cause incorrect layout detection in certain documents

February 3, 2026

  • Smart Lookup: New feature to enrich extracted data with external reference files using AI-powered matching
  • Field Source Tracking: Fields now track their source (extraction vs. smart lookup) for better traceability
  • OCR Performance: Improved startup times for OCR services with memory snapshots
  • Bug Fix: Fixed field data processing to correctly handle manual field creation

January 2026

January 27, 2026

  • Usage-Based Billing: Introduced billing for data extraction operations with transparent per-page pricing
  • Improved Suggestions: Faster and more reliable field suggestions when creating workflows
  • Bug Fix: Resolved an issue where certain field value formats could cause upload errors

January 20, 2026

  • Faster OCR Processing: Improved document processing speed with optimized text recognition
  • Extraction Review: Administrators can now review extractions directly in the dashboard
  • Organization Credits: Added ability to manage organization credits for usage billing

January 13, 2026

  • Faster Document Processing: Significantly improved processing speed for large documents
  • Row Count Filters: Filter extraction results by the number of rows extracted
  • Page Rotation Control: Configure custom page rotation strategies per workflow

January 6, 2026

  • Results Filtering: Enhanced results download with additional filtering options
  • API Documentation: Expanded API reference documentation with more examples

December 2025

December 15, 2025

  • Improved Reliability: Enhanced system stability and faster deployments

December 1, 2025

  • Performance Improvements: Optimized results retrieval for workflows with many documents
  • Cost Tracking: Added detailed usage tracking for extraction operations

November 2025

November 24, 2025

  • Datarow Creation: Create new data rows directly from specific extractions

November 17, 2025

  • Table Extraction: Improved accuracy when extracting data from complex tables
  • User Status Endpoint: New /user/is-enabled endpoint to check account status
  • File Validation: File validator responses now include user information

November 10, 2025

  • Large Document Support: Better handling of documents with many pages using smart chunking
  • Validation Tracking: Track which values have been manually validated or edited
  • Improved Accuracy: Enhanced extraction accuracy with updated default model

November 3, 2025

  • Large Document Processing: Improved extraction quality for documents exceeding context limits
  • Confidence Scoring: When merging extractions, the highest confidence value is preserved

October 2025

October 27, 2025

  • Performance: General performance improvements across the platform

October 13, 2025

  • CSV Export: Fixed field ordering in CSV exports to match workflow configuration
  • Results Ordering: Extraction results now consistently respect field order

October 6, 2025

  • Nested Field Reordering: New endpoint to reorder fields within object-type fields
  • Verification URLs: Job results now include verification URLs for easy access
  • Bug Fix: Fixed handling of fields with similar names in nested structures

September 30, 2025

  • Improved Extraction Pipeline: Redesigned extraction workflow for better reliability