Skip to main content

What is CommodityAI?

CommodityAI is an AI-powered document processing platform that automatically extracts structured data from your unstructured documents. Whether you’re processing invoices, contracts, emails, or custom business documents, CommodityAI uses advanced AI classification and extraction to transform your documents into queryable, structured records.

Key Features

Automated Document Processing

Upload documents through email forwarding or direct file upload, and CommodityAI automatically:
  • Classifies document types using AI-powered classification
  • Splits multi-document files into logical sources
  • Extracts structured data based on your custom schemas
  • Provides confidence scores for each extracted field with alternative values

Custom Object Management

Define your own data schemas tailored to your business:
  • Create source record definitions for document-based data (invoices, contracts, etc.)
  • Build custom objects with primary keys, formulas, and field validations
  • Set up workflows that route documents to the right extraction pipeline
  • Configure triggers based on email properties, document classifications, or data conditions

Multi-Tenant Architecture

Enterprise-grade security and isolation:
  • Company-scoped data - All data is isolated by company ID
  • Role-based access control - Admin, member, and custom role permissions
  • Audit trails - Complete history of all data modifications
  • API key authentication - Secure programmatic access to your data

Integration Capabilities

Access your data programmatically:
  • REST API with cursor-based pagination for efficient data retrieval
  • Rate-limited access - 100 requests/min, 10k requests/day per key
  • Filtering and querying - Query by definition, timestamps, counters
  • Real-time access - Data available via API immediately after extraction

Who Uses CommodityAI?

Document-Heavy Businesses

Perfect for companies processing large volumes of documents:
  • Trading companies extracting contract terms, shipment details, and invoice data
  • Supply chain operations processing bills of lading, certificates, and shipping documents
  • Finance teams automating invoice and receipt processing
  • Procurement departments extracting purchase order and vendor information

Compliance and Audit Teams

Streamline compliance workflows:
  • Automated data extraction reduces manual data entry errors
  • Confidence scoring flags low-confidence extractions for human review
  • Audit trails provide complete history of document processing
  • Structured export enables compliance reporting and analysis

Data Analytics Teams

Convert unstructured documents to structured data for analysis:
  • Normalized data - All extracted data follows your defined schemas
  • API access - Pull data into your analytics tools and dashboards
  • Field-level provenance - Track where each value came from in source documents
  • Confidence metrics - Filter by extraction confidence for quality analysis

How It Works

1. Document Upload

Documents enter the system through multiple channels:
  • Email forwarding - Forward emails to your company-specific address
  • Direct upload - Upload files through the web interface
  • Supported formats - PDF, images, Excel, Word documents

2. AI Classification

Our AI engine automatically identifies document types:
  • Multi-class classification - Documents are categorized by type
  • Confidence levels - Each classification includes a confidence score (high/medium/low)
  • Document splitting - Multi-page PDFs are intelligently split into logical documents
  • Trigger matching - Classified documents route to the appropriate workflows

3. Data Extraction

Structured data extraction using your custom schemas:
  • LLM-powered extraction - Uses advanced language models (GPT-4, Gemini, Claude)
  • Schema validation - Extracted data conforms to your defined field types
  • Confidence scoring - Each field includes confidence metrics and alternatives
  • Human review workflows - Low-confidence extractions can route to review queues

4. Access and Integration

Your data becomes immediately accessible:
  • Web interface - View and edit records in the dashboard
  • REST API - Programmatic access for integrations
  • Export options - Download data or sync to external systems
  • Real-time availability - Data accessible within seconds of extraction

Getting Started

Ready to start processing your documents? Check out our Quick Start Guide or jump directly to the API Reference.