Developer-first document extraction

Document Extraction API for Receipts, Invoices, and PDFs

DocuParse API helps developers extract structured data from business documents. Upload files, process them through the API, and receive clean JSON for your application, dashboard, or automation workflow.

Start with 20 documents/month and full API access. No credit card required.

API-first

A simple API for document extraction

API-first, not spreadsheet-first. DocuParse API is designed for developers who need structured document data inside their product, dashboard, automation, or backend system.

R

REST API

Standard HTTP endpoint. Works with any language, framework, or HTTP client.

S

Structured JSON output

Named fields returned as a consistent JSON object — not raw text.

R

Reliable extraction

Handles difficult scans, unusual layouts, and non-standard document formats automatically.

D

Dashboard visibility

View every extracted document in the DocuParse dashboard with full result details.

A

API key access

Secure API key authentication. Generate and revoke keys from the dashboard.

S

Structured fields

Every extraction returns normalized, named fields so you can build reliable downstream workflows.

What you can extract

Structured fields returned for receipts, invoices, and business documents

Merchant / vendor
Company or store name
Total and subtotal
Charged and pre-tax amounts
Tax
Tax amount and rate
Date and due date
Transaction and payment dates
Invoice / receipt ID
Unique document identifiers
Currency
ISO 3-letter currency code
Payment method
Card, bank transfer, etc.
Line items
Individual products or services
Structured fields
Per-field field coverage
How it works

How it works

Four steps from document file to structured data in your application

STEP 01

Upload document

Send a PDF, JPG, or PNG file via a multipart POST request with your API key.

STEP 02

Extract structured fields

DocuParse processes the file and extracts named fields. Difficult documents are handled automatically.

STEP 03

Review in dashboard

View extraction results in the DocuParse dashboard or receive the JSON response directly.

STEP 04

Store JSON in your app

Use the structured JSON in your database, workflow, dashboard, or automation system.

Request

curl -X POST https://docuparseapi.com/api/v1/extract \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -F "file=@document.pdf"

Response

{
  "success": true,
  "document_type": "invoice",
  "merchant": "Example Corp Ltd.",
  "invoice_id": "INV-2026-0891",
  "date": "2026-05-10",
  "due_date": "2026-06-10",
  "subtotal": "1200.00",
  "tax": "120.00",
  "total": "1320.00",
  "currency": "USD",
  "line_items": [
    {
      "description": "API access — Professional plan",
      "quantity": 1,
      "amount": "1200.00"
    }
  ],
  "extraction_source": "rule"
}
Developer-first

Built for developers

Structured fields, not just OCR text. DocuParse API returns useful fields your application can actually use — including totals, dates, currencies, merchants, document IDs, and line items.

Built for real workflows. Use extracted JSON to power expense tools, bookkeeping workflows, invoice processing, internal dashboards, customer uploads, and SaaS document ingestion.

Consistent JSON schema
Every response follows the same structure regardless of document type.
Dashboard visibility
See every document and its extraction result inside the DocuParse dashboard.
API key authentication
Generate and manage API keys from the dashboard. Revoke anytime.

Frequently asked questions

What is a document extraction API?

A document extraction API accepts document files and returns structured data from them — named fields like merchant, total, date, currency, and line items — as a JSON response rather than raw text.

What document types does DocuParse API support?

DocuParse API is built for receipts, invoices, and business documents in PDF, JPG, PNG, and CSV formats.

How is this different from OCR?

OCR converts images to raw text. DocuParse API goes further: it identifies document type, locates relevant fields, extracts values, and returns them as a structured JSON object your application can use directly.

Can I see extraction results in a dashboard?

Yes. All extracted documents are visible in the DocuParse dashboard with their full extraction results, document type, and structured fields.

How does DocuParse handle difficult or low-quality documents?

DocuParse handles difficult scans, unusual layouts, and non-standard formats automatically. The extraction runs and returns the same named fields regardless of document quality.

How do I get started?

Sign up for a free account, generate an API key from the dashboard, and send your first document to the /api/v1/extract endpoint. The free tier includes 20 documents/month with full API access and no credit card required.

Start extracting documents today

Start with 20 documents/month and full API access. No credit card required.