Open-Source OCR Platform

Where Documents
Become Decisions

Convert PDFs, scans & images to structured data in seconds. AI-powered accuracy at R0.09 per 1,000 pages — 100x cheaper than commercial APIs.

R0.09 per 1,000 pages
< 3s avg processing
99.2% accuracy
OCR Forge document processing flow — upload a PDF or scan, process with AI, and retrieve structured JSON data
Quick Start
# Upload a document
curl -X POST https://api.ocrforge.co.za/documents \
  -H "X-API-Key: your_api_key" \
  -d '{
    "filename": "invoice.pdf",
    "mimeType": "application/pdf",
    "outputFormat": "json"
  }'

# Response
{
  "job_id": "doc_a1b2c3d4e5f6",
  "status": "awaiting_upload",
  "upload_url": "https://s3.af-south-1..."
}
Municipalities Accounting Firms Medical Aids Agriculture Co-ops Legal Practices SARS Compliance

Everything you need to digitise your documents

Enterprise-grade OCR without the enterprise price tag.

🧠

AI-Powered Accuracy

GLM-OCR vision model — ranked #1 on OmniDocBench. Handles tables, handwriting, mixed layouts, and low-quality scans.

Lightning Fast

Serverless architecture on AWS af-south-1. Documents processed in under 3 seconds with automatic scaling.

🔒

Zero-Knowledge Security

POPIA-compliant. Your documents are encrypted at rest and in transit. We never see your content — only metadata for debugging.

📊

Structured Output

Get results as JSON, CSV, searchable PDF, or map to custom templates.

🇿🇦

Made for SA

Hosted in Cape Town (af-south-1). Optimised for South African documents — SARS forms, municipal bills, medical claims.

🔌

Simple REST API

Two API calls: upload and retrieve. Webhook callbacks when processing completes. SDKs for Python, Node.js, PHP & .NET/C#.

How it works

From raw document to structured data in 3 simple steps.

Upload document icon
1

Upload

Send your PDF, image, or scan via API. Get a pre-signed URL for direct upload to S3 — fast and secure.

AI processing icon
2

Process

Our AI pipeline pre-processes, routes to the best OCR engine, extracts text, and structures the output automatically.

Retrieve structured data icon
3

Retrieve

Poll the status endpoint or set up a webhook. Get structured JSON, CSV, or searchable PDF — ready for your systems.

Simple, transparent pricing

Start free. Scale as you grow. All prices in ZAR.

Free
R0/month
For testing & personal projects
  • 50 pages / month
  • JSON & text output
  • Standard processing
  • Community support
Get Started
Business
R1,999/month
For growing teams
  • 25,000 pages / month
  • Custom templates
  • Priority support
  • SLA guarantee
Contact Sales
Enterprise
Custom
For large organisations
  • Unlimited pages
  • Self-hosted option
  • Dedicated GPU infra
  • Custom models
  • Dedicated support
Contact Sales

All plans include: POPIA compliance, encryption at rest & in transit, 30-day data retention. Pay-as-you-go also available at R0.09 per 1,000 pages.

Developer-first API

Two endpoints. That's it. Upload, then retrieve.

POST
/documents

Create a new OCR job. Returns a pre-signed upload URL and job ID.

GET
/documents/:id

Check job status. Returns extracted text and structured data when complete.

GET
/documents/:id/result

Download the processed result in your chosen format (JSON, CSV, PDF).

GET
/health

Health check. Returns platform status and version.

Full API documentation with examples, SDKs, and webhook setup.

API Documentation

Ready to forge your documents?

Sign up for early preview access. A member of our team will be in contact to get you started.

By signing up you agree to our terms of service. POPIA compliant.