AI case study — Document AI

A privacy-first extraction system for confidential Australian taxation documents — handwritten text, arbitrary layouts, and government monograms read by open-source vision models on private servers.

Vision-language modelsOpen-source LLMsPrivate deploymentOCR
5sper document — work that took four full-time reviewers
The challenge

What we walked into.

Taxation documents are confidential, so third-party LLMs (OpenAI, Claude, Grok) were off the table — everything had to run on open-source models deployed to private servers. The documents include free-form handwritten text with no consistent format, plus information that had to be retrieved from government monograms.

What we built

The solution.

01

A dual vision-language model pipeline running entirely on the client's private infrastructure.

02

Extraction accuracy above 98% using only a 2B-parameter model — small enough to run economically on-premises.

The results

What changed.

01

Previously four people worked full-time on review and data entry, processing about 480 documents a day combined. Each document now processes in under 5 seconds.

02

Throughput equivalent to 17,000+ documents per person per day — a 14,000%+ increase in processing speed.

03

Manual workload for this task eliminated entirely, freeing staff for higher-value work.

Next project

EpiWriting