NEWMindMap Digital has acquired Bluetide.co— deepening our data & agentic-AI stack.Read more →
Home · Customer Stories · Gulf Bank
BFSI · Middle East

DocuMage Cheque OCR at a Gulf Bank — 99.1% Field Accuracy on 10,000 Cheques per Day

Replacing a legacy template-based OCR engine with DocuMage's LLM-augmented IDP pipeline — and pushing 94% of cheques to straight-through processing.

99.1%
Field-level accuracy
12w
Delivery duration
Private Cloud
Deployment
4
Accelerators used
Private CloudGulf Bank — 99.1% Field-level accuracy
99.1%
Field-level OCR accuracy
10K
Cheques per day
94%
Straight-through rate
62%
Operations cost reduction
In this storyIDPOCRDocuMageBFSIPayments
01
The challenge

The challenge

The bank — a Tier-1 commercial lender headquartered in the Gulf, with operations across the GCC — was running a legacy cheque-truncation OCR engine licensed from a global vendor since 2014. The system was template-based: it relied on the cheque image conforming to a small set of pre-trained layouts, each with hard-coded field coordinates. When the GCC clearing system migrated to the new pan-regional cheque standard in 2024, the bank's straight-through rate collapsed from 78% to 41% overnight. Every non-matching cheque was being routed to a back-office team in Bahrain for manual data entry.

By the time we were engaged, the manual queue was averaging 5,800 cheques per day and the cost per processed cheque had increased 2.3x in eighteen months. The bank was paying the legacy vendor for an upgrade that was scheduled for delivery in eleven months. They wanted to know if there was a faster path.

Cheque processing is deceptively hard. Beyond reading the CMC-7 magnetic-ink line at the bottom of the cheque, the system has to read the date (in three or four different date formats depending on the issuing country), the courtesy amount (the numeric box), the legal amount (the long-hand written line — often in cursive Arabic or English), the payee name, the drawer signature, and any endorsements on the back. The legal-amount-versus-courtesy-amount mismatch is the single most common reason for human review in any cheque system on earth.

02
The approach

The approach

We deployed DocuMage (Dm), our flagship intelligent document processing platform, in a parallel-run configuration alongside the legacy OCR. For the first four weeks, every cheque that flowed through the existing system was also pushed through DocuMage, with no impact to production. The bank's operations team was given a side-by-side dashboard showing the legacy system's output, DocuMage's output, and the eventual human-confirmed truth.

By the end of week four, DocuMage's field-level accuracy on the bank's actual cheque traffic was 98.4%, versus the legacy system's 87.2%. We then ran a tuning sprint — fine-tuning the courtesy amount and legal amount extraction models on the bank's specific cheque corpus, including the Arabic cursive variations common in the GCC. By the end of the tuning sprint, field-level accuracy was 99.1% and end-to-end straight-through rate (where every field on the cheque passed validation without human touch) was 94%.

We also brought in two supporting accelerators. Signature Verifier (Sv) was deployed to perform on-cheque signature matching against the bank's customer signature card library, flagging signature anomalies for fraud review before payment release. Anomaly Detector (Ad) was wired into the post-OCR flow to flag patterns indicative of fraud — duplicate cheque numbers, sudden amount escalations on a single drawer account, or stylistic anomalies in the legal-amount handwriting.

Accelerators in this engagement

The pre-built building blocks

Rather than commission a ground-up build, the engagement leaned on MindMap's pre-built accelerator library — production-tested components that compress what would otherwise be a six-to-nine-month build into weeks.

Dm

DocuMage

Core IDP pipeline — OCR, ICR, LLM extraction, exception management

Sv

Signature Verifier

Signature-card matching for fraud control

Ad

Anomaly Detector

Behavioural fraud pattern detection

St

Straight-Through Processor

Confidence-routed STP for the clearing engine

03
The architecture

The architecture

DocuMage runs on a private cloud tenant inside the bank's existing Azure regional deployment, which the bank's regulator has approved for processing payment data. The platform is deployed as a Kubernetes-native stack on AKS, with autoscaling configured to handle the morning processing peak — cheque volumes typically triple between 09:00 and 11:00 local time.

The OCR pipeline is layered. The first pass uses Microsoft Azure Document Intelligence (Form Recognizer) for base text extraction and field localisation. The second pass uses our own fine-tuned ICR model — a vision transformer fine-tuned on roughly 1.4 million cheque images from comparable banking deployments — to handle the handwritten courtesy and legal amount fields. The third pass uses GPT-4o Vision in a constrained, schema-validated prompt to reconcile the courtesy and legal amounts and resolve any disagreement between the OCR and ICR passes. The output from all three passes is fed into a confidence aggregator that emits a single confidence score per field.

Cheques where every field exceeds a 96% confidence threshold are pushed straight to the clearing engine via the bank's existing ESB. Cheques with at least one field below threshold land in an exception queue, where they are presented to a human reviewer with the original image, the extracted fields, and the model's confidence scores. The reviewer's correction is fed back into a continuous-learning loop that retrains the field-extraction models on a weekly cadence.

Integration with the bank's existing Finacle core happens through the bank's ESB layer using a dedicated cheque-clearing connector we built. The connector enforces the bank's existing pre-clearing validation rules, including drawer-account status checks, stop-payment instruction lookups, and same-day clearing windows.

The outcomes

The numbers behind the story

99.1%
Field-level OCR accuracy
10K
Cheques per day
94%
Straight-through rate
62%
Operations cost reduction

At steady state, DocuMage processes between 9,500 and 11,200 cheques per day across the bank's footprint. Field-level OCR accuracy sits at 99.1% on a rolling thirty-day window. Straight-through processing rate is 94%, meaning only 6% of cheques require any human touch — and most of those touches are confirmation clicks rather than data re-entry.

The back-office team in Bahrain has been redeployed. Twelve FTEs who were full-time on cheque data entry are now working on fraud review, exception handling and customer-facing case management. Cost per processed cheque has fallen 62%.

An unexpected outcome: the fraud-detection module has caught 47 cases of cheque fraud in the first nine months that would have cleared under the legacy system. The bank's fraud-loss provision for retail cheque transactions has been reduced.

The legacy OCR vendor's upgrade — originally scheduled for delivery six months after our go-live — has been cancelled.

We were told an eleven-month upgrade to fix our cheque OCR. MindMap had us at 94% straight-through processing in twelve weeks, on a platform we now control. The morning processing peak is a non-event. The back-office team is doing work that actually matters.
Head of Payment Operations· Gulf Bank
04
Why MindMap was chosen

Why MindMap was chosen

The bank had been quoted an eleven-month timeline by the incumbent vendor for an upgrade that would, at best, restore the previous 78% straight-through rate. They were not confident the upgrade would solve the underlying template-rigidity problem — the new pan-regional cheque standard was unlikely to be the last format change.

MindMap proposed a deployment in twelve weeks with a contractual commitment to exceed 90% straight-through and 98% field accuracy. The parallel-run approach meant the bank could see the system's actual performance on their actual traffic before committing to cutover. The pre-built DocuMage platform — already deployed in three other GCC banks — meant the bank was not commissioning new software, just new tuning.

Critically, our approach was format-agnostic by design. The LLM extraction layer in DocuMage does not depend on hard-coded field coordinates; it understands what a cheque is and where its fields are likely to be, even on novel layouts. The bank now has a system that will adapt to the next cheque-standard change without a vendor upgrade cycle.

Want an outcome like this?

Start with a 2-week AI Readiness Sprint. We deliver a prioritised use-case backlog and business case grounded in what's actually buildable with our accelerator library.

Book a walkthrough →Explore BFSI
Talk to the product team