Arsalan Younus.
Back to Projects

Interpage Document Correlation

Automated cross-page entity validation for multi-page medical and insurance forms, catching inconsistencies before they reach downstream systems.

The Business Problem

Multi-page forms spread data across many pages; inconsistencies led to wrong decisions and audit issues. Manual cross-checks did not scale.

Medical and insurance workflows require a clear audit trail and automatic resolution of conflicting values across pages.

The Technical Solution

I built a correlation and validation system that cross-references entities across pages using a rule engine and entity resolution logic. Inconsistencies are flagged for human review with a clear audit trail.

The system runs after OCR and codification, integrating with the extraction pipeline.

The Scalability Factor

Deployed on AWS with Docker alongside the extraction pipeline. Rule-based processing is stateless and scales horizontally with document volume.

Business Impact

39% fewer downstream errors from conflicting or missing cross-page data; clear audit trail for review.

Critical for multi-page medical and insurance forms; running in production alongside extraction.

Built with

NLP
Entity Resolution
Python
Rule Engine
AWS
Docker