AI-Powered Policy Data Extraction
for a Leading Auto Insurer
170+
consumer health & pharma
brands
94,000+
employees across the
globe
80+ countries
with active operations
Business Case
The client wanted to flag high-risk patients before symptoms, but data gaps and label issues blocked accurate models
Source Mapping
Vitals, and EHR feeds used different formats, so teams struggled to trace the true source for each metric
Cohort Labelling
No shared rule to tag “positive” patients, which slowed model setup and review
Class Balance
Positive cases formed only a small slice of the cohort, tilting early model results
80% less manual
entry time
Model fills every field the moment a
file arrives
90% fewer key-in
errors
to onboard new event types, from
weeks to days
3x faster quoteturnaround
New agents issue bindable quotes in
under ten minutes
Our Solution
We inserted an AI extraction layer that drops clean data straight into core systems. Here are some of the features:
Event-driven ingest
An Azure function fires on each upload and routes files to processing
Custom Form Recognizer model
Captures names, addresses, vehicle details, VINs, and coverage in seconds
Python post-processing
Pandas, Regex, and NumPy tidy fields and build a structured JSON record
Secure API hand-off
JSON flows to policy-admin and quoting tools through azure API management
Serverless design
No hardware to maintain; scaling happens automatically