Skip links

AI-Powered Policy Data Extraction
for a Leading Auto Insurer 

170+
consumer health & pharma
brands

94,000+
employees across the
globe

80+ countries
with active operations

Business Case

The client wanted to flag high-risk patients before symptoms, but data gaps and label issues blocked accurate models

Source Mapping

Vitals, and EHR feeds used different formats, so teams struggled to trace the true source for each metric

Cohort Labelling

No shared rule to tag “positive” patients, which slowed model setup and review

Class Balance

Positive cases formed only a small slice of the cohort, tilting early model results

Impact Delivered

80% less manual
entry time

Model fills every field the moment a
file arrives

90% fewer key-in
errors

to onboard new event types, from
weeks to days​

3x faster quoteturnaround

New agents issue bindable quotes in
under ten minutes

Our Solution

We inserted an AI extraction layer that drops clean data straight into core systems. Here are some of the features:

Event-driven ingest

An Azure function fires on each upload and routes files to processing

Custom Form Recognizer model

Captures names, addresses, vehicle details, VINs, and coverage in seconds

Python post-processing

Pandas, Regex, and NumPy tidy fields and build a structured JSON record

Secure API hand-off

JSON flows to policy-admin and quoting tools through azure API management

Serverless design

No hardware to maintain; scaling happens automatically

Tech Stack

Azure Form Recognizer

Python

Azure Functions

NumPy

Pandas

Azure API Management