Introducing email data extraction

Updated on
July 7, 2025
5
min read
Contributors
No items found.
Author
Introducing email data extraction
Table of contents
Turn documents into structured data
Get started free
Share this post

Today’s businesses receive large volumes of email attachments containing document data that needs processing. Whether you're processing lease applications in PropTech, handling insurance claims, or managing loan documentation in FinTech, manually extracting data from email attachments creates workflow bottlenecks and introduces errors.

We're excited to announce a significant addition to Sensible's document automation platform: email-driven document extraction. This new capability allows you to automatically extract structured data from document attachments simply by forwarding emails to Sensible. Using LLM-based classification, Sensible intelligently identifies and extracts data from any number of attachments per email, optionally parses the email body itself for additional data, and delivers comprehensive extraction results with metadata and download links via webhook. This feature transforms common manual business processes around emailed documents into an automated data extraction pipeline.

Streamlined Email-to-Data Pipeline

Document processing workflows often begin with emails containing attachments that need to be classified, extracted, and routed to downstream systems. With Sensible's new email extraction capabilities, this entire process is automated through a simple forwarding mechanism.

Here's how it works:

  1. You receive emails containing document attachments
  2. You forward them to Sensible’s  dedicated email processor address using your email filters
  3. Automatic classification and extraction happens behind the scenes

You receive structured data via webhook or view it in the Sensible dashboard



For example, after some initial configuration, you can forward emails like the following sample to Sensible:


And get back extraction data for each attachment as JSON via the Sensible app or webhook. For example, here’s the extracted data for an attached PNG of a driver’s license:


Powerful Use Cases for Email-Driven Extraction

This new capability opens up numerous automation opportunities across industries:

PropTech: Streamlined Lease Application Processing

Property management companies can automate the lease application workflow. When prospective tenants email their applications with attachments like pay stubs, driver's licenses, and lease agreements, Sensible can automatically extract key information such as:

  • Applicant personal information from driver's licenses
  • Income verification from pay stubs
  • Lease terms and conditions from signed agreements
  • Contact details and application metadata from email bodies

This transforms a manual, time-intensive process into an automated pipeline that can process applications in seconds rather than hours. 

For this use case, Sensible provides out-of-the-box support  to get you started immediately with the following documents:

Insurance: Claims Document Processing

Insurance companies receive many claim submissions via email with supporting documentation. Sensible can automatically extract information from:

  • Medical reports and bills for health insurance claims
  • Repair estimates and photos for auto insurance claims
  • Property damage assessments for homeowner's claims

For this use case, Sensible provides out-of-the-box support  to get you started immediately with the following documents:

Financial Services: Loan Documentation

Banks and lending institutions can automate processing emailed loan applications. Sensible can handle:

  • Tax documents and W2 forms for income verification
  • Bank statements for financial history analysis
  • Identity verification documents for compliance

For this use case, Sensible provides out-of-the-box support  to get you started immediately with the following documents:


How It Works: Email Processors and Document Types

Behind the scenes, Sensible's email extraction uses a classification and routing system. Email processors act as AI-based routing systems that first classify attachments, then extract structured data:

  1. Classify attachments against multiple document types using LLMs. For example, Sensible can classify lease application attachments  against driver_license, pay_stubs, and leases.
  2. Route each attachment to the appropriate extraction configuration
  3. Extract structured data using document-specific SenseML queries
  4. Return JSON results for each attachment plus the email body

For example, when processing a lease application email with three attachments, Sensible might classify them as:

  • attachment-1.pdf → driver_license document type → Extract name, address, license number
  • attachment-2.pdf → pay_stubs document type → Extract employer, salary, pay period
  • attachment-3.pdf → leases document type → Extract rent amount, lease terms, property address

Once Sensible classifies an attachment as a specific document type, it applies the corresponding extraction configuration to pull out structured data. Here's an example of a driver's license extraction configuration. This configuration uses Sensible's LLM-based Query Group method:



{ /* Sensible uses JSON5 to support in-line comments */
  "fields": [
    {
      "method": {
        "id": "queryGroup",
        "queries": [
          {
            // Use natural language to describe the target data. This description is an LLM prompt
            "id": "license_number",
            "description": "driver's license number, DL, DLN",
            // Apply pattern matching to ensure data quality, i.e. the drivers license must be a sequence of uppercase letters, digits, hyphens, and spaces that's at least 7 characters long
            "type": {
              "id": "custom",
              "pattern": "[A-Z0-9\\- ]{7,}"
            }
          },
           {
            "id": "first_name_value",
            "description": "first name of driver. sometimes found next to a label that reads FN or first name"
          }
          // Additional fields for address, DOB, physical characteristics, etc.
        ]
      }
    }
    
  ]
}

This configuration demonstrates how Sensible combines the power of large language models with structured extraction rules. The Query Group method uses natural language descriptions to locate and extract specific data points, while type definitions ensure the extracted data meets quality standards.


Getting Started with Email Extraction

Setting up email extraction involves three main steps:

1. Configure Document Types

Define document types for each attachment format you expect to receive. Sensible provides out-of-the-box support for common documents,, and can assist you in creating custom document types for your specific needs.

2. Set Up Email Filtering

Determine filtering criteria for emails you want to process automatically. For example, you might filter by:

  • Sender domain (e.g., @mycompany.com)
  • Subject line keywords (e.g., "lease application", "claim submission")
  • Specific recipient addresses (e.g., applications@yourcompany.com)

3. Create Your Email Processor

Contact Sensible to create your email processor with:

  • Document types for attachment classification and extraction
  • Webhook URLs for receiving extracted data
  • Unique forwarding email address

Once configured, you'll receive a dedicated email address like 87237966-5965-4019-97f2-66436947ccbb.residential_lease_applications@app.sensible.so for forwarding emails.

Seamless Integration with Existing Workflows

Email extraction integrates naturally with existing business processes. The extracted data can be:

  • Pushed to your systems via webhooks for immediate processing
  • Stored in databases for record keeping and analysis
  • Routed to approval workflows for human review when needed
  • Integrated with CRMs to update customer records automatically

This means teams can continue using their existing email workflows while gaining the benefits of automated document processing.

Start Automating Your Email Document Processing

Email-driven document extraction represents a step forward in making document automation accessible and practical for business workflows. By leveraging email communication, Sensible transforms a manual bottleneck into an automated advantage.

Ready to streamline your document processing workflows? Book a demo to see email extraction in action, or check out our managed services for customized implementation support. You can also sign up for an account and explore our comprehensive documentation to start building your own email-driven extraction workflows.

Frances Elliott
Frances Elliott
Turn documents into structured data
Get started free
Share this post

Turn documents into structured data

Stop relying on manual data entry. With Sensible, claim back valuable time, your ops team will thank you, and you can deliver a superior user experience. It’s a win-win.