Extract loan packages to structured JSON
Loan packages bundle dozens of document types into a single PDF: applications, pay stubs, bank statements, tax returns, and disclosures. Sensible classifies, splits, and extracts each document for automated underwriting, compliance checks, and loan boarding.
Why loan packages demand document intelligence
Hundreds of pages. 20+ document types. No table of contents. Extraction starts with classification.
No table of contents. No consistent ordering. A 200-page file mixes tax returns, pay stubs, W-2s, bank statements, and disclosures. Classification identifies each document type, then the appropriate extraction config takes over.
A single loan package mixes clean digital PDFs with faxed copies, mobile photos of documents, and degraded scans. Some pages need layout-based extraction; others need LLM parsing to read through noise. Sensible's hybrid engine and configurable OCR engines handle the full quality range without manual sorting.
Pay stubs, W-2s, bank statements, and disclosures each get their own extraction config, but all output validated JSON against your target schema. When the borrower's name, income, and property address appear across ten document types, your pipeline can cross-check them programmatically because every extraction is structured the same way.
Fields we extract
Fields span all document types in the package. The schema unifies borrower, loan, and compliance data.
Borrower name, SSN (masked), address, employer, income, assets, liabilities (from application and verification docs)
Loan amount, interest rate, loan type, term, amortization, LTV, DTI, property type, property address, appraised value
Closing costs, APR, TRID tolerance checks, conditions to close, title exceptions, insurance requirements
Business lending package including financials, guarantor docs, and collateral.
Small Business Administration loan documentation with SBA-specific forms.
Complete loan file with application, disclosures, verifications, and title documents.
Supported loan package types
Sensible handles conventional, FHA, VA, USDA, jumbo, and commercial loan packages. The classifier detects 20+ document types, and hybrid extraction ensures each gets the right configuration.
Conventional, FHA, VA, USDA, jumbo, HELOC, commercial, SBA, construction
Origination, underwriting, closing, post-closing QC, due diligence, securitization



Common Questions
Answers about document classification, portfolio processing, and multi-document extraction.
Yes. You can submit multiple loan package PDFs via the API and receive structured data for each. This supports portfolio due diligence, bulk onboarding, and audit workflows.
Sensible handles loan packages of any page count. Packages of 200+ pages are common in mortgage workflows. Processing time scales linearly with page count.
Sensible classifies each document within a loan package by type: tax returns, pay stubs, bank statements, W-2s, 1099s, VOEs, closing disclosures, appraisals, title reports, and more. You can add custom document types as needed.
Sensible classifies each document within a loan package PDF, splitting it into individual documents like pay stubs, tax returns, bank statements, and closing disclosures. Each is then extracted with the appropriate configuration.
Yes. Sensible sends extraction results to your webhook endpoint when processing completes. You can also poll the API for status.
Yes. Sensible flags extractions with low confidence for human review. You can configure review thresholds and workflows.
Sensible is SOC 2 Type II certified and HIPAA compliant. Data is encrypted in transit and at rest.
Document data is stored indefinitely by default. Custom retention policies are available and can be configured for same-day deletion if needed.
Yes. Sensible offers a 14-day free trial on the Growth plan. No credit card required to start.
Sensible uses per-document pricing for predictable costs. No token-based billing or usage surprises. Volume discounts are available for higher throughput.
Sensible provides REST APIs and SDKs for Python and Node.js. Most integrations take a few hours. Webhooks, Zapier, and direct API calls are all supported.
Sensible processes PDFs (native or scanned), Microsoft Word (DOC, DOCX), spreadsheets (XLSX, XLS, CSV), single-page images (JPEG, PNG), multi-page images (TIFF), and email bodies with attachments.
Accuracy depends on document quality and configuration. Most production deployments achieve 95%+ accuracy with proper validation rules and confidence signals.
Processing speed depends on document size, page count, OCR requirements, and which extraction methods are used. Simple single-page documents process in seconds. Larger or more complex documents that use LLM-based extraction take longer.
