Extract claims forms to structured JSON
Claims forms like CMS-1500, UB-04, and ADA pack hundreds of fields into dense gridded layouts for healthcare reimbursement. A single wrong character means a denied claim. Use Sensible to get structured claims data for adjudication, compliance, and analytics.
Why claims forms challenge extraction tools
Dense field grids, tiny print, and strict positional requirements make claims forms demanding.
Thirty-three numbered boxes on a single page. Sub-fields, checkboxes, multi-line entries crammed into tiny cells. Each box position is mapped precisely, with extraction anchored to the CMS-1500's physical layout.
ICD-10 codes, CPT codes, and NPI numbers are sequences where one wrong character changes the clinical meaning. Validation rules check format compliance on every code field, catching OCR misreads before they propagate.
CMS-1500 for professional claims. UB-04 for institutional. ADA for dental. Each form has its own box numbering and field semantics. Sensible detects the form type and applies the correct extraction configuration automatically.
Fields we extract
Box-level extraction maps to your claims adjudication schema. Add custom validation rules as needed.
Patient name, DOB, member ID, group number, provider name, NPI, tax ID, referring provider, facility name/NPI
ICD-10 codes (primary through quaternary), CPT/HCPCS codes, modifiers, place of service, dates of service, units, charges
Total charges, amount paid, balance due, prior authorization number, assignment of benefits, signature, employer information
American Dental Association standard claim form for dental services.
Institutional healthcare claim form used by hospitals and inpatient facilities.
Standard professional healthcare claim form used by physicians and outpatient facilities.
Supported claims form types
Pre-built configurations cover CMS-1500 and UB-04. New claims form types can be configured in hours. Hybrid extraction handles both the structured box fields and any free-text clinical sections.
CMS-1500 (professional), UB-04/CMS-1450 (institutional), ADA Dental Claim Form, pharmacy claims (NCPDP)
ACORD claims forms, state-specific first report of injury, workers comp forms, auto injury claims



Common Questions
Answers about CMS-1500 and UB-04 extraction, code validation, and form type detection.
Sensible captures date of service, place of service, CPT/HCPCS code, modifiers, diagnosis pointers, charges, units, and rendering provider NPI for each service line.
Sensible extracts ICD-10 diagnosis codes, diagnosis pointers, date of onset, and related cause indicators. For CMS-1500, all 12 diagnosis code fields are captured.
Yes. Sensible extracts billing provider name, NPI, tax ID, address, rendering provider, referring provider, and facility information from both CMS-1500 and UB-04 forms.
Yes. Sensible has pre-built configurations for CMS-1500 (professional claims) and UB-04 (institutional claims). Each form's unique field layout is mapped precisely.
Yes. Sensible sends extraction results to your webhook endpoint when processing completes. You can also poll the API for status.
Yes. Sensible flags extractions with low confidence for human review. You can configure review thresholds and workflows.
Sensible is SOC 2 Type II certified and HIPAA compliant. Data is encrypted in transit and at rest.
Document data is stored indefinitely by default. Custom retention policies are available and can be configured for same-day deletion if needed.
Yes. Sensible offers a 14-day free trial on the Growth plan. No credit card required to start.
Sensible uses per-document pricing for predictable costs. No token-based billing or usage surprises. Volume discounts are available for higher throughput.
Sensible provides REST APIs and SDKs for Python and Node.js. Most integrations take a few hours. Webhooks, Zapier, and direct API calls are all supported.
Sensible processes PDFs (native or scanned), Microsoft Word (DOC, DOCX), spreadsheets (XLSX, XLS, CSV), single-page images (JPEG, PNG), multi-page images (TIFF), and email bodies with attachments.
Accuracy depends on document quality and configuration. Most production deployments achieve 95%+ accuracy with proper validation rules and confidence signals.
Processing speed depends on document size, page count, OCR requirements, and which extraction methods are used. Simple single-page documents process in seconds. Larger or more complex documents that use LLM-based extraction take longer.
