motovaultpro

egullickson/motovaultpro

Fork 0

Commit Graph

Author	SHA1	Message	Date
Eric Gullickson	653c535165	chore: add PDF support to receipt OCR pipeline (refs #182 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 38s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 22s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details The receipt extractor only accepted image MIME types, rejecting PDFs at the OCR layer. Added application/pdf to supported types and PDF-to-image conversion (first page at 300 DPI) before OCR preprocessing. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 21:22:40 -06:00
Eric Gullickson	90401dc1ba	feat: add maintenance receipt extraction pipeline with Gemini + regex (refs #150 ) - New MaintenanceReceiptExtractor: Gemini-primary extraction with regex cross-validation for dates, amounts, and odometer readings - New maintenance_receipt_validation.py: cross-validation patterns for structured field confidence adjustment - New POST /extract/maintenance-receipt endpoint reusing ReceiptExtractionResponse model - Per-field confidence scores (0.0-1.0) with Gemini base 0.85, boosted/reduced by regex agreement Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-12 21:14:13 -06:00

Author

SHA1

Message

Date

Eric Gullickson

653c535165

chore: add PDF support to receipt OCR pipeline (refs #182 )

Deploy to Staging / Build Images (pull_request) Successful in 38s

Details

Deploy to Staging / Deploy to Staging (pull_request) Successful in 22s

Details

Deploy to Staging / Verify Staging (pull_request) Successful in 8s

Details

Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s

Details

Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped

Details

The receipt extractor only accepted image MIME types, rejecting PDFs at
the OCR layer. Added application/pdf to supported types and PDF-to-image
conversion (first page at 300 DPI) before OCR preprocessing.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-02-13 21:22:40 -06:00

Eric Gullickson

90401dc1ba

feat: add maintenance receipt extraction pipeline with Gemini + regex (refs #150 )

- New MaintenanceReceiptExtractor: Gemini-primary extraction with regex
  cross-validation for dates, amounts, and odometer readings
- New maintenance_receipt_validation.py: cross-validation patterns for
  structured field confidence adjustment
- New POST /extract/maintenance-receipt endpoint reusing
  ReceiptExtractionResponse model
- Per-field confidence scores (0.0-1.0) with Gemini base 0.85,
  boosted/reduced by regex agreement

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-02-12 21:14:13 -06:00

2 Commits