chore: add PDF support to receipt OCR pipeline (refs #182)
All checks were successful
Deploy to Staging / Build Images (pull_request) Successful in 38s
Deploy to Staging / Deploy to Staging (pull_request) Successful in 22s
Deploy to Staging / Verify Staging (pull_request) Successful in 8s
Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s
Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped
All checks were successful
Deploy to Staging / Build Images (pull_request) Successful in 38s
Deploy to Staging / Deploy to Staging (pull_request) Successful in 22s
Deploy to Staging / Verify Staging (pull_request) Successful in 8s
Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s
Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped
The receipt extractor only accepted image MIME types, rejecting PDFs at the OCR layer. Added application/pdf to supported types and PDF-to-image conversion (first page at 300 DPI) before OCR preprocessing. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
@@ -281,9 +281,9 @@ async def extract_maintenance_receipt(
|
||||
- Gemini semantic field extraction from OCR text
|
||||
- Regex cross-validation for dates, amounts, odometer
|
||||
|
||||
Supports HEIC, JPEG, PNG formats.
|
||||
Supports HEIC, JPEG, PNG, and PDF formats.
|
||||
|
||||
- **file**: Maintenance receipt image file (max 10MB)
|
||||
- **file**: Maintenance receipt image or PDF file (max 10MB)
|
||||
|
||||
Returns:
|
||||
- **receiptType**: "maintenance"
|
||||
|
||||
Reference in New Issue
Block a user