motovaultpro

Author	SHA1	Message	Date
Eric Gullickson	5e4515da7c	fix: use PyMuPDF instead of pdf2image for PDF-to-image conversion (refs #182 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 37s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 52s Details Deploy to Staging / Verify Staging (pull_request) Successful in 9s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details pdf2image requires poppler-utils which is not installed in the OCR container. PyMuPDF is already in requirements.txt and can render PDF pages to PNG at 300 DPI natively without extra system dependencies. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 21:34:17 -06:00
Eric Gullickson	013fb0c67a	feat: migrate VIN/receipt extractors and OCR service to engine abstraction (refs #117 ) Replace direct pytesseract calls with OcrEngine interface in vin_extractor.py, receipt_extractor.py, and ocr_service.py. PSM mode fallbacks replaced with engine-agnostic single-line/single-word configs. Dead _process_ocr_data removed. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 10:56:27 -06:00
Eric Gullickson	852c9013b5	feat: add core OCR API integration (refs #65 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 5m59s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 31s Details Deploy to Staging / Verify Staging (pull_request) Successful in 2m19s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details OCR Service (Python/FastAPI): - POST /extract for synchronous OCR extraction - POST /jobs and GET /jobs/{job_id} for async processing - Image preprocessing (deskew, denoise) for accuracy - HEIC conversion via pillow-heif - Redis job queue for async processing Backend (Fastify): - POST /api/ocr/extract - authenticated proxy to OCR - POST /api/ocr/jobs - async job submission - GET /api/ocr/jobs/:jobId - job polling - Multipart file upload handling - JWT authentication required File size limits: 10MB sync, 200MB async Processing time target: <3 seconds for typical photos Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 16:02:11 -06:00

3 Commits