motovaultpro

Author	SHA1	Message	Date
Eric Gullickson	3705e63fde	feat: add Gemini engine module and configuration (refs #133 ) Add standalone GeminiEngine class for maintenance schedule extraction from PDF owners manuals using Vertex AI Gemini 2.5 Flash with structured JSON output enforcement, 20MB size limit, and lazy initialization. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 10:00:47 -06:00
Eric Gullickson	4abd7d8d5b	feat: add Vision monthly cap, WIF auth, and cloud-primary hybrid engine (refs #127 ) - Add VISION_MONTHLY_LIMIT config setting (default 1000) - Update CloudEngine to use WIF credential config via ADC - Rewrite HybridEngine to support cloud-primary with Redis counter - Pass monthly_limit through engine factory Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 20:50:02 -06:00
Eric Gullickson	9a2b12c5dc	fix: No matches All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 37s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 22s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details	2026-02-07 16:35:28 -06:00
Eric Gullickson	9d2d4e57b7	fix: PaddleOCR error All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 36s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 52s Details Deploy to Staging / Verify Staging (pull_request) Successful in 9s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details	2026-02-07 16:12:07 -06:00
Eric Gullickson	dab4a3bdf3	fix: PaddleOCR error All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 3m46s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details	2026-02-07 15:51:04 -06:00
Eric Gullickson	639ca117f1	fix: Update PaddleOCR API All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 5m6s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details	2026-02-07 14:44:06 -06:00
Eric Gullickson	b9fe222f12	fix: Build errors and tesseract removal Some checks failed Deploy to Staging / Build Images (pull_request) Failing after 4m14s Details Deploy to Staging / Deploy to Staging (pull_request) Has been skipped Details Deploy to Staging / Verify Staging (pull_request) Has been skipped Details Deploy to Staging / Notify Staging Ready (pull_request) Has been skipped Details Deploy to Staging / Notify Staging Failure (pull_request) Successful in 8s Details	2026-02-07 12:12:04 -06:00
Eric Gullickson	47c5676498	chore: update OCR tests and documentation (refs #121 ) Some checks failed Deploy to Staging / Build Images (pull_request) Failing after 7m4s Details Deploy to Staging / Deploy to Staging (pull_request) Has been skipped Details Deploy to Staging / Verify Staging (pull_request) Has been skipped Details Deploy to Staging / Notify Staging Ready (pull_request) Has been skipped Details Deploy to Staging / Notify Staging Failure (pull_request) Successful in 7s Details Add engine abstraction tests and update docs to reflect PaddleOCR primary architecture with optional Google Vision cloud fallback. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 11:42:51 -06:00
Eric Gullickson	4ef942cb9d	feat: add optional Google Vision cloud fallback engine (refs #118 ) CloudEngine wraps Google Vision TEXT_DETECTION with lazy init. HybridEngine runs primary engine, falls back to cloud when confidence is below threshold. Disabled by default (OCR_FALLBACK_ENGINE=none). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 11:12:08 -06:00
Eric Gullickson	013fb0c67a	feat: migrate VIN/receipt extractors and OCR service to engine abstraction (refs #117 ) Replace direct pytesseract calls with OcrEngine interface in vin_extractor.py, receipt_extractor.py, and ocr_service.py. PSM mode fallbacks replaced with engine-agnostic single-line/single-word configs. Dead _process_ocr_data removed. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 10:56:27 -06:00
Eric Gullickson	ebc633fb36	feat: add OCR engine abstraction layer (refs #116 ) Introduce pluggable OcrEngine ABC with PaddleOCR PP-OCRv4 as primary engine and Tesseract wrapper for backward compatibility. Engine factory reads OCR_PRIMARY_ENGINE config to instantiate the correct engine. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 10:47:40 -06:00
Eric Gullickson	e4336ce9da	fix: extract VIN from noisy OCR via sliding window + char deletion (refs #113 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 37s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details When OCR reads extra characters (e.g. sticker border as 'C', spurious 'Z' insertion), the raw text exceeds 17 chars and the old first-17 trim produced wrong VINs. New strategy tries all 17-char sliding windows and single/double character deletions, validating each via check digit. For 'CWVGGNPE2Z4NP069500', this finds the correct VIN 'WVGGNPE24NP069500' (valid check digit) instead of 'CWVGGNPE2Z4NP0695' (invalid). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 22:00:07 -06:00
Eric Gullickson	432b3bda36	fix: remove char whitelist incompatible with Tesseract LSTM (refs #113 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 36s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details tessedit_char_whitelist does not work with OEM 1 (LSTM engine) and causes empty/erratic output. This was the root cause of Tesseract returning empty text despite clear, well-preprocessed images. Character filtering is already handled post-OCR by the VIN validator's correct_ocr_errors() method (I->1, O->0, Q->0, etc). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 21:52:08 -06:00
Eric Gullickson	ae5221c759	fix: invert min-channel so Tesseract gets dark-on-light text (refs #113 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 35s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details The min-channel correctly extracts contrast (white text=255 vs green sticker bg=130), but Tesseract expects dark text on light background. Without inversion, the grayscale-only path returned empty text for every PSM mode because Tesseract couldn't see bright-on-dark text. Invert via bitwise_not: text becomes 0 (black), sticker bg becomes 125 (gray). Fixes all three OCR paths (adaptive, grayscale, Otsu). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 21:39:48 -06:00
Eric Gullickson	63c027a454	fix: always use min-channel and add grayscale-only OCR path (refs #113 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 35s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 50s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details Two fixes: 1. Always use min-channel for color images instead of gated comparison that was falling back to standard grayscale (which has only 23% contrast for white-on-green VIN stickers). 2. Add grayscale-only OCR path (CLAHE + denoise, no thresholding) between adaptive and Otsu attempts. Tesseract's LSTM engine is designed to handle grayscale input directly and often outperforms binarized input where thresholding creates artifacts. Pipeline order: adaptive threshold → grayscale-only → Otsu threshold Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 21:32:52 -06:00
Eric Gullickson	a07ec324fe	fix: use min-channel grayscale and morphological cleanup for VIN OCR (refs #113 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 35s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details Replace std-based channel selection (which incorrectly picked green for green-tinted VIN stickers) with per-pixel min(B,G,R). White text stays 255 in all channels while colored backgrounds drop to their weakest channel value, giving 2x contrast improvement. Add morphological opening after thresholding to remove noise speckles from car body surface that were confusing Tesseract's page segmentation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 21:23:43 -06:00
Eric Gullickson	0de34983bb	fix: use best-contrast color channel for VIN preprocessing (refs #113 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 36s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 1m7s Details Deploy to Staging / Verify Staging (pull_request) Successful in 10s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 9s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details White text on green VIN stickers has only ~12% contrast in standard grayscale conversion because the green channel dominates luminance. The new _best_contrast_channel method evaluates each RGB channel's standard deviation and selects the one with highest contrast, giving ~2x improvement for green-tinted VIN stickers. Falls back to standard grayscale for neutral-colored images. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 21:14:56 -06:00
Eric Gullickson	ff3858f750	fix: add debug image saving gated on LOG_LEVEL=debug (refs #113 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 36s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 21s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details Save original, adaptive, and Otsu preprocessed images to /tmp/vin-debug/{timestamp}/ when LOG_LEVEL is set to debug. No images saved at info level. Volume mount added for access. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 20:26:06 -06:00
Eric Gullickson	d5696320f1	fix: align VIN OCR logging with unified logging design (refs #113 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 3m25s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 2m36s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 9s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details Replace filesystem-based debug system (VIN_DEBUG_DIR) with standard logger.debug() calls that flow through Loki when LOG_LEVEL=DEBUG. Use .env.logging variable for OCR LOG_LEVEL. Increase image capture quality to 0.95 for better OCR accuracy. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 19:36:35 -06:00
Eric Gullickson	6a4c2137f7	fix: resolve VIN OCR scanning failures on all images (refs #113 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 35s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 2m31s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details Root cause: Tesseract fragments VINs into multiple words but candidate extraction required continuous 17-char sequences, rejecting all results. Changes: - Fix candidate extraction to concatenate adjacent OCR fragments - Disable Tesseract dictionaries (VINs are not dictionary words) - Set OEM 1 (LSTM engine) for better accuracy - Add PSM 11 (sparse text) and PSM 13 (raw line) fallback modes - Add Otsu's thresholding as alternative preprocessing pipeline - Upscale small images to meet Tesseract's 300 DPI requirement - Remove incorrect B->8 and S->5 transliterations (valid VIN chars) - Fix pre-existing test bug in check digit expected value Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 15:57:14 -06:00
Eric Gullickson	87ee498af7	chore: update docs	2026-02-05 21:49:35 -06:00
Eric Gullickson	3eb54211cb	feat: add owner's manual OCR pipeline (refs #71 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 3m1s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 31s Details Deploy to Staging / Verify Staging (pull_request) Successful in 2m19s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details Implement async PDF processing for owner's manuals with maintenance schedule extraction: - Add PDF preprocessor with PyMuPDF for text/scanned PDF handling - Add maintenance pattern matching (mileage, time, fluid specs) - Add service name mapping to maintenance subtypes - Add table detection and parsing for schedule tables - Add manual extractor orchestrating the complete pipeline - Add POST /extract/manual endpoint for async job submission - Add Redis job queue support for manual extraction jobs - Add progress tracking during processing Processing pipeline: 1. Analyze PDF structure (text layer vs scanned) 2. Find maintenance schedule sections 3. Extract text or OCR scanned pages at 300 DPI 4. Detect and parse maintenance tables 5. Normalize service names and extract intervals 6. Return structured maintenance schedules with confidence scores Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 21:30:20 -06:00
Eric Gullickson	6319d50fb1	feat: add receipt OCR pipeline (refs #69 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 32s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 31s Details Deploy to Staging / Verify Staging (pull_request) Successful in 2m20s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details Implement receipt-specific OCR extraction for fuel receipts: - Pattern matching modules for date, currency, and fuel data extraction - Receipt-optimized image preprocessing for thermal receipts - POST /extract/receipt endpoint with field extraction - Confidence scoring per extracted field - Cross-validation of fuel receipt data - Unit tests for all pattern matchers Extracted fields: merchantName, transactionDate, totalAmount, fuelQuantity, pricePerUnit, fuelGrade Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 20:43:30 -06:00
Eric Gullickson	54cbd49171	feat: add VIN photo OCR pipeline (refs #67 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 31s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 31s Details Deploy to Staging / Verify Staging (pull_request) Successful in 2m19s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details Implement VIN-specific OCR extraction with optimized preprocessing: - Add POST /extract/vin endpoint for VIN extraction - VIN preprocessor: CLAHE, deskew, denoise, adaptive threshold - VIN validator: check digit validation, OCR error correction (I->1, O->0) - VIN extractor: PSM modes 6/7/8, character whitelist, alternatives - Response includes confidence, bounding box, and alternatives - Unit tests for validator and preprocessor - Integration tests for VIN extraction endpoint Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 19:31:36 -06:00
Eric Gullickson	852c9013b5	feat: add core OCR API integration (refs #65 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 5m59s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 31s Details Deploy to Staging / Verify Staging (pull_request) Successful in 2m19s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details OCR Service (Python/FastAPI): - POST /extract for synchronous OCR extraction - POST /jobs and GET /jobs/{job_id} for async processing - Image preprocessing (deskew, denoise) for accuracy - HEIC conversion via pillow-heif - Redis job queue for async processing Backend (Fastify): - POST /api/ocr/extract - authenticated proxy to OCR - POST /api/ocr/jobs - async job submission - GET /api/ocr/jobs/:jobId - job polling - Multipart file upload handling - JWT authentication required File size limits: 10MB sync, 200MB async Processing time target: <3 seconds for typical photos Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 16:02:11 -06:00
Eric Gullickson	1ba491144b	feat: add OCR service container (refs #64 ) Some checks failed Deploy to Staging / Build Images (pull_request) Successful in 7m41s Details Deploy to Staging / Deploy to Staging (pull_request) Failing after 13s Details Deploy to Staging / Verify Staging (pull_request) Has been skipped Details Deploy to Staging / Notify Staging Ready (pull_request) Has been skipped Details Deploy to Staging / Notify Staging Failure (pull_request) Successful in 8s Details Add Python-based OCR service container (mvp-ocr) as the 6th service: - Python 3.11-slim with FastAPI/uvicorn - Tesseract OCR with English language pack - pillow-heif for HEIC image support - opencv-python-headless for image preprocessing - Health endpoint at /health - Unit tests for health, HEIC support, and Tesseract availability Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 13:06:16 -06:00

26 Commits