motovaultpro

Author	SHA1	Message	Date
Eric Gullickson	ab0d8463be	docs: update CLAUDE.md indexes and README for OCR expansion (refs #137 ) Add/update documentation across backend, Python OCR service, and frontend for receipt scanning, manual extraction, and Gemini integration. Create new CLAUDE.md files for engines/, fuel-logs/, documents/, and maintenance/ features. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 11:04:19 -06:00
Eric Gullickson	40df5e5b58	feat: add frontend manual extraction flow with review screen (refs #136 ) - Create useManualExtraction hook: submit PDF to OCR, poll job status, track progress - Create useCreateSchedulesFromExtraction hook: batch create maintenance schedules from extraction - Create MaintenanceScheduleReviewScreen: dialog with checkboxes, inline editing, batch create - Update DocumentForm: remove "(Coming soon)", trigger extraction after upload, show progress - Add 12 unit tests for review screen (rendering, selection, empty state, errors) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 10:48:46 -06:00
Eric Gullickson	a281cea9c5	feat: add backend OCR manual proxy endpoint (refs #135 ) Add POST /api/ocr/extract/manual endpoint that proxies to the Python OCR service's manual extraction pipeline. Includes Pro tier gating via document.scanMaintenanceSchedule, PDF-only validation, 200MB file size limit, and async 202 job response for polling via existing job status endpoint. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 10:37:18 -06:00
Eric Gullickson	57ed04d955	feat: rewrite ManualExtractor to use Gemini engine (refs #134 ) Replace traditional OCR pipeline (table_detector, table_parser, maintenance_patterns) with GeminiEngine for semantic PDF extraction. Map Gemini serviceName values to 27 maintenance subtypes via ServiceMapper fuzzy matching. Add 8 unit tests covering normal extraction, unusual names, empty response, and error handling. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 10:24:11 -06:00
Eric Gullickson	3705e63fde	feat: add Gemini engine module and configuration (refs #133 ) Add standalone GeminiEngine class for maintenance schedule extraction from PDF owners manuals using Vertex AI Gemini 2.5 Flash with structured JSON output enforcement, 20MB size limit, and lazy initialization. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 10:00:47 -06:00
Eric Gullickson	d8dec64538	feat: add station matching from receipt merchant name (refs #132 ) Add Google Places Text Search to match receipt merchant names (e.g. "Shell", "COSTCO #123") to real gas stations. Backend exposes POST /api/stations/match endpoint. Frontend calls it after OCR extraction and pre-fills locationData with matched station's placeId, name, and address. Users can clear the match in the review modal. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 09:45:13 -06:00
Eric Gullickson	bc91fbad79	feat: add tier gating for receipt scan in FuelLogForm (refs #131 ) Free tier users see locked button with upgrade prompt dialog. Pro+ users can capture receipts normally. Works on mobile and desktop. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 09:32:08 -06:00
Eric Gullickson	399313eb6d	feat: update useReceiptOcr to call /ocr/extract/receipt endpoint (refs #131 ) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 09:30:02 -06:00
Eric Gullickson	dfc3924540	feat: add fuelLog.receiptScan tier gating with pro minTier (refs #131 ) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 09:29:48 -06:00
Eric Gullickson	e0e578a627	feat: add receipt extraction proxy endpoint (refs #130 ) Add POST /api/ocr/extract/receipt endpoint that proxies to the Python OCR service's /extract/receipt for receipt-specific field extraction. - ReceiptExtractionResponse type with receiptType, extractedFields, rawText - OcrClient.extractReceipt() with optional receipt_type form field - OcrService.extractReceipt() with 10MB max, image-only validation - OcrController.extractReceipt() with file upload and error mapping - Route with auth middleware - 9 unit tests covering normal, edge, and error scenarios Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 09:26:57 -06:00
egullickson	e98b45eb3a	Merge pull request 'feat: Google Vision primary OCR with Auth0 WIF and monthly usage cap (#127 )' (#128 ) from issue-127-google-vision-primary-ocr into main All checks were successful Deploy to Staging / Build Images (push) Successful in 34s Details Deploy to Staging / Deploy to Staging (push) Successful in 51s Details Deploy to Staging / Verify Staging (push) Successful in 8s Details Deploy to Staging / Notify Staging Ready (push) Successful in 7s Details Deploy to Staging / Notify Staging Failure (push) Has been skipped Details Reviewed-on: #128	2026-02-11 01:46:20 +00:00
Eric Gullickson	91dc847f56	fix: use correct Auth0 US region domain in WIF token script (refs #127 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 34s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details Domain was motovaultpro.auth0.com (404) instead of motovaultpro.us.auth0.com. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-10 18:44:30 -06:00
Eric Gullickson	7bba28154d	fix: capture Auth0 error response in WIF token script (refs #127 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 35s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 9s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details The set -e + curl --fail-with-body inside $() caused the script to exit with code 22 and empty stderr, hiding the actual Auth0 error. Switch to writing the body to a temp file and checking HTTP status manually so the error response is visible in logs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-10 18:41:34 -06:00
Eric Gullickson	a416f76c21	fix: copy WIF config to deploy path in CI/CD workflows (refs #127 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 35s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 22s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details The google-wif-config.json was never synced to the deploy path, so the Docker bind mount created a directory artifact instead of a file. Vision client initialization failed on every request, silently falling back to PaddleOCR. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-10 18:34:41 -06:00
Eric Gullickson	e6dd7492a1	test: add monthly limit, counter, and cloud-primary engine tests (refs #127 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 8m46s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 22s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details - Update existing hybrid engine tests for new Redis counter behavior - Add cloud-primary path tests (under/at limit, fallback, errors) - Add Redis counter increment and TTL verification tests - Add Redis failure graceful handling test - Update cloud engine error message assertion for WIF config Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 20:56:51 -06:00
Eric Gullickson	f4a28d009f	feat: update all Docker Compose files for Vision primary with WIF auth (refs #127 ) - Switch OCR engine config to google_vision primary / paddleocr fallback - Mount Auth0 OCR secrets and WIF config into all OCR containers - Add WIF config to repo (not a secret, contains no credentials) - Remove obsolete google-vision-key.json.example Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 20:53:44 -06:00
Eric Gullickson	5e4848c4e2	feat: add Auth0 OCR secrets to injection script and CI/CD workflows (refs #127 ) - Add AUTH0_OCR_CLIENT_ID and AUTH0_OCR_CLIENT_SECRET to inject-secrets.sh - Add new secrets to staging and production workflow env blocks - Create .example files for new secret documentation Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 20:52:29 -06:00
Eric Gullickson	9209739e75	feat: add Auth0 WIF token script and update Dockerfile (refs #127 ) - Create fetch-auth0-token.sh for Auth0 M2M -> GCP WIF token exchange - Add jq to Dockerfile system dependencies - Ensure script is executable in container image Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 20:51:30 -06:00
Eric Gullickson	4abd7d8d5b	feat: add Vision monthly cap, WIF auth, and cloud-primary hybrid engine (refs #127 ) - Add VISION_MONTHLY_LIMIT config setting (default 1000) - Update CloudEngine to use WIF credential config via ADC - Rewrite HybridEngine to support cloud-primary with Redis counter - Pass monthly_limit through engine factory Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 20:50:02 -06:00
Eric Gullickson	4412700e12	fix: use valid Redis log levels and add log level comments to all containers All checks were successful Deploy to Staging / Build Images (push) Successful in 33s Details Deploy to Staging / Deploy to Staging (push) Successful in 22s Details Deploy to Staging / Verify Staging (push) Successful in 8s Details Deploy to Staging / Notify Staging Ready (push) Successful in 8s Details Deploy to Staging / Notify Staging Failure (push) Has been skipped Details Redis only supports debug\|verbose\|notice\|warning -- not info or error. The command was using ${LOG_LEVEL:-info} which resolved to INFO in production (from workflow env), causing Redis to crash loop. Hardcode the correct Redis-native levels (debug for dev, warning for prod) and add available log level comments above every container's log setting. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-08 21:27:33 -06:00
Eric Gullickson	c6b99ab29a	fix: Postgres Fixes for Prod All checks were successful Deploy to Staging / Build Images (push) Successful in 1m34s Details Deploy to Staging / Deploy to Staging (push) Successful in 23s Details Deploy to Staging / Verify Staging (push) Successful in 2m36s Details Deploy to Staging / Notify Staging Ready (push) Successful in 8s Details Deploy to Staging / Notify Staging Failure (push) Has been skipped Details	2026-02-08 20:57:49 -06:00
egullickson	8248b1a732	Merge pull request 'feat: Improve VIN decode confidence reporting and make/model/trim editability (#125 )' (#126 ) from issue-125-improve-vin-confidence-editability into main All checks were successful Deploy to Staging / Build Images (push) Successful in 33s Details Deploy to Staging / Deploy to Staging (push) Successful in 51s Details Deploy to Staging / Verify Staging (push) Successful in 9s Details Deploy to Staging / Notify Staging Ready (push) Successful in 7s Details Deploy to Staging / Notify Staging Failure (push) Has been skipped Details Reviewed-on: #126	2026-02-09 01:40:14 +00:00
Eric Gullickson	e9020dbb2f	feat: improve VIN confidence reporting and editable review dropdowns (refs #125 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 4m37s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 9s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details VIN OCR confidence now reflects recognition accuracy only (not match quality). Review modal replaces read-only fields with editable cascade dropdowns pre-populated from NHTSA decode, with NHTSA reference hints for unmatched fields. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-08 19:24:27 -06:00
Eric Gullickson	e7471d5c27	fix: Python Image Pinning All checks were successful Deploy to Staging / Build Images (push) Successful in 8m28s Details Deploy to Staging / Deploy to Staging (push) Successful in 22s Details Deploy to Staging / Verify Staging (push) Successful in 8s Details Deploy to Staging / Notify Staging Ready (push) Successful in 8s Details Deploy to Staging / Notify Staging Failure (push) Has been skipped Details	2026-02-08 19:11:13 -06:00
Eric Gullickson	2c3e432fcf	fix: Build errors with python3.13 All checks were successful Deploy to Staging / Build Images (push) Successful in 8m50s Details Deploy to Staging / Deploy to Staging (push) Successful in 23s Details Deploy to Staging / Verify Staging (push) Successful in 8s Details Deploy to Staging / Notify Staging Ready (push) Successful in 7s Details Deploy to Staging / Notify Staging Failure (push) Has been skipped Details	2026-02-08 18:54:49 -06:00
egullickson	ee123a2ffd	Merge pull request 'feat: Improve VIN photo capture camera crop (#123 )' (#124 ) from issue-123-improve-vin-camera-crop into main Some checks failed Deploy to Staging / Deploy to Staging (push) Has been cancelled Details Deploy to Staging / Build Images (push) Has been cancelled Details Deploy to Staging / Verify Staging (push) Has been cancelled Details Deploy to Staging / Notify Staging Ready (push) Has been cancelled Details Deploy to Staging / Notify Staging Failure (push) Has been cancelled Details Reviewed-on: #124	2026-02-09 00:36:43 +00:00
Eric Gullickson	1ff1931864	fix: re-request camera stream on retake when tracks are inactive (refs #123 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 3m20s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 9s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details The retake button failed because the stream tracks could become inactive during the crop phase, but handleRetake never re-acquired the camera. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 20:26:37 -06:00
Eric Gullickson	efc55cd3db	feat: improve VIN camera crop overlay-to-crop alignment and touch targets (refs #123 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 3m20s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 9s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details Bridge guidance overlay position to crop tool initial coordinates so the crop box appears centered matching the viewfinder guide. Increase handle touch targets to 44px (32px on compact viewports) for mobile usability. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 20:05:40 -06:00
egullickson	dd77cb3836	Merge pull request 'feat: Improve OCR process - replace Tesseract with PaddleOCR (#115 )' (#122 ) from issue-115-improve-ocr-paddleocr into main All checks were successful Deploy to Staging / Build Images (push) Successful in 36s Details Deploy to Staging / Deploy to Staging (push) Successful in 51s Details Deploy to Staging / Verify Staging (push) Successful in 9s Details Deploy to Staging / Notify Staging Ready (push) Successful in 7s Details Deploy to Staging / Notify Staging Failure (push) Has been skipped Details Mirror Base Images / Mirror Base Images (push) Successful in 51s Details Reviewed-on: #122	2026-02-08 01:13:33 +00:00
Eric Gullickson	9a2b12c5dc	fix: No matches All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 37s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 22s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details	2026-02-07 16:35:28 -06:00
Eric Gullickson	3adbb10ff6	fix: OCR Timout still All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 3m23s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details	2026-02-07 16:26:10 -06:00
Eric Gullickson	fcffb0bb43	fix: PaddleOCR timeout All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 3m20s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 22s Details Deploy to Staging / Verify Staging (pull_request) Successful in 9s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details	2026-02-07 16:18:14 -06:00
Eric Gullickson	9d2d4e57b7	fix: PaddleOCR error All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 36s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 52s Details Deploy to Staging / Verify Staging (pull_request) Successful in 9s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details	2026-02-07 16:12:07 -06:00
Eric Gullickson	0499c902a8	fix: Crop box broken All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 3m22s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 22s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details	2026-02-07 16:00:23 -06:00
Eric Gullickson	dab4a3bdf3	fix: PaddleOCR error All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 3m46s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details	2026-02-07 15:51:04 -06:00
Eric Gullickson	639ca117f1	fix: Update PaddleOCR API All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 5m6s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details	2026-02-07 14:44:06 -06:00
Eric Gullickson	b9fe222f12	fix: Build errors and tesseract removal Some checks failed Deploy to Staging / Build Images (pull_request) Failing after 4m14s Details Deploy to Staging / Deploy to Staging (pull_request) Has been skipped Details Deploy to Staging / Verify Staging (pull_request) Has been skipped Details Deploy to Staging / Notify Staging Ready (pull_request) Has been skipped Details Deploy to Staging / Notify Staging Failure (pull_request) Successful in 8s Details	2026-02-07 12:12:04 -06:00
Eric Gullickson	cf114fad3c	fix: build errors for OpenCV Some checks failed Deploy to Staging / Build Images (pull_request) Failing after 3m16s Details Deploy to Staging / Deploy to Staging (pull_request) Has been skipped Details Deploy to Staging / Verify Staging (pull_request) Has been skipped Details Deploy to Staging / Notify Staging Ready (pull_request) Has been skipped Details Deploy to Staging / Notify Staging Failure (pull_request) Successful in 8s Details	2026-02-07 11:58:00 -06:00
Eric Gullickson	47c5676498	chore: update OCR tests and documentation (refs #121 ) Some checks failed Deploy to Staging / Build Images (pull_request) Failing after 7m4s Details Deploy to Staging / Deploy to Staging (pull_request) Has been skipped Details Deploy to Staging / Verify Staging (pull_request) Has been skipped Details Deploy to Staging / Notify Staging Ready (pull_request) Has been skipped Details Deploy to Staging / Notify Staging Failure (pull_request) Successful in 7s Details Add engine abstraction tests and update docs to reflect PaddleOCR primary architecture with optional Google Vision cloud fallback. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 11:42:51 -06:00
Eric Gullickson	1e96baca6f	fix: workflow contract	2026-02-07 11:32:36 -06:00
Eric Gullickson	3c1a090ae3	fix: resolve crop tool regression with stale ref and aspect ratio minSize (refs #120 ) Three bugs fixed in the draw-first crop tool introduced by PR #114: 1. Stale cropAreaRef: replaced useEffect-based ref sync with direct synchronous updates in handleMove and handleDrawStart. The useEffect ran after browser paint, so handleDragEnd read stale values (often {width:0, height:0}), preventing cropDrawn from being set. 2. Aspect ratio minSize: when aspectRatio=6 (VIN mode), height=width/6 required width>=60% to pass the height>=10% check. Now only checks width>=minSize when aspect ratio constrains height. 3. Bounds clamping: aspect-ratio-forced height could push crop area past 100% of container. Now clamps y position to keep within bounds. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 11:29:16 -06:00
Eric Gullickson	9b6417379b	chore: update Docker and compose files for PaddleOCR engine (refs #119 ) - Replace libtesseract-dev with libgomp1 (OpenMP for PaddlePaddle) - Pre-download PP-OCRv4 models during Docker build - Add OCR engine env vars to all compose files (base, staging, prod) - Add optional Google Vision secret mount (commented, enable on demand) - Create google-vision-key.json.example placeholder Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 11:17:44 -06:00
Eric Gullickson	4ef942cb9d	feat: add optional Google Vision cloud fallback engine (refs #118 ) CloudEngine wraps Google Vision TEXT_DETECTION with lazy init. HybridEngine runs primary engine, falls back to cloud when confidence is below threshold. Disabled by default (OCR_FALLBACK_ENGINE=none). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 11:12:08 -06:00
Eric Gullickson	013fb0c67a	feat: migrate VIN/receipt extractors and OCR service to engine abstraction (refs #117 ) Replace direct pytesseract calls with OcrEngine interface in vin_extractor.py, receipt_extractor.py, and ocr_service.py. PSM mode fallbacks replaced with engine-agnostic single-line/single-word configs. Dead _process_ocr_data removed. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 10:56:27 -06:00
Eric Gullickson	ebc633fb36	feat: add OCR engine abstraction layer (refs #116 ) Introduce pluggable OcrEngine ABC with PaddleOCR PP-OCRv4 as primary engine and Tesseract wrapper for backward compatibility. Engine factory reads OCR_PRIMARY_ENGINE config to instantiate the correct engine. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 10:47:40 -06:00
egullickson	6b0c18a41c	Merge pull request 'fix: VIN OCR scanning fails with "No VIN Pattern found" on all images (#113 )' (#114 ) from issue-113-fix-vin-ocr-scanning into main All checks were successful Deploy to Staging / Build Images (push) Successful in 35s Details Deploy to Staging / Deploy to Staging (push) Successful in 21s Details Deploy to Staging / Verify Staging (push) Successful in 8s Details Deploy to Staging / Notify Staging Ready (push) Successful in 7s Details Deploy to Staging / Notify Staging Failure (push) Has been skipped Details Reviewed-on: #114	2026-02-07 15:47:35 +00:00
Eric Gullickson	75ce316aa5	chore: Change crop to remove locked aspect ratio All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 3m21s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 22s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details	2026-02-06 22:15:39 -06:00
Eric Gullickson	e4336ce9da	fix: extract VIN from noisy OCR via sliding window + char deletion (refs #113 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 37s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details When OCR reads extra characters (e.g. sticker border as 'C', spurious 'Z' insertion), the raw text exceeds 17 chars and the old first-17 trim produced wrong VINs. New strategy tries all 17-char sliding windows and single/double character deletions, validating each via check digit. For 'CWVGGNPE2Z4NP069500', this finds the correct VIN 'WVGGNPE24NP069500' (valid check digit) instead of 'CWVGGNPE2Z4NP0695' (invalid). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 22:00:07 -06:00
Eric Gullickson	432b3bda36	fix: remove char whitelist incompatible with Tesseract LSTM (refs #113 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 36s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details tessedit_char_whitelist does not work with OEM 1 (LSTM engine) and causes empty/erratic output. This was the root cause of Tesseract returning empty text despite clear, well-preprocessed images. Character filtering is already handled post-OCR by the VIN validator's correct_ocr_errors() method (I->1, O->0, Q->0, etc). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 21:52:08 -06:00
Eric Gullickson	ae5221c759	fix: invert min-channel so Tesseract gets dark-on-light text (refs #113 ) All checks were successful Deploy to Staging / Build Images (pull_request) Successful in 35s Details Deploy to Staging / Deploy to Staging (pull_request) Successful in 51s Details Deploy to Staging / Verify Staging (pull_request) Successful in 8s Details Deploy to Staging / Notify Staging Ready (pull_request) Successful in 7s Details Deploy to Staging / Notify Staging Failure (pull_request) Has been skipped Details The min-channel correctly extracts contrast (white text=255 vs green sticker bg=130), but Tesseract expects dark text on light background. Without inversion, the grayscale-only path returned empty text for every PSM mode because Tesseract couldn't see bright-on-dark text. Invert via bitwise_not: text becomes 0 (black), sticker bg becomes 125 (gray). Fixes all three OCR paths (adaptive, grayscale, Otsu). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 21:39:48 -06:00

1 2 3 4 5 ...

536 Commits