feat: Owner's Manual OCR Pipeline (#71) #79
Reference in New Issue
Block a user
Delete Branch "issue-71-manual-ocr-pipeline"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Summary
Files Changed
New Files
ocr/app/patterns/maintenance_patterns.py- Mileage, time, fluid spec patternsocr/app/patterns/service_mapping.py- Service name to subtype mappingocr/app/preprocessors/pdf_preprocessor.py- PDF text/image extractionocr/app/table_extraction/detector.py- Table detection in images/textocr/app/table_extraction/parser.py- Table content parsingocr/app/extractors/manual_extractor.py- Main extraction orchestratorocr/tests/test_maintenance_patterns.py- Pattern matching testsocr/tests/test_service_mapping.py- Service mapping testsocr/tests/test_table_parser.py- Table parsing testsModified Files
ocr/app/models/schemas.py- Manual extraction response modelsocr/app/routers/extract.py- POST /extract/manual endpointocr/app/services/job_queue.py- Manual job queue methodsocr/requirements.txt- Added PyMuPDF dependencyTest Plan
Closes #71