feat: rewrite ManualExtractor progress to spec-aligned 10/50/95/100 pattern (refs #143)
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
@@ -280,11 +280,9 @@ async def extract_manual(
|
||||
the time required for large documents.
|
||||
|
||||
Pipeline:
|
||||
1. Analyze PDF structure (text layer vs scanned)
|
||||
2. Find maintenance schedule sections
|
||||
3. Extract text or perform OCR on scanned pages
|
||||
4. Detect and parse maintenance tables
|
||||
5. Extract service intervals and fluid specifications
|
||||
1. Send entire PDF to Gemini for semantic extraction
|
||||
2. Map extracted service names to system maintenance subtypes
|
||||
3. Return structured results with confidence scores
|
||||
|
||||
- **file**: Owner's manual PDF (max 200MB)
|
||||
- **vehicle_id**: Optional vehicle ID for context
|
||||
|
||||
Reference in New Issue
Block a user