Before updates to NHTSA

This commit is contained in:
Eric Gullickson
2025-12-14 14:53:45 -06:00
parent 61e87bb9ad
commit 1fc69b7779
12 changed files with 1680503 additions and 1156458 deletions

View File

@@ -264,6 +264,14 @@ Acceptance:
4) Flush Redis dropdown caches (if needed) and re-test dropdowns.
5) Run QA harness and capture summary output in a `stats.txt` (or similar).
## Status Update (completed)
- ETL rewritten to use makes-filter as baseline (year/make/model + trims/engines) and overlay evidence only to prune impossible year/trim combos and enrich engines/transmissions.
- Engines/transmissions now deduped by display name; vehicle_options deduped on full key.
- Uniqueness constraints added to prevent duplicates on import.
- Import script made rerunnable (truncate + restart identity) and prints year range.
- QA script added and validated (duplicates=0, year range 20002026).
- Example issue (GMC Sierra 1500 AT4X 6.2L V8) now present via baseline engines for that trim/year and Automatic/Manual fallback when transmissions are absent.
## Acceptance Criteria (End-to-End)
- Years available in dropdown are exactly those loaded (default 20002026).
- Makes for a year only include makes with models in that year.
@@ -272,4 +280,3 @@ Acceptance:
- Engines show detailed specs when available; otherwise show one of `Gas/Diesel/Electric/Hybrid`.
- Transmissions show derived options when available; otherwise show both `Manual` and `Automatic`.
- No duplicate dimension rows; no duplicate fact rows.