Devlog ·
Vision Classifier Worklog — March 29, 2026
~1 min read · 336 words
Date:
- 2026-03-29
Worktree:
/tmp/jc_reborn_ps1_debug
Branch:
worktree-ps1-debug-20260329
On this page
Goal
Build a local CPU-only visual classifier pipeline that:
- uses the canonical reference scenes as source of truth
- runs under low-memory constraints
- produces structured semantic artifacts
- generates human-reviewable HTML pages
- scales across all
63reference scenes
Constraints
- effectively no usable GPU
- about
8 GBRAM - avoid heavyweight runtime dependencies
- acceptable to trade speed for robustness
Decisions
The pipeline is reference-first, not training-first.
It uses:
- deterministic visual features
- lightweight per-frame feature vectors
- nearest-reference retrieval against the canonical bank
- semantic summaries derived from reference-bank matches
It deliberately does not depend on:
torchtransformerssklearn
Current runtime dependencies are:
PillowNumPy
Implemented
Primary code:
/tmp/jc_reborn_ps1_debug/scripts/vision_classifier.py/tmp/jc_reborn_ps1_debug/scripts/run-vision-reference-pipeline.sh/tmp/jc_reborn_ps1_debug/scripts/publish-vision-pipeline.py
Design doc:
/tmp/jc_reborn_ps1_debug/docs/ps1/research/LOCAL_VISION_CLASSIFIER_PLAN_2026-03-29.md
Completed Outputs
Reference bank:
/tmp/jc_reborn_ps1_debug/artifacts/vision-reference-bank-20260329/
Reference self-check, latest:
repo:/vision-artifacts/vision-reference-selfcheck-20260329-v4/
Published top-level entry:
repo:/vision-artifacts/vision-reference-pipeline-current/index.html
Artifact Types Now Available
For all 63 reference scenes:
- per-scene
review.html - per-scene
vision-analysis.json - bank-level
index.html - self-check
index.html - quality report
- confusion report
- family report
- inventory JSON
- inventory HTML
- inventory CSV
- strongest-scenes JSON
- weakest-scenes JSON
- top-level manifest JSON
Scale Reached
Reference scenes processed:
63
Reference frames indexed:
13,128
Key Improvements Made
- Built a full bank over all reference frames.
- Added per-run analysis against that bank.
- Added per-scene HTML review outputs.
- Added full self-check over all
63references. - Tightened label rules to reduce excessive
oceanclassification. - Added quality, confusion, and family-level reports.
- Added export-friendly inventory artifacts and top-level publishing.
Current Limits
The pipeline is operational, but semantic quality is still v1.
Known weak areas:
- similar
FISHINGscenes cross-match each other - several
STANDscenes remain hard to separate - some scene families still need stronger actor/sprite semantics
So:
- pipeline completeness: good
- artifact completeness: good
- semantic precision: improving, not final
Next Practical Step
Run this exact pipeline against PS1 result directories and produce:
- PS1
review.html - PS1
vision-analysis.json - PS1 quality summary against the reference bank
- scene ranking by semantic mismatch mode
That is the next step that will directly help fix PS1 scene bugs.