The 4 'ambiguous' citation items flagged for chair turned out to be dead orphan stubs: 0 inbound/outbound edges across all 5 citation mechanisms, 0 full_text, 0 halachot, 0 chunks/embeddings. A corpus-wide check found 15 such orphans total (incl. clean-looking ones). Per OpenCitations (keep an id-less entity only if it is CITED — these are cited by nothing), these are pure noise → deleted, not chair-judgment. - 15 orphan cited_only stubs deleted (cited_only 46 -> 31); backup in data/audit/fu2b-orphan-stub-cleanup-*.json. - 0 malformed / 0 orphans remain; all 31 remaining stubs are cited. - Combines with the 3 earlier mechanical normalizations. #70 fully done. - Known forward-edge (no current data, no task): '+' combined-citation handling in citation_extractor if it recurs in future extraction. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
211 KiB
211 KiB