Per chaim's directive — for decisions not requiring Dafna/chaim, decide after >=3 authoritative open sources. #71 DONE — resolved by #15's weight fix (measured: all multi-relevant docs now in top-10, the rank-15/16 weak queries fixed). Research (6 sources) said enable rerank; tested empirically → it HURT (nDCG@5 0.879 vs 0.960, MRR 0.867 vs 0.954) because recall is saturated and the cross-encoder demotes exact known-item matches. Measurement overrides theory: no rerank, no limit change. #42 CANCELLED — obviated by BM25 hybrid (already on; handles abbreviation tokens lexically); 0 abbrev queries in eval, recall ~0.99, no measured gap. #14 DEFERRED (reviewed) — no current blocker; YAGNI; trigger documented. #76 CANCELLED — upstream Paperclip bug (ee=companyId), not safely fixable our side; workaround + #78 documented. #70 — research-backed normalization (ECLI/Akoma Ntoso/ELI/OpenCitations + Christen). Applied 3 deterministic mechanical fixes to cited_only (whitespace + missing prefix-space); 0 malformed remain. 4 ambiguous items (2 garbled, 'ערר אדלר', 1 combined citation) flagged for chair — NOT auto-guessed, per the entity-resolution false-merge guardrail. #80 stays pending — human-gated (Dafna value-labeling + chaim cost). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
210 KiB
210 KiB