Diagnosis from the FU-5 eval: co-relevant precedents for broad legal questions rank 15-16 (retrieved, not absent — recall ~1.0 by rank 20). Tracked as a deliberate, harness-measured tuning task rather than an unmeasured global limit change (which affects UI + writer agents + token cost). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
191 KiB
191 KiB