ezer-mishpati/legal-ai: AI Legal Decision Drafting System — MCP server, web upload, RAG search - legal-ai - Dafna Tamir Vault

ezer-mishpati/legal-ai

Go to file

Chaim dba2a131e0 feat(halacha): multi-judge approval panel + policy calibration (Trust-or-Escalate)

The chair cannot review every pending halacha. Three independent-lineage judges
(Opus via claude_session · DeepSeek · Gemini-2.5-flash — #1 on LegalBench) vote
on the COARSE axis we proved reliable across models (92%): "is this a genuine,
keepable rule?". Only an agreed verdict acts; every split escalates to the chair
(INV-G10). Buckets: clean→KEEP?; nli_unsupported→entailment re-adjudication;
extraction-defects→re-extraction.

halacha_panel_calibrate.py calibrates the voting policy on the gold-set's
is_holding (the coarse label) per Trust-or-Escalate (ICLR 2025): unanimous →
94.9% precision / 78% coverage; majority → 92.9% / 99%; ZERO false-drops in
both (the panel never rejects a good rule). Chosen policy (chair-approved):
clean→majority-2/3, nli→asymmetric (majority-reject, unanimous-approve),
defects→re-extraction. Reversible (--apply backs up review_status+flags first).

Sources: Panel-of-LLM-Evaluators (PoLL) · Trust-or-Escalate (ICLR 2025,
arXiv:2407.18370) · selective-prediction / learning-to-defer.

Invariants: upholds G10 (human gate — splits escalate, panel only collapses the
queue) and G9 (provenance — reviewer records the panel + policy). Read paths only
in calibrate; --apply writes review_status/quality_flags reversibly with backup.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

2026-06-07 21:11:30 +00:00

feat(digests): קורפוס יומונים כשכבת-גילוי (radar) — X12

2026-06-07 17:49:00 +00:00

ci: prune old build-NNN images and stale build cache after deploy

2026-06-06 17:31:43 +00:00

chore(tasks): mark style-acquisition T0-T15 + #85/#87/#88 done (initiative complete)

2026-06-06 21:03:27 +00:00

adapters/deepseek-paperclip-adapter

feat(curator): switch Hermes Curator to DeepSeek V4-Pro via deepseek_local adapter

2026-05-10 05:58:52 +00:00

chore(#70 ): delete 15 orphaned cited_only stubs + close #70

2026-06-03 09:38:30 +00:00

docs(X13): sync spec to route-by-format reality + Tier-0 limitation

2026-06-07 20:51:38 +00:00

fix(X13): route by נט-format availability; robust fetch error handling

2026-06-07 20:45:20 +00:00

feat(halacha): multi-judge approval panel + policy calibration (Trust-or-Escalate)

2026-06-07 21:11:30 +00:00

docs(lessons): קיפול ידני של 21 הערות יו"ר backlog לקבצי הידע

2026-06-06 13:08:21 +00:00

feat(graph): centrality + cluster analytics (corpus graph PR B)

2026-06-07 21:04:47 +00:00

feat(graph): centrality + cluster analytics (corpus graph PR B)

2026-06-07 21:04:47 +00:00

.dockerignore

fix(training): bundle reference content + use docker bridge gateway

2026-05-27 10:15:27 +00:00

.gitignore

docs+config: בידוד-סשנים נתמך-סביבה לעבודה מקבילה (worktree defaults)

2026-06-06 16:39:11 +00:00

.worktreeinclude

docs+config: בידוד-סשנים נתמך-סביבה לעבודה מקבילה (worktree defaults)

2026-06-06 16:39:11 +00:00

cases

Add case data, benchmark embeddings, and bug report

2026-04-09 17:20:40 +00:00

CLAUDE.md

docs: remove n8n from Nautilus services table

2026-06-06 18:58:47 +00:00

Dockerfile

feat(upload): accept legacy .doc, convert via LibreOffice in container

2026-06-03 13:47:47 +00:00

start.sh

Fix start.sh: redirect uvicorn output to Docker logs

2026-04-13 14:55:04 +00:00

Languages

Python 65.9%

TypeScript 32.1%

JavaScript 1%

Shell 0.7%

CSS 0.2%