ezer-mishpati/legal-ai: AI Legal Decision Drafting System — MCP server, web upload, RAG search - legal-ai - Dafna Tamir Vault

ezer-mishpati/legal-ai

Go to file

Chaim 808c2e4c46 feat(goldset): independent second-judge for rule_role (break AI-anchoring)

The gold-set's human role tags were made while seeing a claude AI recommendation,
so human↔AI agreement (~100%) is anchoring, not an independent accuracy signal.
This adds a third, genuinely independent judge — a DIFFERENT model (DeepSeek,
direct OpenAI-compatible API) classifies rule_role BLIND (never sees the human
tag nor the first AI's answer) — and reports an inter-rater agreement matrix.

Finding (100 tagged items): ai↔human 100% (anchored) vs deepseek↔human 50%
fine-grained — BUT 92% on the coarse axis (generalizable-rule vs application/
obiter). Conclusion: the fine sub-type (holding/interpretive/procedural) is an
inherently fuzzy boundary two capable models split differently; the coarse
"is this a real rule" axis is robust across models. Use the coarse axis as
ground truth; treat the sub-type as advisory, never as a gate.

Zero chair tagging, read-only on the gold-set. Key from ~/.hermes deepseek env.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

2026-06-07 20:12:58 +00:00

feat(digests): קורפוס יומונים כשכבת-גילוי (radar) — X12

2026-06-07 17:49:00 +00:00

ci: prune old build-NNN images and stale build cache after deploy

2026-06-06 17:31:43 +00:00

chore(tasks): mark style-acquisition T0-T15 + #85/#87/#88 done (initiative complete)

2026-06-06 21:03:27 +00:00

adapters/deepseek-paperclip-adapter

feat(curator): switch Hermes Curator to DeepSeek V4-Pro via deepseek_local adapter

2026-05-10 05:58:52 +00:00

chore(#70 ): delete 15 orphaned cited_only stubs + close #70

2026-06-03 09:38:30 +00:00

feat(X13): auto-trigger court fetch from digests + drain tool

2026-06-07 20:04:12 +00:00

feat(X13): auto-trigger court fetch from digests + drain tool

2026-06-07 20:04:12 +00:00

feat(goldset): independent second-judge for rule_role (break AI-anchoring)

2026-06-07 20:12:58 +00:00

docs(lessons): קיפול ידני של 21 הערות יו"ר backlog לקבצי הידע

2026-06-06 13:08:21 +00:00

Merge pull request 'feat(graph): in-app corpus citation graph (/graph) — Phase 1' (#113 ) from worktree-corpus-graph into main

2026-06-07 18:52:01 +00:00

fix(graph): stop corpus-graph labels overlapping

2026-06-07 20:07:27 +00:00

.dockerignore

fix(training): bundle reference content + use docker bridge gateway

2026-05-27 10:15:27 +00:00

.gitignore

docs+config: בידוד-סשנים נתמך-סביבה לעבודה מקבילה (worktree defaults)

2026-06-06 16:39:11 +00:00

.worktreeinclude

docs+config: בידוד-סשנים נתמך-סביבה לעבודה מקבילה (worktree defaults)

2026-06-06 16:39:11 +00:00

cases

Add case data, benchmark embeddings, and bug report

2026-04-09 17:20:40 +00:00

CLAUDE.md

docs: remove n8n from Nautilus services table

2026-06-06 18:58:47 +00:00

Dockerfile

feat(upload): accept legacy .doc, convert via LibreOffice in container

2026-06-03 13:47:47 +00:00

start.sh

Fix start.sh: redirect uvicorn output to Docker logs

2026-04-13 14:55:04 +00:00

Languages

Python 65.9%

TypeScript 32.1%

JavaScript 1%

Shell 0.7%

CSS 0.2%