legal-ai

Author	SHA1	Message	Date
chaim	8dc0a268fb	Merge pull request 'feat(graph): research-gap (ghost) nodes (corpus graph PR C)' (#131 ) from worktree-graph-gaps into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 42s Details	2026-06-07 21:22:19 +00:00
Chaim	9a126f7c36	feat(graph): research-gap (ghost) nodes (corpus graph PR C) Turns the graph into a gap-finder: the 247 unresolved internal citations (a corpus precedent cites a ruling NOT in the corpus) collapse to 230 distinct "gap" nodes — each sized by how many corpus precedents cite it, i.e. the most-wanted missing precedent. Backend (web/graph_api.py — read-only, G2): - "gap" added to VALID_NODE_TYPES (NOT default → off unless requested). - New _gap_nodes_and_edges(): gap:<normalized citation> nodes from precedent_internal_citations WHERE cited_case_law_id IS NULL, sized by global in-degree; cites edges only from precedents present in the view (dangling-edge invariant holds). Best-effort enrichment from missing_precedents via exact normalized-citation match → gap_status + missing_precedent_id. Validated: 230 gaps, top ע"א 3213/97 (cited 5×), 230/230 matched to missing_precedents. - GraphNode += gap_status, missing_precedent_id. Metrics correctly exclude gap edges (target not a precedent). No app.py change (gated via node_types). Frontend: - graph.ts: GraphNodeType += "gap"; node fields. - graph-filter-panel: toggle "חוסרי מחקר (פסיקה חסרה)" (off by default). - graph-canvas: gaps render as faint hollow dashed circles, never recoloured by color-by; sized by citation count. - graph-node-panel: gap branch — "מצוטטת ע״י N פסיקות" + status badge + link to /missing-precedents. web-ui build + lint pass. Invariants: G2 (SELECT-only), UI2 (model grows on explicit Pydantic). api:types post-deploy. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 21:21:53 +00:00
chaim	3c030dd7f5	Merge pull request 'feat(halacha): multi-judge approval panel + policy calibration (Trust-or-Escalate)' (#130 ) from worktree-halacha-panel into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 11s Details	2026-06-07 21:12:03 +00:00
Chaim	dba2a131e0	feat(halacha): multi-judge approval panel + policy calibration (Trust-or-Escalate) The chair cannot review every pending halacha. Three independent-lineage judges (Opus via claude_session · DeepSeek · Gemini-2.5-flash — #1 on LegalBench) vote on the COARSE axis we proved reliable across models (92%): "is this a genuine, keepable rule?". Only an agreed verdict acts; every split escalates to the chair (INV-G10). Buckets: clean→KEEP?; nli_unsupported→entailment re-adjudication; extraction-defects→re-extraction. halacha_panel_calibrate.py calibrates the voting policy on the gold-set's is_holding (the coarse label) per Trust-or-Escalate (ICLR 2025): unanimous → 94.9% precision / 78% coverage; majority → 92.9% / 99%; ZERO false-drops in both (the panel never rejects a good rule). Chosen policy (chair-approved): clean→majority-2/3, nli→asymmetric (majority-reject, unanimous-approve), defects→re-extraction. Reversible (--apply backs up review_status+flags first). Sources: Panel-of-LLM-Evaluators (PoLL) · Trust-or-Escalate (ICLR 2025, arXiv:2407.18370) · selective-prediction / learning-to-defer. Invariants: upholds G10 (human gate — splits escalate, panel only collapses the queue) and G9 (provenance — reviewer records the panel + policy). Read paths only in calibrate; --apply writes review_status/quality_flags reversibly with backup. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 21:11:30 +00:00
chaim	ecd9e46bb9	Merge pull request 'feat(graph): centrality + cluster analytics (corpus graph PR B)' (#129 ) from worktree-graph-analytics into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 49s Details	2026-06-07 21:05:13 +00:00
chaim	6cdf178ea4	Merge pull request 'docs: רישום Infisical כ-SoT לסודות שירותי-המארח' (#128 ) from worktree-secret-infisical-note into main Some checks failed Build & Deploy / build-and-deploy (push) Has been cancelled Details	2026-06-07 21:05:09 +00:00
Chaim	2fbc0cd3c2	feat(graph): centrality + cluster analytics (corpus graph PR B) The Obsidian "Graph Analysis" equivalent — surfaces influence and structure beyond raw citation count. Backend (new web/graph_metrics.py — pure, dependency-free, no DB → G2): - PageRank (power-iteration), betweenness (Brandes), community (deterministic label-propagation + connected-components fallback), computed in-memory over the precedent citation subgraph that build_corpus_graph already fetched. Normalized 0–1; community ints dense-ranked by size (stable colours). - GraphNode += pagerank/betweenness/community (None unless metrics=true). - build_corpus_graph + /api/graph/corpus gain metrics=false (default path unchanged). Validated on the live corpus: 147 nodes in 13ms. Frontend: - graph.ts: GraphNode metrics fields + metrics param. - graph-canvas: color-by (type \| practice_area \| precedent_level \| community \| recency) and size-by (in-degree \| pagerank \| betweenness) via colorForNode / radiusForNode; exported palettes. - graph-view: colorBy/sizeBy controls; metrics requested only when needed; global metrics overlaid onto neighborhood nodes by id (a node's PageRank shouldn't change when focused); a ranking panel (Tabs: המשפיעות / גשרים, click → focus); dynamic legend per color-by. - graph-filter-panel: "צביעה לפי" + "גודל נקודה לפי" Selects. web-ui build + lint pass. Invariants: G2 (metrics pure, no DB writes), UI2 (model grows on explicit Pydantic). api:types post-deploy. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 21:04:47 +00:00
Chaim	360f49d8b4	docs: record Infisical SoT for host-service shared secrets COURT_FETCH_SHARED_SECRET + LEGAL_CHAT_SHARED_SECRET migrated to Infisical nautilus:/legal-ai (2026-06-07). Updated the pm2 config comments: the stale "migrate to Infisical once the MCP server is back" TODO is now done; local env files remain the runtime source, Infisical is the SoT/record. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 21:04:44 +00:00
chaim	24d80e6a2a	Merge pull request 'feat(digests): self-heal drain — auto-resume after quota/interruption (X12)' (#127 ) from worktree-digest-resume into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 12s Details	2026-06-07 21:00:18 +00:00
Chaim	3ae183009f	feat(digests): self-heal in drain_digests — auto-resume after quota/interruption ה-cron של drain_digests הוא מנגנון ה-resume (pending-based, idempotent, host-side, לא תלוי בסשן). חיזוק: אם enrich נכשל באמצע (מכסת claude נגמרה) השורה נשארה 'completed' עם שדות ריקים → לא היתה מטופלת שוב. עכשיו drain מאפס בתחילתו כל digest 'completed' עם concept_tag ריק וגם underlying_citation ריק (= חילוץ שמעולם לא נחת; שורה תקינה תמיד מכילה לפחות מראה-מקום) → pending לריצה חוזרת. כך כל קטיעה/מכסה מתאוששת אוטומטית בריצת ה-cron הבאה. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 20:59:49 +00:00
chaim	106ab53231	Merge pull request 'feat(graph): metadata filters + facets (corpus graph PR A)' (#126 ) from worktree-graph-metadata into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m1s Details	2026-06-07 20:52:36 +00:00
Chaim	8258f09228	feat(graph): metadata filters + facets (corpus graph PR A) Adds legal-metadata filtering and the payload to color by it (foundation for the color-by selector in the analytics PR). Backend (web/graph_api.py, web/app.py) — read-only, G2: - GraphNode += court, date (ISO) — precedents carry them for filter/color-by. - build_corpus_graph += server-side WHERE filters (G5): court, precedent_level, chair, district, year_from, year_to (EXTRACT(YEAR FROM date)). Neighborhood query also selects court/date. - New GET /api/graph/facets (response_model GraphFacets, UI2) → distinct courts/levels/chairs/districts so the UI doesn't hardcode Hebrew strings. Frontend: - graph.ts: GraphNode += court/date; GraphFilters += the six params; buildParams; useGraphFacets() hook. - graph-filter-panel: an "advanced" Accordion with court/precedent_level/chair/ district Selects (from facets) + year-from/year-to Selects. - graph-view: new controls wired into filters; facets fetched and passed down. Verified read-only against the live DB (precedent_level=עליון&year_from=2015 filters correctly; facets populated: 36 courts / 3 levels / 19 chairs / 4 districts). web-ui build + lint pass. Invariants: G2 (SELECT-only via db.get_pool), G5 (filters server-side), UI2 (explicit response_models). api:types to be regenerated post-deploy. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 20:52:13 +00:00
chaim	aa32766a8c	Merge pull request 'docs(X13): סנכרון ספ לניתוב-לפי-פורמט + מגבלת Tier-0' (#125 ) from worktree-court-fetch-specsync into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 9s Details	2026-06-07 20:51:58 +00:00
Chaim	6882ccfcf1	docs(X13): sync spec to route-by-format reality + Tier-0 limitation The spec said "supreme → Tier-0"; reality (PR #124) routes by נט-format availability — נט המשפט (Tier-1) serves all courts incl. Supreme-with-נט-format, and only serial-only Supreme falls to the (still-unbuilt) Tier-0 → manual. Updated §0 source-distinction, §1 routing diagram, §5 risks (Tier-0 limitation + scheduled drain). Docs-only. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 20:51:38 +00:00
chaim	618f476a22	Merge pull request 'fix(X13): ניתוב לפי פורמט-נט; טיפול-שגיאות חסין באחזור' (#124 ) from worktree-court-fetch-routing into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m43s Details	2026-06-07 20:46:01 +00:00
Chaim	69b34f1c3f	fix(X13): route by נט-format availability; robust fetch error handling Live drain surfaced three issues: 1. Tier-0 needed `h2` (httpx http2) — added to the court-fetch extra. 2. Supreme cases that carry a נט-format number (e.g. בר"מ 72182-06-25) were routed to the unvalidated Tier-0 and failed, even though נט המשפט serves Supreme cases too. classify() now parses the file-month-year triple for Supreme prefixes; the orchestrator routes by triple-availability: נט-format present → Tier-1 (validated, all courts) serial-only Supreme (עע"מ 5886/24) → Tier-0 neither → clear "no public route" failure Validated live: בר"מ 72182-06-25 fetched via Tier-1 (5-page PDF). 3. A non-`RuntimeError` fetch exception (the h2 import error) left jobs stuck in 'running'. The fetch block now catches any Exception → _record_failure (INV-CF2/CF3), so a job always reaches a terminal state. + test_supreme_with_net_format_triple. Suite 11/11. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 20:45:20 +00:00
chaim	796bfa890f	Merge pull request 'feat(digests): drain_digests.py — local enrichment drainer for daily cron (X12)' (#123 ) from worktree-digest-cron into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 13s Details	2026-06-07 20:41:07 +00:00
Chaim	c1abf2ec0e	feat(digests): scripts/drain_digests.py — local enrichment drainer for cron (X12) ריקון תור ההעשרה של יומונים מקומית (claude_session local-only): כל digest 'pending' → enrich_digest (Sonnet + embedding + autolink). מקבילי (3), idempotent, מוסיף ~/.local/bin ל-PATH (claude CLI תחת cron). מיועד ל-cron יומי אחרי ה-poll של n8n (flock למניעת חפיפה) + שימוש ידני אחרי backfill. SCRIPTS.md עודכן. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 20:40:45 +00:00
chaim	6468e151d9	Merge pull request 'refactor(digests): single source of truth — drop processed/ folder state (X12)' (#122 ) from worktree-digests-single-truth into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 28s Details	2026-06-07 20:33:43 +00:00
Chaim	fb40ec8565	refactor(digests): single source of truth — drop processed/ folder state (X12) ה-DB (`digests`) הוא מקור-האמת היחיד למצב-קליטה. ingest_digests_batch.py העביר קבצים incoming→processed/ — state מבוסס-תיקיות מקביל ל-DB (הפרת-G2 קטנה). - הוסר ה-move ל-processed/ + import shutil + PROCESSED. הסקריפט מסתמך על dedup ב-content_hash (ingest_digest מחזיר 'exists' לקיימים) → הרצה חוזרת בטוחה. - תיקיות (incoming/) = staging בלבד, לא state. - X12 INV-DIG2: תועד מקור-אמת-יחיד + ההפרה-שתוקנה (processed/). - SCRIPTS.md עודכן. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 20:33:18 +00:00
chaim	bcd5fd5f8d	Merge pull request 'feat(X13): drain מתוזמן → לולאת יומון→אחזור→קליטה אוטונומית מלאה' (#121 ) from worktree-court-fetch-schedule into main Some checks are pending Build & Deploy / build-and-deploy (push) Has started running Details	2026-06-07 20:32:32 +00:00
Chaim	f4f110f0d1	feat(X13): scheduled drain — fully-autonomous digest→fetch→ingest loop - scripts/drain_court_fetch.py: drives orchestrator.drain_pending (host-only; no-op when queue empty). Mirrors drain_halacha_queue.py. - scripts/legal-court-fetch-drain.config.cjs: pm2 cron (hourly :17, one-shot), COURT_FETCH_DRAIN_CRON override. - fix: orchestrator default service URL 127.0.0.1 → 10.0.1.1 (the service binds the docker0 gateway; the host can't reach it on loopback). Found live — the first drain failed "connection refused" until corrected. - SCRIPTS.md entries. Validated end-to-end in PRODUCTION on a real digest: עת"מ 43830-12-24 (החברה להגנת הטבע) fetched from נט המשפט → case_law (79 chunks, source_url), digest relinked (INV-DIG3 closed), halacha queued pending_review. job=done. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 20:31:53 +00:00
chaim	540d39b958	Merge pull request 'fix(extract): disable tools for digest LLM extraction (no error_max_turns)' (#120 ) from worktree-digest-notools into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m29s Details	2026-06-07 20:18:48 +00:00
Chaim	d3b5c563ce	fix(extract): disable tools for digest LLM extraction (no error_max_turns) חילוץ-המטא-דאטה של יומון הוא טקסט→JSON טהור, אבל ה-claude CLI רץ עם tools זמינים, ו-Sonnet לפעמים פולט stop_reason=tool_use → פוגע ב---max-turns 1 → error_max_turns → retry (איטי). מבזבז זמן רב בגיבוי-המוני. - claude_session.query/query_json: פרמטר חדש `tools` → מועבר כ---tools. "" = ביטוי כל ה-tools (אין tool_use → אין max-turns trip). None = ברירת-CLI. - digest_metadata_extractor.extract: מעביר tools="". אומת: extract על יומון 5160 ב-Sonnet+tools="" → num_turns=1, JSON תקין, ללא error_max_turns. claude_session נשאר local-only. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 20:18:29 +00:00
chaim	d9340f6c39	Merge pull request 'feat(goldset): independent second-judge for rule_role — break AI-anchoring' (#119 ) from worktree-goldset-independent-judge into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 11s Details	2026-06-07 20:13:25 +00:00
Chaim	808c2e4c46	feat(goldset): independent second-judge for rule_role (break AI-anchoring) The gold-set's human role tags were made while seeing a claude AI recommendation, so human↔AI agreement (~100%) is anchoring, not an independent accuracy signal. This adds a third, genuinely independent judge — a DIFFERENT model (DeepSeek, direct OpenAI-compatible API) classifies rule_role BLIND (never sees the human tag nor the first AI's answer) — and reports an inter-rater agreement matrix. Finding (100 tagged items): ai↔human 100% (anchored) vs deepseek↔human 50% fine-grained — BUT 92% on the coarse axis (generalizable-rule vs application/ obiter). Conclusion: the fine sub-type (holding/interpretive/procedural) is an inherently fuzzy boundary two capable models split differently; the coarse "is this a real rule" axis is robust across models. Use the coarse axis as ground truth; treat the sub-type as advisory, never as a gate. Zero chair tagging, read-only on the gold-set. Key from ~/.hermes deepseek env. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 20:12:58 +00:00
chaim	879bb6c074	Merge pull request 'fix(graph): stop corpus-graph labels overlapping' (#118 ) from worktree-corpus-graph-labels into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m1s Details	2026-06-07 20:07:57 +00:00
Chaim	f3e99a14ca	fix(graph): stop corpus-graph labels overlapping Labels piled on top of each other (esp. in the neighborhood view) for two reasons, both fixed in graph-canvas.tsx: 1. Font grew as you zoomed OUT (size was divided by sqrt(globalScale) and had a +6 floor), so at overview zoom labels became huge and collided. Now the label font is a ~constant SCREEN size (fontPx / globalScale). 2. Every node drew its label at once. Now labels are zoom-gated: at overview zoom only the active node, the 3 practice-area hubs, and the most-cited precedents (size>=4) are labeled; topic hubs appear at >=1.05 and the rest at >=1.5 — by which point there is pixel room between nodes. Also: a white halo (strokeText) behind each label for legibility over edges and nearby nodes, and stronger d3 forces (charge -220, link distance 60) so nodes spread out and labels have more room. web-ui build passes; /graph in the route table. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 20:07:27 +00:00
chaim	b9fa38f3db	Merge pull request 'feat(X13): טריגר אוטומטי מיומונים → אחזור פסיקה + כלי drain' (#117 ) from worktree-court-fetch-trigger into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m37s Details	2026-06-07 20:04:45 +00:00
Chaim	f56309da5a	feat(X13): auto-trigger court fetch from digests + drain tool סוגר את הלולאה — יומון שמצביע על פס"ד בית-משפט שלא בקורפוס מזניק אחזור אוטומטי, וקושר את היומון חזרה אחרי הקליטה (INV-DIG3 + INV-CF2). - digest_library.try_autolink: בכשל-קישור, אם הציטוט מסווג כפס"ד-בימ"ש (supreme/admin) → _enqueue_court_fetch יוצר court_fetch_jobs(pending); ועדת-ערר (skip) לא מוזנק. never-raises (לא שובר קליטת-יומון). - orchestrator.drain_pending(limit): מנקז pending/failed סדרתי (cooldown, INV-CF4), fetch+ingest לכל אחד; בהצלחה מקשר את היומון ל-case_law שנקלט. - כלי-MCP court_fetch_drain + רישום ב-server.py. - X13 spec: עודכן (הפער ב-INV-CF2 סומן כמתוקן). נבדק מול ה-DB: עת"מ 46111-12-22 → job tier=admin pending digest-linked; ערר 1110/20 → לא מוזנק. כלי מקומי בלבד (ingest = claude CLI). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 20:04:12 +00:00
chaim	635dc98492	Merge pull request 'feat(digests): Sonnet for digest metadata extraction (X12)' (#116 ) from worktree-digest-sonnet into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m28s Details	2026-06-07 19:59:30 +00:00
Chaim	e6dc410d7d	feat(digests): use Sonnet for digest metadata extraction (X12) חילוץ-המטא-דאטה של יומון (תג-מושג, כותרת-הלכה, מראה-מקום, תגיות מסיכום עמוד-אחד) הוא משימה פשוטה בנפח גבוה — Sonnet הוא נקודת-האיזון מהירות/עלות, בניגוד לחילוץ-הלכות שמצמיד Opus. - config.DIGEST_EXTRACT_MODEL (env-tunable, ברירת-מחדל claude-sonnet-4-6). - digest_metadata_extractor.extract(model=None) → ברירת-מחדל מה-config; קודם לא צוין model → רץ על ברירת-המחדל של ה-CLI (Opus 4.8). אומת: extract על יומון 5163 עם Sonnet החזיר תג-מושג/כותרת/מראה-מקום/תחום/ תגיות תקינים (~36s). claude_session נשאר local-only. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 19:58:48 +00:00
chaim	e82eeaad9f	Merge pull request 'fix(X13): הקשחה נגד דליפת-זיכרון מדפדפנים + reaper ל-task-master-mcp' (#115 ) from worktree-court-fetch-harden into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m24s Details	2026-06-07 19:44:26 +00:00
Chaim	e186183527	fix(X13): harden court-fetch against browser leaks + reaper for task-master-mcp leak שלוש שכבות-הגנה נגד דליפת-זיכרון מדפדפנים יתומים, + טיפול בדליפה הגדולה בפועל בשרת (task-master-mcp). - camofox_client.py: - asyncio.wait_for קשיח סביב כל ה-fetch (COURT_FETCH_HARD_TIMEOUT_S=180ש') — hang → ביטול → async-with tear-down → reap. - _reap_orphan_browsers(): הורג camoufox-bin יתומים (ppid=1) לפני ואחרי כל fetch. סדרתיות (INV-CF4) → כל ppid=1 הוא שארית בטוחה. - scripts/reap_orphan_procs.py: reaper כללי ל-task-master-mcp (~3GB יתומים) + camoufox-bin. רק ppid=1; /proc טהור. --dry-run / --loop N. - scripts/legal-reaper.config.cjs: דמון pm2 (loop 180s, max_memory_restart 100M). - X13 spec + SCRIPTS.md: תיעוד שכבות-ההגנה. max_memory_restart בשירות (1.5G) כבר נותן רשת-ביטחון ברמת-התהליך. Invariants: מקיים INV-CF4 (politeness/serial) — ללא שינוי חוזה. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 19:43:53 +00:00
chaim	61b9d72bcf	Merge pull request 'feat(X13 Tier-1): כיול אחזור נט המשפט — Camoufox python, אומת על עת"מ 46111-12-22 (34 עמ')' (#114 ) from worktree-court-fetch-tier1 into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m26s Details	2026-06-07 19:33:01 +00:00
Chaim	781f24c643	feat(X13 Tier-1): calibrate נט המשפט fetch — Camoufox python, proven on 46111-12-22 אומת end-to-end: פס"ד 34 עמ' של עת"מ 46111-12-22 הורד אוטונומית מלא, נטו קוד-פתוח, ללא כרטיס-חכם וללא פתרון-CAPTCHA. ממצאי-כיול עיקריים: - החיפוש+הניווט-לתיק ללא reCAPTCHA כלל. reCAPTCHA קיים רק בצופה ורק על שמירה/הדפסה מפורשת — לא על הצגת המסמך. - הצופה מגיש עמודים כ-PNG דרך PageMethod GetImages (4/batch); משיכה ב-fetch עם הכותרת X-Requested-With: XMLHttpRequest (חובה — F5 WAF חוסם בלעדיה) → הרכבת PDF (Pillow). שינויים: - camofox_client.py: שכתוב מלא — Camoufox דרך חבילת-הפייתון (in-process, לא שרת-Node REST). מסלול מכויל: home→btnExternalSearchCases→Bama fields→ CaseDetails→פסקי דין→DecisionList→NGCSViewerPage→GetImages→PDF. - pm2 config: app Xvfb :99 + DISPLAY=:99 (Camoufox קורס headless בלי צג וירטואלי). - pyproject: extra [court-fetch] = camoufox + faster-whisper (host-only; הקונטיינר לא מריץ דפדפן). Pillow כבר בבסיס. - X13 spec + SCRIPTS.md: עודכנו לממצאים (image-API, Xvfb, אימות). reCAPTCHA audio (Whisper) נשמר כ-fallback למסלול-השמירה-המפורש בלבד; המסלול הראשי אינו זקוק לו. Invariants: מקיים INV-CF1/CF4/CF6 (ללא שינוי). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 19:32:13 +00:00
chaim	9315ba4dfe	Merge pull request 'feat(graph): in-app corpus citation graph (/graph) — Phase 1' (#113 ) from worktree-corpus-graph into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m44s Details	2026-06-07 18:52:01 +00:00
Chaim	c80e4ce8ff	feat(graph): in-app corpus citation graph (/graph) — Phase 1 Native, Obsidian-graph-view-like network of the precedent corpus, rendered in web-ui from a read-only projection of the live DB. Replaces the idea of exporting to an external Obsidian vault (which would be a parallel, drifting copy of the corpus — the exact root cause G2 forbids). The graph edges already existed in the data model; this only surfaces them: nodes = precedents (case_law) + synthesized topic/practice-area hubs; edges = cites (precedent_internal_citations) + same_chain (case_law_relations) + tagged/in_area (subject_tags / practice_area membership). Node size = incoming-citation count (index-backed GROUP BY on idx_pic_target). Click a node → local-graph neighborhood focus; panel deep-links to /precedents/[id]. Backend (read-only, SELECT only — G2): - web/graph_api.py — Pydantic models (CorpusGraph/GraphNode/GraphEdge, so OpenAPI emits real types — UI2) + SQL assembly over the shared db.get_pool(). - web/app.py — GET /api/graph/corpus, GET /api/graph/node/{id}/neighborhood, both with explicit response_model. practice_area validated against the closed enum (G5); both endpoints write nothing. Frontend: - react-force-graph-2d (canvas/d3-force), loaded via next/dynamic ssr:false. - /graph page + nav entry; graph.ts TanStack hooks; filter panel (practice_area / source / min-citations / search / node-type toggles), node detail panel, hover+selection neighborhood highlight. Explicit error handling (UI4). Not a retrieval path (03-retrieval): returns graph topology, never ranked search results. Halacha nodes + corroboration/equivalence edges are Phase 2, already gated behind the node_types param (no contract change needed). SQL validated read-only against the live DB (142 precedents, 85 resolved citations, JSONB tag expansion, ANY(uuid[]) edge + BFS queries). web-ui lint + build pass; /graph in the route table. Invariants: keeps G2 (single source of truth — live projection, no parallel store), G5 (corpus separation filtered server-side), UI2 (response models), UI4 (no swallowed UI errors). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 18:50:56 +00:00
chaim	f3740fef68	Merge pull request 'fix(halacha): split authority (derived) from rule_role — stop source-conflation (INV-DM7)' (#112 ) from worktree-halacha-authority-split into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m32s Details	2026-06-07 18:19:43 +00:00
Chaim	2e33cac043	fix(halacha): split authority (derived) from rule_role — stop source-conflation (INV-DM7) The extractor classified rule_type by SOURCE bindingness (higher-court→binding, committee→persuasive) instead of by rule KIND. The gold-set proved it: 'binding' appeared on 19/19 external rulings & 0 committees; 'persuasive' on 13/13 committees & 0 external — only 58% agreement with the human role tags. The two axes (authority vs rule role) were crammed into one enum. This splits them per INV-DM7: - authority (binding/persuasive) — DERIVED from case_law.precedent_level (עליון/מנהלי→binding, ועדת_ערר_מחוזית→persuasive), never stored, never LLM-guessed. New helper halacha_quality.derive_authority; surfaced read-only in list_halachot / goldset_list / search results. - rule_type — now the rule ROLE only: holding/interpretive/procedural/ application/obiter. Both extractor prompts unified to this vocabulary; _coerce_halacha no longer defaults rule_type from the source; legacy binding→holding / persuasive→interpretive fold for safety. UI: authority shown as a separate read-only badge (gold=מחייב / muted=משכנע) across the review queue, precedent detail, and gold-set; the gold-set role selector drops binding/persuasive and adds מהותי (holding). Migration: scripts/halacha_rule_role_backfill.py re-classifies the 276 pre-split binding/persuasive rows into a genuine role via local claude_session (run after deploy). Gold-set correct_type/ai_correct_type 'binding'→'holding' via SQL. Sources (≥3, per research-decision policy): OASIS LegalRuleML v1.0 (appliesAuthority/Strength as metadata orthogonal to rule logic) · SemEval-2023 Task 6 LegalEval (rhetorical roles by function, authority kept separate) · Bluebook signals (weight-of-authority is a separate dimension). Invariants: ESTABLISHES INV-DM7. Upholds G1 (normalize at source — extractor classifies role, system derives authority) and G2 (single source of truth — authority derived, not a parallel stored field). Tests: 211 pass + new derive_authority/coerce coverage. web-ui build + tsc clean. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 18:18:41 +00:00
chaim	acb8e2c206	Merge pull request 'feat(X13): אחזור-פסיקה אוטומטי מנט המשפט → קורפוס (Tier 0 + scaffold)' (#110 ) from worktree-court-fetch into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m21s Details	2026-06-07 18:13:15 +00:00
Chaim	0990db7a3c	feat(X13): auto-fetch court verdicts from נט המשפט → corpus (Tier 0 + scaffold) תת-מערכת אחזור-פסיקה אוטומטי: כשיומון מצביע על פס"ד בית-משפט, מסווגים את הערכאה, מורידים מהמקור הציבורי המתאים, וקולטים דרך צינור-הקליטה הקנוני. - spec-first: docs/spec/X13-court-fetch.md (INV-CF1..CF7) + אינדקס - מסווג court_citation.py (supreme/admin/skip) + 10 בדיקות (עת"מ 46111-12-22 → admin) - Tier 0: court_fetch_supreme.py — supremedecisions API (reverse-engineered), httpx + browser-headers (אומת 200) + politeness - תור court_fetch_jobs (SCHEMA_V30) + DB helpers + court_fetch_orchestrator.py - Tier 1 scaffold: legal-court-fetch-service (aiohttp+Bearer, מראת legal-chat-service) + camofox_client (Camoufox open-source) + recaptcha_audio (Whisper מקומי) + pm2 - Tier 2 fallback חינני: manual + missing_precedent (INV-CF2/CF3 — אין drop שקט) - כלי-MCP court_verdict_fetch / court_fetch_status; SCRIPTS.md Invariants: מקיים G2 (מסלול-קליטה יחיד, INV-CF1) · G3/G1 (idempotent+נרמול, INV-CF5) · G4/§6 (אין בליעה שקטה, INV-CF2) · G10 (שער-אנושי, INV-CF3) · G5 (source_type, INV-CF6) · G9 (provenance+audit, INV-CF7). מקורות INV-CF4: RFC 9309 · Google crawler · OWASP OAT. Follow-ups (טרם אומתו חי): live Tier-0 validation · התקנת camofox-browser+whisper · כיול selectors Tier-1 · COURT_FETCH_SHARED_SECRET (Infisical+Coolify) · טריגר מ-digest try_autolink (worktree-digests-radar). V30 עלול להתנגש עם digests-radar. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 18:12:13 +00:00
chaim	692eea76f0	Merge pull request 'feat(digests): Phase 2 — API endpoints + /digests UI (X12)' (#111 ) from worktree-digests-ui into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m25s Details	2026-06-07 18:11:45 +00:00
Chaim	06281996ca	feat(digests): Phase 2 — API endpoints + /digests UI (X12) משטחי-משתמש לקורפוס היומונים: endpoints ב-FastAPI + דף UI נפרד /digests (לדפדוף, חיפוש, העלאה, וקישור לפסק המקורי). היומון נשאר מקור-משני המצביע על הפסק — אינו מצוטט בהחלטה (INV-DIG1) ואינו מחלץ הלכות (INV-DIG2). Backend (container-safe + local split): - digest_library: פוצל ל-create_pending_digest (CONTAINER-SAFE: stage+ extract_text+create row 'pending', בלי LLM) ↔ enrich_digest/ process_pending_digests (local: LLM+embed+autolink). ingest_digest מאחד. - db.list_pending_digests; MCP digest_process_pending (tool+server) — חלופה ל-batch script לריקון התור. - web/app.py: 10 endpoints /api/digests/* (upload/list/search/queue-pending/ get/patch/delete/link/relink/unlink). upload=INSERT-only pending (ה-LLM רץ מקומית — claude_session local-only). כולם מחזירים dict בדפוס precedent. Frontend (Next 16, ללא api:types — hooks עם טיפוסים hand-written כמו precedent-library.ts): - lib/api/digests.ts — hooks (useDigests/useDigestSearch/useDigestPending/ useUploadDigest/useLink/Relink/Unlink/Delete/Update). - דף /digests נפרד (לא כרטיסייה ב-/precedents — לשמור גבול סמכותי/משני, INV-DIG1): טאבים יומונים/חיפוש + DigestCard (badge קישור-לפסק) + DigestUploadDialog + pending badge. nav + header-context. אומת: backend round-trip מלא (create_pending→list_pending→process_pending→ search→restore); web-ui מתקמפל (webpack/tsc נקי, route /digests נוצר). הערה: build דיפולטי (turbopack) נכשל ב-worktree עקב symlink ל-node_modules — ב-CI/Docker (node_modules אמיתי) עובד; אומת עם --webpack. Invariants: מקיים INV-DIG1/2 (upload לא מחלץ הלכות, UI מציג "מצביע לא מצוטט"), INV-DIG3 (link/relink/queue). G4 (אין בליעה — שגיאות→toast/HTTP), G2 (מסלול נפרד, לא מקביל). X6 (חוזה UI↔API — endpoints בדפוס precedent; hooks hand-written כמו שאר ה-domain modules). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 18:11:05 +00:00
chaim	955675eb1f	Merge pull request 'feat(digests): קורפוס יומונים כשכבת-גילוי (radar) — X12' (#109 ) from worktree-digests-radar into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m25s Details	2026-06-07 17:50:07 +00:00
Chaim	8171572cdd	feat(digests): קורפוס יומונים כשכבת-גילוי (radar) — X12 מאגר חדש ליומוני "כל יום" (עפר טויסטר) כשכבת-גילוי מעל קורפוסי-הפסיקה: מקור-משני המצביע על פסק הדין המקורי, נקלט לטבלה נפרדת `digests`, נחפש סמנטית, ומקושר לפסק המקורי בספריית הפסיקה — אך לעולם אינו מצוטט בהחלטה ואינו מחלץ הלכות. Phase 0 (spec): - docs/spec/X12-digests-radar.md — INV-DIG1 (מצביע לא מצוטט) / INV-DIG2 (מסלול-קליטה נפרד, לא מקביל — מקיים G2) / INV-DIG3 (קישור-לפסק הוא הגשר; חוסר-קישור = פער גלוי). עדכון אינדקס 00/03/README. Phase 1 (MVP): - SCHEMA_V30: טבלת `digests` (HNSW על embedding — לא ivfflat, להימנע מ-recall cliff בקורפוס קטן/צומח) + GIN/FTS + UNIQUE חלקי ל-idempotent. - services/digest_metadata_extractor.py — חילוץ-LLM (claude_session local-only, ייבוא lazy): תג-מושג, כותרת-הלכה, מראה-מקום, שני-תאריכים מובחנים, תגיות. - services/digest_library.py — מסלול קצר עצמאי (INV-DIG2): extract→hash→LLM→ embedding יחיד→autolink. לא משתמש ב-ingest.ingest_document. - tools/digests.py + רישום 7 כלים ב-server.py (digest_upload/list/get/link/ relink/delete + search_digests). - scripts/ingest_digests_batch.py — קליטה ידנית מ-data/digests/incoming. - legal-researcher.md: שלב 2ב.0 (סריקת-radar לפני אימות) + סעיף-דוח ט + 3 כלים ב-frontmatter. HEARTBEAT §8: ניתוב יומון→digest_upload. אומת end-to-end: 4 יומונים נקלטו (מטא-דאטה מדויק), חיפוש סמנטי מדרג נכון ("היטל השבחה"→5160, "תמא 38"→5158), link/relink/autolink/revert + מעטפת-MCP. Invariants: מוסיף INV-DIG1/2/3 (X12). מקיים G2 (bounded context נפרד, לא מסלול מקביל), G3 (idempotent upsert), G4 (אין בליעה שקטה — פער-קישור מוצף), G9 (עקיבוּת — היומון מצביע על מקור עקיב). נוגע G7 (RRF) — נדחה, חיפוש סמנטי-בלבד בשלב 1 (FTS index מוכן). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 17:49:00 +00:00
chaim	9eaabffba4	Merge pull request 'fix(goldset): single view-mode filter (can't get stuck hiding untagged)' (#108 ) from worktree-goldset-filter-fix into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 44s Details	2026-06-07 14:48:16 +00:00
Chaim	90f3c472b5	fix(goldset): single view-mode filter — can't get stuck hiding untagged The old independent toggles had a trap: clicking "אי-הסכמות AI" set a filter, and once all disagreements were resolved the toggle button disappeared (rendered only when count>0) while the filter stayed ON — so the list showed zero items and the untagged ones were unreachable. Replaced hideTagged + disagreeOnly with one mutually-exclusive segmented control: הכל / לא תויגו / תויגו / ⚠ אי-הסכמות, each with a live count and always visible. No stuck state; "לא תויגו" makes the remaining work obvious. Verified: tsc --noEmit 0. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 14:47:53 +00:00
chaim	638a542cf4	Merge pull request 'feat(goldset): AI second-opinion per item (QA aid)' (#107 ) from worktree-goldset-ai-recommendation into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m35s Details	2026-06-07 14:25:06 +00:00
Chaim	0e35060d3d	feat(goldset): AI second-opinion per item (QA aid) — compare vs human tag The chair wanted an independent recommendation beside each tag, to reconsider his own judgments. Adds a NON-ground-truth AI second-opinion: - schema: halacha_goldset.ai_is_holding / ai_correct_type / ai_rationale / ai_generated_at (additive). - db.goldset_set_ai_recommendation + goldset_list now returns the ai_* fields. - scripts/goldset_ai_recommend.py — local claude_session judges is_holding + type + a one-line rationale per item, INDEPENDENTLY (own legal rubric). Independent of the rule-based validators #81.8 measures → no circularity. Never auto-applied; QA aid only. - web-ui: each card shows "🤖 המלצת AI: הלכה/לא · type" + rationale and an agreement/disagreement chip vs the human tag (amber on disagree); a "⚠ אי-הסכמות AI (N)" filter to review only the conflicts. Methodology note kept explicit: the human stays the ground truth; the AI is a prompt to reconsider, not to copy. Verified: tsc --noEmit 0; generator stores recs and flags disagreements with existing human tags. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 14:24:35 +00:00

1 2 3 4 5 ...

709 Commits