legal-ai

Author	SHA1	Message	Date
Chaim	af651d0135	feat(rag): Stage B — RAG improvements (HNSW + BM25 hybrid + MMR + dynamic boost) All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m35s Details Five enhancements to the precedent retrieval stack: * #44 HNSW indexes for precedent_chunks + halachot (replacing IVFFlat lists=50). Build time ~3s combined. Better recall@10 with pgvector 0.8.2. * #45 Halacha sweep — 96 pending halachot at conf>=0.78 promoted to approved (1141 → 1237). Cluster at conf=0.78 spot-checked OK. Applied via psql only — env HALACHA_AUTO_APPROVE_THRESHOLD unchanged (0.80). * #43 MMR diversity — search_precedent_library_hybrid now caps at ``max_per_case_law=2`` (default). Prevents one precedent dominating top-10 when many of its chunks/halachot rank high. New helper ``_diversify_by_case_law`` in hybrid_search.py. * #46 Dynamic halacha boost — replaces the static ``score+=0.05`` with ``score+=confidence0.06``. Calibrated so avg-confidence (~0.85) stays at +0.05; high-conf halachot get a slight extra lift, low-conf ones get less. Behaviour preserved at the mean. #41 BM25/tsvector hybrid + RRF. Schema V12 adds STORED tsvector columns ``precedent_chunks.content_tsv`` and ``halachot.rule_tsv`` (using simple config — Postgres has no Hebrew stemmer) + GIN indexes. New ``db.search_precedent_library_lexical`` mirrors the semantic function with ts_rank_cd over plainto_tsquery. ``hybrid_search`` runs sem+lex in parallel and fuses via RRF before rerank. Toggle: env ``BM25_HYBRID_ENABLED`` (default true), graceful fallback to semantic-only on lexical failure. #40 (VOYAGE_RERANK_ENABLED) was already true in Coolify env; no change. #42 (Claude Haiku query expansion) deferred — latency + cost concerns warrant a separate plan; the bm25 lexical leg already recovers most of the exact-string recall #42 was meant to address. Closes TaskMaster #41, #43-#46. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-26 08:08:02 +00:00
Chaim	b197d2329c	fix(corpus): move citation guard to service level All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m31s Details Defense in depth — the MCP wrapper guard catches researcher uploads, but the HTTP API (/api/precedent-library/upload) bypasses the wrapper and calls services.precedent_library.ingest_precedent directly. The guard now also lives in the service, so HTTP uploads of ערר/בל"מ citations to the external corpus get rejected at the source. Companion to DB constraint case_law_external_arar_check (applied via psql) — three independent layers now enforce the same invariant. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-26 07:49:49 +00:00
Chaim	c6e368e4f7	feat(corpus): Stage A — corpus tagging fixes + prevention layer All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m8s Details מתקן את הבאג של תיוג שגוי לועדות ערר ומונע חזרתו: Code changes: * New MCP tool `internal_decision_upload` (chair_name+district required) — sole supported path for ingesting committee decisions; tags source_kind='internal_committee' automatically. * Citation guard in `precedent_library_upload` rejects citations starting with "ערר" or "בל\"מ" with a directive to use internal_decision_upload. * `practice_area.py` taxonomy unification: PRACTICE_AREAS now accepts both multi-tenant (appeals_committee/national_insurance/labor_law) and domain (rishuy_uvniya/betterment_levy/compensation_197) values. New helper `to_db_practice_area(multi_tenant, subtype) -> domain`. Agent docs: * legal-researcher (+5K): upload-tool decision flowchart, code samples per source_kind, district enum (ירושלים/מרכז/תל אביב/צפון/דרום/חיפה/ארצי) * legal-ceo, legal-analyst, legal-writer, legal-qa, HEARTBEAT — taxonomy awareness + source_kind-aware citation patterns + research_complete as valid status. * Fixed two pre-existing wrong practice_area values in examples (histael_hashbacha→betterment_levy, pitsuim_197→compensation_197). Closes TaskMaster #30(parts), #38(parts), #39 (root cause). DB-side backfill + CHECK constraints applied directly via psql: * 11 cases.practice_area corrected (1xxx→rishuy, 8xxx→betterment) * 6 case_law records reclassified external_upload→internal_committee with inferred district * 6 chair_name backfilled from full_text (5 שרית אריאלי + 1 דפנה תמיר) * 88 new halachot extracted for newly-uploaded precedents (אנטרים + ירושלים שקופה 1112/22 + אגא וכט) * CHECK constraints: cases.practice_area enum, case_law internal⇒district Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-26 07:40:18 +00:00
Chaim	8153bc9f03	fix(extractor): add regex fix for Hebrew law year gershayim corruption All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m36s Details תש[א-ת]+יי[א-ת] → תש[א-ת]+"[א-ת] (e.g. תשכייה → תשכ"ה) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-25 16:12:20 +00:00
Chaim	4892fb6e8f	fix(extractor): apply Hebrew quote fixer to direct PDF extraction path All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m40s Details Born-digital Hebrew PDFs from legal software often encode gershayim (״) as double-yod (יי), producing the same corruption patterns as OCR. The fixer was only called after Google Cloud Vision OCR — digitally created PDFs that passed quality checks received no correction. Changes: - Apply _fix_hebrew_quotes() in the direct extraction path - Add 'בליימ' → 'בל"מ' (בקשה להארכת מועד — systematic corruption in 1017-03-26) - Add 'תמייא' → 'תמ"א' (תכנית מתאר ארצית) - Update docstring to reflect the broader scope Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-25 15:59:39 +00:00
Chaim	b368bce690	fix: handle invalid date formats gracefully and add missing dialog descriptions All checks were successful Build & Deploy / build-and-deploy (push) Successful in 4m14s Details - Wrap date.fromisoformat() in try/except in case_update tool — prevents unhandled ValueError from surfacing as 500; FastAPI now catches it as 422 - Add DialogDescription (sr-only) to 5 dialogs missing aria-describedby: documents-panel preview + delete, drafts-panel delete + feedback, link-related-dialog Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-25 15:53:01 +00:00
Chaim	1496e520fd	feat(precedent-library): add district and chair_name to edit form All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m11s Details Fields existed in DB and Precedent type but were missing from: - PrecedentUpdateRequest (backend model) - update_case_law allowed set (db layer) - PrecedentPatch (frontend type) - precedent-edit-sheet form state, inputs, and patch payload Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-19 12:16:43 +00:00
Chaim	1da2a9a2cb	fix: exclude archived cases from stale-case-reminder All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details Archived cases have archived_at IS NOT NULL — they are not "stuck", they are done. The stale query was missing this filter. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 16:41:38 +00:00
Chaim	f3ecccd4f0	docs: add procedural patterns layer (interim decision template) Document new daphna-procedural-patterns.md cataloging the "appraiser clarification request" interim-decision pattern observed in 8174-24 — structure only, not phrasing (case is an outlier example). - daphna-decision-tree.md §0.5: gating question before main tree - legal-ceo.md voice docs table: register procedural patterns doc - legal-writer.md: mandatory consultation when pattern_tag is set, with explicit warning against copying 8174-24 wording Approved via interaction request_confirmation (CMPA-15) 2026-05-17. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-17 16:29:58 +00:00
Chaim	a2fc36d65f	fix: recognize extended chair-position placeholders as empty All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m35s Details The legal-analyst agent was generating a longer placeholder form [ימולא ע"י יו"ר הוועדה — עמדה/הנחיה לגבי סוגיה זו שתשמש את סוכן הכתיבה] which _is_placeholder() did not match (substring check fails because ] is further along in the longer form). Result: UI showed "✓ עמדה נקבעה" (green) for all 4 issues even though no chair direction had been entered. Fixes: 1. research_md.py: add regex fallback — any text starting with [ימולא is a placeholder 2. legal-analyst.md: template now emits the standard short placeholder only Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 12:59:13 +00:00
Chaim	653f441e99	docs: update agent audit report — mark all 12 issues resolved All checks were successful Build & Deploy / build-and-deploy (push) Successful in 7s Details - עדכון טבלת מצב: כל המודלים מסונכרנים (instructions = DB) - החלפת טבלת בעיות בטבלת סטטוס תיקונים עם commit references - הוסף טבלת שינויים נוספים מהסשן - הערה: Skills CMPA=6 עיצוב מכוון, verify מאשר "0 need sync" Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 12:57:54 +00:00
Chaim	c3ce0e7e1f	upgrade: upgrade opus-4-6 → opus-4-7 for all heavy-reasoning agents All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details DB: עדכון 8 סוכנים (CMP + CMPA) — CEO, מנתח, כותב, מגיה instructions: עדכון 4 קבצי הנחיות להתאמה ל-DB opus-4-7 מחליף opus-4-6 לכל הסוכנים שדורשים reasoning כבד. sonnet-4-6 נשאר ל-QA, חוקר, מייצא. deepseek-v4-pro נשאר לcurator. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 12:42:33 +00:00
Chaim	1608ea5ed0	fix: medium/low audit items — model drift, placeholders, corpus check, curator ownership All checks were successful Build & Deploy / build-and-deploy (push) Successful in 7s Details Model drift (instructions → match DB): - CEO: claude-sonnet-4-6 → claude-opus-4-6 (DB runs opus; CEO needs opus quality) - מנתח/כותב/מגיה: claude-opus-4-7 → claude-opus-4-6 (DB runs 4-6; no 4-7 in adapter) legal-proofreader.md: - {issue-id} placeholder → $PAPERCLIP_TASK_ID בשני המקומות (done + blocked) legal-researcher.md: - הוסף reference ל-HEARTBEAT.md בראש הקובץ legal-qa.md: - הבהרת שיטת בדיקת corpus_queries_logged: grep ידני בלבד, לא validate_decision CLAUDE.md (curator): - הוסף תהליך אישור הצעות curator: comment → חיים מאשר → commits ל-SKILL.md/lessons.md maxConcurrentRuns CEO: כבר 2 ב-DB — לא נדרש שינוי Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 12:35:49 +00:00
Chaim	35423eafc1	fix: high-priority agent audit items — CEO hardcoded IDs + researcher search_internal_decisions All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details CEO (legal-ceo.md): - הסרת company UUID ו-project UUID קשוחים בדוגמת יצירת issue - שימוש ב-$PAPERCLIP_COMPANY_ID לחברה - project_id נשלף דינמית מה-issue ההורה דרך $PAPERCLIP_TASK_ID researcher (legal-researcher.md): - הוסף mcp__legal-ai__search_internal_decisions לרשימת tools - הוסף סעיף 2ב.2א המסביר את ההבדל: search_decisions = דפנה בלבד; search_internal_decisions = כל ועדות הערר בכל המחוזות - הוראות מתי להשתמש + אזהרת היררכיה (ועדת ערר < מחוזי) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 12:29:47 +00:00
Chaim	a584dc3602	fix: legal-exporter — versioning, dynamic skill path, case status update All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details - טיוטה-V → טיוטה-v (lowercase) בכל המקומות (שלב 4 + כללים קריטיים) - hardcoded CMP UUID בנתיבי legal-docx SKILL → $PAPERCLIP_COMPANY_ID (תומך CMP + CMPA) - הוסף case_update לרשימת tools - הוסף שלב 4.5: עדכן סטטוס תיק ל-exported אחרי שמירת DOCX Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 12:14:24 +00:00
Chaim	d37d03f478	docs: add comprehensive agent audit 2026-05-17 All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details 7-agent parallel audit of all Paperclip agents (CEO, analyst, researcher, writer, QA, exporter, proofreader, curator). Found 12 issues including 3 critical: - Exporter: V vs v naming mismatch in DOCX versioning - Exporter: case.status not updated to exported after export - Researcher: section ז missing from case 8174-24 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 11:52:32 +00:00
Chaim	011555fb78	docs: update CLAUDE.md — webhook pipeline, scheduled jobs, paperclip_api.py All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details - Document emit_case_status_webhook flow and plugin integration - Document stale-case-reminder and weekly-feedback-analysis jobs - Fix paperclip_api.py vs paperclip_client.py (both exist, api.py is current) - Add warning: weekly-feedback-job CEO has no issueId Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 11:23:47 +00:00
Chaim	ea0532b7ba	fix: weekly-feedback-job handler writes to file only (no Paperclip issue) All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m39s Details CEO wakes for weekly-feedback-job via agents.invoke without issueId, so $PAPERCLIP_TASK_ID is empty. Removed steps 4-5 (comment + close issue) from handler — now file-write only with stdout logging. Also commits pending docs and agent instructions from prior session. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 11:08:14 +00:00
Chaim	cddc7c8d24	fix: start-workflow wakeup failure now returns 502 instead of silent success All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m33s Details If pc_wake_ceo fails, the endpoint now raises HTTP 502 and skips the case_update to processing — preventing cases from silently getting stuck with no CEO running. Also adds `processing` to CEO routing table and updates case_list docstring with full status list. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 11:02:30 +00:00
Chaim	83b6ff51b7	feat: fix wizard step-skip bug + extend case edit with all fields + Paperclip title sync All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m38s Details - Fix keyboard navigation bug: React was reusing the submit button DOM element when transitioning "הבא" → "צור תיק", retaining focus and causing Enter to auto-submit step 3. Added key props to force element replacement. - CaseEditDialog now covers all wizard fields: appellants, respondents, property_address, permit_number (in addition to existing title, subject, hearing_date, expected_outcome, notes). - When case title changes, Paperclip project name is updated in background via new update_project_name() in paperclip_client.py. - Extended CaseUpdateRequest, case_update MCP tool, and caseUpdateSchema to carry the new fields end-to-end. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 10:55:45 +00:00
Chaim	8dc7a40fa2	fix: exclude exported cases from stale; add weekly-feedback-job handler to CEO All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details - /api/cases/stale: exclude 'exported' status — exported cases await Dafna's review intentionally, they are not stuck - legal-ceo.md: add routing for weekly-feedback-job reason + explicit handler (analyze feedback, update decision-lessons.md, close issue) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 10:35:39 +00:00
Chaim	a3468d5b2f	fix: use timezone-aware datetime in webhook timestamp All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m17s Details Replace deprecated datetime.utcnow() with datetime.now(timezone.utc) to avoid Python 3.12+ DeprecationWarning. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 10:15:52 +00:00
Chaim	5f43659b5a	fix: add defensive JSON parsing in check_instructions	2026-05-16 17:53:42 +00:00
Chaim	86734da210	feat: add --check-instructions, pre-flight validation, and mtime tracking to sync script - P3-T1: --check-instructions flag + check_instructions() prints a table of all agents' instructionsFilePath with status (✅ OK / ❌ MISSING / ⚠ NOT SET), size, mtime, and ⚠ DRIFT when file has changed since last sync - P3-T2: --apply now runs a pre-flight check on master agents and aborts if any instruction file is missing, before touching the DB or calling any API - P3-T3: get_claude_md_mtime() helper; --apply stamps claude_md_mtime and claude_md_last_synced into each mirror agent's metadata via the PATCH call - P3-T4: alias check-agents added to ~/.bashrc Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-16 17:51:34 +00:00
Chaim	82ded005a4	fix: add days>0 guard and limit param to stale/feedback endpoints All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details	2026-05-16 17:38:34 +00:00
Chaim	c7ed1110f8	feat: add /api/cases/stale and /api/chair-feedback/weekly-summary endpoints GET /api/cases/stale?days=N — returns cases not updated in N days (default 3) that are not in 'final' or 'new' status, with days_stale count. GET /api/chair-feedback/weekly-summary?days=N — returns chair feedback from the last N days (default 7) as a Hebrew bullet-list summary for CEO agent. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-16 17:36:12 +00:00
Chaim	015e553d06	fix: add debug log and null company_id comment to webhook scheduling All checks were successful Build & Deploy / build-and-deploy (push) Successful in 4m16s Details	2026-05-16 17:13:07 +00:00
Chaim	6bdf9786ac	feat: emit case-status webhook on status change in PUT /api/cases/:case	2026-05-16 17:10:30 +00:00
Chaim	d87f9c5a5f	fix: include case details in webhook failure warning log	2026-05-16 17:08:33 +00:00
Chaim	a0fab1f6de	feat: add emit_case_status_webhook helper	2026-05-16 17:06:37 +00:00
Chaim	d5043100a7	fix: json.loads JSONB overrides on GET — asyncpg has no codec registered All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details asyncpg returns JSONB columns as raw JSON strings when no type codec is configured (only pgvector is registered in _init_connection). The stored value is a correct JSONB array (jsonb_typeof=array confirmed), but asyncpg decodes it as str. Parse it explicitly in the GET handler so the frontend receives the correct Python list/dict. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-10 18:54:44 +00:00
Chaim	932cc7191c	fix: use ::text::jsonb to store methodology overrides correctly All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details asyncpg cannot encode a Python list as JSONB directly (expects str). Passing str with ::jsonb causes double-encoding (stored as JSONB string). Solution: json.dumps() the value → pass as text → PostgreSQL parses with ::text::jsonb cast, storing it as the correct JSONB array/object. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-10 18:38:05 +00:00
chaim	d983cfdd3b	Merge pull request 'fix: prevent JSONB double-encoding on methodology save' (#6 ) from fix/methodology-jsonb-double-encoding into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m39s Details	2026-05-10 18:34:03 +00:00
Chaim	50649baeed	fix: prevent JSONB double-encoding on methodology save Pass req.value directly to asyncpg instead of json.dumps(req.value). When a Python string was passed with ::jsonb, asyncpg encoded it as a JSONB string (not an array), causing the frontend spread operator to split it into individual characters — one textarea per character. Also fix typo in DISCUSSION_RULES default: "אסה" → "מאסה". Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-10 18:30:49 +00:00
Chaim	a9cd8aeb12	fix: prevent write_interim_draft context overflow (465K → ≤300K chars) Two bugs caused all 5 interim blocks to fail with "Claude CLI failed (exit 1): unknown error": 1. source_context was embedded BOTH inside the prompt template (via {source_context}) AND prepended again in write_block — doubling every block's context size (232K chars × 2 = 465K chars). 2. _build_source_context loaded all 9 case documents for every block regardless of relevance. Fixes: - Remove the duplicate source_context prepend in write_block; the template already contains it via {source_context} - Add per-block document filtering (_BLOCK_DOC_TYPES): block-he/zayin → empty, block-chet → protocol only, block-tet → appraisals only - Add 400K char guard before calling claude -p with a descriptive error (vs opaque "exit 1: unknown error") - Add prompt-size warning and size info in claude_session error messages Result: block-he 0 chars, block-zayin 0 chars, block-vav ~172K, block-chet ~45K, block-tet ~300K (all under 400K limit) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-10 10:49:47 +00:00
Chaim	10a63fb9e0	fix(precedents): separate court rulings from committee decisions correctly All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m37s Details - DB: add 'all_committees' virtual source_kind covering internal_committee + external_upload appeals_committee rows in one query - DB: stats now count all case_law rows (not just external_upload), fixing the precedents_total that excluded 44 internal-committee records - UI: courts table filters to source_type=court_ruling only; committees table uses the new all_committees query Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-10 09:59:30 +00:00
Chaim	f94201c577	feat(precedents): make citation link to detail page All checks were successful Build & Deploy / build-and-deploy (push) Successful in 34s Details Both CourtRow and CommitteeRow citation cells are now Next.js Links → /precedents/{id}, letting users navigate directly from the list. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-10 09:01:26 +00:00
Chaim	026457dac4	fix(precedent-edit): sync form from record without useEffect flash All checks were successful Build & Deploy / build-and-deploy (push) Successful in 36s Details Replace useEffect-based form hydration with React's approved derived-state pattern (setState-during-render). This eliminates the one-frame flash where the precedent_level Select showed "—" before useEffect fired, and fixes cases where the same record reference returned from TanStack cache caused useEffect to not re-run after save+invalidate. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-10 08:35:04 +00:00
chaim	75493ce233	Merge pull request 'feat: link related precedents across court instances (SCHEMA_V11)' (#4 ) from feat/related-precedents-v11 into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m41s Details Reviewed-on: #4	2026-05-10 07:54:37 +00:00
Chaim	3e14cd6798	feat: link related precedents across court instances (SCHEMA_V11) Add ability to mark case_law records as related (e.g. same appeal through ועדת ערר → מנהלי → עליון): - DB: case_law_relations join table (bidirectional, V11 migration) - DB CRUD: add/remove/get_case_law_relations - Service: get_precedent() now returns related_cases[] - MCP: precedent_link_cases + precedent_unlink_cases tools - REST: POST/DELETE /api/precedent-library/{id}/relations - UI: RelatedCasesSection on detail page with search dialog and unlink Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-10 07:52:29 +00:00
chaim	13a8d9e58f	Merge pull request 'feat(curator): switch Hermes Curator to DeepSeek V4-Pro via deepseek_local adapter' (#3 ) from feat/deepseek-curator-adapter into main All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m53s Details	2026-05-10 06:21:28 +00:00
Chaim	45341a0bc8	feat(curator): switch Hermes Curator to DeepSeek V4-Pro via deepseek_local adapter A/B test (2026-05-05) showed DeepSeek V4-Pro is 2-3x faster and ~20x cheaper than Sonnet for style/lexicon pattern analysis, with comparable quality. Adds adapters/deepseek-paperclip-adapter/ package, documents adapter requirements (env injection, run-id headers), updates CLAUDE.md with adapter integration notes, and records lessons from ערר 1200-25 (block order for 1xxx, "להלן מתוך" pattern, expanded factual background, bridge planning analysis, flat heading structure). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-10 05:58:52 +00:00
Chaim	d81c3c37ab	fix(precedent-edit): translate appeal_subtype enum values to Hebrew All checks were successful Build & Deploy / build-and-deploy (push) Successful in 34s Details The metadata extractor occasionally stuffs the practice_area enum (``betterment_levy``, ``rishuy_uvniya``, ``compensation_197``) into the free-text ``appeal_subtype`` column. The edit sheet then showed the raw English string in the "תת-סוג" input. When initialising the form, run the value through ``appealSubtypeLabel`` which maps known practice-area enum values to their Hebrew label and returns anything else unchanged. The user can then edit normally; on save the Hebrew sticks, so the next view is also clean.	2026-05-07 08:45:03 +00:00
Chaim	fff2d1c859	fix(precedent-library): per-record extraction must drain the queue too All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m36s Details reextract_metadata / reextract_halachot extract & apply but never cleared metadata_extraction_requested_at / halacha_extraction_requested_at — only the bulk worker (process_pending_extractions) did. Result: clicking "חלץ מטא-דאטה" on the edit sheet (or calling precedent_extract_metadata directly) left the row stuck in the queue forever, with the UI badge showing "ממתין לחילוץ" even after extraction succeeded. Mirror the worker's behaviour: on success ('completed' / 'no_changes' / 'no_halachot'), call db.clear_extraction_request to drain the queue. Coolify deploy required for the FastAPI container; local MCP server needs a process restart for the change to take effect (long-running).	2026-05-07 07:08:31 +00:00
Chaim	36b78ea404	fix(precedent-library): queue listing must include internal_committee too All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m36s Details Earlier commit `afcc481` opened request_metadata_extraction and request_halacha_extraction to all source kinds — but list_pending_extraction_requests still hard-filtered to external_upload. Result: stamping a queue request on an internal_committee row succeeded silently, but the worker (and the queue badge) never saw it. Even with the auto-wakeup added in `c7132ba` the CEO would wake, find 0 pending items, and exit. Drop the legacy filter so the queue listing matches the writer side. Coolify deploy required for the FastAPI container to pick this up.	2026-05-07 06:51:19 +00:00
Chaim	c7132ba0d2	feat(precedent-library): auto-trigger CEO wakeup on manual extract requests All checks were successful Build & Deploy / build-and-deploy (push) Successful in 9s Details The "חלץ מטא-דאטה" / "חלץ הלכות" buttons in the UI used to only stamp the queue (set metadata_extraction_requested_at / halacha_extraction_requested_at) and rely on a human running `mcp__legal-ai__precedent_process_pending` from local Claude Code to drain it. That left the user with an unintuitive two-step flow: click button → run local MCP tool. Meanwhile, the upload endpoint already does the right thing — after ingest succeeds it calls `pc_wake_for_precedent_extraction`, which creates a Paperclip issue, assigns it to the CEO, and wakes them to run `precedent_process_pending` automatically. Add the same wakeup call to the manual request-metadata / request-halachot endpoints. Now clicking the button is sufficient — the CEO picks it up and drains the queue without manual intervention. Best-effort: matches the upload flow's failure semantics. The queue stamp still happens even if the wakeup fails, so the user can fall back to the manual MCP tool when needed. The wakeup outcome is included in the response under `wakeup` for observability. Coolify deploy required for the FastAPI container to pick this up.	2026-05-07 06:48:51 +00:00
Chaim	171da84680	feat(precedent-library): add halacha-extract button to library list rows All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m8s Details When a precedent has not had successful halacha extraction yet, show a small wand icon between the edit and delete buttons. Clicking it queues the precedent for the local MCP worker (request-halachot endpoint). Visibility rule (`needsHalachaExtraction`): show when text extraction is complete AND halacha status is "pending without requested_at" (never tried) or "failed" (allow retry). Hide while processing, after completion, or when already queued — to avoid duplicate requests. Pairs with the metadata-extract button on the edit sheet.	2026-05-07 06:30:03 +00:00
Chaim	afcc4818a4	fix(precedent-library): allow re-extraction for internal_committee rows All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m13s Details The "חלץ מטא-דאטה" / "חלץ הלכות" buttons in the UI were returning 404 for any precedent with `source_kind != 'external_upload'`. The original restriction was meant to keep LLM extraction off internal-committee imports (their metadata supposedly came from the case file system), but the same precedent rows can still need re-extraction when ingest produces broken data — e.g. the corrupted `subject_tags` value `['[','"','ה','י',...]` that motivated this change (an early ingest stored a JSON literal into a TEXT[] column, which Postgres split into single chars). Two changes here: 1. db.request_metadata_extraction / request_halacha_extraction: drop the `AND source_kind='external_upload'` filter. The extractor already preserves user values (only fills empty fields), so this is safe. 2. precedent_metadata_extractor.extract_and_apply: detect the character-by-character corruption above and treat it as empty so the freshly-extracted tags actually replace the broken ones. Heuristic: 3+ elements where every element is at most 2 chars (legitimate tags are multi-character Hebrew words). Coolify deploy required for the FastAPI container to pick this up.	2026-05-06 19:44:13 +00:00
Chaim	bd4b0ca766	feat(mcp): case_get_final_text — fall back to PDF/DOC/RTF/TXT/MD All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m58s Details The Hermes Knowledge Curator's hermes-curator.md says it must be able to read both DOCX and PDF final decisions. The original implementation hardcoded the .docx extension only. Extend to try .docx → .pdf → .doc → .rtf → .txt → .md, returning the first match. extractor.extract_text already supports all six formats, so no extractor changes needed. If none found, the not_found response now includes the tried_extensions list so the caller knows what was attempted. Verified on case 1130-25 (.docx still picked first) and tested via `curator-cmp mcp test legal-ai`.	2026-05-05 19:18:57 +00:00
Chaim	7c9582ed04	feat(mcp): case_get_final_text — let agents read the signed final DOCX All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m36s Details The Knowledge Curator (Hermes) couldn't read סופי-{case}.docx because document_get_text only works on rows in the documents table — the final file is just a copy in the case's exports/ directory, not a tracked document. CMP-71 hit this and produced an unproductive interaction asking the user how to fix the access issue. Add a new MCP tool that: - Locates exports/סופי-{case_number}.docx via config.find_case_dir - Extracts text using the existing extractor service (python-docx based) - Returns JSON with status + text + page_count + truncation info - Optional max_chars cap for large decisions Smoke test on case 1130-25: 400-char preview returns proper Hebrew text beginning with "לפנינו ערר על החלטת הוועדה המקומית...". The local MCP server reloads on next Hermes spawn (stdio mode), so the tool is immediately available — no Coolify deploy needed. Curator's promptTemplate (DB-stored) updated to use the new tool as the primary path for reading the final. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-05 15:57:10 +00:00

... 3 4 5 6 7 ...

542 Commits