legal-ai

Author	SHA1	Message	Date
Chaim	d4496b96f1	fix(mcp): eliminate "No such tool available" race at agent wakeup All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m26s Details When Paperclip wakes the CEO and the model issues an mcp__legal-ai__* call within ~10s of session init, Claude Code sometimes returns "No such tool available" because the legal-ai MCP server hasn't finished bringing up its tool catalog yet. Observed twice today on CMPA precedent-extraction wakeups (sessions 9989fbaf and a9c61801); the agent fell back to bash + .venv/bin/python and finished the work, but the race needed fixing on the server side. Three changes that close the window: 1. Lazy schema init (services/db.py + server.py) `init_schema()` was awaited inside the FastMCP lifespan, blocking the `initialize`/`tools/list` handshake until ~10 CREATE TABLE IF NOT EXISTS statements ran. Under contention (two CEOs waking at once for different companies) this stretched. Now the lifespan returns immediately and `get_pool()` runs the schema migrations exactly once on first DB access, guarded by an asyncio.Lock. tools/list is answered in milliseconds regardless of DB state. 2. Lazy heavy imports - services/embeddings.py: voyageai (~450ms) loaded only inside _get_client() - services/extractor.py: google.cloud.vision (~550ms) loaded only inside _get_vision_client() and _ocr_with_google_vision() These two were being imported at module top from legal_mcp.tools.documents -> services.processor -> services.{ extractor,embeddings}, so the FastMCP server couldn't even start responding until both finished. Cold start dropped from 2.7s to 1.17s end-to-end (init + tools/list response). 3. Agent-side warmup + retry guidance (.claude/agents/legal-ceo.md) Even with a fast server, the model can still race on the very first call. The precedent-extraction section now tells the CEO to call workflow_status as a warmup probe and to retry after a short sleep if it sees "No such tool available", before falling back to the python bypass. Also expanded the precedent-tool whitelists on the sub-agents that delegate halacha/library work (commits `4a9a6b7` + `7ee90dc` added the tools to the MCP server but only the CEO got them in its allowed list). Added to: legal-researcher (full extraction set), legal-analyst (library_get/list + halacha review), legal-writer (library lookups + halacha_review), legal-qa (library_get + halacha_review), and the two that the CEO was already missing (halacha_review, halachot_pending). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 20:23:14 +00:00
Chaim	7ee90dce31	feat: external precedent library with auto halacha extraction All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m27s Details Adds a third corpus of legal authority distinct from style_corpus (Daphna's prior decisions for voice) and case_precedents (chair-attached quotes per case). The new corpus holds chair-uploaded court rulings and other appeals committee decisions, with binding rules (הלכות) extracted automatically and queued for chair approval. Pipeline (web/app.py + services/precedent_library.py): file → extract → chunk → Voyage embed → halacha_extractor → store + publish progress over the existing Redis SSE channel. Schema V7 (services/db.py): extends case_law with source_kind + extraction status fields under a CHECK constraint pinning practice_area to the three appeals committee domains (rishuy_uvniya, betterment_levy, compensation_197). New precedent_chunks (vector(1024)) and halachot tables (vector(1024) over rule_statement, IVFFlat indexes, gin on practice_areas/subject_tags). Halachot start as pending_review; only approved/published rows are visible to search_precedent_library. Agents: legal-writer, legal-researcher, legal-analyst, legal-ceo, legal-qa get search_precedent_library. legal-writer prompt explains the three-corpus distinction and CREAC use; legal-qa now verifies that every cited halacha resolves to an approved row in the corpus. UI: /precedents page with four tabs — library / semantic search / pending review (J/K nav, A/R/E shortcuts, badge count) / stats. Reuses the existing upload-sheet progress + SSE pattern. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 08:38:18 +00:00
Chaim	47127f1e85	agents: close-own-issue PATCH for every agent (kill the retry loop) All checks were successful Build & Deploy / build-and-deploy (push) Successful in 9s Details The retry loop bug we fixed in legal-analyst yesterday existed in every single sub-agent skill. They all post a comment + wake the CEO + exit, leaving their own issue in `in_progress`. Paperclip's "in_progress with no live execution" watchdog then re-wakes them, repeating until something external transitions the issue. Watched it happen on CMPA-17 (researcher) today — 4 iterations + manual SIGTERM + manual PATCH. Same fix applied to all 5 remaining agents: • legal-researcher.md • legal-writer.md • legal-qa.md • legal-exporter.md • legal-proofreader.md (file was incomplete — also added the missing שלב 5: דיווח and wake-CEO sections to bring it to parity with the other agents) Each gets a "סגור את ה-issue של עצמך — חובה!" section with two PATCH templates: one for `done` after a successful run, one for `blocked` if checks fail or output is incomplete. The section sits before the wake-CEO block, with an explicit reference to the CMPA-17 incident so the rule has a concrete anchor. Result: every agent now has the same close-issue contract. No more zombie in_progress issues, no more 4× wakeup loops. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-30 17:35:44 +00:00
Chaim	c2fb4ca08e	Voice corpus: acceptance architecture + block-zayin + decision tree All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m8s Details Three new voice docs based on deep reading of 1033-25 (full-acceptance) and 7 representative cases for block-zayin (claims summary): - daphna-acceptance-architecture.md: 5 distinct templates for case acceptance (A: internal flaw + voiding; B: remand to committee; C: corrections in request; D: substantive 8xxx; E: appraiser remand). Fixes the wrong reference in architecture-by-outcome that treated full-acceptance as a variation of partial-acceptance. - daphna-block-zayin-claims.md: rules for claims summary block — order by procedural role, neutrality, sub-headings per party, anti-patterns (numbered lists, evaluation words, premature conclusion). - daphna-decision-tree.md: operational tool that unifies all 5 voice docs into a short analytical process. Starts with the decisive question: "what is the winning evidence?". Decision trees for architecture selection, opening mode, citation choice, length by weight. Updates legal-writer.md to read decision-tree first, then the 5 voice docs, plus block-zayin.md before block ז. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 16:41:25 +00:00
Chaim	8b816c8b61	Voice corpus deep read: precedent network + architecture-by-outcome All checks were successful Build & Deploy / build-and-deploy (push) Successful in 6s Details After reading all 23 1xxx decisions from style_corpus DB (in addition to the 10 training files and 1130-25/1194-25 deep reads), synthesized two new operational documents: docs/daphna-precedent-network.md - Maps each legal issue to the specific precedent Daphna cites - 9 threshold issues (standing, השפר, סעיף 152, קנייני, פגמי פרסום, פסילה, עבירות בנייה) with her preferred quotes for each - 8 substantive issues (תכנון נקודתי vs כולל, חיקוק תכנית, סטייה ניכרת, 62א, חניה, תמ"א 38, תכניות ישנות, שימוש חורג) - Lists ~30 external precedents she cites consistently + ~15 personal precedents (her own canon — 1110/20 בעלז, 1112/22 שקופה, 1181/22 אדלר, 1130-25, etc.) - Distinguishes precedents she cites vs. those she does NOT cite docs/daphna-architecture-by-outcome.md - 7 distinct block-yod architectures keyed to outcome type: 1. Pure rejection (short, 555-2000 words) 2. Rejection after complex analysis (2500-4500) 3. Threshold dismissal + merits "ועל מנת לא לצאת בחסר" (mode F) 4. Three or more distinct issues (sub-headings) 5. Partial acceptance (full funnel architecture) 6. Joined appeals 7. Remand follow-up - Decision tree for the agent (4 questions → architecture choice) - Internal proportions table (opening 5-10%, doctrine 15-25%, etc.) - Costs matrix with 6 scenarios Updated docs/daphna-voice-fingerprint.md with section 6 (additions from 23-file corpus read): 2 new opening modes (F: threshold+merits, G: remand follow-up), nuanced sub-heading rule, self-citation of full analytical blocks, 10 new "we" verbs, 11 traditional phrases with sources, expanded costs matrix, transparency about petition outcomes, warning that 1015-24 is dissent (not Daphna's voice). Updated .claude/agents/legal-writer.md to require reading all 4 voice docs before block-yod (the "voice quartet"), with explicit decision tree integration. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-27 07:26:45 +00:00
Chaim	bccc0a132f	Refine voice fingerprint with full 1xxx corpus (24 cases) All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m3s Details After analyzing all 24 building_permit decisions in style_corpus DB (not just the 2 local files), refined two anti-patterns: 1. Sub-headings: actually permitted when block-yod handles 3+ distinct legal issues (e.g., 1079-24 had "הבקשות לפסילה" / "מעמד המבקשת וזכות עמידה" / "עותרים ציבוריים"). The earlier rule of "no sub-headings except academic cases" was too strict — based only on small local sample. 2. Paragraph numbering: discovered it's an evolutionary pattern, not a static rule. Pre-2025 decisions had sequential paragraph numbers (1, 2, 3 throughout); recent decisions (1126-25, 1128-25, 1130-25, 1194-25) abandoned it for narrative flow. The agent should NOT add paragraph numbers — the new style. The (1)...(2)...(3)... in-paragraph enumeration ban remains absolute — 0/33 final decisions used it. Distinction now made explicit: in-paragraph enumeration ≠ paragraph-level numbering (former always forbidden; latter is evolutionary). Updated: - docs/daphna-voice-fingerprint.md — corpus stats, refined anti-patterns - .claude/agents/legal-writer.md — checklist with new distinctions Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-27 06:45:55 +00:00
Chaim	deb8baab5d	Inject Daphna's voice into legal-writer + corpus fingerprint All checks were successful Build & Deploy / build-and-deploy (push) Successful in 7s Details Synthesized two voice documents from corpus reading: - docs/voice-1130-25.md: deep read of case 1130-25 block-yod (5000 words), extracting the 9-movement funnel architecture, 8 reasoning templates, 10 'we' verbs with their distinct functions, the 'akhen...ulam' pattern, pacing/silence principles, and the deliberative meta-narrative. - docs/daphna-voice-fingerprint.md: cross-corpus synthesis of 10 finals (1 planning + 9 appraisal levy). Identifies 10 invariants, 5 opening modes mapped to outcome certainty, mandatory ברמ 3644/13 preamble for shamai cases, copy-paste templates, and 7 anti-patterns to avoid. Updated .claude/agents/legal-writer.md: - Added voice docs as MUST-READ before block-yod (was missing the deep voice layer; only had surface style_guide patterns) - Replaced the ' (1)...(2)...(3)...' enumeration template with the 5 opening modes (the enumeration was a known anti-pattern Daphna always removes) - Added the 'we' verbs catalog with explicit functions - Made 'אכן...אולם' pattern mandatory for issues with substantial counter-arguments (was vaguely 'אמנם...אולם') - Added mandatory ברמ 3644/13 preamble for 8xxx shamai cases - Added self-citation triple-mode (refer/defer/distinguish) — Daphna's emerging practice of building personal jurisprudence - Added 8-item anti-patterns checklist for post-write review - Replaced block-yod-alef section with proper 4-paragraph closing template (process narrative → outcome → costs → date) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-26 19:38:17 +00:00
Chaim	726498126d	Add Track Changes architecture for draft revisions (CMP + CMPA) All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m29s Details Fixes critical bug in 1033-25: user-uploaded עריכה-*.docx files were orphaned on disk while exports kept rebuilding from stale DB blocks. New architecture: - User-uploaded DOCX becomes the source of truth (cases.active_draft_path) - System edits via XML surgery with real Word <w:ins>/<w:del> revisions - User can Accept/Reject each change from within Word Components: - docx_reviser.py: XML surgery for Track Changes (15 tests) - docx_retrofit.py: retroactive bookmark injection with Hebrew marker detection + heading heuristic (9 tests) - docx_exporter.py: emits bookmarks around each of the 12 blocks - 3 new MCP tools: apply_user_edit, list_bookmarks, revise_draft - 4 new/updated endpoints: upload (auto-registers active draft), /exports/revise, /exports/bookmarks, /exports/{filename}/retrofit, /active-draft - DB migration: cases.active_draft_path column - UI: correct banner using real v-numbers, "מקור האמת" badge, detailed upload toast with bookmarks_added/missing_blocks - agents: legal-exporter (3 export modes), legal-ceo (stage G for revision handling), legal-writer (revision mode) Multi-tenancy: - Works for both CMP (1xxx cases) and CMPA (8xxx/9xxx cases) - New revise-draft skill added to both companies - deploy-track-changes.sh syncs skills CMP ↔ CMPA - retrofit_case.py: one-off retrofit of existing files Tests: 34 passing (15 reviser + 9 retrofit + 4 exporter bookmarks + 6 e2e) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-16 18:49:30 +00:00
Chaim	3da4d73498	Upgrade agents to Claude Opus 4.7 All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m28s Details - legal-analyst: opus 4.6 → opus 4.7 - legal-proofreader: opus 4.6 → opus 4.7 - legal-writer: sonnet 4.6 → opus 4.7 (complex block writing benefits from stronger model) - block_writer MODEL_MAP: updated opus ID to 4.7 Opus 4.7 brings: high-res images (2576px), better file-based memory, improved DOCX generation, and task budgets for agentic loops. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 16:10:56 +00:00
Chaim	45d52a74d2	Fix agent wakeup: /wake → /wakeup, remove broken DB fallback All checks were successful Build & Deploy / build-and-deploy (push) Successful in 7s Details The agents used /api/agents/{id}/wake (404) with a fallback of INSERT INTO agent_wakeup_requests. The DB insert creates only the wakeup record without a heartbeat_run, so the Paperclip dispatcher never processes it — agents get stuck in queued forever. Fix: - All agents: /wake → /wakeup (correct Paperclip API endpoint) - Remove all DB INSERT fallbacks, replace with warning - Document the rule in CLAUDE.md: always API, never DB insert - Save to memory for future conversations Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 14:18:57 +00:00
Chaim	089a8b3a08	Route user comments through CEO agent + add draft/attachment awareness All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m42s Details When a user comments on a Paperclip issue, the built-in automation wakes the assigned agent directly, bypassing the CEO. This meant user instructions (like "read the uploaded draft and route to the right agent") were ignored. Changes: - Plugin: add issue.comment.created event handler that wakes the CEO agent with the comment context (plugin-legal-ai, separate repo) - HEARTBEAT: add steps 2b (read recent user comments) and 2c (check attachments) before agents start working - CEO agent: add comment-routing section — read, check attachments, route - Writer agent: add step 0 — check for uploaded DOCX drafts before writing Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 13:47:43 +00:00
Chaim	e698419faf	Fix git not found error crashing document uploads in container All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m13s Details Install git in Docker image and wrap all subprocess git calls in try/except so a missing or failing git binary never kills an upload that already succeeded. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 12:38:40 +00:00
Chaim	3f6a130cf9	Make all agent instructions self-contained — no reliance on hope Every agent now has explicit instructions in its own definition file, not just in HEARTBEAT.md. An agent following only its own step-by-step instructions will do the right thing on any new case. All 6 non-CEO agents: explicit wakeup CEO block in completion step (curl API + psql fallback, with agent name customized) legal-ceo.md: issue template for analyst with 5 mandatory items (document mapping table, no-extract list, split large docs, wakeup CEO, blocked if failed) legal-writer.md: explicit Read of decision-methodology.md as step 1 (before case_get, not just "read before starting") legal-qa.md: methodology_compliance severity → critical (was warning — decisions without syllogisms/steel-man now blocked) legal-proofreader.md: added case_update tool + status='proofread' (was missing entirely — CEO couldn't know proofreading was done) legal-researcher.md: added case_update + mail notification (was missing — CEO couldn't know research was done) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 13:17:44 +00:00
Chaim	be9fa9e712	Add decision-writing methodology based on FJC, Garner, Posner sources "בית ספר להחלטות" Phase 2 — the system now has formal analytical methodology for building quasi-judicial decisions, separate from Dafna's writing style (SKILL.md) and content checklists. What was done: - Downloaded 5 authoritative sources (~341K words): FJC Judicial Writing Manual (1991+2020), Garner Legal Writing in Plain English, Posner How Judges Think, Scalia/Garner Making Your Case - Extracted principles from all sources into intermediate docs - Synthesized into docs/decision-methodology.md (3,400 words, 12 sections, 10 guiding principles) - Integrated methodology into block-yod prompt via {methodology_guidance} - Restructured legal-writer agent workflow to follow analytical stages - Made "answer all claims" flexible (bundle/skip via chair_directions) - Added methodology compliance check (#7) to legal-qa agent - Updated all knowledge files (CLAUDE.md, SKILL.md, lessons, corpus) Three-layer architecture: 1. Methodology (decision-methodology.md) — universal, how to think 2. Content checklists (lessons.py) — specific per appeal subtype 3. Style (SKILL.md) — Dafna's personal writing patterns Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 23:29:16 +00:00
Chaim	0c4886afe6	Wire legal-writer to chair directions from analysis-and-research.md Closes the loop so דפנה's positions (written inline in the UI and saved to analysis-and-research.md) automatically become binding direction for the legal-writer agent — no manual copy-paste, no bypass. Backend: - research_md.extract_chair_directions(path) returns a compact dict with status (missing/empty/partial/complete), filled_count, empty_count, and a reduced list of threshold_claims + issues each with {id, number, title, direction}. Designed to be directly usable as direction_doc by the writer. - New MCP tool: drafting.get_chair_directions(case_number) wraps the helper, resolves the case research file path via config.find_case_dir, returns formatted JSON. - Registered in server.py as mcp__legal-ai__get_chair_directions. legal-writer agent update: - Adds get_chair_directions to the tools list. - New mandatory "שלב 1ב" before any block writing: call get_chair_directions, branch on status. - missing → halt, report "legal-analyst לא רץ עדיין" - empty → halt, instruct Dafna to fill positions via the UI URL - partial → halt unless user confirms; write only filled sections - complete → proceed - New "שלב 1ג" constructs an internal direction_doc from the received chair rulings before writing block י. - Block י section expanded with 5 binding rules: 1. Open each discussion with Dafna's ruling as the thesis 2. Frame the reasoning in her style (use get_style_guide phrases) 3. Match her tone (decisive vs nuanced) 4. Must NOT contradict her position — if she disagreed with your own inclination, her position rules 5. Use legal_questions from the analysis file as the analytical structure (principle question first, concrete application second) - New bullet section for block יא: summarize each chair ruling briefly, state final outcome, close with the signed date formula. Verified all four status paths (missing/empty/partial/complete) via local test. Now Dafna's workflow is fully end-to-end: she reads the analyst report in the UI, fills "עמדת ועדת הערר" in each card, hits blur to auto-save, then triggers legal-writer — which picks up her positions as direction without any file shuffle. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 13:04:30 +00:00
Chaim	4df2040a40	Fix: save_block_content now writes draft file + writer must update status Two issues that caused QA agent to fail: 1. save_block_content saved to DB only — now also rebuilds drafts/decision.md 2. legal-writer.md now has explicit mandatory step: case_update(status="drafted") Without these, workflow_status reports has_draft=false and QA can't run. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 15:25:53 +00:00
Chaim	4574987a69	Add 3 new agents: legal-researcher, legal-writer, legal-qa Complete agent pipeline for decision writing: 1. legal-analyst (existing) — extract claims/responses/replies 2. legal-researcher (new) — analyze precedents, plans, protocols 3. legal-writer (new) — write decision blocks in Dafna's style 4. legal-qa (new) — validate before export (6 checks) All agents use claude_local adapter (Claude Code session, zero API cost). Each has YAML frontmatter with specific tools and detailed Hebrew instructions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 16:47:28 +00:00

17 Commits