legal-ai

Author	SHA1	Message	Date
Chaim	cf5f6fe274	feat(paperclip): close 11 integration gaps (#16-#28) Brings the legal-ai ↔ Paperclip integration in line with the official Paperclip skill. Net effect: HEARTBEAT.md -47% (370→195 lines), all 14 agents on uniform runtime_config + budget + instructionsBundleMode, and two cross-company helpers replacing manual SQL. Highlights: - HEARTBEAT.md refactor: project-specific only, delegates to the official paperclipai/paperclip skill (loaded per agent). Adds heartbeat-context fast-path (§1.7) and PAPERCLIP_WAKE_PAYLOAD_JSON shortcut (§1.5). - Issue Thread Interactions API: legal-ceo.md now uses ask_user_questions / request_confirmation / suggest_tasks instead of free-text comments — gives chair structured UI with idempotency keys. - pc.sh + paperclip_api.pc_request: every API call goes through helpers that inject Authorization + X-Paperclip-Run-Id (audit trail). - sync_agents_across_companies.py: master(CMP)→mirror(CMPA) sync via Paperclip API, idempotent, with --verify and --apply modes. - skills/new-company-setup: 11-step blueprint distilling all 11 gaps into a single onboarding runbook for the next company. - .taskmaster: 12 tasks covering each gap (one already closed: #29). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-04 17:25:45 +00:00
Chaim	d4496b96f1	fix(mcp): eliminate "No such tool available" race at agent wakeup All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m26s Details When Paperclip wakes the CEO and the model issues an mcp__legal-ai__* call within ~10s of session init, Claude Code sometimes returns "No such tool available" because the legal-ai MCP server hasn't finished bringing up its tool catalog yet. Observed twice today on CMPA precedent-extraction wakeups (sessions 9989fbaf and a9c61801); the agent fell back to bash + .venv/bin/python and finished the work, but the race needed fixing on the server side. Three changes that close the window: 1. Lazy schema init (services/db.py + server.py) `init_schema()` was awaited inside the FastMCP lifespan, blocking the `initialize`/`tools/list` handshake until ~10 CREATE TABLE IF NOT EXISTS statements ran. Under contention (two CEOs waking at once for different companies) this stretched. Now the lifespan returns immediately and `get_pool()` runs the schema migrations exactly once on first DB access, guarded by an asyncio.Lock. tools/list is answered in milliseconds regardless of DB state. 2. Lazy heavy imports - services/embeddings.py: voyageai (~450ms) loaded only inside _get_client() - services/extractor.py: google.cloud.vision (~550ms) loaded only inside _get_vision_client() and _ocr_with_google_vision() These two were being imported at module top from legal_mcp.tools.documents -> services.processor -> services.{ extractor,embeddings}, so the FastMCP server couldn't even start responding until both finished. Cold start dropped from 2.7s to 1.17s end-to-end (init + tools/list response). 3. Agent-side warmup + retry guidance (.claude/agents/legal-ceo.md) Even with a fast server, the model can still race on the very first call. The precedent-extraction section now tells the CEO to call workflow_status as a warmup probe and to retry after a short sleep if it sees "No such tool available", before falling back to the python bypass. Also expanded the precedent-tool whitelists on the sub-agents that delegate halacha/library work (commits `4a9a6b7` + `7ee90dc` added the tools to the MCP server but only the CEO got them in its allowed list). Added to: legal-researcher (full extraction set), legal-analyst (library_get/list + halacha review), legal-writer (library lookups + halacha_review), legal-qa (library_get + halacha_review), and the two that the CEO was already missing (halacha_review, halachot_pending). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 20:23:14 +00:00
Chaim	7ee90dce31	feat: external precedent library with auto halacha extraction All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m27s Details Adds a third corpus of legal authority distinct from style_corpus (Daphna's prior decisions for voice) and case_precedents (chair-attached quotes per case). The new corpus holds chair-uploaded court rulings and other appeals committee decisions, with binding rules (הלכות) extracted automatically and queued for chair approval. Pipeline (web/app.py + services/precedent_library.py): file → extract → chunk → Voyage embed → halacha_extractor → store + publish progress over the existing Redis SSE channel. Schema V7 (services/db.py): extends case_law with source_kind + extraction status fields under a CHECK constraint pinning practice_area to the three appeals committee domains (rishuy_uvniya, betterment_levy, compensation_197). New precedent_chunks (vector(1024)) and halachot tables (vector(1024) over rule_statement, IVFFlat indexes, gin on practice_areas/subject_tags). Halachot start as pending_review; only approved/published rows are visible to search_precedent_library. Agents: legal-writer, legal-researcher, legal-analyst, legal-ceo, legal-qa get search_precedent_library. legal-writer prompt explains the three-corpus distinction and CREAC use; legal-qa now verifies that every cited halacha resolves to an approved row in the corpus. UI: /precedents page with four tabs — library / semantic search / pending review (J/K nav, A/R/E shortcuts, badge count) / stats. Reuses the existing upload-sheet progress + SSE pattern. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 08:38:18 +00:00
Chaim	47127f1e85	agents: close-own-issue PATCH for every agent (kill the retry loop) All checks were successful Build & Deploy / build-and-deploy (push) Successful in 9s Details The retry loop bug we fixed in legal-analyst yesterday existed in every single sub-agent skill. They all post a comment + wake the CEO + exit, leaving their own issue in `in_progress`. Paperclip's "in_progress with no live execution" watchdog then re-wakes them, repeating until something external transitions the issue. Watched it happen on CMPA-17 (researcher) today — 4 iterations + manual SIGTERM + manual PATCH. Same fix applied to all 5 remaining agents: • legal-researcher.md • legal-writer.md • legal-qa.md • legal-exporter.md • legal-proofreader.md (file was incomplete — also added the missing שלב 5: דיווח and wake-CEO sections to bring it to parity with the other agents) Each gets a "סגור את ה-issue של עצמך — חובה!" section with two PATCH templates: one for `done` after a successful run, one for `blocked` if checks fail or output is incomplete. The section sits before the wake-CEO block, with an explicit reference to the CMPA-17 incident so the rule has a concrete anchor. Result: every agent now has the same close-issue contract. No more zombie in_progress issues, no more 4× wakeup loops. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-30 17:35:44 +00:00
Chaim	6b5d6586dc	Agents: voice docs awareness for qa/researcher/analyst/ceo Until now only legal-writer referenced the voice corpus. Without these references the qa agent can't validate writer output, the researcher chooses precedents outside Daphna's canon, and the analyst's claims classification doesn't match block-zayin rules. - legal-qa: adds 8th check "voice_compliance" — block ז structure, block י voice (אכן/אולם, "אנחנו" verbs, no numbered lists), correct precedent from canon, acceptance template match. - legal-researcher: must check daphna-precedent-network.md before proposing any precedent; cross-reference with Daphna's own past decisions via search_decisions. - legal-analyst: reads block-zayin-claims.md — its output is the writer's input for block ז. - legal-ceo: lists all 6 voice docs and which agent reads each. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 17:14:44 +00:00
Chaim	28daff58be	Pre-existing agent updates + analysis DOCX export Updates accumulated from prior sessions: - HEARTBEAT: company-based filtering (CMP/CMPA) rules - legal-qa, legal-researcher: routine updates - analysis_docx_exporter: new service for analysis DOCX export - compose page: "הורד כ-DOCX" button for analysis - decision_template.docx: template for exporter Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-16 18:49:10 +00:00
Chaim	45d52a74d2	Fix agent wakeup: /wake → /wakeup, remove broken DB fallback All checks were successful Build & Deploy / build-and-deploy (push) Successful in 7s Details The agents used /api/agents/{id}/wake (404) with a fallback of INSERT INTO agent_wakeup_requests. The DB insert creates only the wakeup record without a heartbeat_run, so the Paperclip dispatcher never processes it — agents get stuck in queued forever. Fix: - All agents: /wake → /wakeup (correct Paperclip API endpoint) - Remove all DB INSERT fallbacks, replace with warning - Document the rule in CLAUDE.md: always API, never DB insert - Save to memory for future conversations Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 14:18:57 +00:00
Chaim	e698419faf	Fix git not found error crashing document uploads in container All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m13s Details Install git in Docker image and wrap all subprocess git calls in try/except so a missing or failing git binary never kills an upload that already succeeded. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 12:38:40 +00:00
Chaim	3f6a130cf9	Make all agent instructions self-contained — no reliance on hope Every agent now has explicit instructions in its own definition file, not just in HEARTBEAT.md. An agent following only its own step-by-step instructions will do the right thing on any new case. All 6 non-CEO agents: explicit wakeup CEO block in completion step (curl API + psql fallback, with agent name customized) legal-ceo.md: issue template for analyst with 5 mandatory items (document mapping table, no-extract list, split large docs, wakeup CEO, blocked if failed) legal-writer.md: explicit Read of decision-methodology.md as step 1 (before case_get, not just "read before starting") legal-qa.md: methodology_compliance severity → critical (was warning — decisions without syllogisms/steel-man now blocked) legal-proofreader.md: added case_update tool + status='proofread' (was missing entirely — CEO couldn't know proofreading was done) legal-researcher.md: added case_update + mail notification (was missing — CEO couldn't know research was done) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 13:17:44 +00:00
Chaim	b1ad67dc49	Fix 12 of 15 pipeline gaps found in 1130-25 test run Test run on case 1130-25 revealed critical gaps. This commit fixes: HEARTBEAT.md (#1, #11): - Agents MUST wake CEO after completing any task (wakeup request) - New "blocked" status option — agents cannot mark "done" if something failed - Fallback: direct DB insert if API wake doesn't work legal-analyst.md (#2): - New step 6: completeness checks BEFORE finishing - Verify all appeal/response documents extracted successfully - Verify all extracted documents produced claims - Verify classification is correct (no claims from committee) - If any check fails → status = "blocked", not "done" legal-ceo.md (#3, #6, #7, #12, #13, #14, #15): - Step A rewritten with 3 sub-checks: A1: extraction completeness (no missing documents) A2: negative checks (wrong classification, abnormal counts, missing parties) A3: methodology compliance (syllogisms, CREAC prep, steel-man, etc.) - Any failure blocks progress to step B legal-qa.md (#6 reinforcement): - New step 2b: negative checks on the written decision - Missing issues, bare quotes, empty formulas, mixed findings/conclusions Also: - Synced all agent files to /home/chaim/legal-ai/ (Paperclip reads from there) - Synced methodology + lessons + corpus docs - Fixed claim classification in DB: 20 committee/applicant claims → response (#5) Remaining gaps (3): - #4: Paperclip cache may need restart to pick up new definitions - #7: Matmon document retry (25K words, 0 claims extracted) - #9: 53 appellant claims may need synthesis (high but not blocking) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 12:28:38 +00:00
Chaim	be9fa9e712	Add decision-writing methodology based on FJC, Garner, Posner sources "בית ספר להחלטות" Phase 2 — the system now has formal analytical methodology for building quasi-judicial decisions, separate from Dafna's writing style (SKILL.md) and content checklists. What was done: - Downloaded 5 authoritative sources (~341K words): FJC Judicial Writing Manual (1991+2020), Garner Legal Writing in Plain English, Posner How Judges Think, Scalia/Garner Making Your Case - Extracted principles from all sources into intermediate docs - Synthesized into docs/decision-methodology.md (3,400 words, 12 sections, 10 guiding principles) - Integrated methodology into block-yod prompt via {methodology_guidance} - Restructured legal-writer agent workflow to follow analytical stages - Made "answer all claims" flexible (bundle/skip via chair_directions) - Added methodology compliance check (#7) to legal-qa agent - Updated all knowledge files (CLAUDE.md, SKILL.md, lessons, corpus) Three-layer architecture: 1. Methodology (decision-methodology.md) — universal, how to think 2. Content checklists (lessons.py) — specific per appeal subtype 3. Style (SKILL.md) — Dafna's personal writing patterns Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 23:29:16 +00:00
Chaim	4574987a69	Add 3 new agents: legal-researcher, legal-writer, legal-qa Complete agent pipeline for decision writing: 1. legal-analyst (existing) — extract claims/responses/replies 2. legal-researcher (new) — analyze precedents, plans, protocols 3. legal-writer (new) — write decision blocks in Dafna's style 4. legal-qa (new) — validate before export (6 checks) All agents use claude_local adapter (Claude Code session, zero API cost). Each has YAML frontmatter with specific tools and detailed Hebrew instructions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 16:47:28 +00:00

12 Commits