legal-ai

Author	SHA1	Message	Date
Chaim	36ca713dfa	Retrofit: tighten yod-bet pattern, add cover-block fallback All checks were successful Build & Deploy / build-and-deploy (push) Successful in 6s Details The "על כן" pattern for block-yod-bet was too greedy and matched mid-discussion transitional sentences (e.g. "על כן, במקום בו..."), which caused forward-scan to skip block-yod-alef ("סוף דבר") via the pointer advance. Tightened to require an operative subject (אנו / הערר / הוועדה / ועדת הערר) so terminal "על כן, אנו מחליטים" still matches but mid-block transitions don't. Added structural_fallback for cover blocks (alef/bet/gimel/dalet) — these are template metadata not present in user-edited DOCX bodies. Inject zero-content anchors so apply_user_edit can still target them later. The frontend toast distinguishes real content gaps from fallback anchors. Also expanded heading patterns based on training corpus inspection: - block-vav: על המקרקעין חלות / במצב התכנוני / התכניות החלות - block-zayin: טענות העוררת - block-chet: עיקר תגובת המשיב - block-tet: הדיון בוועדת הערר For case 1130-25, this raises detection from 6/12 to 11/12 blocks — only block-yod-bet remains missing (Daphna's edit ends at "סוף דבר" + numbered ruling, no terminal "ההחלטה" or "על כן אנו מחליטים" paragraph). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-26 06:57:41 +00:00
Chaim	c536ed0e63	Edit document doc_type and appraiser side from the case UI All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m26s Details Until now changing a document's doc_type required a manual SQL update. Adds an inline editor on the document badge so the chair can retag without leaving the case page, and threads an appraiser_side tag (committee / appellant / deciding) through the appraisal pipeline so betterment-levy cases — which usually have 2-3 appraisers — render conflicts with the deciding appraiser's view marked as governing. Backend - New appraiser_facts.appraiser_side column (V5.1) populated from documents.metadata.appraiser_side at extraction time. - extract_appraiser_facts now returns status='sides_missing' with the list of untagged appraisals instead of running with empty side labels — chair must tag every appraisal first via the UI. - Conflict detection orders entries committee → appellant → deciding so the deciding appraiser appears last; block-tet's prompt instructs the writer to phrase the deciding appraiser's view as the governing factual finding ("ואולם, השמאי המכריע קבע..."). - New PATCH /api/cases/{n}/documents/{doc_id} (Pydantic model with whitelist validation) and matching document_update MCP tool. Both merge appraiser_side into metadata JSONB instead of touching the schema. UI - New shared doc-types module exports the canonical 11 doc_type options plus the 3 appraiser-side options; both upload-sheet and the document badge now read from it instead of duplicating Hebrew labels. - New DocumentTypeEditor renders a Popover off the doc-type Badge with two Selects. The save button stays disabled while doc_type is appraisal but no side has been picked, mirroring the backend enforcement so the user finds out before triggering extraction. - usePatchDocument React-Query mutation invalidates the case detail on success so the badge updates without a manual refresh.	2026-04-19 06:26:51 +00:00
Chaim	c619c22a51	Add pre-ruling interim draft (טיוטת ביניים) for appeals committee All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m26s Details Lets the chair generate a partial decision DOCX before the discussion-and- ruling block is decided. Same template, skill and DOCX styling as the final decision (David, RTL, bookmarks) — only the block selection and order differ: רקע (ו) → תכניות+היתרים (ט) → טענות (ז) → הליכים (ח). The opening (ה), ruling (י), summary (יא), and signatures (יב) are omitted. - New appraiser_facts table + CRUD + conflict detection in db.py (V5 schema). Conflict = same plan/permit identifier reported differently by 2+ appraisers. - New appraiser_facts_extractor service: per-appraisal Claude extraction of plans + permits with raw quotes and page numbers. - block-tet prompt extended with a permits sub-section sourced from the extracted facts, plus an explicit instruction to flag inter-appraiser conflicts in neutral wording without resolving them (deferred to block-yod). - block-chet prompt extended with a post-hearing materials context sourced from documents.metadata.is_post_hearing. - docx_exporter.export_decision now accepts mode='interim' which reorders the blocks per the chair's mental model and writes טיוטת-ביניים-v{N}.docx (versioned independently of regular drafts). - 3 new MCP tools: extract_appraiser_facts, write_interim_draft, export_interim_draft. write_interim_draft auto-runs extraction if the appraiser_facts table is empty for the case.	2026-04-18 13:28:04 +00:00
Chaim	726498126d	Add Track Changes architecture for draft revisions (CMP + CMPA) All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m29s Details Fixes critical bug in 1033-25: user-uploaded עריכה-*.docx files were orphaned on disk while exports kept rebuilding from stale DB blocks. New architecture: - User-uploaded DOCX becomes the source of truth (cases.active_draft_path) - System edits via XML surgery with real Word <w:ins>/<w:del> revisions - User can Accept/Reject each change from within Word Components: - docx_reviser.py: XML surgery for Track Changes (15 tests) - docx_retrofit.py: retroactive bookmark injection with Hebrew marker detection + heading heuristic (9 tests) - docx_exporter.py: emits bookmarks around each of the 12 blocks - 3 new MCP tools: apply_user_edit, list_bookmarks, revise_draft - 4 new/updated endpoints: upload (auto-registers active draft), /exports/revise, /exports/bookmarks, /exports/{filename}/retrofit, /active-draft - DB migration: cases.active_draft_path column - UI: correct banner using real v-numbers, "מקור האמת" badge, detailed upload toast with bookmarks_added/missing_blocks - agents: legal-exporter (3 export modes), legal-ceo (stage G for revision handling), legal-writer (revision mode) Multi-tenancy: - Works for both CMP (1xxx cases) and CMPA (8xxx/9xxx cases) - New revise-draft skill added to both companies - deploy-track-changes.sh syncs skills CMP ↔ CMPA - retrofit_case.py: one-off retrofit of existing files Tests: 34 passing (15 reviser + 9 retrofit + 4 exporter bookmarks + 6 e2e) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-16 18:49:30 +00:00
Chaim	28daff58be	Pre-existing agent updates + analysis DOCX export Updates accumulated from prior sessions: - HEARTBEAT: company-based filtering (CMP/CMPA) rules - legal-qa, legal-researcher: routine updates - analysis_docx_exporter: new service for analysis DOCX export - compose page: "הורד כ-DOCX" button for analysis - decision_template.docx: template for exporter Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-16 18:49:10 +00:00
Chaim	3da4d73498	Upgrade agents to Claude Opus 4.7 All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m28s Details - legal-analyst: opus 4.6 → opus 4.7 - legal-proofreader: opus 4.6 → opus 4.7 - legal-writer: sonnet 4.6 → opus 4.7 (complex block writing benefits from stronger model) - block_writer MODEL_MAP: updated opus ID to 4.7 Opus 4.7 brings: high-res images (2576px), better file-based memory, improved DOCX generation, and task budgets for agentic loops. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 16:10:56 +00:00
Chaim	5dd24729e2	Auto-strip Nevo preambles and separate style analysis per appeal subtype - Add strip_nevo_preamble() to extractor.py — auto-removes Nevo database headers (bibliography, legislation, mini-ratio) during training upload - Add appeal_subtype column to style_patterns table — patterns are now stored per subtype instead of globally mixed - Update clear_style_patterns() to support subtype-scoped deletion - Pass appeal_subtype through analyze_corpus → store → upsert pipeline Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 14:03:06 +00:00
Chaim	ba39707c70	Add CMPA (betterment levy) training support and update methodology Support ingestion of betterment levy (היטל השבחה) decisions into a separate training corpus (CMPA). Key changes: - Add .doc file extraction via LibreOffice conversion in extractor - Add practice_area/appeal_subtype columns to style_corpus table - Route training files to cmp/ or cmpa/ subdirs based on appeal subtype - Fix derive_subtype to handle ARAR-YY-NNNN format (was matching year digit) - Expose practice_area/appeal_subtype params in MCP upload_training tool - Add appeal_subtype filter to analyze_style for per-type style analysis - Update betterment levy methodology in lessons.py: checklist (from generic to corpus-based), opening/closing strategies, and discussion rules Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 14:00:35 +00:00
Chaim	684a4cfd3b	Fix 500 error on precedents API — add default=str to json.dumps All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m41s Details UUID and datetime objects from PostgreSQL RETURNING * were not serializable. All other tool files already used default=str. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 12:11:30 +00:00
Chaim	2e2d2d42b6	Prevent status regression in case_update All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m32s Details CEO agent was reverting case status from "processing" to "new" when updating metadata fields. Added ordered status list — case_update now silently ignores status changes that would move backwards. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 17:05:40 +00:00
Chaim	82ba4663ba	Fix case repo sync + auto-create Gitea repos + add sync indicator All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m30s Details - auto-sync-cases.sh: fix broken directory scan (was looking for status subdirs that don't exist), fix env var word-splitting bug, add safe.directory handling and error logging - cases.py: auto-create Gitea repo on case_create, fix documents/original → documents/originals naming mismatch - app.py: add GET /api/cases/{case_number}/git-status endpoint - web-ui: add SyncIndicator component in case header showing sync status (synced/pending/no remote) with last commit time - pyproject.toml: add httpx dependency - CLAUDE.md: update Paperclip wakeup API docs - settings page: switch tag input from Select to free-text with datalist Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 15:28:16 +00:00
Chaim	e698419faf	Fix git not found error crashing document uploads in container All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m13s Details Install git in Docker image and wrap all subprocess git calls in try/except so a missing or failing git binary never kills an upload that already succeeded. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 12:38:40 +00:00
Chaim	2faae002e7	Add settings page for tag-to-company mappings and auto-create Paperclip projects All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m22s Details When a case is created, a Paperclip project is now automatically created in the correct company based on the appeal_subtype tag. Tag-to-company mappings are managed via a new Settings page that pulls companies from Paperclip DB. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 06:24:23 +00:00
Chaim	bd974f7791	Fix practice_area/appeal_subtype regression in search and case creation All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m55s Details The merge of ui-rewrite removed these parameters from db.search_similar() and db.create_case() but left the callers passing them, causing TypeError on any corpus search. Restores the parameters and adds schema migration. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 19:37:38 +00:00
Chaim	2d0e987803	Add missing case_precedents CRUD functions to db module All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m14s Details Four functions were called by tools/precedents.py but never implemented in services/db.py: create_case_precedent, list_case_precedents, delete_case_precedent, search_precedent_library. This caused 500 errors on the /api/cases/{n}/precedents endpoint. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 18:44:50 +00:00
Chaim	2b431e75ab	Add document preview, delete, and fix scroll in documents panel Documents tab was limited to ~9 visible items due to fixed max-height without overflow-hidden. Now uses 70vh with proper overflow. Added click-to-preview (shows extracted text in dialog) and delete button with confirmation dialog + backend DELETE endpoint. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 17:45:01 +00:00
Chaim	94bc66d7c1	Bundle FastAPI backend into Next.js Docker container The Next.js app was proxying /api/* to the old Flask/FastAPI server at legal-ai.nautilus.marcusgroup.org. When that server went down, the Next.js app's API calls failed with 503. Now both services run in the same container: - FastAPI (uvicorn) on :8000 — the API backend - Next.js (node) on :3000 — proxies /api/* to localhost:8000 Changes: - Dockerfile: multi-stage build with Python 3.12 + Node.js - next.config.ts: default proxy target is now 127.0.0.1:8000 - start.sh: launches uvicorn in background + node in foreground - pyproject.toml: add fastapi + uvicorn as explicit deps Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 14:33:52 +00:00
Chaim	bffdfe3e9d	Merge ui-rewrite into main: methodology + pipeline fixes Major changes from ui-rewrite branch: - Decision-writing methodology (decision-methodology.md) based on FJC, Garner, Posner - 5 source books downloaded and processed (341K words) - Methodology integrated into block-yod prompt - All 8 Paperclip agents updated for methodology compliance - DB schema V4: claim handling, standard of review, precedent hierarchy - 15 pipeline gaps identified and fixed after test run on case 1130-25 - Negative checks layer added to CEO and QA agents - HEARTBEAT: wakeup CEO on completion + blocked status - Flexible claim handling (bundle/skip via chair_directions) Conflicts resolved: all 5 files use ui-rewrite version (the latest). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 12:42:32 +00:00
Chaim	6cf918ad79	Add DB schema V4: methodology alignment columns New columns for methodology-aware decision pipeline: claims table: - claim_handling (address/bundle/skip) — per-claim handling mode - bundle_group — group name for bundled claims - handling_reason — explanation for skip/bundle cases table: - standard_of_review — review standard (independent discretion / etc.) - subject_categories — JSONB array of topics in the appeal case_law table: - precedent_level — hierarchy (supreme/administrative/national/district) - is_binding — binding holding vs. obiter dictum - creac_role — how it serves reasoning (rule/explanation/analogy) decisions table: - issue_order — JSONB array of ordered issues with type - claim_handling — JSONB overrides from chair_directions Migration tested and applied successfully on production DB. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 23:47:11 +00:00
Chaim	be9fa9e712	Add decision-writing methodology based on FJC, Garner, Posner sources "בית ספר להחלטות" Phase 2 — the system now has formal analytical methodology for building quasi-judicial decisions, separate from Dafna's writing style (SKILL.md) and content checklists. What was done: - Downloaded 5 authoritative sources (~341K words): FJC Judicial Writing Manual (1991+2020), Garner Legal Writing in Plain English, Posner How Judges Think, Scalia/Garner Making Your Case - Extracted principles from all sources into intermediate docs - Synthesized into docs/decision-methodology.md (3,400 words, 12 sections, 10 guiding principles) - Integrated methodology into block-yod prompt via {methodology_guidance} - Restructured legal-writer agent workflow to follow analytical stages - Made "answer all claims" flexible (bundle/skip via chair_directions) - Added methodology compliance check (#7) to legal-qa agent - Updated all knowledge files (CLAUDE.md, SKILL.md, lessons, corpus) Three-layer architecture: 1. Methodology (decision-methodology.md) — universal, how to think 2. Content checklists (lessons.py) — specific per appeal subtype 3. Style (SKILL.md) — Dafna's personal writing patterns Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 23:29:16 +00:00
Chaim	50eaa887db	Add chair feedback system and content checklists for block-yod Backend changes cherry-picked from ui-rewrite branch to enable feedback API endpoints for the Next.js staging UI. - chair_feedback DB table + API endpoints (GET/POST/PATCH) - Content checklists by appeal subtype injected into block-yod prompt - MCP tools for recording and listing chair feedback - Corpus analysis documentation (24 decisions) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 21:05:53 +00:00
Chaim	0fef20e272	Add content checklists for block-yod and chair feedback system Addresses Dafna's observation that licensing decisions lack comprehensive planning discussion. Systematic corpus analysis of all 24 training decisions revealed the system learned writing style but not substantive content. Changes: - Corpus analysis of all 24 decisions (docs/corpus-analysis.md) - 5 content checklists by appeal subtype injected into block-yod prompt - chair_feedback DB table + API endpoints + MCP tools - Feedback management page in Next.js UI (/feedback) - Navigation updated with "הערות יו״ר" link Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 20:58:28 +00:00
Chaim	e2088a4f60	Add case_precedents: attached legal support for the compose phase New self-contained table + MCP tools + FastAPI endpoints for letting the chair attach external case-law quotes (quote + citation מראה מקום, optional chair note, optional archived PDF) to either a specific threshold_claim / issue or the case as a whole. Data model - case_precedents (SCHEMA_V5_SQL) — case_id, section_id NULL/ "threshold_N"/"issue_N", quote, citation (free-text), chair_note, pdf_document_id FK to documents, denormalized practice_area for cross-case library filtering. - Deliberately NOT linked to the existing case_law table — that one has UNIQUE(case_number) which would force parsing the free-text citation into a structured key. A backfill pass into case_law is a later follow-up once the UI stabilizes. - db.py gains 4 helpers: create_case_precedent, list_case_precedents, delete_case_precedent, search_precedent_library. The last uses DISTINCT ON (citation) for the cross-case typeahead so each precedent appears once even if reused across many cases. MCP tools (legal_mcp/tools/precedents.py) - precedent_attach, precedent_list, precedent_remove, precedent_search_library — registered in server.py. FastAPI (web/app.py) - POST /api/cases/{n}/precedents — create, with PrecedentCreateRequest - POST /api/cases/{n}/precedents/upload-pdf — one-shot PDF upload to a dedicated documents/precedents/ subdirectory, creates a documents row with doc_type="precedent_archive" and no text extraction (archive only) - GET /api/cases/{n}/precedents — list - DELETE /api/precedents/{id} — uses path param since precedent_id is a UUID (slash-safe, unlike case numbers) - GET /api/precedents/search?q=...&practice_area=... — library typeahead Block-writer integration into _build_precedents_context is a deferred follow-up — Phase 1 surfaces the feature in the compose UI only. Plan: ~/.claude/plans/woolly-cooking-graham.md Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 19:16:48 +00:00
Chaim	8989ad9a9b	Add case_delete: MCP tool + DELETE endpoint + DB helper Wires a new case-deletion path across the three layers that needed it: - db.delete_case(case_id) — single SQL DELETE; documents, chunks, and qa_results cascade via existing schema FKs, audit_log nullifies. - cases_tools.case_delete(case_number, remove_files=False) — MCP tool wrapper. File tree on disk is kept by default (audit trail); pass remove_files=True for a hard delete. - DELETE /api/cases?case_number=... — FastAPI endpoint taking the case number as a QUERY param rather than a path segment. Case numbers like "1000/0426" can't be passed through a path parameter because FastAPI routing decodes %2F before matching, so a query param is the only shape that works for historical data. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 16:47:50 +00:00
Chaim	26d09d648f	Practice area separation: multi-tenant axis across DB, RAG, and UI Adds two orthogonal columns — practice_area (top-level legal domain: appeals_committee / national_insurance / labor_law) and appeal_subtype (building_permit / betterment_levy / compensation_197) — denormalized into cases, documents, document_chunks, decisions, and style_corpus so vector searches can filter without JOINs. Why: the system handles two unrelated sub-domains under the same appeals committee (1xxx building permits and 8xxx/9xxx betterment/197), with different rules and writing style. Without a separation axis, search_similar() and the block-writer's precedent lookup were free to surface betterment-levy paragraphs while drafting a building-permit decision — a real risk of cross-domain contamination. The same axis also lets future domains (national insurance, labor law) coexist without separate schemas. Schema (V4 migration in db.py): - ALTER ... ADD COLUMN IF NOT EXISTS on all five tables + composite indexes (practice_area first). - Idempotent backfill: case_number ~ '^1' → building_permit, '^8' → betterment_levy, '^9' → compensation_197; propagated to documents, chunks, and decisions via case_id; training-corpus rows (case_id NULL) default to appeals_committee. Code: - New services/practice_area.py with derive_subtype, validate, and is_override + enum constants. - db.create_case / create_document / store_chunks / create_decision inherit practice_area from the parent case (or take an explicit override for the case_id=None training corpus). - db.search_similar and search_similar_paragraphs accept practice_area + appeal_subtype filters using the denormalized columns. - tools/search.py auto-resolves the filter from case_number when given. - block_writer._build_precedents_context now passes the active case's practice_area to search_similar_paragraphs — closes the contamination hole for the discussion-block precedent fetch. - tools/cases.case_create auto-derives subtype from case_number; an explicit override that disagrees writes a case_subtype_override entry to audit_log so we can spot bad classifications later. - tools/documents.document_upload_training tags new training material with practice_area + subtype end-to-end (corpus, document, chunks). UI (web/static/index.html + web/app.py): - New-case wizard gets a practice_area dropdown (others disabled until national_insurance / labor_law arrive) and an appeal_subtype dropdown with JS auto-fill from the case-number prefix; manual edits stick. - Case header shows a blue badge with practice_area · subtype. - CaseCreateRequest plumbs both fields through to cases_tools.case_create. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 16:36:48 +00:00
Chaim	0c4886afe6	Wire legal-writer to chair directions from analysis-and-research.md Closes the loop so דפנה's positions (written inline in the UI and saved to analysis-and-research.md) automatically become binding direction for the legal-writer agent — no manual copy-paste, no bypass. Backend: - research_md.extract_chair_directions(path) returns a compact dict with status (missing/empty/partial/complete), filled_count, empty_count, and a reduced list of threshold_claims + issues each with {id, number, title, direction}. Designed to be directly usable as direction_doc by the writer. - New MCP tool: drafting.get_chair_directions(case_number) wraps the helper, resolves the case research file path via config.find_case_dir, returns formatted JSON. - Registered in server.py as mcp__legal-ai__get_chair_directions. legal-writer agent update: - Adds get_chair_directions to the tools list. - New mandatory "שלב 1ב" before any block writing: call get_chair_directions, branch on status. - missing → halt, report "legal-analyst לא רץ עדיין" - empty → halt, instruct Dafna to fill positions via the UI URL - partial → halt unless user confirms; write only filled sections - complete → proceed - New "שלב 1ג" constructs an internal direction_doc from the received chair rulings before writing block י. - Block י section expanded with 5 binding rules: 1. Open each discussion with Dafna's ruling as the thesis 2. Frame the reasoning in her style (use get_style_guide phrases) 3. Match her tone (decisive vs nuanced) 4. Must NOT contradict her position — if she disagreed with your own inclination, her position rules 5. Use legal_questions from the analysis file as the analytical structure (principle question first, concrete application second) - New bullet section for block יא: summarize each chair ruling briefly, state final outcome, close with the signed date formula. Verified all four status paths (missing/empty/partial/complete) via local test. Now Dafna's workflow is fully end-to-end: she reads the analyst report in the UI, fills "עמדת ועדת הערר" in each card, hits blur to auto-save, then triggers legal-writer — which picks up her positions as direction without any file shuffle. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 13:04:30 +00:00
Chaim	753fe0d57d	Research analysis cards with inline chair-position editor New feature on case view: the analysis-and-research.md produced by the legal-analyst agent is now rendered as structured cards in the UI, with inline editing of "עמדת ועדת הערר" that writes directly back to the markdown file (atomic rename). Backend (research_md.py): - parse(Path) → dict with header, prose sections, threshold_claims[], issues[], conclusions, other_sections - Tolerant field extractor handles both block ("LABEL:\ncontent") and inline ("LABEL: content") variants - Detects [ימולא ע"י יו"ר הוועדה] placeholder → empty chair_position - update_chair_position(path, section_id, text) locates the exact subsection by ordinal, replaces or appends the chair field, writes atomically via temp file + os.replace - Section IDs: threshold_N / issue_N (1-based) Endpoints: - GET /api/cases/{n}/research/analysis — returns parsed JSON or 404 - PATCH /api/cases/{n}/research/analysis/chair-position — {section_id, position} Frontend (#page-case): - New card "ניתוח משפטי ומחקר" below local-files card - Prose sections as justified text panels (background + gold border) - Threshold claims and issues as collapsible <details> items with gold right-border on open, numbered pills - Each item shows all extracted fields with label above content - Chair position editor: gold-wash background, 📝 icon label, textarea with placeholder prompt - onblur → PATCH with save indicator: ⏳ שומר → ✓ נשמר HH:MM → fade - Status pill next to each item title: "ממתין לעמדה" / "✓ עמדה נקבעה" - First threshold claim opens by default, rest closed - Card hidden entirely when no analysis file exists (404) Tested against real file: case 1033-25 with 3 threshold claims and 6 issues, all chair positions correctly empty, update writes only the targeted section, atomic rewrite preserves all other content. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 12:47:36 +00:00
Chaim	3e0221ccec	Management UI: corpus delete, process panel, activity feed, diagnostics - DELETE /api/training/corpus/{id} + delete button on training page, with confirmation dialog and recompute hint - /api/system/tasks + floating process panel (bottom-left) showing active background tasks with live 3s polling - /api/system/recent-activity derives a feed from cases, style_corpus, and last style_patterns run; sidebar on home page renders with relative timestamps - /api/system/diagnostics + /#/diagnostics page showing DB health, row counts per table, active tasks, stuck documents (>10 min), failed extractions - Cosmetic: signature phrase headline now prefers clean phrases over bracket-heavy templates for display Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 12:04:13 +00:00
Chaim	32f18de049	Add training corpus UI with Nevo proofreading pipeline - New proofreader service strips Nevo editorial additions (front matter, postamble, page headers, watermarks, inline codes) from DOCX/PDF/MD - PDF pages use Google Vision OCR for clean Hebrew RTL extraction - New training page at #/training with drag-and-drop upload, automatic metadata extraction (decision number, date, categories), reviewable preview, and style pattern report grouped by type - API endpoints: /api/training/{analyze,upload,corpus,patterns, analyze-style,analyze-style/status} - Fix claude_session.query to pipe prompt via stdin, avoiding ARG_MAX overflow when analyzing 900K+ char corpus - CLI scripts for batch proofreading and corpus upload Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 11:04:58 +00:00
Chaim	3f759d3610	Improve document processing pipeline and agent workflows - Add delete_document_chunks for reprocessing, save extracted text to disk - Expand case directory structure (original/extracted/proofread/backup) - Update classifier patterns (תגובה, הודעת עמדה) - Fix proofreader agent paths for new directory layout - Update HEARTBEAT to notify on every task completion - Improve bidi_table with LRE/PDF directional embedding - Add Paperclip project verification and auto-close setup issue - Add auto-sync-cases.sh for Gitea synchronization Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 16:45:49 +00:00
Chaim	22e819363e	Flatten cases directory structure and unify paths - Remove cases/new\|in-progress\|completed subdivision (status managed in DB) - Rename documents/original → documents/originals (consistent plural) - Move exports from global data/exports/ into cases/{num}/exports/ - Add documents/research/ for case law and analysis files - Update all agents, scripts, config, web API endpoints, and DB paths Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 14:33:27 +00:00
Chaim	6aaca14e31	Replace Claude Vision OCR with Google Cloud Vision Benchmark results on Hebrew legal docs (case 1130-25): - Google Vision: 1s/page, $0.001/page, high accuracy - Claude Opus Vision: 90s/page, $0.05/page, poor accuracy - PyMuPDF broken OCR layers now detected via quality check Changes: - extractor.py: Google Vision OCR with Hebrew language hint (300 DPI) - extractor.py: text quality detection (word length, words-per-line, Hebrew ratio) - extractor.py: Hebrew abbreviation quote fixer (15 known patterns) - config.py: add GOOGLE_CLOUD_VISION_API_KEY, remove ANTHROPIC_API_KEY - pyproject.toml: add google-cloud-vision, remove anthropic Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 20:17:58 +00:00
Chaim	bc72a83a71	Switch embedding model from voyage-3-large to voyage-law-2 Benchmark on case 1130-25 (4 Hebrew legal docs, 8 queries) showed: - voyage-law-2: avg top-1 score 0.5839 (+27% over voyage-3-large) - voyage-4-large: avg top-1 score 0.4119 (worse than current) - voyage-3-large: avg top-1 score 0.4589 (baseline) voyage-law-2 costs ~4.6x more per run but delivers significantly better retrieval quality for Hebrew legal text. Model is now configurable via VOYAGE_MODEL env var. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 19:05:58 +00:00
Chaim	5a8d5cac0a	Add exports panel: versioned drafts, download, upload revisions, mark final Export DOCX now saves to data/exports/{case_number}/ with auto-versioning (טיוטה-v1, v2...). The case view UI shows all drafts with download buttons, allows uploading revised versions (עריכה-v1...), and marking a version as final (copies to training corpus for style learning). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 12:10:02 +00:00
Chaim	4df2040a40	Fix: save_block_content now writes draft file + writer must update status Two issues that caused QA agent to fail: 1. save_block_content saved to DB only — now also rebuilds drafts/decision.md 2. legal-writer.md now has explicit mandatory step: case_update(status="drafted") Without these, workflow_status reports has_draft=false and QA can't run. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 15:25:53 +00:00
Chaim	96ea54dc6e	Add claim_type field: distinguish claims vs responses vs replies Legal documents have 3 types of assertions: - claim: from appeal documents (כתב ערר) - response: from original responses (כתב תשובה) - reply: from supplementary responses (תגובה, השלמת טיעון) DB: added claim_type column to claims table Extractor: _infer_claim_type() auto-detects from doc_type + title Updated existing 113 records: 29 claims, 28 responses, 56 replies Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 15:35:16 +00:00
Chaim	328436f56d	Remove stale classifier import from processor.py (was deleted) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 14:45:19 +00:00
Chaim	911c797eb2	Reorganize: skills/ directory + move memory to docs/ skill-legal-decision/ → skills/decision/ skill-legal-assistant/ → skills/assistant/ skill-legal-docx/ → skills/docx/ memory/*.md → docs/ Also removed: TASKS.md (use TaskMaster), classifier.py (replaced by local_classifier.py) Updated all references in CLAUDE.md, scripts, PRDs, docs. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 14:27:07 +00:00
Chaim	bacb330a2a	Replace all Anthropic API calls with Claude Code session (claude -p) New module claude_session.py provides query() and query_json() that run prompts via `claude -p` CLI — uses the claude.ai session, zero API cost. Converted 6 services: - claims_extractor.py: extract_claims_with_ai - brainstorm.py: brainstorm_directions - block_writer.py: write_block (was streaming+thinking, now simple) - qa_validator.py: claims_coverage check - style_analyzer.py: 3 API calls (single pass, multi pass, synthesis) - learning_loop.py: extract_lessons Only extractor.py still uses Anthropic API (for PDF OCR with Vision). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 14:14:08 +00:00
Chaim	52ee3419d3	Add local rule-based classifier with Claude Code headless fallback Replaces API-based classifier with: 1. Filename pattern matching (covers 95%+ of legal docs) 2. Content keyword matching for ambiguous filenames 3. Claude Code headless (claude -p) fallback for edge cases No Anthropic API calls needed for classification. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 13:14:13 +00:00
Chaim	9e7492e761	Make classification and reference extraction non-fatal in document pipeline Text extraction, chunking and embedding proceed even if Claude API classification or reference extraction fails (e.g. API quota exceeded). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 13:00:34 +00:00
Chaim	5fc52ce530	Switch to cases/{new,in-progress,completed}/ directory structure Replace single CASES_DIR with find_case_dir() that searches across all status directories. New cases created in cases/new/{number}/. Config: CASES_BASE, CASES_NEW, CASES_IN_PROGRESS, CASES_COMPLETED Docker: added -v /home/chaim/legal-ai/cases:/cases volume mount Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 10:45:47 +00:00
Chaim	081c7fb17a	Replace Haiku with Sonnet in classifier for better accuracy classify_document and identify_parties both used Haiku, which produced parsing failures and 0% confidence on Beit HaKerem documents. Sonnet handles Hebrew legal documents more reliably. No more Haiku usage in the entire codebase. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 07:47:12 +00:00
Chaim	586f1db402	QA claims check: Haiku→Sonnet + filter appellant claims only Two fixes for claims_coverage false negatives (55% → expected ~85%+): 1. Model upgrade: Haiku → Sonnet for semantic matching. Haiku missed obvious matches (e.g., paragraph about "כריתת עצים" not matching claim about tree cutting). Sonnet understands context better. 2. Filter: only check appellant/respondent claims, not committee or permit_applicant claims. Committee claims are defensive positions ("the application complies with the plan") — they don't need to be "addressed" in the discussion section. 3. Send full discussion text (was truncated to 12K chars). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 07:37:23 +00:00
Chaim	9d0a73a1dc	Add context-only mode: Claude Code writes blocks, no API needed New architecture: MCP provides context, Claude Code writes. New functions: - get_block_context(case_id, block_id) → returns full context package (prompt, source docs, claims, direction, precedents, style guide) WITHOUT calling Anthropic API - save_block_content(case_id, block_id, content) → saves block to DB New MCP tools: get_block_context, save_block_content The old write_block (API-based) still works as fallback. The new flow uses Claude Code's own model (Opus 4.6, 1M context) which has no separate API billing. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-03 16:18:25 +00:00
Chaim	7033d2d3ee	Embed full style guide in block prompts for Dafna's voice _build_style_context rewritten from 10-line summary to comprehensive style guide including: - Tone rules per appeal type (warm for licensing, cold for levy) - 15 mandatory expressions ("כידוע", "ברי כי", "אין בידנו לקבל") - Discussion structure rules (continuous prose, conclusion first) - Per-party phrasing templates (appellants, committee, permit applicants) - DB patterns grouped by type (phrases, transitions, openings, closings) This addresses the main quality gap: style rated 2/5 because the output was "dry and overly formal" vs Dafna's "direct and clear" voice. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-03 16:12:09 +00:00
Chaim	e725f9ecd7	Fix claims parsing: truncated JSON recovery + chunking + compact output config.py parse_llm_json: Added truncated JSON recovery. When Claude's output is cut mid-JSON (common with long claim lists), the parser now: - Finds the last complete JSON item (closing "}") - Closes the array/object brackets - Returns partial but valid results instead of None Tested: recovers 2/3 items from truncated array, all cases pass. claims_extractor.py: - Prompt asks for compact output (150 words max per claim, group similar) - Explicitly requests "no markdown, no explanations, JSON only" - Long documents split into chunks at paragraph boundaries - Each chunk processed separately, results merged - max_tokens already at 8192 This fixes the recurring "0 claims" bug for committee responses and permit applicant responses where the JSON was getting truncated. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-03 16:04:34 +00:00
Chaim	7d1dc73112	Fix max_tokens to 16K for Opus (API limit is 32K, need room for thinking) block-yod max_tokens reduced from 32K to 16K — the API returned "max_tokens: 32768 > 32000" error. With thinking enabled, the actual limit for output is lower. 16K is sufficient for discussion blocks. Also: extractor.py now supports .md files (was missing, blocked Beit HaKerem upload). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-03 16:00:49 +00:00
Chaim	e24e24dac5	Maximize context and output per Anthropic best practices Per official Anthropic documentation (April 2026): Output tokens increased to match model capabilities: - block-yod (discussion): 8K → 32K (Opus supports 128K) - block-zayin (claims): 4K → 16K - block-vav (background): 4K → 16K - claims_extractor: 4K → 8K (fixes truncated JSON) - qa_validator: 4K → 8K Source documents sent in full (not truncated): - Was: 3000 chars per doc, 15K total - Now: full document text, no truncation - Reduces hallucinations: "extract word-for-word quotes first" Prompt structure follows long-context tips: - Source documents placed FIRST (top of prompt) - Instructions and query placed LAST - "Queries at the end improve quality by up to 30%" Extended thinking uses adaptive mode for Opus 4.6. Streaming enabled for all requests > 21K tokens. Unified JSON parsing via parse_llm_json() helper in config.py. Applied to: classifier, claims_extractor, brainstorm, qa_validator, learning_loop (5 files). Also: extractor.py now supports .md files. Sources: - https://docs.anthropic.com/en/docs/build-with-claude/extended-thinking - https://docs.anthropic.com/en/docs/build-with-claude/prompt-engineering/long-context-tips - https://docs.anthropic.com/en/docs/minimizing-hallucinations - https://docs.anthropic.com/en/docs/about-claude/models/overview Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-03 14:17:43 +00:00
Chaim	bed9d5c7e9	Improve block-zayin: synthesize claims by topic + fix markdown JSON parsing block_writer: Rewrote block-zayin prompt to require synthesis by topic instead of listing each claim separately. Now produces 3 organized sections (appellants 8, committee 6, permit applicants 3+) instead of 40 scattered paragraphs. Target: 800-1500 words. claims_extractor: Fix markdown code block stripping (same bug as qa_validator had). Enables parsing claims from Claude responses wrapped in ```json blocks. Tested on Hecht: block-zayin from 40 paragraphs/1049 words to 17 organized paragraphs/1039 words. Structure now matches Dafna's original (3 parties, grouped by topic). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-03 12:54:42 +00:00

1 2

59 Commits