legal-ai

Author	SHA1	Message	Date
Chaim	cbc7a1e336	feat(precedents): formal citation per Israeli citation rules + copy/edit UI All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m25s Details Until now, "case_number" was the only stored identifier for a precedent. But a citation per the Israeli unified citation rules is a different beast — it has bold parties, an unbold prefix (court abbrev + panel/ district parenthetical + case number), and an unbold trailing reporter (נבו / פ"ד...). Without storing it as a first-class field we couldn't hand the chair a one-click "copy as citation" experience for pasting into decisions. Changes: - Schema V19: case_law.citation_formatted TEXT (Markdown — parties wrapped in … so the copy helper can render <strong> for Word/Docs paste and keep plain-text fallback meaningful). - Metadata extractor: composes citation_formatted from the document text per the unified citation rules, with worked examples for ע"א / עת"מ / ערר / בל"מ in the prompt. Refuses to store half-formed strings. - PATCH /api/precedent-library/{id} accepts citation_formatted so the chair can correct LLM mistakes. - /precedents/[id]: dedicated "מראה מקום" block with bold rendering, a copy-to-clipboard button (text/html + text/plain so Word keeps the bolds), and an inline edit textarea. - /precedents list rows: link displays the formatted citation when available, with a small inline copy button — falls back to the bare case_number for older rows. Backfill of existing rows happens by re-stamping the extraction queue once V19 has rolled out and the new field is reachable. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-27 07:14:34 +00:00
Chaim	a02a4e3a64	feat(precedents): minimum-effort upload — file+citation, rest auto-extracted All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m35s Details The missing-precedents drawer + general precedent upload both required the user to type chair_name, district, practice_area, court, date etc. upfront — even though those fields can be (and already are, post-upload) extracted from the document text by the LLM. The metadata-extraction wakeup also only fired for the /precedent-library/upload path, leaving missing-precedents committee uploads stuck with whatever stub the user typed. Changes: - Extractor learns chair_name + district, overwrites the new PLACEHOLDER_PENDING_EXTRACTION sentinel for internal_committee rows (the DB CHECK forces non-empty; we stamp the placeholder at insert). - missing_precedent_upload no longer 400s on missing chair/district; it infers district from the citation when possible, falls back to the placeholder, and always fires pc_wake_for_precedent_extraction so the LLM can fill in the rest. - Both upload sheets default to file (+ citation) only; every other field is tucked into a closed <details> labeled "אופציונלי — דריסה ידנית של שדות שיחולצו אוטומטית". Required validators on chair/ district/practice_area dropped — the LLM fills them. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-26 14:43:25 +00:00
Chaim	b01722b1b4	feat: emit missing_precedent + export_complete webhooks to plugin All checks were successful Build & Deploy / build-and-deploy (push) Successful in 9s Details Adds two webhook emitters in paperclip_api.py that the plugin's onWebhook handler now routes by ``eventType``: * ``emit_missing_precedent_webhook(...)`` — fires from POST /api/missing-precedents on first insert (non-duplicate). The plugin surfaces an askUserQuestions interaction on the linked issue so Daphna can choose upload / irrelevant / defer without needing to open the legal-ai UI. * ``emit_export_complete_webhook(...)`` — fires from POST /api/cases/{n}/export-docx after a successful export. The plugin attaches a "final-decision" markdown document with a download link to the linked Paperclip issue. Both are fire-and-forget BackgroundTasks — failures are logged but never block the originating request. Company resolution follows the same 1xxx→licensing / 8-9xxx→betterment rule used by emit_case_status_webhook. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-26 13:29:04 +00:00
Chaim	2aee398b4a	feat: Stage C — RAG advanced (#33 , #47 , #48 , #49 , #50 , #51 ) All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m35s Details Six independent sub-tasks dispatched in parallel; aggregated here. ## #33 — Hide case_name column library-list-panel.tsx: `<TableHead>` + `<TableCell>` for "שם" get `className="hidden"` in both Court and Committee row variants. DB column preserved for future use. ## #47 — Audit script periodic New scripts/audit_corpus_integrity.py — 3 SQL checks (external+ערר prefix, internal missing chair/district, cases.practice_area enum) + CEO wakeup on violations + cron `0 7 * * `. First run: 0 issues. ## #48 — Parent-doc retrieval (gated, default off) Schema V17: precedent_chunks.parent_chunk_id + chunk_role ('child'\|'parent'). New chunker.chunk_document_hierarchical() — section-aware parents (~1500 tokens) containing ~5 overlapping children (~300 tokens each). New db.store_precedent_chunks_hierarchical two-pass writer. Search SQL (semantic + lexical) LEFT-JOIN parent and swap content + dedupe by parent_chunk_id when flag on. Toggle: PARENT_DOC_RETRIEVAL_ENABLED + PARENT_DOC_{CHILD,PARENT}_SIZE_TOKENS. Backfill ~3min and ~$0.20 — deferred to follow-up. ## #49 — Multimodal backfill New scripts/backfill_multimodal_precedents.py with token-matching case_number ↔ source files (PDF + DOCX via PyMuPDF). Ran in container: 26 precedents embedded, 503 pages, $0.21, 0 errors. precedent_image_embeddings grew 3 → 29 rows. 44 remaining are style_corpus-migrated rows (no source file on disk) — will catch up when re-uploaded. ## #50 — Closed-loop feedback + nDCG Schema V18: search_logs + search_relevance_feedback. New telemetry.py with fire-and-forget log_search_bg (p50 = 0.002ms — zero overhead) + auto-infer_relevance_from_citations (reads case drafts → marks score=3 when cited precedent appears in past search top-K). Hooks added to 5 search paths. scripts/compute_ndcg.py for aggregation. Two admin API endpoints (GET /api/admin/rag-metrics + POST .../infer). Dashboard UI deferred — API is enough for now. ## #51 — Halacha quality monitoring New scripts/monitor_halacha_quality.py — baseline avg confidence (trusted=0.849, all=0.833, pending=0.694) with rolling window drift detection. Default 5% threshold. Exits non-zero on alert for cron integration. Recommended: `0 8 * 1` weekly Mon 8am. ## Bonus: 230 unlinked citations → missing_precedents Bulk-imported 230 distinct unlinked citations from precedent_internal_citations to missing_precedents.status='open', party='committee', with notes listing source citers. Top candidate: ע"א 3213/97 (cited 5x). Total open missing_precedents now 237. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-26 11:26:52 +00:00
Chaim	3a05e30c8d	fix(appraiser-facts): route extraction through analyst wakeup (was silent 0) All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m38s Details The "חלץ עובדות שמאיות" UI button hit POST /api/cases/{n}/extract-appraiser-facts which called appraiser_facts_extractor inline — that shells out to the local `claude` CLI, which is absent in the Coolify container, so every doc errored, the per-doc try/except swallowed it, and the response was "completed, 0 facts". Refactored the endpoint to wake the legal-analyst of the correct company via Paperclip (same pattern as wake_curator_for_final), and surface extraction_failed instead of "completed" when every doc errored.	2026-05-26 11:02:55 +00:00
Chaim	d32452f95c	fix(api): include proceeding_type in /api/cases list response All checks were successful Build & Deploy / build-and-deploy (push) Successful in 9s Details The cases-table reads from the list endpoint, not /details, so without proceeding_type in the row payload the בל"מ badge can't render for cases that flipped the field manually (only the legacy appeal_subtype LIKE 'extension_request_%' path was firing). Added the field to both detail=false and detail=true branches. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-26 10:01:24 +00:00
Chaim	d359ab9884	feat(proceeding-type): explicit ערר/בל"מ field for cases + corpus All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m40s Details Same case_number can exist as both a regular appeal (ערר) and an extension-of-time request (בל"מ), and we were inferring the difference from appeal_subtype prefixes — fragile, and case-number lookups weren't disambiguated. Now stored as a first-class field on both case_law (corpus) and cases (live cases), with partial unique indexes on (case_number, proceeding_type). - SCHEMA_V15: column + CHECK constraints + backfill from appeal_subtype LIKE 'extension_request_%' + partial unique indexes replace the old global UNIQUE(case_number). - derive_proceeding_type() centralizes the inference rule (extension_request_* → בל"מ; subject regex fallback; default ערר). - Metadata extractor prompt asks Claude to populate the new field explicitly; apply_to_record writes it for internal_committee rows. - internal_decision_upload, case_create, case_update accept an optional proceeding_type; FastAPI request models expose it. - Wizard + edit dialog get a sided Select; case header renders the resolved label (ערר / בל"מ). - Uploaded the 2 staged בל"מ decisions on betterment levy: 8126/24 (סופר נוח, 13 chunks), 8047/23 (הרנון, 48 chunks). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-26 09:17:33 +00:00
Chaim	f3cc9ca9d4	feat: Stage A finalizers + #35/#36/#37 — critical-gap closure Some checks failed Build & Deploy / build-and-deploy (push) Has been cancelled Details Four parallel sub-agents closed the remaining critical gaps from the 26/05 Stage A/B sprint. Each block independently tested; aggregated here. ## #30/#31 finalizers (sub-agent A) * Auto-derive practice_area in case_create from case_number prefix (1xxx→rishuy_uvniya, 8xxx→betterment_levy, 9xxx→compensation_197); default for CaseCreateRequest is now "" (the DB constraint catches any stray "appeals_committee"). * practice_area.py: derive_subtype now handles axis-B domain values (rishuy_uvniya/betterment_levy/compensation_197) without parsing the case number; new helper derive_domain_practice_area(). * Halacha re-extraction verified unnecessary — all 6 reclassified records already had is_binding=false and approved halachot. * Regression tests: 6 cases in tests/test_corpus_constraints.py covering practice_area enum, internal-committee chair/district, external-upload arar prefix, MCP guard. * UI: district input → Select dropdown (7 districts) in precedent-edit-sheet.tsx, preserving legacy free-text values. ## #37 בל"מ subtypes (sub-agent B) * 3 new appeal_subtypes: extension_request_{building_permit, betterment_levy,compensation}. APPEALS_COMMITTEE_SUBTYPES extended, SUBTYPES_BY_AREA mappings added. * New helpers: is_blam_subject(), is_blam_subtype(), derive_subtype_with_blam(case_number, subject, practice_area). case_create now uses it to auto-detect "בקשה להארכת מועד" subjects. * 3 methodology templates under docs/methodology/extension-request-.md. paperclip_client.py mapping updated for the 3 new subtypes (extension_request_building_permit→CMP, the other two→CMPA). * Frontend: bilingual "בל"מ" badge + filter dropdown on cases list + detail header; appeal-type-bars collapseBlam() merges בל"מ into its parent domain for aggregate bars. * Wizard auto-detects בל"מ from subject during case creation. * 3 Berlinger cases (1017/1018/1019-03-26) migrated to appeal_subtype=extension_request_building_permit via psql. ## #35 missing_precedents feature (sub-agent C) * Schema V13: missing_precedents table (citation, case_id, party, legal_topic, status, linked_case_law_id, claim_quote, ...) + FK constraints + 3 indexes. Applied via psql + idempotent migration. * 6 db.py service functions, 3 MCP tools, 6 FastAPI endpoints (POST/GET/PATCH/DELETE/upload — upload routes by citation prefix to ingest_internal_decision or ingest_precedent). * Next.js page /missing-precedents with 5 status tabs + filters + sidebar badge counter + detail drawer with metadata edit + smart upload form that switches fields per committee/court. * Bootstrap: 7 rows imported from the JSON file (3 citations × cases, all status=closed with linked_case_law_id). * legal-researcher.md: new §2ב.5 with missing_precedent_create usage + dedup semantics + tool grant. ## #36 legal_arguments aggregation (sub-agent D) * Schema V14: legal_arguments + legal_argument_propositions M:M. Applied via psql. * New service argument_aggregator.py with two functions — aggregate_claims_to_arguments() (Claude CLI / claude_session) and get_legal_arguments(). Graceful llm_unavailable handling when CLI is missing (containers). * 2 MCP tools + 2 API endpoints (POST .../aggregate-arguments as BackgroundTask, GET .../legal-arguments). * Frontend: shadcn Accordion + new legal-arguments-panel.tsx with hierarchical (party → priority badge → arguments) display, "טיעונים" tab on the case page, "חשב/חשב מחדש" buttons. * scripts/backfill_legal_arguments.py + SCRIPTS.md entry — dry-run found 8 candidate cases including 1017/1018/1019. ## Open follow-ups (intentionally deferred) * npm run api:types in web-ui (CLAUDE.md flow) — recommended before the next UI commit; not required for backend deployment. * Run backfill_legal_arguments.py --apply once the container picks up the new aggregator service. * webhook on missing-precedents upload-close to Paperclip (optional). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-26 08:34:40 +00:00
Chaim	b368bce690	fix: handle invalid date formats gracefully and add missing dialog descriptions All checks were successful Build & Deploy / build-and-deploy (push) Successful in 4m14s Details - Wrap date.fromisoformat() in try/except in case_update tool — prevents unhandled ValueError from surfacing as 500; FastAPI now catches it as 422 - Add DialogDescription (sr-only) to 5 dialogs missing aria-describedby: documents-panel preview + delete, drafts-panel delete + feedback, link-related-dialog Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-25 15:53:01 +00:00
Chaim	1496e520fd	feat(precedent-library): add district and chair_name to edit form All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m11s Details Fields existed in DB and Precedent type but were missing from: - PrecedentUpdateRequest (backend model) - update_case_law allowed set (db layer) - PrecedentPatch (frontend type) - precedent-edit-sheet form state, inputs, and patch payload Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-19 12:16:43 +00:00
Chaim	1da2a9a2cb	fix: exclude archived cases from stale-case-reminder All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details Archived cases have archived_at IS NOT NULL — they are not "stuck", they are done. The stale query was missing this filter. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 16:41:38 +00:00
Chaim	cddc7c8d24	fix: start-workflow wakeup failure now returns 502 instead of silent success All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m33s Details If pc_wake_ceo fails, the endpoint now raises HTTP 502 and skips the case_update to processing — preventing cases from silently getting stuck with no CEO running. Also adds `processing` to CEO routing table and updates case_list docstring with full status list. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 11:02:30 +00:00
Chaim	83b6ff51b7	feat: fix wizard step-skip bug + extend case edit with all fields + Paperclip title sync All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m38s Details - Fix keyboard navigation bug: React was reusing the submit button DOM element when transitioning "הבא" → "צור תיק", retaining focus and causing Enter to auto-submit step 3. Added key props to force element replacement. - CaseEditDialog now covers all wizard fields: appellants, respondents, property_address, permit_number (in addition to existing title, subject, hearing_date, expected_outcome, notes). - When case title changes, Paperclip project name is updated in background via new update_project_name() in paperclip_client.py. - Extended CaseUpdateRequest, case_update MCP tool, and caseUpdateSchema to carry the new fields end-to-end. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 10:55:45 +00:00
Chaim	8dc7a40fa2	fix: exclude exported cases from stale; add weekly-feedback-job handler to CEO All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details - /api/cases/stale: exclude 'exported' status — exported cases await Dafna's review intentionally, they are not stuck - legal-ceo.md: add routing for weekly-feedback-job reason + explicit handler (analyze feedback, update decision-lessons.md, close issue) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-17 10:35:39 +00:00
Chaim	82ded005a4	fix: add days>0 guard and limit param to stale/feedback endpoints All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details	2026-05-16 17:38:34 +00:00
Chaim	c7ed1110f8	feat: add /api/cases/stale and /api/chair-feedback/weekly-summary endpoints GET /api/cases/stale?days=N — returns cases not updated in N days (default 3) that are not in 'final' or 'new' status, with days_stale count. GET /api/chair-feedback/weekly-summary?days=N — returns chair feedback from the last N days (default 7) as a Hebrew bullet-list summary for CEO agent. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-16 17:36:12 +00:00
Chaim	015e553d06	fix: add debug log and null company_id comment to webhook scheduling All checks were successful Build & Deploy / build-and-deploy (push) Successful in 4m16s Details	2026-05-16 17:13:07 +00:00
Chaim	6bdf9786ac	feat: emit case-status webhook on status change in PUT /api/cases/:case	2026-05-16 17:10:30 +00:00
Chaim	d5043100a7	fix: json.loads JSONB overrides on GET — asyncpg has no codec registered All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details asyncpg returns JSONB columns as raw JSON strings when no type codec is configured (only pgvector is registered in _init_connection). The stored value is a correct JSONB array (jsonb_typeof=array confirmed), but asyncpg decodes it as str. Parse it explicitly in the GET handler so the frontend receives the correct Python list/dict. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-10 18:54:44 +00:00
Chaim	932cc7191c	fix: use ::text::jsonb to store methodology overrides correctly All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details asyncpg cannot encode a Python list as JSONB directly (expects str). Passing str with ::jsonb causes double-encoding (stored as JSONB string). Solution: json.dumps() the value → pass as text → PostgreSQL parses with ::text::jsonb cast, storing it as the correct JSONB array/object. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-10 18:38:05 +00:00
Chaim	50649baeed	fix: prevent JSONB double-encoding on methodology save Pass req.value directly to asyncpg instead of json.dumps(req.value). When a Python string was passed with ::jsonb, asyncpg encoded it as a JSONB string (not an array), causing the frontend spread operator to split it into individual characters — one textarea per character. Also fix typo in DISCUSSION_RULES default: "אסה" → "מאסה". Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-10 18:30:49 +00:00
Chaim	3e14cd6798	feat: link related precedents across court instances (SCHEMA_V11) Add ability to mark case_law records as related (e.g. same appeal through ועדת ערר → מנהלי → עליון): - DB: case_law_relations join table (bidirectional, V11 migration) - DB CRUD: add/remove/get_case_law_relations - Service: get_precedent() now returns related_cases[] - MCP: precedent_link_cases + precedent_unlink_cases tools - REST: POST/DELETE /api/precedent-library/{id}/relations - UI: RelatedCasesSection on detail page with search dialog and unlink Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-10 07:52:29 +00:00
Chaim	c7132ba0d2	feat(precedent-library): auto-trigger CEO wakeup on manual extract requests All checks were successful Build & Deploy / build-and-deploy (push) Successful in 9s Details The "חלץ מטא-דאטה" / "חלץ הלכות" buttons in the UI used to only stamp the queue (set metadata_extraction_requested_at / halacha_extraction_requested_at) and rely on a human running `mcp__legal-ai__precedent_process_pending` from local Claude Code to drain it. That left the user with an unintuitive two-step flow: click button → run local MCP tool. Meanwhile, the upload endpoint already does the right thing — after ingest succeeds it calls `pc_wake_for_precedent_extraction`, which creates a Paperclip issue, assigns it to the CEO, and wakes them to run `precedent_process_pending` automatically. Add the same wakeup call to the manual request-metadata / request-halachot endpoints. Now clicking the button is sufficient — the CEO picks it up and drains the queue without manual intervention. Best-effort: matches the upload flow's failure semantics. The queue stamp still happens even if the wakeup fails, so the user can fall back to the manual MCP tool when needed. The wakeup outcome is included in the response under `wakeup` for observability. Coolify deploy required for the FastAPI container to pick this up.	2026-05-07 06:48:51 +00:00
Chaim	3be676e062	fix(api_mark_final): remove ingest_final_version call from container All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details ingest_final_version uses claude_session internally, which requires the Claude CLI binary (not present in the legal-ai FastAPI container). The call always failed with "Claude CLI not found" — caught by try/except but noisy. Replace with a static skipped status + comment pointing to the architectural rule. Run ingest_final_version manually via Claude Code / MCP from the local host when populating case_law is desired. The curator wakeup hook remains and works correctly. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-05 14:52:38 +00:00
Chaim	799b950961	feat(curator): trigger Knowledge Curator from api_mark_final, drop CEO F2 All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details The previous F2 stage in legal-ceo.md fired after the first DOCX export — too early, since the user often iterates with עריכה-* uploads after the first export. The true "this is dafna's chosen final" signal is the "סמן כסופי" button in the UI, which calls api_mark_final. This commit moves the curator wakeup from CEO's instructions to a direct hook in api_mark_final: - web/paperclip_client.py: add CURATOR_AGENTS dict (CMP + CMPA UUIDs) and wake_curator_for_final() helper. Looks up main case issue, creates a child issue assigned to the curator, tags plugin_state for case visibility, and triggers wakeup via Paperclip API. - web/app.py: api_mark_final now calls workflow_tools.ingest_final_version (so case_law table finally gets populated for search_decisions) and pc_wake_curator_for_final. Both are best-effort — failure does not block marking final. - legal-ceo.md: remove F2 stage, leave only the agents-table reference noting the curator runs from api_mark_final. - hermes-curator.md: update activation description to reflect the new flow. Result: curator runs only when chaim deliberately clicks "סמן כסופי", on the actual final file, with no risk of analyzing a draft that will later change. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-05 14:47:03 +00:00
Chaim	c0f67ab841	feat(precedents): split library into court rulings + appeals committee tables All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m34s Details - /api/precedent-library now accepts source_kind param (default external_upload) - list_external_case_law returns chair_name/district fields - LibraryListPanel renders two separate tables with appropriate columns - internal_decisions migration: added queue_halachot param to defer extraction - Fixed practice_area mapping from style_corpus (appeals_committee → proper enum) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-04 18:49:32 +00:00
Chaim	92a2763b86	feat: add internal committee decisions corpus (source_kind='internal_committee') All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m31s Details Three-layer separation: style learning (style_corpus), appeals-committee decisions (internal_committee), and court rulings (external_upload). - SCHEMA_V10: chair_name + district columns on case_law and cases, partial indexes - create_internal_committee_decision() DB upsert function - search_precedent_library_semantic() now accepts source_kind/district/chair_name params - search_precedent_library_hybrid() passes through new params - services/internal_decisions.py: ingest_internal_decision, migrate_from_style_corpus, migrate_from_external_corpus (identifies rows via source_type='appeals_committee') - search_internal_decisions() MCP tool (server.py + tools/search.py) - internal_decision_migrate() MCP admin tool - Web endpoints: POST /api/internal-decisions/upload, POST /api/internal-decisions/migrate, GET /api/internal-decisions - ingest_final_version auto-ingests finalized decisions into internal corpus - SKILL.md updated: agents now search internal + external in parallel, present separately Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-04 18:33:39 +00:00
Chaim	69e153b3db	fix(settings/agents): exclude noise from drift detection All checks were successful Build & Deploy / build-and-deploy (push) Successful in 32s Details Two false positives surfaced after the Agents tab went live: 1. status (running/idle/paused) is runtime state, not config — drops in and out as agents pick up issues. Removed from _DRIFT_FIELDS. 2. desiredSkills compared raw, but local/* and company/* skills carry per-company hashes/scopes by design (sync_agents_across_companies.py filters local skills with a warning). Comparing them flags every master+mirror pair that has any local skill on master. Now compares only paperclipai/* skills (vendor-shipped, must match). UI shows an inline note explaining the filter. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-04 17:39:17 +00:00
Chaim	6f713042b5	feat(settings): add Agents tab — read-only Paperclip agent config view Task #29: surfaces all 14 agents (7 roles × 2 companies) in /settings as master+mirror pairs with drift detection. Replaces ad-hoc psql + script inspection with a single dashboard. Backend: GET /api/admin/paperclip-agents — fetches via Paperclip API (not direct DB), groups by name, computes drift across model/effort/ timeoutSec/maxTurnsPerRun/skills/runtime_config.heartbeat/budget/status. Frontend: new AgentsTab card-per-pair with side-by-side compare, drift highlighting, expandable details (skills list + instructions path). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-04 17:23:48 +00:00
Chaim	d0994704cf	feat(agents): mirror Paperclip interactions in case page All checks were successful Build & Deploy / build-and-deploy (push) Successful in 47s Details Surface issue_thread_interactions (ask_user_questions / request_confirmation / suggest_tasks) directly inside legal-ai's case detail feed so the user can answer agent prompts without switching to Paperclip's UI. Backend (FastAPI): - paperclip_client.py: 4 new helpers — get_issue_interactions (DB), respond_to_interaction / accept_interaction / reject_interaction (REST). - app.py: extends GET /api/cases/{case_number}/agents to include `interactions`, and adds POST /api/cases/{case_number}/agents/interaction-response routing to /respond, /accept, /reject in Paperclip. - paperclip_client.py: also pulls existing httpx calls onto the centralized pc_request helper (paperclip_api.py) for consistent auth + run-id headers. Frontend (web-ui, Next.js 16 + TanStack Query): - agents.ts: Interaction / InteractionPayload / InteractionStatus types, useSubmitInteraction mutation hook (invalidates the activity query). - agent-activity-feed.tsx: InteractionCard renders radio (single) / checkbox (multi) for ask_user_questions, accept/reject + reason for request_confirmation, task selection for suggest_tasks. Resolved interactions show a read-only summary. Cards are interleaved with comments by created_at, so the feed reads chronologically. Paperclip auto-wakes the issue assignee on a successful response (queueResolvedInteractionContinuationWakeup) — no explicit wakeup needed. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-04 16:40:45 +00:00
Chaim	e90faa9ba4	feat(settings): add Blocks tab — 12-block decision schema reference All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m35s Details Read-only display of BLOCK_CONFIG from block_writer.py with CREAC role and JWM functional-purpose annotations per block (sourced from docs/block-schema.md). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-04 07:58:04 +00:00
Chaim	d1e12619d4	refactor(settings): pivot to Coolify env API as source of truth Investigation showed legal-ai container has no INFISICAL_TOKEN and there is no /legal-ai folder in Infisical — all env vars are stored in Coolify and injected into os.environ at container start. - Replace _read_infisical_values with _read_coolify_envs - New: _coolify_authoritative_value picks among Coolify duplicates - PATCH writes via Coolify API (upsert by key) - Drift = Coolify-stored vs container-runtime (common: Coolify edited without redeploy) - Response field renamed: infisical_value → coolify_value - New 'has_duplicates' flag per row when Coolify has multiple entries Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-04 07:50:02 +00:00
Chaim	394b971856	feat(settings): add MCP registrations endpoint + Coolify volume runbook Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-04 06:38:47 +00:00
Chaim	272e49b6b0	feat(settings): add MCP tools introspection endpoint	2026-05-04 06:34:19 +00:00
Chaim	69bdf7b30a	fix(settings): harden PATCH/redeploy per code review - Add infisicalsdk dependency - Narrow update→create fallback to NotFound errors only (no silent swallow) - Truncate Coolify error response text to 200 chars - Add 60s cooldown to redeploy endpoint - Move httpx to top-level import	2026-05-04 06:33:01 +00:00
Chaim	2fe73fcce1	feat(settings): add PATCH env + Coolify redeploy endpoints Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-04 06:26:00 +00:00
Chaim	c30c987ec2	fix(settings): suppress false drift when Infisical unreachable - Add infisical_available flag to _build_env_var_row - Stabilize error code (no exception text in API response) - Document raw-comparison safety inline	2026-05-04 06:24:26 +00:00
Chaim	562eae010a	feat(settings): add GET /api/settings/mcp/env endpoint Adds four helper functions (_infisical_client, _infisical_ctx, _read_infisical_values, _build_env_var_row) and the /api/settings/mcp/env endpoint that compares Infisical vs container env vars and reports drift. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-04 06:19:04 +00:00
Chaim	81ccf3a888	feat(retrieval): track page_number on text chunks for multimodal hybrid boost All checks were successful Build & Deploy / build-and-deploy (push) Successful in 6m33s Details The legacy chunker did not track which PDF page each chunk came from. Stored chunks had page_number=NULL, which blocked the multimodal hybrid retriever's text+image boost — it joins (chunk, image) on (document_id, page_number) and the join could never fire. This change: - extractor.extract_text now returns (text, page_count, page_offsets); page_offsets[i] is the start char offset of page (i+1) in the joined text. None for non-PDFs. - chunker.chunk_document accepts an optional page_offsets and tags each chunk with the page that contains its first character (uses the existing chunker logic; pages assigned post-hoc by content search to keep the diff minimal). - processor.process_document and precedent_library.ingest_precedent forward page_offsets through the chunker. New uploads now carry accurate page_number on every chunk. - Other extract_text callers (tools/documents, tools/workflow, web/app.py) updated to unpack the third element (ignored). - scripts/backfill_chunk_pages.py: per-case retrofit. Re-extracts each PDF (re-OCRs via Google Vision if needed, ~$0.0015/page), computes page_offsets, and updates page_number on every chunk by content search. Idempotent; --force re-runs on already-tagged docs. Forward-only would leave the 419 image embeddings backfilled on cases 8174-24 + 8137-24 unable to boost their corresponding text chunks. The retrofit script closes that gap (cost ~$0.60). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 19:49:41 +00:00
Chaim	f722fa45bd	feat(search): add header global search (Phase A) — cases + precedents + docs All checks were successful Build & Deploy / build-and-deploy (push) Successful in 41s Details Adds an always-visible debounced search input in the AppShell header that fans out to three independent sources in parallel and renders per-source result groups with their own loading/empty/error states: - /api/search/cases (NEW): SQL ILIKE on case_number, address, parties, title, subject. Returns small projections, no embeddings needed. - /api/precedent-library/search (existing): semantic over case-law halachot + passages. - /api/search (existing): semantic over case documents + past decisions. Cmd/Ctrl+K focuses the input; Esc and click-outside close the panel. This is Phase A of the header redesign — the bar layout itself is unchanged; row grouping + dynamic context follow in Phase B. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 18:05:51 +00:00
Chaim	923903217c	feat(precedents): auto-trigger Claude extraction via Paperclip wakeup All checks were successful Build & Deploy / build-and-deploy (push) Successful in 7s Details When a precedent is uploaded to the library, the FastAPI container now fires a Paperclip wakeup so Claude (running locally as the CEO agent) picks up the new row and runs `precedent_process_pending` for both metadata and halacha extraction. The user no longer has to remember to trigger it manually. Mechanics: - New `wake_for_precedent_extraction()` in paperclip_client.py creates (or reuses) a per-company "ספריית פסיקה — תור חילוץ" project, opens a fresh issue assigned to the company CEO with the case_law_id + citation in the description, and pings the Board API wakeup endpoint with `triggerDetail=precedent_library_upload`. - ingest_precedent's _run() in app.py captures the returned case_law_id and best-effort calls the wake function (failures are logged, not surfaced — the upload itself stays clean). - legal-ceo.md adds the precedent_process_pending tool family and a new "חילוץ פסיקה אוטומטי" section that tells the CEO to short-circuit past the heartbeat scan when woken with this trigger. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 16:49:25 +00:00
Chaim	4a9a6b7970	feat(precedents): UI button queues extraction for local MCP worker All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m27s Details The chair wanted a one-click "extract metadata" button on the edit sheet. The constraint stays the same — claude_session needs the local CLI which the container doesn't have, so the button can't run the extractor itself. Compromise: button stamps a queue marker; the local MCP server drains the queue on demand. DB (V8): two nullable timestamps on case_law, metadata_extraction_requested_at and halacha_extraction_requested_at, with partial indexes for cheap "find pending" scans. API: POST /api/precedent-library/{id}/request-metadata → stamp the row POST /api/precedent-library/{id}/request-halachot → same for halacha GET /api/precedent-library/queue/pending?kind=... → read-only view UI: Sparkles button in the edit sheet header. Click → toast tells the chair what to run from Claude Code. The button never triggers the extractor directly from the container. MCP tool: precedent_process_pending(kind, limit) — runs from Claude Code with the local CLI, picks up everything stamped, calls the extractor for each, clears the timestamp on success. Failures keep the timestamp so the next invocation retries them. Architectural rule (claude_session local-only) is preserved end-to-end and called out in the new endpoint comment + tool docstring. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 12:32:25 +00:00
Chaim	2cfdf35191	refactor(precedents): keep all LLM calls on the local-MCP path All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m28s Details Architectural correction: every claude_session caller in this project runs through the local MCP server (~/.claude.json points at /home/chaim/legal-ai/mcp-server/.venv/bin/python). The Coolify container has no `claude` CLI and no claude.ai session, so any LLM call originating from web/ FastAPI fails with "Claude CLI not found" — which is exactly what we hit on 403-17. The earlier Anthropic SDK fallback would have made it work, but at direct API cost. The chair's preference is to stay on the claude.ai session for everything. So: - claude_session.py: removed the SDK fallback, restored CLI-only. The error message now points the next person at the architectural rule in the module docstring instead of papering over it. - precedent_library.py:ingest_precedent (called from FastAPI on upload) now does only the non-LLM half: extract → chunk → embed → store. Sets halacha_extraction_status='pending' for the chair to act on. - reextract_halachot / reextract_metadata kept, but lazy-import their extractors so the FastAPI path can't accidentally pull them in. They are reachable only via the MCP tools precedent_extract_halachot / precedent_extract_metadata, which run locally with CLI. - Removed POST /api/precedent-library/{id}/extract-halachot and /extract-metadata — they were dead ends from the container. - Dropped the `anthropic` Python dep that the SDK fallback required. - UI: removed the "refresh halachot" and "sparkles metadata" buttons that called those endpoints. Edit sheet now points the chair at the MCP tool names instead. Halacha and metadata extraction for an uploaded precedent now happen when the chair (via Claude Code) runs: mcp__legal-ai__precedent_extract_metadata <case_law_id> mcp__legal-ai__precedent_extract_halachot <case_law_id> Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 11:06:08 +00:00
Chaim	73a79ea7e8	feat(precedents): metadata auto-fill, edit sheet, persuasive extraction All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m28s Details Three improvements to the precedent library based on usage feedback: 1. Auto-fill metadata at upload time. New service precedent_metadata_extractor reads the ruling's full_text and suggests case_name (short), summary, headnote, key_quote, subject_tags, appeal_subtype. The merge policy fills only empty fields, preserving everything the chair typed in the upload form. Wired into the ingest pipeline; also exposed as a re-run endpoint POST /api/precedent-library/{id}/extract-metadata for existing records. 2. Edit sheet in the UI. Pencil icon on each library row opens a pre-populated form covering every field. A Sparkles button on the sheet runs the metadata extractor on demand and refreshes the form. The case_number is read-only because halachot are FK'd to it; renaming requires delete + re-upload. 3. Halacha extractor branches on is_binding. Sources marked binding (Supreme/Administrative) keep the strict halacha prompt. Non-binding sources (other appeals committees, district courts on planning matters) get a different prompt that extracts applications, interpretive principles, and persuasive conclusions — labeled with new rule_types 'application' and 'persuasive'. The fallback also widens chunk selection: if the chunker labeled nothing as legal_analysis/ruling/conclusion, we now run on all chunks rather than returning zero halachot for a usable ruling. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 10:19:35 +00:00
Chaim	7ee90dce31	feat: external precedent library with auto halacha extraction All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m27s Details Adds a third corpus of legal authority distinct from style_corpus (Daphna's prior decisions for voice) and case_precedents (chair-attached quotes per case). The new corpus holds chair-uploaded court rulings and other appeals committee decisions, with binding rules (הלכות) extracted automatically and queued for chair approval. Pipeline (web/app.py + services/precedent_library.py): file → extract → chunk → Voyage embed → halacha_extractor → store + publish progress over the existing Redis SSE channel. Schema V7 (services/db.py): extends case_law with source_kind + extraction status fields under a CHECK constraint pinning practice_area to the three appeals committee domains (rishuy_uvniya, betterment_levy, compensation_197). New precedent_chunks (vector(1024)) and halachot tables (vector(1024) over rule_statement, IVFFlat indexes, gin on practice_areas/subject_tags). Halachot start as pending_review; only approved/published rows are visible to search_precedent_library. Agents: legal-writer, legal-researcher, legal-analyst, legal-ceo, legal-qa get search_precedent_library. legal-writer prompt explains the three-corpus distinction and CREAC use; legal-qa now verifies that every cited halacha resolves to an approved row in the corpus. UI: /precedents page with four tabs — library / semantic search / pending review (J/K nav, A/R/E shortcuts, badge count) / stats. Reuses the existing upload-sheet progress + SSE pattern. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 08:38:18 +00:00
Chaim	e849285806	home: split cases table by appeal type + add appeal-type chart All checks were successful Build & Deploy / build-and-deploy (push) Successful in 32s Details Backend (cases listing) - /api/cases: also return updated_at, created_at, practice_area, appeal_subtype, subject. The detail-mode response was previously dropping these even though db.list_cases reads them, leaving the UI's "תחום" and "עודכן" columns blank. Frontend - Split the home table into two: רישוי (1xxx) and היטל השבחה ופיצויים (8xxx + 9xxx), bucketing on appeal_subtype with a case-number-prefix fallback. The "תחום" column is now redundant and removed. - New AppealTypeBars chart in the right rail next to the existing status donut. - Donut: switch to a vertical layout (donut on top, legend below in a 3-col grid) so labels like "חדש / בעיבוד" no longer wrap inside the 320px sidebar; counts now align in a tabular column. - CasesTable accepts emptyText/searchPlaceholder so each split table has its own copy. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 15:44:41 +00:00
Chaim	f7249b7807	admin/skills: fail loud on DB error + read skills dir from env All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m8s Details - Raise HTTPException(503) when Paperclip DB is unreachable instead of silently falling through to disk-only mode and returning []. - Honor PAPERCLIP_SKILLS_DIR env var (falls back to ~/.paperclip/...). In the Coolify container the host's skills dir is bind-mounted at /paperclip-skills; without this, Path.home() resolved to /root/ and the disk inventory was always empty. Both bugs together silently turned a Paperclip DB outage into "no skills installed" on the /skills page. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 15:24:39 +00:00
Chaim	f256eddbb1	git_sync: full case-dir backup to Gitea (sweep + explicit commits) All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m25s Details The case repo is the user's backup, so anything in the dir must end up on Gitea. Two layers: 1. Periodic sweep (every 30s) — git_sync.sweep_loop runs as a FastAPI background task. It scans every case dir, runs git status --porcelain on each, and commit_and_push's any dirty changes with an auto-built Hebrew message ("אוטו: טיוטות (2) · מסמכים"). Catches files written outside the API path: agent research artefacts, manual edits, etc. 2. Explicit commits at known write paths — DOCX export, interim draft, apply_user_edit, revise_draft, mark-final, analysis DOCX export. These give immediate feedback with descriptive messages instead of waiting up to 30s for the sweep. safe.directory injection added to _git_env so sweep + explicit commits work even when the running uid differs from the case-dir owner (host runs vs. uniform-root container). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-30 18:27:36 +00:00
Chaim	fa70944ed4	case-create: surface Gitea repo result + UI retry button All checks were successful Build & Deploy / build-and-deploy (push) Successful in 1m29s Details The auto-creation in case_create had two failure modes that combined to make repos silently missing: a stale GITEA_TOKEN returning 401, and the outer try/except in case_create that swallowed every exception with a bare pass. Result: cases like 8174-24 ended up with a local git repo and Paperclip project but no Gitea repo, with no signal anywhere. _setup_gitea_remote now returns {ok, url, error} and never raises; the result is attached to the case JSON and the FastAPI endpoint logs a warning when ok=false. The UI gets a "צור ריפו ב-Gitea" button on the case header that appears only when the repo or remote is missing. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-30 18:12:05 +00:00
Chaim	9bdfb05350	Upload progress: Redis-backed store + flushed SSE + client fallback All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m24s Details The previous in-memory _progress dict + polling SSE handler had a 30s silent tail after completion. HTTP/2 framing in the proxy chain (Traefik) buffered the small chunks until the stream closed, so when a transient blip caused EventSource to reconnect, the server returned 404 and the UI stuck on the "מתחיל…" placeholder forever. Reproduced live: 445 bytes withheld 31s. Changes: • web/progress_store.py — ProgressStore wraps Redis with TTL (5m), atomic GETDEL, dict-like API. Best-effort: Redis errors are logged and swallowed so observability outages don't break uploads. • web/app.py — _progress is now Redis-backed; every set/get/active/pop is awaited. SSE handler emits a heartbeat each tick (forces HTTP/2 flush), drops the 30s post-completion sleep, and returns a terminal {"status":"unknown"} payload instead of 404 when the task is gone — so EventSource closes cleanly instead of reconnect-looping. New _SSE_HEADERS set X-Accel-Buffering: no. • web-ui useProgress(taskId, caseNumber) — 10s fallback that invalidates the case detail if no SSE message arrived; treats "unknown" as terminal and triggers a refetch from the source of truth. • upload-sheet wires caseNumber through and renders "unknown" as completed. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-30 12:53:23 +00:00

1 2

99 Commits