legal-ai

Author	SHA1	Message	Date
Chaim	e6778d26e5	docs(#126 ): תיעוד Hermes כ-runtime + self-learning אינרטי; ניקוי persona→אוצֵר All checks were successful G12 Leak-Guard / leak-guard (pull_request) Successful in 5s Details המשך ל-#123. ממצא: אנחנו מריצים את ה-Hermes CLI של Nous כ-runtime ל-deepseek_local (harness בלבד), וה-self-learning דלוק-אך-אינרטי (state.db = תמלילים בלבד, ללא memories/user_profile/skills; רדום מאז 5-6.2026). - doc חדש: docs/research/hermes-runtime-and-self-learning-state.md (חקירה פורנזית + playbook להפעלת Hermes המלא בעתיד + שערי-ממשל INV-LRN1/LRN5/G12) - cross-link מ-#123 feasibility - ניקוי persona "Hermes" ב-hermes-curator.md (Hermes=runtime CLI, זהות=אוצֵר-ידע) שינויי-host (לא בריפו, מתועדים): כיבוי self-learning ב-curator-{cmp,cmpa}/config.yaml + persona ב-SOUL.md → אוצֵר-ידע (גיבוי .bak). HERMES_HOME/HERMES_CLI נשמרו (runtime). Invariants: INV-LRN1/LRN5 (יישור — self-learning לא-מגודר כובה), G12 (Hermes=runtime מאחורי Port, לא פלטפורמה מקבילה), G2. מסמך+config, אין שינוי-קוד. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-11 17:54:16 +00:00
Chaim	a4e006ab50	feat(agents): deepseek_local טוען פרומפט מקובץ — איחוד מקור-אמת לפרומפט של Hermes (G2) All checks were successful G12 Leak-Guard / leak-guard (pull_request) Successful in 8s Details כל סוכני המערכת טוענים את ה-system prompt מקובץ תחת .claude/agents/ דרך instructionsFilePath (claude_local + gemini_local), פרט ל-Hermes/curator על deepseek_local שתמך רק ב-promptTemplate inline ב-DB — מסלול-פרומפט מקביל (הפרת G2), לא מגורסת ב-git, ושני המקורות (DB ↔ hermes-curator.md) כבר התפצלו בתוכן. מה השתנה: - adapters/deepseek-paperclip-adapter: buildPrompt קורא instructionsFilePath אם הוגדר (resolveTemplate; עדיפות file > promptTemplate > DEFAULT). הקובץ עובר renderTemplate כך ש-{{wakeReason}}/{{#taskId}}/… ממשיכים לעבוד. כשל-רועש אם הקובץ הוגדר ואינו קריא — לא fallback שקט (כלל-הנדסה §6, feedback_silent_swallow). - hermes-curator.md הופך ממסמך-תיעוד למקור-האמת בפועל: מיזוג current-from-both — ה-runbook התפעולי מה-DB (PIPELINE-WAKE/X16 + §A/§B + interactions) + שער anti-hallucination (INV-AH) וקריאת-ספ (INV-AG1) שהיו רק ב-md ומעולם לא הגיעו ל-runtime של הרמס. ה-ingest_final_version/lessons הידני הושמט — ה-pipeline (X16) כבר מריץ אותו durably; הרצה ידנית הייתה כפילה. נותר תפעולי (לא ב-git): עדכון 2 רשומות deepseek_local ב-Paperclip DB (instructionsFilePath=.../hermes-curator.md, ריקון promptTemplate) + git pull בעץ הראשי + pm2 restart paperclip + sync-agents. Invariants: מקיים G2 (ביטול מסלול-פרומפט מקביל), G12/X15 (מגע-פלטפורמה רק במעטפת המוצהרת — adapter), INV-AH + INV-AG1 (מגיעים סוף-סוף ל-Hermes), כלל-הנדסה §6 (כשל-רועש). ללא שינוי התנהגות-runtime פרט להוספת שער-ה-AH (כוונה מפורשת, אישור יו"ר). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-11 12:16:33 +00:00
Chaim	d156bcfaf1	feat(agents): שער anti-hallucination משותף מעוגן-מקור (INV-AH) לכל הסוכנים מחלץ את דיסציפלינת מניעת-ההזיות לבלוק קנוני אחד (docs/anti-hallucination-gate.md) ומחיל אותו אחיד על כל הסוכנים — במקום שכל סוכן ימציא אותה מחדש ad-hoc (G2: בלי מסלולים מקבילים). 5 טכניקות, כל אחת מעוגנת במקור מקצועי: - AH-1 עיגון-מקור (אפס ציטוט מהזיכרון) — Stanford RegLab/Magesh JELS 2025 (כלי-RAG משפטיים הוזים 17-33%) - AH-2 quote-or-retract + AH-3 abstention — Anthropic Reduce-hallucinations - AH-4 תיוג-ודאות — NIST AI RMF GenAI Profile + RAGAS - AH-5 Chain-of-Verification — Dhuliawala et al. arXiv:2309.11495 הפצה DRY: הפניה ב-HEARTBEAT.md (נקרא ע"י כל סוכני Paperclip) + שורה אחידה בבלוק 'קרא לפני פעולה' של כל 8 הסוכנים, עם הערת-יישום לכל תפקיד (writer=read-only, qa=אוכף, proofreader=אל תתקן לכיוון מונח משפטי, exporter=אפס מהות חדשה). בנוסף: legal-ceo.md מקבל ידע על 'שטן מליץ (Gemini)' עם מדיניות on-demand טהורה — לא בפייפליין, מופעל רק לבקשת חיים/דפנה, הפלט=לידים ליו"ר (לא לכותב, human-in-the-loop). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-09 17:05:33 +00:00
Chaim	0d995483ce	feat(style-acq T4+T5): פנקס-התאמה draft↔final + דיסטילציה אוטומטית דרך ה-curator סוגר את לולאת-הלמידה (INV-LRN4): כל החלטה נסגרת מול הסופי, וכל סופי מנותח מול הטיוטה. מזין את הטבלאות ש-T15 כבר קורא מהן. T5 — פנקס-התאמה: - SCHEMA_V26: טבלת draft_final_pairs (snapshot draft + final + diff + analysis + status). - db: create/update/list_draft_final_pairs. - mark-final (app.py): תופס snapshot של הטיוטה (decision_blocks) ברגע החתימה, לפני שאפשר לדרוס אותו, ופותח שורת-פנקס (status=final_received). T4 — דיסטילציה אוטומטית: - learning_loop.process_final_version: משתמש ב-snapshot (לא בבלוקים שאולי השתנו), מסווג style_method↔substance, שומר הצעה ב-pair (status=analyzed). הוסר ה-auto-upsert של style_patterns — ביטל את ה-bug שדרס את שער-היו"ר וזיהם סגנון במהות (INV-LRN1 + INV-LRN5). - LESSONS_PROMPT: הפרדת style_method↔substance מפורשת + לקח מופשט בלבד. - curator wake + hermes-curator.md: מריץ ingest_final_version ראשון; מציע רק style_method שלא תועד; substance→מסלול precedent. INV-LRN1 (שער-יו"ר, אין auto-commit) · INV-LRN4 (ניגוד-אמת) · INV-LRN5 (טוהר). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-06 17:20:57 +00:00
Chaim	a02a606f34	feat(agents): wire spec into agents — INV-AG1 read-before-act gate (FU-8b/GAP-23) חיווט ספ-המערכת לסוכני-Paperclip כך שכל סוכן חייב לקרוא את 00-constitution תחילה, ואז את ספ-התחום הרלוונטי לתפקידו (לפי טבלת X4 §2) — לפני עבודה מהותית. - HEARTBEAT.md: סעיף עליון "קריאת-ספ — קודם החוקה (00), אז ספ-התחום" לפני §0–§8, עם טבלת תפקיד→ספ ל-8 הסוכנים. - 8 קבצי-סוכן (ceo/proofreader/researcher/analyst/writer/qa/exporter/hermes): סעיף "קרא לפני פעולה (INV-AG1)" בראש הגוף. - X4-agents.md: שדה "אכיפה" של INV-AG1 → "מחוּוט (פרוצדורלי)"; §5 → "בוצע". אכיפה פרוצדורלית בכוונה — invariant פרויקטלי-תפעולי, אין שער-קוד שמכריח קריאה. prereq לסוכני-התהליך (תת-פרויקט 5). gap-audit נשמר כ-snapshot (כמו FU-8a). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-31 16:02:04 +00:00
Chaim	bb0cd7c6a2	feat(training): Style Studio — upload, rich corpus, lessons, curator portrait, chat All checks were successful Build & Deploy / build-and-deploy (push) Successful in 2m7s Details Six-phase upgrade of /training from a read-only dashboard into a full Style Studio for managing Daphna's style corpus. - Upload Sheet on /training: file → proofread preview → commit (no more CLI-only `upload-training` skill). - Rich corpus metadata: GET /api/training/corpus returns summary, outcome, key_principles, page_count, parties (regex), legal_citation, lessons_count. PATCH endpoint for chair edits. CorpusDetailDrawer with 4 tabs (details /content/lessons/patterns) replaces the bare table row. - LLM metadata enrichment: style_metadata_extractor + MCP tools (style_corpus_enrich, style_corpus_pending_enrichment) fill summary /outcome/key_principles via claude_session (free, host-side). - Per-decision lessons: new decision_lessons table + 4 REST endpoints + LessonsTab in drawer; hermes-curator now auto-posts findings as decision_lessons(source=curator). - Curator Portrait tab: prompt rendered with link to Gitea, recent curator findings, style_analyzer training prompts, propose-change form that writes proposals to data/curator-proposals/ for manual chair review (no auto-mutation of the agent file). - Style chat tab: SSE-streamed conversations with the style agent. New host-side pm2 service (legal-chat-service, port 8770) wraps claude CLI with stream-json + --resume continuation; FastAPI proxies via host.docker.internal. Zero API cost — uses chaim's claude.ai subscription. chat_conversations + chat_messages persist history. Architecture: keeps the existing rule that claude_session only runs on the host (not the container). The new legal-chat-service is the canonical bridge between the container and the local CLI for the chat feature; everything else (upload, metadata, lessons) stays within the container's existing capabilities. Audit script (scripts/audit_training_corpus.py) included for verifying which corpus rows still need enrichment. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 10:06:22 +00:00
Chaim	45341a0bc8	feat(curator): switch Hermes Curator to DeepSeek V4-Pro via deepseek_local adapter A/B test (2026-05-05) showed DeepSeek V4-Pro is 2-3x faster and ~20x cheaper than Sonnet for style/lexicon pattern analysis, with comparable quality. Adds adapters/deepseek-paperclip-adapter/ package, documents adapter requirements (env injection, run-id headers), updates CLAUDE.md with adapter integration notes, and records lessons from ערר 1200-25 (block order for 1xxx, "להלן מתוך" pattern, expanded factual background, bridge planning analysis, flat heading structure). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-10 05:58:52 +00:00
Chaim	ea29778197	docs(hermes-curator): document interaction-driven conversation support All checks were successful Build & Deploy / build-and-deploy (push) Successful in 7s Details The curator's promptTemplate (stored in DB) now teaches Hermes how to post issue_thread_interactions instead of free-text comments. Three patterns supported, curator picks per context: - ask_user_questions for filtering findings (multi-select) - request_confirmation for accept/reject of a single proposal - suggest_tasks for proposing follow-up issues Verified end-to-end on CMP-71: curator hit a real obstacle (couldn't read the final DOCX from its container) and chose request_confirmation on its own to ask the user how to proceed — exactly the conversational behavior we want. Paperclip auto-wakes the curator with $PAPERCLIP_APPROVAL_ID when the user responds. The new prompt has a §B branch that handles the second wake (read response → act → close). The UI side was already built in `d099470` (mirror Paperclip interactions in case page) — now Hermes-side agents produce interactions too, not just claude_local agents. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-05 15:24:57 +00:00
Chaim	799b950961	feat(curator): trigger Knowledge Curator from api_mark_final, drop CEO F2 All checks were successful Build & Deploy / build-and-deploy (push) Successful in 8s Details The previous F2 stage in legal-ceo.md fired after the first DOCX export — too early, since the user often iterates with עריכה-* uploads after the first export. The true "this is dafna's chosen final" signal is the "סמן כסופי" button in the UI, which calls api_mark_final. This commit moves the curator wakeup from CEO's instructions to a direct hook in api_mark_final: - web/paperclip_client.py: add CURATOR_AGENTS dict (CMP + CMPA UUIDs) and wake_curator_for_final() helper. Looks up main case issue, creates a child issue assigned to the curator, tags plugin_state for case visibility, and triggers wakeup via Paperclip API. - web/app.py: api_mark_final now calls workflow_tools.ingest_final_version (so case_law table finally gets populated for search_decisions) and pc_wake_curator_for_final. Both are best-effort — failure does not block marking final. - legal-ceo.md: remove F2 stage, leave only the agents-table reference noting the curator runs from api_mark_final. - hermes-curator.md: update activation description to reflect the new flow. Result: curator runs only when chaim deliberately clicks "סמן כסופי", on the actual final file, with no risk of analyzing a draft that will later change. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-05 14:47:03 +00:00
Chaim	77e5996497	feat(agents): wire Hermes Knowledge Curator to CEO post-export (CMP + CMPA) All checks were successful Build & Deploy / build-and-deploy (push) Successful in 3m37s Details Adds new sub-agent "מנהל ידע" (hermes_local adapter) that runs after each successful export to analyze the final decision and suggest updates to skills/decision/SKILL.md and lessons. Read-only on case data, write only on a single comment per run. - legal-ceo.md: new stage F2 after F (export). Looks up curator by name in current company, creates async sub-issue, no waiting. Falls back to silent skip if no curator configured. - legal-ceo.md: agents table updated with both curator UUIDs (CMP + CMPA). - hermes-curator.md: role instructions documenting CMP/CMPA split and what the curator does/does not do. Stage 1 POC. End-to-end validated on CMP-68 (case 1130-25) with two substantive findings on style patterns. CMPA agent created with separate ~/.hermes/profiles/curator-cmpa profile (own MEMORY.md focused on היטל השבחה / פיצויים). Known gaps to follow up: curator does not auto-close its issue, does not auto-persist findings to MEMORY.md, comment attribution falls back to chaim's user (install-key) — these are tracked separately and do not block validation. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-05 13:33:23 +00:00

10 Commits