Improve document processing pipeline and agent workflows
- Add delete_document_chunks for reprocessing, save extracted text to disk - Expand case directory structure (original/extracted/proofread/backup) - Update classifier patterns (תגובה, הודעת עמדה) - Fix proofreader agent paths for new directory layout - Update HEARTBEAT to notify on every task completion - Improve bidi_table with LRE/PDF directional embedding - Add Paperclip project verification and auto-close setup issue - Add auto-sync-cases.sh for Gitea synchronization Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -19,7 +19,7 @@ logger = logging.getLogger(__name__)
|
||||
_FILENAME_RULES: list[tuple[str, str, float]] = [
|
||||
# (regex pattern on filename, doc_type, confidence)
|
||||
(r"כתב.ערר|כתב-ערר", "appeal", 1.0),
|
||||
(r"תשובה|תשובת|תגובת|השלמת.טיעון|בקשה.להשלמת", "response", 1.0),
|
||||
(r"תשובה|תשובת|תגובה|תגובת|השלמת.טיעון|בקשה.להשלמת|הודעת.עמדה", "response", 1.0),
|
||||
(r"פרוטוקול", "protocol", 1.0),
|
||||
(r"החלטת?.ביניים|החלטה.לתיקון", "decision", 0.95),
|
||||
(r"הוראות.תכנית|תכנית", "plan", 1.0),
|
||||
|
||||
Reference in New Issue
Block a user