Commit Graph

  • 3ccbdb8394 fix: multi-worker translation status and OCR fallback handling main egg 2025-12-14 16:36:36 +08:00
  • 1c37585be2 fix: resume translation polling when returning to task detail page egg 2025-12-14 16:14:52 +08:00
  • f01591e9eb fix: update browser tab title from 'frontend' to 'Tool_OCR' egg 2025-12-14 16:02:59 +08:00
  • ee49751c38 fix: add UTC timezone indicator to all datetime serialization egg 2025-12-14 15:48:17 +08:00
  • 7233e9cb7b fix: logging, warnings, and soft-delete consistency egg 2025-12-14 15:40:31 +08:00
  • f46402f6c9 refactor: simplify deployment - unified env and startup script egg 2025-12-14 15:16:26 +08:00
  • e255039419 chore: remove AI dev files from repo and clean up env config egg 2025-12-14 15:08:33 +08:00
  • 86a6633000 feat: consolidate env config and add deployment files egg 2025-12-14 15:02:16 +08:00
  • 858d93155f chore: archive completed proposals egg 2025-12-14 12:50:04 +08:00
  • bbd68a2162 feat: enable audit logging for authentication and task operations egg 2025-12-14 12:46:20 +08:00
  • 73112db055 feat: add storage cleanup mechanism with soft delete and auto scheduler egg 2025-12-14 12:41:01 +08:00
  • 81a0a3ab0f feat: complete i18n support for all frontend pages and components egg 2025-12-14 11:56:18 +08:00
  • 3876477bda feat: add multilingual font support for translated PDFs egg 2025-12-12 19:18:58 +08:00
  • efa7e4175c feat: optimize task file generation and add visualization download egg 2025-12-12 19:11:50 +08:00
  • 65abd51d60 feat: add translation billing stats and remove Export/Settings pages egg 2025-12-12 17:38:12 +08:00
  • d20751d56b feat: add batch processing for multiple file uploads egg 2025-12-12 17:05:16 +08:00
  • d5bc311757 feat: simplify login page UX and add i18n English support egg 2025-12-12 12:49:48 +08:00
  • 1f18010040 fix: OCR Track reflow PDF and translation with image text filtering egg 2025-12-12 11:02:35 +08:00
  • 24253ac15e feat: unify Direct Track PDF rendering and simplify export options egg 2025-12-12 07:50:43 +08:00
  • 53bfa88773 docs: archive simplify-frontend-ocr-config proposal egg 2025-12-11 17:17:07 +08:00
  • 63ffa8f0e3 docs: archive enable-doc-orientation-detection proposal egg 2025-12-11 17:15:05 +08:00
  • cfe65158a3 feat: enable document orientation detection for scanned PDFs egg 2025-12-11 17:13:46 +08:00
  • 57070af307 docs: mark remove-unused-code tasks as completed egg 2025-12-11 12:03:41 +08:00
  • 5d962ca97c refactor: remove unused code and migrate legacy API egg 2025-12-11 12:03:09 +08:00
  • 940a406dce chore: backup before code cleanup egg 2025-12-11 11:55:39 +08:00
  • eff9b0bcd5 feat: refactor dual-track architecture (Phase 1-5) egg 2025-12-07 07:18:27 +08:00
  • 8265be1741 test egg 2025-12-04 18:00:37 +08:00
  • 9437387ef1 fix: add IoU text coverage check and page boundary validation egg 2025-12-04 07:48:38 +08:00
  • 1c3c37bce0 test: add covering images to preprocessing test output egg 2025-12-04 07:43:19 +08:00
  • d6387adbd1 feat: add black/white covering image detection egg 2025-12-04 07:42:55 +08:00
  • 3903bcf77d fix: tighten covering detection thresholds to avoid false positives egg 2025-12-04 07:36:07 +08:00
  • bc66f72352 feat: extend covering detection to include black/redaction rectangles egg 2025-12-04 07:34:35 +08:00
  • 63b474f93a test: add preprocessing pipeline test script egg 2025-12-03 16:51:12 +08:00
  • 6a65c7617d feat: add PDF preprocessing pipeline for Direct track egg 2025-12-03 16:11:00 +08:00
  • 1b5c7f39a8 fix: improve PDF layout generation for Direct track egg 2025-12-03 14:55:00 +08:00
  • 08adf3d01d feat: add translated PDF format selection (layout/reflow) egg 2025-12-03 10:10:28 +08:00
  • 0dcea4a7e7 fix: use task.files relationship to get source file path egg 2025-12-02 18:12:22 +08:00
  • bed473cd30 fix: properly stop child processes and orphaned services egg 2025-12-02 18:01:24 +08:00
  • 7916c75768 fix: allow extra environment variables in pydantic-settings egg 2025-12-02 17:53:34 +08:00
  • c006905b6f refactor: centralize DIFY settings in config.py and cleanup env files egg 2025-12-02 17:50:47 +08:00
  • d7f7166a2d feat: unify environment scripts with start.sh egg 2025-12-02 12:48:52 +08:00
  • a07aad96b3 feat: add translated PDF export with layout preservation egg 2025-12-02 12:33:31 +08:00
  • 8d9b69ba93 feat: add document translation via DIFY AI API egg 2025-12-02 11:57:02 +08:00
  • 87dc97d951 fix: improve Office document processing with Direct track egg 2025-11-30 16:22:04 +08:00
  • 6806fff1d5 chore: archive extract-table-cell-boxes proposal egg 2025-11-30 14:22:29 +08:00
  • 6252be6c6f fix: correct scale factor calculation for rotated documents egg 2025-11-30 13:42:48 +08:00
  • f27b4d9710 fix: correct orientation detection to use OCR pixel coordinates egg 2025-11-30 13:37:03 +08:00
  • c65e4f98d4 fix: detect and handle rotated document content in PDF generation egg 2025-11-30 13:27:01 +08:00
  • 95ae1f1bdb feat: add table detection options and scan artifact removal egg 2025-11-30 13:21:50 +08:00
  • f5a2c8a750 feat: extract cell_box_list from table_res_list egg 2025-11-28 12:41:18 +08:00
  • 5ddccbf5a2 docs: update tasks.md with Phase 1-3 completion status egg 2025-11-28 12:20:59 +08:00
  • 715805b3b8 feat: implement table cell boxes extraction with SLANeXt egg 2025-11-28 12:20:32 +08:00
  • 801ee9c4b6 feat: create extract-table-cell-boxes proposal and archive old proposal egg 2025-11-28 12:15:06 +08:00
  • dda9621e17 feat: enhance layout preprocessing and unify image scaling proposal egg 2025-11-28 09:23:19 +08:00
  • 86bbea6fbf fix: improve OCR track table rendering with Paragraph wrapping egg 2025-11-28 09:22:07 +08:00
  • 2861f54838 fix: prevent preview infinite loop and add document type filtering egg 2025-11-27 17:31:05 +08:00
  • 894d18b432 feat: add real-time preprocessing preview with side-by-side comparison egg 2025-11-27 17:25:52 +08:00
  • 5982fff71c feat: add contrast/sharpen strength controls, disable binarization egg 2025-11-27 17:18:44 +08:00
  • f6d2957592 fix: pass preprocessing parameters from start_task to OCR service egg 2025-11-27 16:13:32 +08:00
  • 19cb80460f docs: update add-layout-preprocessing tasks with completion status egg 2025-11-27 15:22:52 +08:00
  • 01d56f84cd feat: add preprocessing UI components and integration egg 2025-11-27 15:21:58 +08:00
  • ea0dd7456c feat: implement layout preprocessing backend egg 2025-11-27 15:17:20 +08:00
  • 06a5973f2e proposal: add hybrid control mode with auto-detection and preview egg 2025-11-27 14:31:09 +08:00
  • c12ea0b9f6 proposal: add-layout-preprocessing for improved table detection egg 2025-11-27 14:24:23 +08:00
  • 5448a047ff chore: archive upgrade-ppstructure-models proposal egg 2025-11-27 14:22:33 +08:00
  • 6235280c45 feat: upgrade PP-StructureV3 models to latest versions egg 2025-11-27 14:21:24 +08:00
  • 59206a6ab8 feat: simplify layout model selection and archive proposals egg 2025-11-27 13:27:00 +08:00
  • c65df754cf wip: add TableData.from_dict() for OCR track table parsing (incomplete) egg 2025-11-26 19:16:51 +08:00
  • 6e050eb540 fix: OCR track table data format and image cropping egg 2025-11-26 18:48:15 +08:00
  • a227311b2d chore: archive enhance-memory-management proposal (75/80 tasks) egg 2025-11-26 16:10:45 +08:00
  • fa9b542b06 fix: improve OCR track multi-line text rendering and HTML table detection egg 2025-11-26 16:09:31 +08:00
  • 19bd5fd609 fix: enable text selection in Direct track PDF output egg 2025-11-26 14:49:40 +08:00
  • 5c561f4203 fix: handle LOGO element type in Direct track PDF generation egg 2025-11-26 12:32:55 +08:00
  • 79cffe6da0 fix: resolve Direct track PDF regression issues egg 2025-11-26 12:29:46 +08:00
  • 1afdb822c3 feat: implement hybrid image extraction and memory management egg 2025-11-26 10:56:22 +08:00
  • ba8ddf2b68 feat: create OpenSpec proposal for enhanced memory management egg 2025-11-25 15:21:32 +08:00
  • 2d0932face chore: remove .claude/settings.local.json from git tracking egg 2025-11-25 15:11:17 +08:00
  • a13cf27b52 chore: add demo_docs/ and .claude/settings.local.json to gitignore egg 2025-11-25 15:10:55 +08:00
  • 0999898358 fix: improve multi-page PDF dimension handling and coordinate transformation egg 2025-11-25 15:09:39 +08:00
  • 2312b4cd66 feat: add frontend-adjustable PP-StructureV3 parameters with comprehensive testing egg 2025-11-25 14:39:19 +08:00
  • a659e7ae00 fix: improve PP-StructureV3 structure preservation for complex diagrams egg 2025-11-25 08:53:37 +08:00
  • 4325d024a7 chore: cleanup test files and archive pdf-layout-restoration proposal egg 2025-11-24 19:43:05 +08:00
  • 3358d97624 fix: resolve Direct track PDF table rendering overlap with canvas scaling egg 2025-11-24 19:39:12 +08:00
  • 108784a270 fix: resolve table/image overlap and missing images in Direct track PDF generation egg 2025-11-24 16:31:28 +08:00
  • 8333182879 fix: correct Y-axis positioning and implement span-based rendering egg 2025-11-24 14:57:27 +08:00
  • 6d4df26223 feat: add multi-column layout support for PDF extraction and generation egg 2025-11-24 14:25:53 +08:00
  • 75c194fe2a feat: implement Task 7 span-level rendering for inline styling egg 2025-11-24 11:44:05 +08:00
  • b1de7616e4 fix: implement actual list item spacing with Y offset adjustment egg 2025-11-24 11:35:58 +08:00
  • 1ac8e82f47 feat: complete Task 6 list formatting with fallback detection and spacing egg 2025-11-24 11:17:28 +08:00
  • 1ec186f680 fix: properly implement list formatting with sequential numbering and grouping egg 2025-11-24 09:59:00 +08:00
  • ad879d48e5 feat: implement Phase 3 list formatting for Direct track egg 2025-11-24 09:54:15 +08:00
  • e1e97c54cf fix: correct Phase 3 implementation and remove invalid OCR track alignment egg 2025-11-24 08:58:55 +08:00
  • 8ba61f51b3 feat: add OCR track alignment support and spacing_after analysis egg 2025-11-24 08:35:01 +08:00
  • 93bd9f5fee refine: add OCR track line break support and spacing_after handling egg 2025-11-24 08:12:32 +08:00
  • 77fe4ccb8b feat: implement Phase 3 enhanced text rendering with alignment and formatting egg 2025-11-24 08:05:48 +08:00
  • 09cf9149ce feat: implement proper track-specific PDF rendering egg 2025-11-24 07:53:17 +08:00
  • 3fc32bcdd7 feat: implement Phase 2 - Basic Style Preservation egg 2025-11-24 07:44:24 +08:00
  • 9621d6a242 fix: handle None image_path safely to prevent AttributeError egg 2025-11-24 07:36:14 +08:00
  • 2911ee16ea fix: properly complete task 2.1 - remove fake table image dependency egg 2025-11-24 07:31:43 +08:00
  • 0aff468c51 feat: implement Phase 1 of PDF layout restoration egg 2025-11-24 07:16:31 +08:00