OCR/tasks.md at 3876477bdacda16785ff311098237fcc79d88f23

egg cd3cbea49d chore: project cleanup and prepare for dual-track processing refactor

- Removed all test files and directories
- Deleted outdated documentation (will be rewritten)
- Cleaned up temporary files, logs, and uploads
- Archived 5 completed OpenSpec proposals
- Created new dual-track-document-processing proposal with complete OpenSpec structure
  - Dual-track architecture: OCR track (PaddleOCR) + Direct track (PyMuPDF)
  - UnifiedDocument model for consistent output
  - Support for structure-preserving translation
- Updated .gitignore to prevent future test/temp files

This is a major cleanup preparing for the complete refactoring of the document processing pipeline.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

5.8 KiB

Raw Blame History

Implementation Tasks

1. Backend - Fix Image Extraction and Saving (PREREQUISITE) ✅

2. Backend - Environment Setup ✅

3. Backend - PDF Generation Service ✅

4. Backend - PDF Download Endpoint Fix ✅

5. Backend - Integrate PDF Generation into OCR Flow (REQUIRED) ✅

6. Frontend - Install Dependencies ✅

7. Frontend - Create PDF Viewer Component ✅

8. Frontend - Results Page Integration ✅

9. Frontend - Task Detail Page Integration ✅

10. Testing ⚠️ (待實際 OCR 任務測試)

基本驗證 (已完成) ✅

功能測試 (需實際 OCR 任務)

5.8 KiB Raw Blame History Unescape Escape

Implementation Tasks

1. Backend - Fix Image Extraction and Saving (PREREQUISITE) ✅

2. Backend - Environment Setup ✅

3. Backend - PDF Generation Service ✅

4. Backend - PDF Download Endpoint Fix ✅

5. Backend - Integrate PDF Generation into OCR Flow (REQUIRED) ✅

6. Frontend - Install Dependencies ✅

7. Frontend - Create PDF Viewer Component ✅

8. Frontend - Results Page Integration ✅

9. Frontend - Task Detail Page Integration ✅

10. Testing ⚠️ (待實際 OCR 任務測試)

基本驗證 (已完成) ✅

功能測試 (需實際 OCR 任務)

5.8 KiB

Raw Blame History