OCR/tasks.md at 63ffa8f0e3a07f07059fd0c85b610016487245e8

egg cfe65158a3 feat: enable document orientation detection for scanned PDFs

- Enable PP-StructureV3's use_doc_orientation_classify feature
- Detect rotation angle from doc_preprocessor_res.angle
- Swap page dimensions (width <-> height) for 90°/270° rotations
- Output PDF now correctly displays landscape-scanned content

Also includes:
- Archive completed openspec proposals
- Add simplify-frontend-ocr-config proposal (pending)
- Code cleanup and frontend simplification

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

2.2 KiB

Raw Blame History

1. Algorithm Changes (gap_filling_service.py)

1.1 IoA Implementation

1.2 Dynamic Threshold Strategy

1.3 Boundary Shrinking

2. OCR Data Source Changes

2.1 Extract overall_ocr_res from PP-StructureV3

2.2 Update Processing Orchestrator

3. Configuration Updates

3.1 Add Settings (config.py)

4. Testing

4.1 Unit Tests

4.2 Integration Tests

5. Documentation

2.2 KiB Raw Blame History

1. Algorithm Changes (gap_filling_service.py)

1.1 IoA Implementation

1.2 Dynamic Threshold Strategy

1.3 Boundary Shrinking

2. OCR Data Source Changes

2.1 Extract overall_ocr_res from PP-StructureV3

2.2 Update Processing Orchestrator

3. Configuration Updates

3.1 Add Settings (config.py)

4. Testing

4.1 Unit Tests

4.2 Integration Tests

5. Documentation

2.2 KiB

Raw Blame History