OCR/tasks.md at 9437387ef1eb40319bf32b2dd92dc41d9d8e20df

egg 59206a6ab8 feat: simplify layout model selection and archive proposals

Changes:
- Replace PP-Structure 7-slider parameter UI with simple 3-option layout model selector
- Add layout model mapping: chinese (PP-DocLayout-S), default (PubLayNet), cdla
- Add LayoutModelSelector component and zh-TW translations
- Fix "default" model behavior with sentinel value for PubLayNet
- Add gap filling service for OCR track coverage improvement
- Add PP-Structure debug utilities
- Archive completed/incomplete proposals:
  - add-ocr-track-gap-filling (complete)
  - fix-ocr-track-table-rendering (incomplete)
  - simplify-ppstructure-model-selection (22/25 tasks)
- Add new layout model tests, archive old PP-Structure param tests
- Update OpenSpec ocr-processing spec with layout model requirements

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

2.0 KiB

Raw Blame History

Implementation Tasks

Phase 1: Core Fix - Table Content Conversion

1.1 Add TableData.from_dict() class method

1.2 Fix _json_to_document_element for TABLE elements

1.3 Verify TableData.to_html() generates correct HTML

Phase 2: OCR Track Rendering Consistency

2.1 Review convert_unified_document_to_ocr_data

2.2 Review draw_table_region

Phase 3: Testing and Verification

3.1 Test OCR Track

3.2 Test Direct Track (Regression)

3.3 Test Hybrid Mode

Phase 4: Code Quality

4.1 Add logging

4.2 Error handling

2.0 KiB Raw Blame History

Implementation Tasks

Phase 1: Core Fix - Table Content Conversion

1.1 Add TableData.from_dict() class method

1.2 Fix _json_to_document_element for TABLE elements

1.3 Verify TableData.to_html() generates correct HTML

Phase 2: OCR Track Rendering Consistency

2.1 Review convert_unified_document_to_ocr_data

2.2 Review draw_table_region

Phase 3: Testing and Verification

3.1 Test OCR Track

3.2 Test Direct Track (Regression)

3.3 Test Hybrid Mode

Phase 4: Code Quality

4.1 Add logging

4.2 Error handling

2.0 KiB

Raw Blame History