OCR/tasks.md at 3fc32bcdd716a02bef84d0d856a7c3bcb089fbc7

egg 3fc32bcdd7 feat: implement Phase 2 - Basic Style Preservation

Implement style application system and track-specific rendering for
PDF generation, enabling proper formatting preservation for Direct track.

**Font System** (Task 3.1):
- Added FONT_MAPPING with 20 common fonts → PDF standard fonts
- Implemented _map_font() with case-insensitive and partial matching
- Fallback to Helvetica for unknown fonts

**Style Application** (Task 3.2):
- Implemented _apply_text_style() to apply StyleInfo to canvas
- Supports both StyleInfo objects and dict formats
- Handles font family, size, color, and flags (bold/italic)
- Applies compound font variants (BoldOblique, BoldItalic)
- Graceful error handling with fallback to defaults

**Color Parsing** (Task 3.3):
- Implemented _parse_color() for multiple formats
- Supports hex colors (#RRGGBB, #RGB)
- Supports RGB tuples/lists (0-255 and 0-1 ranges)
- Automatic normalization to ReportLab's 0-1 range

**Track Detection** (Task 4.1):
- Added current_processing_track instance variable
- Detect processing_track from UnifiedDocument.metadata
- Support both object attribute and dict access
- Auto-reset after PDF generation

**Track-Specific Rendering** (Task 4.2, 4.3):
- Preserve StyleInfo in convert_unified_document_to_ocr_data
- Apply styles in draw_text_region for Direct track
- Simplified rendering for OCR track (unchanged behavior)
- Track detection: is_direct_track check

**Implementation Details**:
- Lines 97-125: Font mapping and style flag constants
- Lines 161-201: _parse_color() method
- Lines 203-236: _map_font() method
- Lines 238-326: _apply_text_style() method
- Lines 530-538: Track detection in generate_from_unified_document
- Lines 431-433: Style preservation in conversion
- Lines 1022-1037: Track-specific styling in draw_text_region

**Status**:
- Phase 2 Task 3: ✅ Completed (3.1, 3.2, 3.3)
- Phase 2 Task 4: ✅ Completed (4.1, 4.2, 4.3)
- Testing pending: 4.4 (requires backend)

Direct track PDFs will now preserve fonts, colors, and text styling
while maintaining backward compatibility with OCR track rendering.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

7.0 KiB

Raw Blame History

Implementation Tasks: PDF Layout Restoration

Phase 1: Critical Fixes (P0 - Immediate)

1. Fix Image Handling

2. Fix Table Rendering

Phase 2: Basic Style Preservation (P1 - Week 1)

3. Implement Style Application System

4. Track-Specific Rendering

Phase 3: Advanced Layout (P2 - Week 2)

5. Enhanced Text Rendering

6. List Formatting

7. Span-Level Rendering (Advanced)

Phase 4: Testing and Optimization (P2 - Week 3)

8. Comprehensive Testing

9. Performance Optimization

10. Documentation and Deployment

Success Criteria

Must Have (Phase 1)

Should Have (Phase 2)

Nice to Have (Phase 3-4)

Timeline

7.0 KiB Raw Blame History

Implementation Tasks: PDF Layout Restoration

Phase 1: Critical Fixes (P0 - Immediate)

1. Fix Image Handling

2. Fix Table Rendering

Phase 2: Basic Style Preservation (P1 - Week 1)

3. Implement Style Application System

4. Track-Specific Rendering

Phase 3: Advanced Layout (P2 - Week 2)

5. Enhanced Text Rendering

6. List Formatting

7. Span-Level Rendering (Advanced)

Phase 4: Testing and Optimization (P2 - Week 3)

8. Comprehensive Testing

9. Performance Optimization

10. Documentation and Deployment

Success Criteria

Must Have (Phase 1)

Should Have (Phase 2)

Nice to Have (Phase 3-4)

Timeline

7.0 KiB

Raw Blame History