OCR/tasks.md at 09cf9149ce7aaf1406198d7c680583acdb3507a8

egg 09cf9149ce feat: implement proper track-specific PDF rendering

Implement independent Direct and OCR track rendering methods with
complete separation of concerns and proper line break handling.

**Architecture Changes**:
- Created _generate_direct_track_pdf() for rich formatting
- Created _generate_ocr_track_pdf() for backward compatible rendering
- Modified generate_from_unified_document() to route by track type
- No more shared rendering path that loses information

**Direct Track Features** (_generate_direct_track_pdf):
- Processes UnifiedDocument directly (no legacy conversion)
- Preserves all StyleInfo without information loss
- Handles line breaks (\n) in text content
- Layer-based rendering: images → tables → text
- Three specialized helper methods:
  - _draw_text_element_direct(): Multi-line text with styling
  - _draw_table_element_direct(): Direct bbox table rendering
  - _draw_image_element_direct(): Image positioning from bbox

**OCR Track Features** (_generate_ocr_track_pdf):
- Uses legacy OCR data conversion pipeline
- Routes to existing _generate_pdf_from_data()
- Maintains full backward compatibility
- Simplified rendering for OCR-detected layout

**Line Break Handling** (Direct Track):
- Split text on '\n' into multiple lines
- Calculate line height as font_size * 1.2
- Render each line with proper vertical spacing
- Font scaling per line if width exceeds bbox

**Implementation Details**:
Lines 535-569: Track detection and routing
Lines 571-670: _generate_direct_track_pdf() main method
Lines 672-717: _generate_ocr_track_pdf() main method
Lines 1497-1575: _draw_text_element_direct() with line breaks
Lines 1577-1656: _draw_table_element_direct()
Lines 1658-1714: _draw_image_element_direct()

**Corrected Task Status**:
- Task 4.2: NOW properly implements separate Direct track pipeline
- Task 4.3: NOW properly implements separate OCR track pipeline
- Both with distinct rendering logic as designed

**Breaking vs Previous Commit**:
Previous commit (3fc32bc) only added conditional styling in shared
draw_text_region(). This commit creates true track-specific pipelines
as per design.md requirements.

Direct track PDFs will now:
✅ Process without legacy conversion (no info loss)
✅ Render multi-line text properly (split on \n)
✅ Apply StyleInfo per element
✅ Use precise bbox positioning
✅ Render images and tables directly

OCR track PDFs will:
✅ Use existing proven pipeline
✅ Maintain backward compatibility
✅ No changes to current behavior

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

7.3 KiB

Raw Blame History

Implementation Tasks: PDF Layout Restoration

Phase 1: Critical Fixes (P0 - Immediate)

1. Fix Image Handling

2. Fix Table Rendering

Phase 2: Basic Style Preservation (P1 - Week 1)

3. Implement Style Application System

4. Track-Specific Rendering

Phase 3: Advanced Layout (P2 - Week 2)

5. Enhanced Text Rendering

6. List Formatting

7. Span-Level Rendering (Advanced)

Phase 4: Testing and Optimization (P2 - Week 3)

8. Comprehensive Testing

9. Performance Optimization

10. Documentation and Deployment

Success Criteria

Must Have (Phase 1)

Should Have (Phase 2)

Nice to Have (Phase 3-4)

Timeline

7.3 KiB Raw Blame History

Implementation Tasks: PDF Layout Restoration

Phase 1: Critical Fixes (P0 - Immediate)

1. Fix Image Handling

2. Fix Table Rendering

Phase 2: Basic Style Preservation (P1 - Week 1)

3. Implement Style Application System

4. Track-Specific Rendering

Phase 3: Advanced Layout (P2 - Week 2)

5. Enhanced Text Rendering

6. List Formatting

7. Span-Level Rendering (Advanced)

Phase 4: Testing and Optimization (P2 - Week 3)

8. Comprehensive Testing

9. Performance Optimization

10. Documentation and Deployment

Success Criteria

Must Have (Phase 1)

Should Have (Phase 2)

Nice to Have (Phase 3-4)

Timeline

7.3 KiB

Raw Blame History