feat: add unified JSON export with standardized schema

- Create JSON Schema definition for UnifiedDocument format
- Implement UnifiedDocumentExporter service with multiple export formats
- Include comprehensive processing metadata and statistics
- Update OCR service to use new exporter for dual-track outputs
- Support JSON, Markdown, Text, and legacy format exports

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
egg
2025-11-19 08:36:24 +08:00
parent 5bcf3dfd42
commit ab89a40e8d
5 changed files with 999 additions and 21 deletions

View File

@@ -59,10 +59,10 @@
- [x] 4.1.2 Route to appropriate processing engine
- [x] 4.1.3 Return UnifiedDocument from both tracks
- [x] 4.1.4 Maintain backward compatibility
- [ ] 4.2 Create unified JSON export
- [ ] 4.2.1 Define standardized JSON schema
- [ ] 4.2.2 Include processing metadata
- [ ] 4.2.3 Support both track outputs
- [x] 4.2 Create unified JSON export
- [x] 4.2.1 Define standardized JSON schema
- [x] 4.2.2 Include processing metadata
- [x] 4.2.3 Support both track outputs
- [ ] 4.3 Update PDF generator for UnifiedDocument
- [ ] 4.3.1 Adapt PDF generation to use UnifiedDocument
- [ ] 4.3.2 Preserve layout from both tracks