Files
OCR/openspec/changes/archive/2025-12-10-add-ocr-processing-presets/test-notes.md
egg 940a406dce chore: backup before code cleanup
Backup commit before executing remove-unused-code proposal.
This includes all pending changes and new features.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2025-12-11 11:55:39 +08:00

15 lines
1.2 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

# Test Notes Add OCR Processing Presets
Status: Manual execution not run in this environment (Paddle models/GPU not available here). Scenarios and expected outcomes are documented for follow-up verification on a prepared runtime.
| Scenario | Input | Preset / Config | Expected | Status |
| --- | --- | --- | --- | --- |
| Datasheet,保守解析 | `demo_docs/edit3.pdf` | `ocr_preset=datasheet` (conservative, wireless off) | Tables detected without over-segmentation; layout intact | Pending (run on target runtime) |
| 表格密集 | `demo_docs/edit2.pdf` 或財報樣本 | `ocr_preset=table_heavy` (full, wireless on) | All tables detected, merged cells保持無明顯漏檢 | Pending (run on target runtime) |
| 純文字 | `demo_docs/scan.pdf` | `ocr_preset=text_heavy` (table disabled, charts/formula off) | 只輸出文字區塊;無表格/圖表元素 | Pending (run on target runtime) |
Suggested validation steps:
1) 透過前端選擇對應預設並啟動處理;或以 API 送出 `ocr_preset`/`ocr_config`
2) 確認結果 JSON/Markdown 與預期行為一致(表格數量、元素類型、是否過度拆分)。
3) 若需要調整,切換至 `custom` 並覆寫 `table_parsing_mode``enable_wireless_table``layout_threshold`,再重試。