Files
OCR/openspec/changes/archive/2025-12-10-add-ocr-processing-presets/test-notes.md
egg 940a406dce chore: backup before code cleanup
Backup commit before executing remove-unused-code proposal.
This includes all pending changes and new features.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2025-12-11 11:55:39 +08:00

1.2 KiB
Raw Blame History

Test Notes Add OCR Processing Presets

Status: Manual execution not run in this environment (Paddle models/GPU not available here). Scenarios and expected outcomes are documented for follow-up verification on a prepared runtime.

Scenario Input Preset / Config Expected Status
Datasheet,保守解析 demo_docs/edit3.pdf ocr_preset=datasheet (conservative, wireless off) Tables detected without over-segmentation; layout intact Pending (run on target runtime)
表格密集 demo_docs/edit2.pdf 或財報樣本 ocr_preset=table_heavy (full, wireless on) All tables detected, merged cells保持無明顯漏檢 Pending (run on target runtime)
純文字 demo_docs/scan.pdf ocr_preset=text_heavy (table disabled, charts/formula off) 只輸出文字區塊;無表格/圖表元素 Pending (run on target runtime)

Suggested validation steps:

  1. 透過前端選擇對應預設並啟動處理;或以 API 送出 ocr_preset/ocr_config
  2. 確認結果 JSON/Markdown 與預期行為一致(表格數量、元素類型、是否過度拆分)。
  3. 若需要調整,切換至 custom 並覆寫 table_parsing_modeenable_wireless_tablelayout_threshold,再重試。