chore: backup before code cleanup

Backup commit before executing remove-unused-code proposal.
This includes all pending changes and new features.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
egg
2025-12-11 11:55:39 +08:00
parent eff9b0bcd5
commit 940a406dce
58 changed files with 8226 additions and 175 deletions

View File

@@ -0,0 +1,14 @@
# Test Notes Add OCR Processing Presets
Status: Manual execution not run in this environment (Paddle models/GPU not available here). Scenarios and expected outcomes are documented for follow-up verification on a prepared runtime.
| Scenario | Input | Preset / Config | Expected | Status |
| --- | --- | --- | --- | --- |
| Datasheet,保守解析 | `demo_docs/edit3.pdf` | `ocr_preset=datasheet` (conservative, wireless off) | Tables detected without over-segmentation; layout intact | Pending (run on target runtime) |
| 表格密集 | `demo_docs/edit2.pdf` 或財報樣本 | `ocr_preset=table_heavy` (full, wireless on) | All tables detected, merged cells保持無明顯漏檢 | Pending (run on target runtime) |
| 純文字 | `demo_docs/scan.pdf` | `ocr_preset=text_heavy` (table disabled, charts/formula off) | 只輸出文字區塊;無表格/圖表元素 | Pending (run on target runtime) |
Suggested validation steps:
1) 透過前端選擇對應預設並啟動處理;或以 API 送出 `ocr_preset`/`ocr_config`
2) 確認結果 JSON/Markdown 與預期行為一致(表格數量、元素類型、是否過度拆分)。
3) 若需要調整,切換至 `custom` 並覆寫 `table_parsing_mode``enable_wireless_table``layout_threshold`,再重試。