feat: implement GPU acceleration support for OCR processing

實作 GPU 加速支援，自動偵測並啟用 CUDA GPU 加速 OCR 處理主要變更： 1. 環境設置增強 (setup_dev_env.sh) - 新增 GPU 和 CUDA 版本偵測功能 - 自動安裝對應的 PaddlePaddle GPU/CPU 版本 - CUDA 11.2+ 安裝 GPU 版本，否則安裝 CPU 版本 - 安裝後驗證 GPU 可用性並顯示設備資訊 2. 配置更新 - .env.local: 加入 GPU 配置選項 * FORCE_CPU_MODE: 強制 CPU 模式選項 * GPU_MEMORY_FRACTION: GPU 記憶體使用比例 * GPU_DEVICE_ID: GPU 裝置 ID - backend/app/core/config.py: 加入 GPU 配置欄位 3. OCR 服務 GPU 整合 (backend/app/services/ocr_service.py) - 新增 _detect_and_configure_gpu() 方法自動偵測 GPU - 新增 get_gpu_status() 方法回報 GPU 狀態和記憶體使用 - 修改 get_ocr_engine() 支援 GPU 參數和錯誤降級 - 修改 get_structure_engine() 支援 GPU 參數和錯誤降級 - 自動 GPU/CPU 切換，GPU 失敗時自動降級到 CPU 4. 健康檢查與監控 (backend/app/main.py) - /health endpoint 加入 GPU 狀態資訊 - 回報 GPU 可用性、裝置名稱、記憶體使用等資訊 5. 文檔更新 (README.md) - Features: 加入 GPU 加速功能說明 - Prerequisites: 加入 GPU 硬體要求（可選） - Quick Start: 更新自動化設置說明包含 GPU 偵測 - Configuration: 加入 GPU 配置選項和說明 - Notes: 加入 GPU 支援注意事項技術特性： - 自動偵測 NVIDIA GPU 和 CUDA 版本 - 支援 CUDA 11.2-12.x - GPU 初始化失敗時優雅降級到 CPU - GPU 記憶體分配控制防止 OOM - 即時 GPU 狀態監控和報告 - 完全向後相容 CPU-only 環境預期效能： - GPU 系統: 3-10x OCR 處理速度提升 - CPU 系統: 無影響，維持現有效能 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-14 07:42:13 +08:00
parent 6452797abe
commit 7536f43513
6 changed files with 361 additions and 32 deletions
--- a/backend/app/main.py
+++ b/backend/app/main.py
@@ -83,13 +83,51 @@ app.add_middleware(
 # Health check endpoint
@app.get("/health")
 async def health_check():
-    """Health check endpoint"""
-    return {
+    """Health check endpoint with GPU status"""
+    from app.services.ocr_service import OCRService
+
+    response = {
        "status": "healthy",
        "service": "Tool_OCR",
        "version": "0.1.0",
    }

+    # Add GPU status information
+    try:
+        # Create temporary OCRService instance to get GPU status
+        # In production, this should be a singleton service
+        ocr_service = OCRService()
+        gpu_status = ocr_service.get_gpu_status()
+
+        response["gpu"] = {
+            "available": gpu_status.get("gpu_available", False),
+            "enabled": gpu_status.get("gpu_enabled", False),
+            "device_name": gpu_status.get("device_name", "N/A"),
+            "device_count": gpu_status.get("device_count", 0),
+            "compute_capability": gpu_status.get("compute_capability", "N/A"),
+        }
+
+        # Add memory info if available
+        if gpu_status.get("memory_total_mb"):
+            response["gpu"]["memory"] = {
+                "total_mb": round(gpu_status.get("memory_total_mb", 0), 2),
+                "allocated_mb": round(gpu_status.get("memory_allocated_mb", 0), 2),
+                "utilization_percent": round(gpu_status.get("memory_utilization", 0), 2),
+            }
+
+        # Add reason if GPU is not available
+        if not gpu_status.get("gpu_available") and gpu_status.get("reason"):
+            response["gpu"]["reason"] = gpu_status.get("reason")
+
+    except Exception as e:
+        logger.warning(f"Failed to get GPU status: {e}")
+        response["gpu"] = {
+            "available": False,
+            "error": str(e),
+        }
+
+    return response
+

 # Root endpoint
@app.get("/")