rnotes-online/backlog/tasks/task-12 - Optimize-Docker-i...

884 B

id title status assignee created_date updated_date labels dependencies priority
TASK-12 Optimize Docker image size - use CPU-only torch Done
2026-02-15 17:17 2026-02-15 17:29
low

Description

Voice-command Docker image is ~3.5GB due to full torch with CUDA/nvidia libs. Netcup has no GPU. Switch to CPU-only torch wheel (pip install torch --index-url https://download.pytorch.org/whl/cpu) to cut ~2GB. Also consider if pyannote.audio can use ONNX runtime instead of torch for inference. Current memory limit is 4G.

Implementation Notes

CPU-only torch optimization deployed to Netcup. Image size: 4.19GB (still large due to pyannote deps, but CUDA libs removed). Health check passes, WebSocket streaming verified working.