diff --git a/backlog/tasks/task-014 - Implement-WebGPU-based-local-image-generation-to-reduce-RunPod-costs.md b/backlog/tasks/task-014 - Implement-WebGPU-based-local-image-generation-to-reduce-RunPod-costs.md new file mode 100644 index 0000000..3be4bda --- /dev/null +++ b/backlog/tasks/task-014 - Implement-WebGPU-based-local-image-generation-to-reduce-RunPod-costs.md @@ -0,0 +1,56 @@ +--- +id: task-014 +title: Implement WebGPU-based local image generation to reduce RunPod costs +status: To Do +assignee: [] +created_date: '2025-12-04 11:46' +labels: + - performance + - cost-optimization + - webgpu + - ai + - image-generation +dependencies: [] +priority: high +--- + +## Description + + +Integrate WebGPU-powered browser-based image generation (SD-Turbo) to reduce RunPod API costs and eliminate cold start delays. This creates a hybrid pipeline where quick drafts/iterations run locally in the browser (FREE, ~1-3 seconds), while high-quality final renders still use RunPod SDXL. + +**Problem:** +- Current image generation always hits RunPod (~$0.02/image + 10-30s cold starts) +- No instant feedback loop for creative iteration +- 100% of compute costs are cloud-based + +**Solution:** +- Add WebGPU capability detection +- Integrate SD-Turbo for instant browser-based previews +- Smart routing: drafts → browser, final renders → RunPod +- Potential 70% reduction in RunPod image generation costs + +**Cost Impact (projected):** +- 1,000 images/mo: $20 → $6 (save $14/mo) +- 5,000 images/mo: $100 → $30 (save $70/mo) +- 10,000 images/mo: $200 → $60 (save $140/mo) + +**Browser Support:** +- Chrome/Edge: Full WebGPU (v113+) +- Firefox: Windows (July 2025) +- Safari: v26 beta +- Fallback: WASM backend for unsupported browsers + + +## Acceptance Criteria + +- [ ] #1 WebGPU capability detection added to clientConfig.ts +- [ ] #2 SD-Turbo model loads and runs in browser via WebGPU +- [ ] #3 ImageGenShapeUtil has Quick Preview vs High Quality toggle +- [ ] #4 Smart routing in aiOrchestrator routes drafts to browser +- [ ] #5 Fallback to WASM for browsers without WebGPU +- [ ] #6 User can generate preview images with zero cold start +- [ ] #7 RunPod only called for High Quality final renders +- [ ] #8 Model download progress indicator shown to user +- [ ] #9 Works offline after initial model download +