--- id: task-4 title: 'Phase 3: AI Integration Shapes' status: To Do assignee: [] created_date: '2026-01-02 15:54' labels: - migration - shapes - ai dependencies: [] priority: medium --- ## Description Port AI-powered shapes using existing MCP servers and APIs: 1. **folk-image-gen** - Image generation (fal.ai Flux) - Prompt input, image history thread - Loading states, error handling - Uses: mcp__fal-ai__fal_generate_image 2. **folk-video-gen** - Video generation (WAN 2.1) - Image-to-video, text-to-video - Duration control, queue polling - Uses: mcp__fal-ai__fal_generate_video 3. **folk-prompt** - LLM prompt executor - Agent binding, multiple personalities - Output streaming - Uses: mcp__gemini__gemini_generate or direct Anthropic API 4. **folk-transcription** - Audio transcription (Whisper) - Real-time transcription, pause/resume - Speaker diarization - Uses: Web Speech API fallback + Whisper API Simplifications: - Use MCP tools directly instead of custom API clients - Simplify loading states to CSS classes - Remove complex React hooks, use async/await patterns ## Acceptance Criteria - [ ] #1 folk-image-gen with fal.ai integration - [ ] #2 folk-video-gen with video generation - [ ] #3 folk-prompt with LLM streaming - [ ] #4 folk-transcription with Whisper