2.3 KiB
2.3 KiB
Transcription Setup Guide
Why the Start Button Doesn't Work
The transcription start button is likely disabled because the OpenAI API key is not configured. The button will be disabled and show a tooltip "OpenAI API key not configured - Please set your API key in settings" when this is the case.
How to Fix It
Step 1: Get an OpenAI API Key
- Go to OpenAI API Keys
- Sign in to your OpenAI account
- Click "Create new secret key"
- Copy the API key (it starts with
sk-)
Step 2: Configure the API Key in Canvas
- In your Canvas application, look for the Settings button (usually a gear icon)
- Open the settings dialog
- Find the OpenAI API Key field
- Paste your API key
- Save the settings
Step 3: Test the Transcription
- Create a transcription shape on the canvas
- Click the "Start" button
- Allow microphone access when prompted
- Start speaking - you should see the transcription appear in real-time
Debugging Information
The application now includes debug logging to help identify issues:
- Console Logs: Check the browser console for messages starting with
🔧 OpenAI Config Debug: - Visual Indicators: The transcription window will show "(API Key Required)" if not configured
- Button State: The start button will be disabled and grayed out if the API key is missing
Troubleshooting
Button Still Disabled After Adding API Key
- Refresh the page to reload the configuration
- Check the browser console for any error messages
- Verify the API key is correctly saved in settings
Microphone Permission Issues
- Make sure you've granted microphone access to the browser
- Check that your microphone is working in other applications
- Try refreshing the page and granting permission again
No Audio Being Recorded
- Check the browser console for audio-related error messages
- Verify your microphone is not being used by another application
- Try using a different browser if issues persist
Technical Details
The transcription system:
- Uses the device microphone directly (not Daily room audio)
- Records audio in WebM format
- Sends audio chunks to OpenAI's Whisper API
- Updates the transcription shape in real-time
- Requires a valid OpenAI API key to function