2.3 KiB

Raw Permalink Blame History

Transcription Setup Guide

Why the Start Button Doesn't Work

The transcription start button is likely disabled because the OpenAI API key is not configured. The button will be disabled and show a tooltip "OpenAI API key not configured - Please set your API key in settings" when this is the case.

How to Fix It

Step 1: Get an OpenAI API Key

Go to OpenAI API Keys
Sign in to your OpenAI account
Click "Create new secret key"
Copy the API key (it starts with sk-)

Step 2: Configure the API Key in Canvas

In your Canvas application, look for the Settings button (usually a gear icon)
Open the settings dialog
Find the OpenAI API Key field
Paste your API key
Save the settings

Step 3: Test the Transcription

Create a transcription shape on the canvas
Click the "Start" button
Allow microphone access when prompted
Start speaking - you should see the transcription appear in real-time

Debugging Information

The application now includes debug logging to help identify issues:

Console Logs: Check the browser console for messages starting with 🔧 OpenAI Config Debug:
Visual Indicators: The transcription window will show "(API Key Required)" if not configured
Button State: The start button will be disabled and grayed out if the API key is missing

Troubleshooting

Button Still Disabled After Adding API Key

Refresh the page to reload the configuration
Check the browser console for any error messages
Verify the API key is correctly saved in settings

Microphone Permission Issues

Make sure you've granted microphone access to the browser
Check that your microphone is working in other applications
Try refreshing the page and granting permission again

No Audio Being Recorded

Check the browser console for audio-related error messages
Verify your microphone is not being used by another application
Try using a different browser if issues persist

Technical Details

The transcription system:

Uses the device microphone directly (not Daily room audio)
Records audio in WebM format
Sends audio chunks to OpenAI's Whisper API
Updates the transcription shape in real-time
Requires a valid OpenAI API key to function

2.3 KiB Raw Permalink Blame History