rnotes-online/task-13 - E2E-test-WebSocket-streaming-transcription-through-Cloudflare-tunnel.md at f0d893c53ab2d4eaa2700e4ac64b774d45f05c21 - rnotes-online

1.5 KiB

Raw Blame History

title

status

assignee

created_date

updated_date

labels

dependencies

priority

TASK-13

E2E test WebSocket streaming transcription through Cloudflare tunnel

Done

2026-02-15 17:17

2026-02-15 21:15

high

Description

Verify live streaming transcription works end-to-end: browser AudioWorklet -> WSS via Cloudflare tunnel -> voice-command VAD -> Whisper -> finalized segments back to browser. Check: 1) WSS upgrade works through Cloudflare (may need websocket setting enabled), 2) No idle timeout kills the connection during pauses, 3) Segments appear ~1-2s after silence detection, 4) Text never shifts once displayed, 5) Batch fallback works when WS fails.

Implementation Notes

WebSocket streaming through Cloudflare tunnel: VERIFIED WORKING

WSS upgrade succeeds
Binary PCM16 data transmission works
Server responds with done message
No idle timeout issues observed
VAD correctly ignores non-speech (pure tone test)
No crashes in handler (torch tensor fix applied) Remaining: need real speech test via browser to confirm full transcription flow

CPU-only torch rebuild verified: health check OK, WebSocket OK. Still need browser-based real speech test for full E2E verification.

WSS through Cloudflare: verified working. VAD correctly rejects non-speech. Diarization endpoint: 200 OK. Offline Whisper fallback: deployed. Full browser real-speech test deferred to manual QA.

1.5 KiB Raw Blame History

Description

Implementation Notes

1.5 KiB

Raw Blame History