On-device speech demo · v1.5.0

Say something. Your browser decodes it.

Your microphone goes to phoneme ASR, then the phonemes go to P2G (phoneme → grapheme). Both models run on the hand-written WebAssembly engine — ~39 KB, no onnxruntime. Audio never leaves your device.

Loading models…
Phonemes Phoneme ASR

P2G decodes phonemes back to text

Text P2G

P2G is a small (~7.3M-param) research model, so the text is deliberately approximate (e.g. quilterkilter). Clearest on short English speech. See the v1.5.0 notes or the text → phonemes demo.