Modalities

This page summarizes what is currently shipped in hama and what remains on the roadmap.

Available

Text → IPA (G2P): Available in Python, Node/Bun, and the browser. Returns IPA plus per-phoneme alignment metadata.
Audio → Phoneme (ASR): Available in Python, Node/Bun, and the browser. Accepts waveform input and returns collapsed phoneme output from asr_waveform_fp16.onnx.

Coming soon

IPA → Text
Audio → Text
Text → Embeddings

Runtime coverage

Runtime	G2P	Phoneme ASR
Browser	Yes	Yes
Node/Bun	Yes	Yes
Python	Yes	Yes