Artificial Intelligence #speech tokenization#semantic distillation
LM-SPT Uses Semantic Distillation to Improve Speech Tokenization for Language Models
A new speech tokenization method called LM-SPT uses semantic speech-resynthesis distillation to better align discrete speech tokens with language models. The approach outperforms previous semantic-enhanced tokenizers on automatic speech recognition and text-to-speech tasks without sacrificing reconstruction fidelity.
Jun 17, 2026 2 sources