Artificial Intelligence #artificial intelligence#phoneme recognition
ArtNet: JEPA-Like Articulatory Framework Achieves 20.56% Error Reduction in Zero-Shot Phoneme Recognition
Researchers propose ArtNet, a JEPA-like framework for zero-shot cross-lingual phoneme recognition. By integrating an articulatory predictor with a variational information bottleneck, ArtNet suppresses language-specific variations. Experiments on seven unseen languages show a 20.56% relative reduction in phoneme error rate and 7.01% in phoneme feature error rate.
Jun 16, 2026 1 source