Artificial Intelligence #ai#artificial intelligence
JoyAI-VL-Interaction Model Brings Real-Time Vision-Language AI to Enterprise Applications
JoyAI-VL-Interaction is an open-source, 8B-scale vision-language model that continuously monitors video streams and decides in real time whether to stay silent, speak, or delegate to a background model. Human raters preferred it over Doubao and Gemini in six real-world scenarios. The system includes pluggable ASR/TTS, memory, and API integration.
Jun 16, 2026 1 source