Topic
mllms
Artificial Intelligence #semantic gradients#mllms
SAGA Framework Uses Frozen MLLMs to Boost Visual Embedding Recall by 3-6 Points
Researchers propose SAGA, a framework that converts frozen MLLMs into attribute-aware training signals for vision encoders, replacing uniform scalar distances with semantic gradients. Using Group Relative Policy Optimization (GRPO) and attention distillation, SAGA improves zero-shot image retrieval Recall@1 by 3 to 6 points on benchmark datasets.
Jun 16, 2026 1 source
Artificial Intelligence #sentiment analysis#multimodal
MAF Framework Dynamically Optimizes Prompting for Multimodal Sentiment Analysis
A new research paper proposes the Multimodal Adaptive Few-Shot Prompting (MAF) framework, which improves sentiment analysis in multimodal large language models (MLLMs) by dynamically retrieving and integrating query-relevant demonstrations. The method uses a lightweight coefficient network to fuse multimodal similarity scores and enhances prediction stability via majority voting.
Jun 16, 2026 1 source