iGEN
Visit IGEN World Explore IGEN Expo
EXPLORE UPGRADE PLANS
BREAKING
New Framework Reduces Visual Hallucinations in Multimodal AI Systems Without Retraining MAF Framework Dynamically Optimizes Prompting for Multimodal Sentiment Analysis Study on Pedestrian Attribute Recognition Identifies Sparsity Wall and Optimizes Edge Deployment AI Framework Targets 50% Water Loss in Jordan with LLM and Digital Twin Integration AnonShield: Scalable On-Premise Pseudonymization Cuts Vulnerability Data Processing from 92 Hours to Under 10 Minutes MoFore: A New Self-Supervised Framework Learns Video Representations by Forecasting Future Latent Embeddings Do LLMs Reliably Identify Correct Information Units in Aphasic Discourse? A New Study Evaluates Four Models AI Video Generation Method for Cardiac MRI Addresses Data Scarcity with Latent Motion Modeling SCAN Framework Helps CTOs Decide When to Use Generative AI for Task Allocation LLM-Encoded Knowledge Guides Federated Graph Recommendation to Improve Accuracy New Framework Reduces Visual Hallucinations in Multimodal AI Systems Without Retraining MAF Framework Dynamically Optimizes Prompting for Multimodal Sentiment Analysis Study on Pedestrian Attribute Recognition Identifies Sparsity Wall and Optimizes Edge Deployment AI Framework Targets 50% Water Loss in Jordan with LLM and Digital Twin Integration AnonShield: Scalable On-Premise Pseudonymization Cuts Vulnerability Data Processing from 92 Hours to Under 10 Minutes MoFore: A New Self-Supervised Framework Learns Video Representations by Forecasting Future Latent Embeddings Do LLMs Reliably Identify Correct Information Units in Aphasic Discourse? A New Study Evaluates Four Models AI Video Generation Method for Cardiac MRI Addresses Data Scarcity with Latent Motion Modeling SCAN Framework Helps CTOs Decide When to Use Generative AI for Task Allocation LLM-Encoded Knowledge Guides Federated Graph Recommendation to Improve Accuracy
Home ›› Topics ›› data science

Topic

data science

3 stories
MMLongEmbed Benchmark Reveals Limitations in Long-Context Multimodal Embedding Models Technology
Artificial Intelligence #multimodal#embedding

MMLongEmbed Benchmark Reveals Limitations in Long-Context Multimodal Embedding Models

MMLongEmbed is the first comprehensive benchmark for evaluating multimodal embedding models (MEMs) in long-context scenarios. It comprises four retrieval tasks covering text, document, and video modalities. The evaluation reveals that current MEMs rely heavily on superficial feature matching and struggle with deep semantic and structural dependencies, with performance degrading systematically based on context length and key information placement.

Jun 16, 2026 1 source
Research Finds Anomalies in Multivariate Time Series Benchmarks Are Mostly Univariate Technology
Artificial Intelligence #time series#anomaly detection

Research Finds Anomalies in Multivariate Time Series Benchmarks Are Mostly Univariate

A study by researchers Pinet, Cumin, Berlemont, and Vaufreydaz on eight public benchmarks for multivariate time series anomaly detection (MTSAD) finds that labeled anomalies are overwhelmingly univariate—no cross-channel rupture occurs without a univariate deviation. The paper's diagnostic framework and synthetic data experiments show that current benchmarks do not justify cross-channel modeling, as channel-dependent detectors offer no measurable gain over channel-independent ones. The authors call for more structurally diverse evaluation sets.

Jun 16, 2026 1 source
New Benchmark IRTS-ToolBench Tests LLMs on Irregular Time Series Question Answering Technology
Artificial Intelligence #ai#artificial intelligence

New Benchmark IRTS-ToolBench Tests LLMs on Irregular Time Series Question Answering

A research paper introduces IRTS-ToolBench, a benchmark of 1,700 questions spanning 10 task types across 13 domains to evaluate large language models (LLMs) and AI agents on irregular time series question answering (TSQA). The benchmark addresses a gap in existing TSQA benchmarks that assume regular sampling, providing standardized inputs and a reproducible evaluation protocol for verifiable agentic data science.

Jun 16, 2026 2 sources