generative models

9 stories

Artificial Intelligence #flow matching#object dynamics

FlowMaps: Modeling Long-Term Multimodal Object Dynamics with Flow Matching

FlowMaps, a latent flow matching model, predicts multimodal distributions of future object locations in 3D space by learning from past human interactions. Tested in over 600 episodes, it outperforms state-of-the-art approaches for dynamic Object Navigation tasks in simulated and real environments. The research, published on arXiv, has potential applications for robotics in changing environments.

Jul 8, 2026 1 source

Residual-Space Evolutionary Optimization via Flow-based Generative Models

Technology

Artificial Intelligence #residual-space#evolutionary optimization

Residual-Space Evolutionary Optimization via Flow-based Generative Models

A new framework called residual-space evolutionary optimization addresses the challenge of data editing with non-differentiable objectives in flow-based generative models. By operating in residual space, it separates local exploitation (self-pollination) from broader exploration (cross-pollination). The method was validated on the MorphoMNIST benchmark and crystal data, showing balanced target alignment, instance preservation, and diversity.

Jun 22, 2026 1 source

New arXiv Paper Outlines Principles for Deterministically Encapsulated Generative Models to De-Risk AI Integration

Technology

Artificial Intelligence #grounded inference#deterministically encapsulated

New arXiv Paper Outlines Principles for Deterministically Encapsulated Generative Models to De-Risk AI Integration

A new manuscript on arXiv establishes foundational principles for incorporating generative models into traditional computational systems. The paper defines four specific primitives of AI blended architecture designed to enable deterministic encapsulation of probabilistic models, and identifies two overarching anti-patterns that serve as warnings for engineers. This framework aims to de-risk AI integration and provide a foundation for future generative model interfaces.

Jun 20, 2026 2 sources

First Billion-Parameter Generative Foundation Model for Chest Radiography Achieves Expert-Level Synthesis Fidelity

Technology

Artificial Intelligence #generative models#foundation models

First Billion-Parameter Generative Foundation Model for Chest Radiography Achieves Expert-Level Synthesis Fidelity

Ribeiro et al. present the largest specialist generative foundation model for chest radiographs, with over 1.3 billion parameters. Trained on 1.2 million radiographs, the model supports controllable generation across demographics, views, and pathologies, advancing synthesis fidelity to clinical indistinguishability.

Jun 20, 2026 1 source

Diffusion Language Models Show Promise but Demand Careful Inference Tuning, Study Finds

Technology

Artificial Intelligence #diffusion#language models

Diffusion Language Models Show Promise but Demand Careful Inference Tuning, Study Finds

A new systematic study from researchers analyzes eight state-of-the-art Diffusion Language Models (DLMs) across eight benchmarks covering reasoning, coding, translation, and more. The research highlights how inference-time choices like denoising steps and context length create trade-offs between generation quality and computational efficiency, offering guidance for enterprise deployment.

Jun 20, 2026 1 source

New Study Challenges Prior Claims on Scaling Context Length in Imitation Learning

Technology

Artificial Intelligence #training#evaluation

New Study Challenges Prior Claims on Scaling Context Length in Imitation Learning

Researchers evaluated diffusion policies for robotic imitation learning across varying context lengths, challenging prior claims that long-context scaling is fragile. They propose a training algorithm that jointly trains policies at multiple context lengths, reducing sample complexity.

Jun 17, 2026 1 source

DySink: Dynamic Frame Sinks Enable Adaptive Long Video Generation Without Context Collapse

Technology

Artificial Intelligence #video generation#autoregressive

DySink: Dynamic Frame Sinks Enable Adaptive Long Video Generation Without Context Collapse

Researchers propose DySink, a retrieval-based framework that replaces static early-frame sinks with dynamic, visually relevant historical frames for autoregressive long video generation. This approach prevents sink collapse and improves temporal quality in minute-long videos.

Jun 16, 2026 1 source

Token Reduction in Generative Models Must Evolve Beyond Efficiency, New Research Argues

Technology

Artificial Intelligence #token reduction#generative models

Token Reduction in Generative Models Must Evolve Beyond Efficiency, New Research Argues

A new paper from arXiv argues that token reduction in Transformer architectures should be reframed from a mere efficiency strategy to a fundamental principle in generative modeling. The authors outline four key benefits beyond efficiency: deeper multimodal integration, reduced overthinking and hallucinations, maintained coherence over long inputs, and enhanced training stability.

Jun 16, 2026 1 source

First Wasserstein-2 Convergence Proof for Decentralized Diffusion Models with ODE Samplers

Technology

Artificial Intelligence #wasserstein convergence#ode-based samplers

First Wasserstein-2 Convergence Proof for Decentralized Diffusion Models with ODE Samplers

A team of researchers has proven the first convergence guarantee in Wasserstein-2 distance for ODE-based samplers in decentralized diffusion models. The work addresses the missing theoretical foundation for decentralized generative architectures that replace a single global velocity field with multiple local experts and a routing mechanism. The result shows distribution converges at rate O(N^{-1/2}+ε), paving the way for privacy-scalable AI deployments.

Jun 16, 2026 1 source