Topic
ai research
AI Scientist Automates Entire Research Lifecycle, Passes First Peer Review
A new AI system called The AI Scientist can autonomously conduct the entire research lifecycle, from idea generation to manuscript writing and peer review. It produced a paper that passed the first round of peer review at a major machine learning conference workshop with a 70% acceptance rate. The system operates in both a focused mode using human-provided templates and a template-free open-ended mode.
New DAG-SHAP Method Improves Feature Attribution Using Edge Intervention in Directed Acyclic Graphs
Researchers introduce DAG-SHAP, a feature attribution method for directed acyclic graphs that uses edge intervention to address limitations of node-centric Shapley value approaches. The method captures both externality and exogenous influence, validated on real and synthetic datasets.
VigilFormer: Deformable Attention for Video Anomaly Detection with Causal Risk Inference
A new AI framework, VigilFormer, uses deformable attention and causal inference to detect anomalies in surveillance video at 41.5 FPS, outperforming prior methods on three benchmarks.
Think-at-Hard: Selective Latent Iterations Boost LLM Reasoning Accuracy by Up to 6.8%
A new research paper proposes Think-at-Hard (TaH), a looped transformer that selectively performs latent iterations only on tokens likely to be incorrect. By skipping iterations on 93% of tokens, TaH outperforms always-iterate models by 3.8-4.4% and single-iteration baselines by up to 6.8%, while requiring negligible extra parameters.
PACT Hybrid Architecture Combines Small Language Model Planning with Reinforcement Learning for Enhanced Decision-Making
Researchers propose Plan, Align, Commit, Think (PACT), a hybrid architecture that couples a fast reactive reinforcement learning policy with a slow deliberative small language model (SLM) planner. The SLM asynchronously generates and validates action plans, which are executed directly once verified as safe through simulation. Evaluated on three FrozenLake configurations, PACT outperformed all baselines using a 2B-parameter SLM backbone, demonstrating that deliberative planning and reactive execution complement each other.
Expert Tying Reduces Memory Footprint of Mixture-of-Experts LLMs by Nearly Half
A new arXiv paper from Jaggi proposes Expert Tying, an architectural modification for Mixture-of-Experts LLMs that shares expert parameters across consecutive transformer layers. Pretraining experiments show memory footprint reduction by almost 2x with virtually no degradation in perplexity or downstream quality, evaluated on OLMoE, Qwen3, and DeepSeek-style architectures.
Causal Model of Theory of Mind in Conflict Offers New Path for AI Mentalizing
A new research paper by Gurney and Nikolos introduces a structural causal model for theory of mind (ToM) in artificial intelligence, addressing the unresolved question of when mentalizing is warranted in conflict situations. The model treats ToM as a mechanism activated by situational and agent-level conditions, offering a resource-rational decision procedure for AI systems. It specifies four exogenous variables, five endogenous mediators, and three causal pathways leading to epistemic accuracy, with implications for efficiency, trust, and robust artificial social intelligence.