large language models

42 stories

Artificial Intelligence #multi-agent debate#dynamic path generation

DynaDebate: Dynamic Path Generation Breaks Homogeneity in Multi-Agent AI Debates

A new research paper introduces DynaDebate, a framework that solves the homogeneity problem in multi-agent AI debates by dynamically generating diverse reasoning paths, shifting to step-by-step logic critique, and activating a verification agent to resolve disagreements. Experiments show superior performance across most benchmarks.

large language models

DynaDebate: Dynamic Path Generation Breaks Homogeneity in Multi-Agent AI Debates

Cordyceps: New Data Poisoning Attack Covertly Controls Large Language Models

Haiku to Opus in Just 10 bits: LLMs Unlock Large Compression Gains

How Scale Design Impacts LLM Metacognition and Enterprise AI Reliability

CircuitLasso Enables Scalable Interpretability for Large Language Models at Lower Cost

From Detection to Recovery: Operational Analysis of LLM Pre-training on 504 NVIDIA B200 GPUs

SDS-LoRA: New Low-Rank Adaptation Method Fixes Gradient Distortion in Large Model Fine-Tuning

MA-ProofBench: New Benchmark Tests LLMs on Formal Theorem Proving in Mathematical Analysis

AuAu Benchmark Audits Authoritarian Alignment in Large Language Models from Four Regions

Graphical-Probabilistic Modeling Brings Rigor to LLM-Native Software Engineering

New Unified Definition of AI Hallucination Pins It on Inaccurate World Modeling

LLM Manuscript Scoring System Validated Against Peer-Review Outcomes at Major AI Conference

New Research Defends LLMs from Extraction Attacks Using 'Knowledge Trap' Honeypot

New Diagnostic for Language-Driven Bandits Determines When Lightweight Models Beat LLMs

Self-Consistency Reranking Boosts Accuracy in Narrative Question Answering for Enterprise AI

Metacognitive Myopia in LLMs: New Framework Reveals Hidden Biases with High-Stakes Implications

LLM-WikiRace Benchmark Reveals Frontier AI Models Still Struggle with Planning Over Knowledge Graphs

Deep Residual Injection Method Enables Full-Spectrum Forensic AI Detection in Multimodal Models

DYNA Framework Uses Temporal Knowledge Graphs to Reduce LLM Forgetting Without Retraining

SPARK Method Activates Latent Security Knowledge in LLMs for Secure Code Generation

New Defense Keeps Attack Success Rate Below 4% for Adaptive Prompt Injection on LLM Agents

Service-Induced Congestion Threatens LLM Serving Throughput, New Model Shows

New Attack FragFuse Exploits LLM Agent Memory to Bypass Access Controls

‘Pretty Crazy’ Token Usage Tests Enterprise AI Bets as Companies Balance Costs and Gains

Tensor-Coord: Algebraic Decomposition Enables Conflict-Free Multi-Agent LLM Planning

UrbanWell Benchmark Puts Multimodal LLMs to Test on Spatio-Temporal Urban Wellbeing Analytics

MAF Framework Dynamically Optimizes Prompting for Multimodal Sentiment Analysis

SCAN Framework Helps CTOs Decide When to Use Generative AI for Task Allocation

LLaMA 3.1's Ethical Reasoning Reveals Frame-Conditioned Moral Computation, Researchers Find

ChatPlanner: LLM Framework Personalizes Public Transit Routing with Fine-Tuning and RAG

SpecAlign Framework Uses Synthetic Data to Align Large Language Models with Specific Policies

New Framework Automates Skill Construction for Agentic Large Language Models

Latent Thought Flow: Efficient Reasoning in LLMs Cuts Cost and Boosts Accuracy

Think-at-Hard: Selective Latent Iterations Boost LLM Reasoning Accuracy by Up to 6.8%

Skill-to-LoRA: Replacing Runtime Skill Text with Trainable Adapters for Token-Efficient LLM Agents

StateGen Platform Generates Synthetic Training Data for Tool-Augmented LLMs with 9.66/10 Hallucination Score

Philosophy Paper Argues Large Language Models Lack Agency for Moral Responsibility

AgentLeak Benchmark Reveals Internal Channel Privacy Leaks in Multi-Agent LLM Systems

New ASRD Method Boosts Diffusion LLM Accuracy by 6.4% and Inference Speed by 7.2×

Training-Free Framework Uses XAI and Multimodal LLMs to Generate Grounded Explanations for Speech Deepfake Detection

Few-Shot Biomedical Relation Extraction with LLMs: A Viable Alternative to Supervised Learning?

New Definition of Good Explanations Highlights Challenges in Explaining LLM Outputs