Topic
decision-making
New LLM-Based Simulator Evaluates Deliberative Polling Information Systems Against Strategic Attacks
Researchers introduce the LLM-based Agentic Bipolar Argumentation Simulator (ABAS) to evaluate information systems for deliberative polling. ABAS simulates autonomous agents voting and submitting justifications, measuring coverage of the reason space. Experiments show that a tag-flood attack collapses coverage, while a reversed-PageRank weighting resists it markedly better than uniform weights.
SCAN Framework Helps CTOs Decide When to Use Generative AI for Task Allocation
A new academic paper introduces SCAN, a decision-making framework for task allocation with generative AI. Based on Vygotsky's Zone of Proximal Development and Metacognition, SCAN defines four sub-zones—Substitute, Complement, Aid, Non-negotiable—to guide knowledge workers and students in effectively using GenAI. The framework also addresses cognitive load, cognitive offloading, sycophancy, and the future of work.
Survey on Medical Embodied AI Highlights Integration of Perception, Decision-Making, and Action
A systematic survey of medical embodied AI examines its core components — perception, decision-making, and action — and their coordinated integration for real-world clinical workflows. The paper reviews representative applications, datasets, and challenges, highlighting the need for unified system-level organization beyond individual functional aspects.
RetailBench Benchmark Tests LLM Agents on Long-Horizon Retail Decisions
Researchers introduced RetailBench, a simulation benchmark for evaluating LLM agents in single-store supermarket management over 180 days. Tests on seven models showed only a subset completed the full horizon, and even the best fell far behind an oracle policy due to incomplete evidence acquisition and lack of consistent strategy.
Causal Model of Theory of Mind in Conflict Offers New Path for AI Mentalizing
A new research paper by Gurney and Nikolos introduces a structural causal model for theory of mind (ToM) in artificial intelligence, addressing the unresolved question of when mentalizing is warranted in conflict situations. The model treats ToM as a mechanism activated by situational and agent-level conditions, offering a resource-rational decision procedure for AI systems. It specifies four exogenous variables, five endogenous mediators, and three causal pathways leading to epistemic accuracy, with implications for efficiency, trust, and robust artificial social intelligence.
Technology Stop Bad AI Projects to Save Your Budget
In the fast-paced world of AI, the ability to stop unproductive projects early is crucial to managing budgets effectively. Implementing a 'kill engine' can help organizations make evidence-based decisions, preventing unnecessary resource allocation.