iGEN
Visit IGEN World Explore IGEN Expo
EXPLORE UPGRADE PLANS
BREAKING
GAS-Leak-LLM: Genetic Algorithm Jailbreaks Black-Box LLMs, Exposing Safety Gaps New Generative Recommendation Model HoloRec Uses Hierarchical Encoding and Interleaved Reasoning to Boost Accuracy Tensor-Coord: Algebraic Decomposition Enables Conflict-Free Multi-Agent LLM Planning Led by US, exits from gold ETFs continue for the 5th week in a row Domain-Guided Prompting Boosts Segment Anything Model for Seismic Interpretation Spokes Optimizes Diverse Pretraining Data Selection for LLMs, Boosting Performance Medical Heuristic Learning: LLM-Driven Framework for Interpretable Clinical Decision Rules Commodore Callback 8020 Brings Digital Detox With Modern Apps and Retro Design PreLort: Prefix-Nested LoRA Enables Federated Fine-Tuning Across Heterogeneous Hardware Ranks Research Shows 'Retrieve, Don't Retrain' Approach Cuts AI Model Adaptation Costs GAS-Leak-LLM: Genetic Algorithm Jailbreaks Black-Box LLMs, Exposing Safety Gaps New Generative Recommendation Model HoloRec Uses Hierarchical Encoding and Interleaved Reasoning to Boost Accuracy Tensor-Coord: Algebraic Decomposition Enables Conflict-Free Multi-Agent LLM Planning Led by US, exits from gold ETFs continue for the 5th week in a row Domain-Guided Prompting Boosts Segment Anything Model for Seismic Interpretation Spokes Optimizes Diverse Pretraining Data Selection for LLMs, Boosting Performance Medical Heuristic Learning: LLM-Driven Framework for Interpretable Clinical Decision Rules Commodore Callback 8020 Brings Digital Detox With Modern Apps and Retro Design PreLort: Prefix-Nested LoRA Enables Federated Fine-Tuning Across Heterogeneous Hardware Ranks Research Shows 'Retrieve, Don't Retrain' Approach Cuts AI Model Adaptation Costs
Home ›› Technology ›› Ai ›› Ai Ethics ›› Philosophy Paper Argues Large Language Models Lack Agency for Moral Responsibility

Philosophy Paper Argues Large Language Models Lack Agency for Moral Responsibility

A recent academic paper from arXiv argues that attributing agency or moral responsibility to large language models (LLMs) is misguided. The paper maintains that LLMs produce coherent outputs but are fully characterized by probabilistic input-output mappings, lacking intrinsic intentionality and self-attributed action. This challenges claims that LLMs can be moral agents, with direct relevance to how enterprises govern AI in decision-making.

iG
iGEN Editorial
June 16, 2026
Philosophy Paper Argues Large Language Models Lack Agency for Moral Responsibility

Recent advances in large language models (LLMs) have prompted claims that such systems exhibit agency or qualify as moral agents, but a new paper published on arXiv argues these attributions are misguided. According to the paper titled "Why Sampling Is Not Choosing: Intentionality, Agency, and Moral Responsibility in Large Language Models" by Keshet; Joseph, moral responsibility requires commitment-bearing agency grounded in intrinsic intentionality and self-attributed action—a form of free will that LLMs lack. For enterprise technology leaders evaluating AI for supply chain and logistics decisions, this distinction has direct implications: systems that cannot genuinely own their outputs cannot bear moral accountability, shifting the responsibility entirely to human operators and organizations.

The Argument Against Agency in LLMs

The paper argues that although LLMs generate coherent and normatively evaluable outputs, their operation is fully characterized by probabilistic input-output mappings learned from data. Their apparent intentionality is derived rather than intrinsic, and their outputs are neither owned as commitments nor guided by reasons. Variability introduced by stochastic sampling does not amount to choice or authorship. The authors address objections from the intentional stance, functionalism, compatibilism, and the presence of moral reasoning in model outputs, concluding that none suffice to establish genuine agency.

Why Probabilistic Sampling Does Not Equal Choice

A core claim is that the stochastic sampling process that produces variation in LLM outputs is not equivalent to choosing. The paper states: "Variability introduced by stochastic sampling does not amount to choice or authorship." For enterprise applications, this means that when an LLM generates different recommendations for a logistics routing problem due to temperature settings or random seeds, the system is not exercising judgment—it is merely reflecting statistical patterns in training data. Decision-makers cannot interpret output variability as the model weighing options like a human would.

Implications for Enterprise AI Governance

For CTOs and digital transformation leaders, the paper reinforces that LLMs should not be treated as moral agents. As the paper puts it, moral responsibility requires "commitment-bearing agency grounded in intrinsic intentionality and self-attributed action"—criteria no current LLM meets. This means accountability for any harm or error caused by an LLM-enabled system lies with the deploying organization, not the model. Enterprises integrating LLMs into supply chain optimization, trade documentation, or customs compliance must ensure humans retain oversight and that systems are designed with clear boundaries on autonomous decision-making. The paper's philosophical analysis, while abstract, provides a foundation for governance frameworks that treat LLMs as tools, not agents.


Sources:

Keep Reading

Recommended Stories

LLaMA 3.1's Ethical Reasoning Reveals Frame-Conditioned Moral Computation, Researchers Find Technology

LLaMA 3.1's Ethical Reasoning Reveals Frame-Conditioned Moral Computation, Researchers Find

A mechanistic interpretability audit of Meta's LLaMA 3.1-8B-Instruct on 54 moral prompts reveals that the model's ethical reasoning is highly sensitive to surface features of the prompt, a phenomenon called Frame-Conditioned Moral Computation. The study, using the Transluce platform, found domain-specific representations dominate activation lists and that RLHF may re-order surface text without removing underlying biases. The authors call for a new research program, Mechanistic Alignment, to supplement behavioral alignment.

June 16, 2026
Training-Free Framework Uses XAI and Multimodal LLMs to Generate Grounded Explanations for Speech Deepfake Detection Technology

Training-Free Framework Uses XAI and Multimodal LLMs to Generate Grounded Explanations for Speech Deepfake Detection

Researchers propose a training-free explanation framework that integrates XAI evidence with multimodal large language models to generate grounded and specific explanations for speech deepfake detection. Using the PartialSpoof dataset, the method increases inside accuracy by over 45%, verified through human evaluation and faithfulness checks.

June 16, 2026
Tensor-Coord: Algebraic Decomposition Enables Conflict-Free Multi-Agent LLM Planning Technology

Tensor-Coord: Algebraic Decomposition Enables Conflict-Free Multi-Agent LLM Planning

A new research paper introduces Tensor-Coord, a multilinear algebra framework that represents joint plans of multiple LLM agents as a third-order tensor. By decomposing the tensor, it identifies coordination conflicts and enables iterative replanning, achieving 100% conflict-free plans for 2-agent tasks and 80% for 3-agent tasks in simulated delivery scenarios.

June 16, 2026
UrbanWell Benchmark Puts Multimodal LLMs to Test on Spatio-Temporal Urban Wellbeing Analytics Technology

UrbanWell Benchmark Puts Multimodal LLMs to Test on Spatio-Temporal Urban Wellbeing Analytics

Researchers introduce UrbanWell, a large-scale benchmark for evaluating multimodal large language models on spatio-temporal urban wellbeing analytics. The benchmark covers 38 cities, multiple years, and diverse indicators including environment, accessibility, urban form, vitality, and subjective perception. Testing 15 state-of-the-art MLLMs in zero-shot settings reveals substantial performance variations across heterogeneous indicators.

June 16, 2026