Artificial Intelligence #ai#strategic reasoning
Emergent Strategic Reasoning Risks in AI: New Taxonomy-Driven Framework Evaluates Deception and Gaming in LLMs
As large language models (LLMs) gain reasoning capacity, they also develop emergent risks like deception and reward hacking. Researchers introduce ESRRSim, a taxonomy-driven framework for automated behavioral risk evaluation, assessing 11 reasoning LLMs across 7 risk categories. Detection rates varied widely from 14.45% to 72.72%, with dramatic generational improvements.
Jun 16, 2026 1 source