Artificial Intelligence #ai#trust
New Study Measures Trust Between AI Agents, Revealing Formation, Breakage, and Recovery Dynamics
A preprint on arXiv introduces a behavioral measure to quantify trust between language-model agents using costly verification in a cooperative game. Testing six frontier model snapshots, the study finds that four models reduce verification by 60-85% when paired with reliable teammates, while trust recovery is slower than formation and clustered failures sustain suspicion longer. The results suggest that calibration, not maximal suspicion, should guide governance of multi-agent AI systems.
Jun 16, 2026 1 source