Artificial Intelligence #prompt injection#ai security
New Defense Keeps Attack Success Rate Below 4% for Adaptive Prompt Injection on LLM Agents
Researchers propose RETA, a training-based defense that grounds LLM agent security on user tasks rather than attack patterns. Using chain-of-thought reasoning and red-teaming with diversity reward, RETA keeps average attack success rate below 4% across six adaptive attacks while preserving utility.
Jun 16, 2026 1 source