Artificial Intelligence #ai#artificial intelligence
KILLBENCH: New Benchmark Tests External Kill Switches to Stop Malicious AI
Researchers propose KILLBENCH, a benchmark for evaluating external AI kill switches that stop malicious web agents without internal access. The benchmark includes four agent configurations, eight harmful scenarios, and ten jailbreak patterns. It was tested on models including GPT-5.2, Grok-4.3, Gemma4, and Qwen variants.
Jun 16, 2026 1 source