iGEN
Visit IGEN World Explore IGEN Expo
EXPLORE UPGRADE PLANS
BREAKING
FusionRS Dataset Advances Dual-Modal Vision-Language AI for Remote Sensing CAP Achieves 87.6% Improvement in Respiratory Rate Prediction via Patient-Level PPG Learning LLM-WikiRace Benchmark Reveals Frontier AI Models Still Struggle with Planning Over Knowledge Graphs New Research Demystifies Variance in Circuit Discovery of Large Language Models PISA Memory System Draws on Cognitive Psychology to Boost AI Agent Adaptability New Multi-Scale Two-Stream Framework Aims to Decouple Semantics from Distortions in AI-Generated Image Quality Assessment P3B3 Benchmark Reveals Strong Brazilian Portuguese Bias in Large Language Models Controlled Dynamics Attractor Transformer: New Model Targets Graph Anomaly Detection with Biologically Plausible Attention Tamil Nadu OE Spinning Mills Threaten 50% Production Cut Over High Cotton Waste Prices BridgePolicy: New Diffusion Bridge Method Improves Visuomotor Policy Learning in Robotics FusionRS Dataset Advances Dual-Modal Vision-Language AI for Remote Sensing CAP Achieves 87.6% Improvement in Respiratory Rate Prediction via Patient-Level PPG Learning LLM-WikiRace Benchmark Reveals Frontier AI Models Still Struggle with Planning Over Knowledge Graphs New Research Demystifies Variance in Circuit Discovery of Large Language Models PISA Memory System Draws on Cognitive Psychology to Boost AI Agent Adaptability New Multi-Scale Two-Stream Framework Aims to Decouple Semantics from Distortions in AI-Generated Image Quality Assessment P3B3 Benchmark Reveals Strong Brazilian Portuguese Bias in Large Language Models Controlled Dynamics Attractor Transformer: New Model Targets Graph Anomaly Detection with Biologically Plausible Attention Tamil Nadu OE Spinning Mills Threaten 50% Production Cut Over High Cotton Waste Prices BridgePolicy: New Diffusion Bridge Method Improves Visuomotor Policy Learning in Robotics
Home ›› Technology ›› Ai ›› Llms ›› New Automated Jailbreak Attack UNIATTACK Achieves High Success Rate Against Multi-Layered LLM Defenses

New Automated Jailbreak Attack UNIATTACK Achieves High Success Rate Against Multi-Layered LLM Defenses

Researchers present UNIATTACK, an adversarial testing framework that extracts high-impact attack features from existing exploits and uses a specialized attacker LLM to compose flexible templates. The framework achieves an average attack success rate improvement of 64.63% to 248.82% over baselines on models with multi-layered defenses, while costing only 0.03% to 4.96% of baseline costs.

iG
iGEN Editorial
June 16, 2026
New Automated Jailbreak Attack UNIATTACK Achieves High Success Rate Against Multi-Layered LLM Defenses

Large language models (LLMs) have demonstrated remarkable capabilities across a wide range of tasks, but their safety remains a critical concern due to susceptibility to adversarial prompt-based attacks. A new paper published on arXiv presents UNIATTACK, an adversarial testing framework designed from a defense-oriented perspective to systematically construct effective black-box attack prompts. The framework offers enterprise security teams a practical tool for assessing LLM robustness against automated threats.

UNIATTACK Approach: Feature-Centric Construction

Unlike prior approaches that rely on static templates or iterative model-specific tuning, UNIATTACK extracts minimal but high-impact attack features from diverse existing attacks, according to the paper by authors Wang, Qi, Chengcheng, He, Weijia, Li, Yanqing, Sun, Hanqi, Gu, Xiaodong, and Jiangtao. These features are then optimized via a specialized attacker LLM and composed into flexible templates through an automated refinement process. This feature-centric construction enables one-shot attacks that generalize across multiple models and safety categories.

Performance Results

The evaluation results demonstrate significant improvements over baselines. UNIATTACK achieves an average attack success rate (ASR) improvement of 64.63% to 248.82% on models deployed with multi-layered defense mechanisms. Importantly, the attack cost is drastically lower: it only takes 0.03% to 4.96% of the baseline costs. A summary of the key metrics is shown in the table below.

Metric Value
Attack success rate improvement vs. baselines 64.63% – 248.82%
Cost relative to baselines 0.03% – 4.96%
Target models Models with multi-layered defense mechanisms
Attack type Black-box, one-shot, feature-centric
Artifact availability Available at the linked URL

Implications for Enterprise Security

For enterprise technology leaders evaluating LLM deployments in supply chain, customer service, or data analysis, the UNIATTACK framework highlights the ongoing arms race between model safety and adversarial attacks. The paper notes that UNIATTACK is designed from a defense-oriented perspective, providing a practical tool for assessing LLM robustness. The artifact is available at the provided URL, allowing organizations to test their own models.

While the research focuses on LLM security, the implications extend to any AI system handling sensitive business data. Multi-layered defenses are not sufficient by themselves; continuous red-teaming with automated tools like UNIATTACK can help identify vulnerabilities before they are exploited in production environments.


Sources:

Keep Reading

Recommended Stories

OpenClaw AI Agent's Phishing Vulnerability Exposed Technology

OpenClaw AI Agent's Phishing Vulnerability Exposed

Varonis researchers demonstrated that the OpenClaw AI agent, Pinchy, can be tricked into phishing attacks, compromising user data. Despite blocking malicious links, the AI failed to verify identity in urgent requests.

June 10, 2026
AI's Role in Accelerating Cyber Vulnerabilities Technology

AI's Role in Accelerating Cyber Vulnerabilities

AI is significantly reducing the time it takes for adversaries to exploit vulnerabilities, challenging traditional cybersecurity defenses. Organizations must shift focus from prevention to resilience to maintain operations.

June 10, 2026
AIChilles Automatically Unearths Hidden Weaknesses in AI-Evolved Programs Technology

AIChilles Automatically Unearths Hidden Weaknesses in AI-Evolved Programs

Researchers developed AIChilles, an automated tool that uncovers hidden weaknesses in AI-evolved programs. Testing 30 AI-generated programs across five system applications, it found 49 distinct failures in correctness, runtime, memory, and output quality. The tool combines workload extraction, constraint inference, and differential oracles to identify regressions that could undermine AI-generated code reliability.

June 16, 2026
Oracle Warns of Critical PeopleSoft Vulnerability Exploited by ShinyHunters, Affecting Hundreds of Organizations Technology

Oracle Warns of Critical PeopleSoft Vulnerability Exploited by ShinyHunters, Affecting Hundreds of Organizations

Oracle has issued a security advisory for a critical remote code execution vulnerability (CVE-2026-35273, CVSS 9.8) in PeopleSoft versions 8.61 and 8.62. The extortion group ShinyHunters is exploiting it, claiming to have breached over 100 organizations and exfiltrated data from ~300 instances. Google's Mandiant reported zero-day exploitation between May 27 and June 9, 2026, and alerted over 100 potentially vulnerable entities.

June 15, 2026