iGEN
Visit IGEN World Explore IGEN Expo
EXPLORE UPGRADE PLANS
BREAKING
Trump Lets Sanctions Waiver on Russian Crude Expire as US-Iran Peace Deal Progresses Iran-US Peace Deal Reopens Hormuz: 62 Million Barrels Set to Flood Market, Asia Braces for Oil Glut Vår Energi Approves Seven-Well North Sea Development with 2027 Start-Up Atom XVII Launches ₹75 Crore Consumer Fund to Back Early-Stage Indian Brands Rupee Tumbles 21 Paise to 94.66 Against US Dollar on Fed Hawkish Stance MOL and NYK Sign Long-Term Ammonia Carrier Charters with JERA for US-Japan Low-Carbon Fuel Supply Qatar LNG Tanker Sails for Hormuz as US-Iran Deal Reopens Critical Waterway UK to Scan Asylum-Seekers’ Faces with Flawed AI Age Estimation Despite Internal Warnings US Firms Sue Container Makers Over Alleged Price-Fixing Scheme Impacting Global Dry Container Market Strait of Hormuz Reopens Under US-Iran Deal, Future Transit Fees Uncertain for Shippers Trump Lets Sanctions Waiver on Russian Crude Expire as US-Iran Peace Deal Progresses Iran-US Peace Deal Reopens Hormuz: 62 Million Barrels Set to Flood Market, Asia Braces for Oil Glut Vår Energi Approves Seven-Well North Sea Development with 2027 Start-Up Atom XVII Launches ₹75 Crore Consumer Fund to Back Early-Stage Indian Brands Rupee Tumbles 21 Paise to 94.66 Against US Dollar on Fed Hawkish Stance MOL and NYK Sign Long-Term Ammonia Carrier Charters with JERA for US-Japan Low-Carbon Fuel Supply Qatar LNG Tanker Sails for Hormuz as US-Iran Deal Reopens Critical Waterway UK to Scan Asylum-Seekers’ Faces with Flawed AI Age Estimation Despite Internal Warnings US Firms Sue Container Makers Over Alleged Price-Fixing Scheme Impacting Global Dry Container Market Strait of Hormuz Reopens Under US-Iran Deal, Future Transit Fees Uncertain for Shippers
Home ›› Technology ›› Ai ›› Llms ›› New Attack Forces Costly Model Usage in Multimodal LLM Cascades

New Attack Forces Costly Model Usage in Multimodal LLM Cascades

A research paper introduces the Forced Deferral Attack (FDA), which manipulates confidence thresholds in multimodal large language model cascades, causing queries to be routed to more expensive strong models. The attack raises security concerns for enterprises deploying cost-optimized AI systems.

iG
iGEN Editorial
June 16, 2026
New Attack Forces Costly Model Usage in Multimodal LLM Cascades

Enterprises deploying multimodal large language models (MLLMs) often use cascades to reduce computational costs: a weaker, cheaper model handles most queries, with a stronger model used only when the weak model lacks confidence. However, a new attack exposes a critical vulnerability in this cost-saving architecture.

According to a paper on arXiv titled Forced Deferral: Manipulating Routing Decisions in Multimodal LLM Cascades, researchers Liu, Zhongye, Zeng, Yaopei, Chang, Yurui, Lin, and Lu demonstrated the Forced Deferral Attack (FDA), which lowers the weak model's confidence on purpose, forcing queries to be deferred to the strong model.

How the Forced Deferral Attack Works

The paper explains that MLLM cascades rely on the weak model's confidence score to decide whether to route a query to the strong model. An adversary can introduce a universal border trigger — an adversarial image perturbation — that consistently reduces the weak model's confidence. The FDA learns this trigger by optimizing a temperature-flattened objective, which pushes the weak model's token distribution on triggered inputs toward less concentrated targets derived from its clean responses.

“FDA learns a universal border trigger by optimizing a temperature-flattened objective,” the researchers reported. The attack is designed to work across datasets, model families, and deferral metrics.

Attack Performance Compared to Baselines

The researchers evaluated FDA against image-perturbation and prompt-injection baselines. According to the paper, FDA consistently increases strong-model routing and outperforms the baselines. This shows that MLLM cascades are vulnerable to attacks that manipulate compute allocation, forcing unintended strong-model usage without directly targeting answer correctness.

Attack Method Effectiveness (Strong-Model Routing Increase)
Forced Deferral Attack (FDA) Higher (outperforms baselines)
Image-Perturbation Baseline Lower
Prompt-Injection Baseline Lower

Implications for Enterprise AI Deployments

For technology leaders, this attack represents a new security consideration when deploying cost-optimized AI pipelines. Cascades are used not only in LLM inference but also in multimodal systems where vision and language combine. If left unaddressed, such attacks could lead to unanticipated cost increases as compute is siphoned to more expensive models. The paper notes that the attack does not target answer correctness, making it potentially stealthy.

The findings highlight the need for robust deferral mechanisms that are resistant to adversarial manipulation of confidence scores. Enterprises should evaluate the security posture of their AI routing decisions, particularly when cascades are integrated into customer-facing or revenue-critical applications.


Sources:

Keep Reading

Recommended Stories

SAMark Watermarking Breaks Paraphrase Robustness Barrier for AI-Generated Text Technology

SAMark Watermarking Breaks Paraphrase Robustness Barrier for AI-Generated Text

Researchers propose SAMark, a self-anchored text watermarking framework that achieves up to 90.2% true positive rate under paragraph-level paraphrasing attacks, outperforming the strongest prior baseline by more than 30% on average. The method breaks the robustness-quality trade-off by using multi-channel hyperbolic scoring and diversity-aware filtering.

June 17, 2026
UniT Framework Enables Multimodal Chain-of-Thought Test-Time Scaling for AI Reasoning Technology

UniT Framework Enables Multimodal Chain-of-Thought Test-Time Scaling for AI Reasoning

UniT introduces a framework for unified multimodal models to perform chain-of-thought reasoning at test time, enabling iterative verification and refinement. Key findings show that sequential reasoning is more compute-efficient than parallel sampling and that training on generation/editing trajectories improves out-of-distribution visual reasoning.

June 16, 2026
AutoDojo: Adaptive Attacks Expose Superficial Defenses and Structural Limits in LLM Agents Technology

AutoDojo: Adaptive Attacks Expose Superficial Defenses and Structural Limits in LLM Agents

The AutoDojo framework adaptively optimizes indirect prompt injections against LLM agent defenses, revealing that many current defenses are superficial. Against a filter that reduces static attack success rate to 0%, AutoDojo recovers 28% overall and 64% on action-open tasks due to a structural limitation where injections can pose as ordinary data.

June 16, 2026
SkillVetBench Uses LLM-as-Judge to Evaluate Security Risks in Open-Source Agent Skills Technology

SkillVetBench Uses LLM-as-Judge to Evaluate Security Risks in Open-Source Agent Skills

SkillVetBench, a live Hugging Face leaderboard, uses an LLM-as-Judge approach to vet open-source LLM agent skills for security risks. It introduces the Skill Agentic Risk Score (SARS) and integrates CVSS v4.0, achieving zero false negatives across 78 malicious skills and zero false positives on 22 benign controls, outperforming static baselines like SKILLSIEVE.

June 16, 2026