Visit IGEN World Explore IGEN Expo

EXPLORE UPGRADE PLANS

BREAKING

GeM completes 10 years as India's trusted digital public procurement platform Moody's Assigns First-Time Baa2 Rating to RBL Bank, One Notch Above India's Sovereign Sebi Bars Zee's Subhash Chandra, Punit Goenka From Market for One Year Zepto Defers IPO by Two to Three Quarters After Tepid Investor Response Tim Cook: India Among Apple's Best Global Markets as June Quarter Records Revenue Domestic funds reach record 21% stake in Indian companies as FPI ownership drops to 17% Cybercriminals widen net as assessees rush to meet I-T return filing deadline Bloomberg Delays India's Sovereign Bond Index Inclusion as Market Reforms Need Further Testing Gold loans jump 93.8% y-o-y, fuel bank credit growth in Q1FY27 Snapchat joins YouTube, LinkedIn and Substack in fight against 'AI slop' GeM completes 10 years as India's trusted digital public procurement platform Moody's Assigns First-Time Baa2 Rating to RBL Bank, One Notch Above India's Sovereign Sebi Bars Zee's Subhash Chandra, Punit Goenka From Market for One Year Zepto Defers IPO by Two to Three Quarters After Tepid Investor Response Tim Cook: India Among Apple's Best Global Markets as June Quarter Records Revenue Domestic funds reach record 21% stake in Indian companies as FPI ownership drops to 17% Cybercriminals widen net as assessees rush to meet I-T return filing deadline Bloomberg Delays India's Sovereign Bond Index Inclusion as Market Reforms Need Further Testing Gold loans jump 93.8% y-o-y, fuel bank credit growth in Q1FY27 Snapchat joins YouTube, LinkedIn and Substack in fight against 'AI slop'

Home ›› Topics ›› agent-safety

Topic

agent-safety

2 stories

LedgerAgent: A New Method for Policy-Adherent Tool-Calling AI Agents in Customer Service

Artificial Intelligence #ai#agents

LedgerAgent: A New Method for Policy-Adherent Tool-Calling AI Agents in Customer Service

Researchers introduce LedgerAgent, an inference-time method that maintains observed task states in a separate ledger and checks policy constraints before tool calls, improving pass^k metrics across four customer-service domains. The approach addresses common failure modes where agents use stale or incorrect information or violate domain policies.

Jun 20, 2026 1 source

CmdNeedle Reveals Widespread Fragility in AI Agent Command Denylists

Artificial Intelligence #ai#ai agents

CmdNeedle Reveals Widespread Fragility in AI Agent Command Denylists

A research paper introduces CmdNeedle, an LLM-driven pipeline that systematically detects incompleteness in command denylists used by terminal AI agents. Evaluating 1,709 real-world denylists, the study finds that 69.0–98.6% are fragile, meaning they can be bypassed by alternative commands, undermining security.

Jun 16, 2026 1 source