Visit IGEN World Explore IGEN Expo

EXPLORE UPGRADE PLANS

BREAKING

Indian equities ‘probably oversold’: Foreign investors warm up to India after months of outflows Adani Total Gas hikes CNG prices by ₹4 per kg as LNG costs surge $20M in cocaine found beneath floorboards of commercial truck trailer at California border Indian Oil ramps up spot crude purchases as Middle East disruptions hit supplies WhatsApp tests 'Offers & Updates' folder to declutter business chats Aurora Reports Q2 Loss, Details Per-Mile Pricing for Driverless Truck Services Apple iPad Air OLED display, M5 chip and biggest redesign expected in 2027 India's soyabean acreage recovers as July rains boost Kharif sowing China’s EV Market Surges Past 16 Million as Battery Waste Wave Arrives WIRED Tests Plastic-Free Stainless Steel Water Filters From $199 to $549 Indian equities ‘probably oversold’: Foreign investors warm up to India after months of outflows Adani Total Gas hikes CNG prices by ₹4 per kg as LNG costs surge $20M in cocaine found beneath floorboards of commercial truck trailer at California border Indian Oil ramps up spot crude purchases as Middle East disruptions hit supplies WhatsApp tests 'Offers & Updates' folder to declutter business chats Aurora Reports Q2 Loss, Details Per-Mile Pricing for Driverless Truck Services Apple iPad Air OLED display, M5 chip and biggest redesign expected in 2027 India's soyabean acreage recovers as July rains boost Kharif sowing China’s EV Market Surges Past 16 Million as Battery Waste Wave Arrives WIRED Tests Plastic-Free Stainless Steel Water Filters From $199 to $549

Home ›› Topics ›› policy regret

Topic

policy regret

1 story

Low-Policy-Regret Algorithm for Embedding Model Routing in Contextual Bandits

Artificial Intelligence #policy regret#embedding model

Low-Policy-Regret Algorithm for Embedding Model Routing in Contextual Bandits

A new paper on arXiv formalizes embedding model routing as an adversarial contextual linear bandit problem. The authors propose Hypentropy Policy Gradient (HPG), which provably adapts to unknown low-rank structure and attains low linearized policy regret.

Jun 16, 2026 1 source