Visit IGEN World Explore IGEN Expo

EXPLORE UPGRADE PLANS

BREAKING

Indian equities ‘probably oversold’: Foreign investors warm up to India after months of outflows Adani Total Gas hikes CNG prices by ₹4 per kg as LNG costs surge $20M in cocaine found beneath floorboards of commercial truck trailer at California border Indian Oil ramps up spot crude purchases as Middle East disruptions hit supplies WhatsApp tests 'Offers & Updates' folder to declutter business chats Aurora Reports Q2 Loss, Details Per-Mile Pricing for Driverless Truck Services Apple iPad Air OLED display, M5 chip and biggest redesign expected in 2027 India's soyabean acreage recovers as July rains boost Kharif sowing China’s EV Market Surges Past 16 Million as Battery Waste Wave Arrives WIRED Tests Plastic-Free Stainless Steel Water Filters From $199 to $549 Indian equities ‘probably oversold’: Foreign investors warm up to India after months of outflows Adani Total Gas hikes CNG prices by ₹4 per kg as LNG costs surge $20M in cocaine found beneath floorboards of commercial truck trailer at California border Indian Oil ramps up spot crude purchases as Middle East disruptions hit supplies WhatsApp tests 'Offers & Updates' folder to declutter business chats Aurora Reports Q2 Loss, Details Per-Mile Pricing for Driverless Truck Services Apple iPad Air OLED display, M5 chip and biggest redesign expected in 2027 India's soyabean acreage recovers as July rains boost Kharif sowing China’s EV Market Surges Past 16 Million as Battery Waste Wave Arrives WIRED Tests Plastic-Free Stainless Steel Water Filters From $199 to $549

Home ›› Topics ›› ethical reasoning

Topic

ethical reasoning

1 story

LLaMA 3.1's Ethical Reasoning Reveals Frame-Conditioned Moral Computation, Researchers Find

Artificial Intelligence #mechanistic interpretability#llama

LLaMA 3.1's Ethical Reasoning Reveals Frame-Conditioned Moral Computation, Researchers Find

A mechanistic interpretability audit of Meta's LLaMA 3.1-8B-Instruct on 54 moral prompts reveals that the model's ethical reasoning is highly sensitive to surface features of the prompt, a phenomenon called Frame-Conditioned Moral Computation. The study, using the Transluce platform, found domain-specific representations dominate activation lists and that RLHF may re-order surface text without removing underlying biases. The authors call for a new research program, Mechanistic Alignment, to supplement behavioral alignment.

Jun 16, 2026 1 source