Artificial Intelligence #llms#circuit discovery
New Research Demystifies Variance in Circuit Discovery of Large Language Models
A new research paper explores variance in circuit discovery of large language models, identifying resampling, rephrasing, and sample-wise variance. The authors propose CEAP, an improved method over EAP-IG with theoretical guarantees, and argue that rephrasing variance makes it hard to find comprehensive circuits, suggesting LLMs may be inherently difficult to steer.
Jun 16, 2026 1 source