Artificial Intelligence #llm#explanation
New Definition of Good Explanations Highlights Challenges in Explaining LLM Outputs
A recent arXiv paper by Mahon, Louis, Ford, Elliot, Hackett, and Callum proposes a definition of good explanations inspired by counterfactual explanations but incorporating the interlocutor's prior beliefs. The authors explore the ramifications for AI explainability, particularly why LLM outputs are difficult to explain well.
Jun 16, 2026 1 source