Artificial Intelligence #llm#stance detection
LLM-Assisted Stance Detection in Scientific Discourse Reaches 0.76 Combined Reliability Score
Researchers used GPT-5.1, Claude Sonnet 4.6, and Gemini 3 Pro to detect whether scientific authors treat Bayesian models as realistic or instrumental. The LLMs achieved a held-out combined reliability of 0.76 and near-perfect article-level rank stability (r=0.96-0.97). The study demonstrates a scalable method for theoretically demanding qualitative coding.
Jun 16, 2026 1 source