Topic
peer review
Artificial Intelligence #large language models#llm
LLM Manuscript Scoring System Validated Against Peer-Review Outcomes at Major AI Conference
Researchers validate AIPR, an LLM-based manuscript scoring system, against 300 ICLR submissions. The system achieves an AUROC of 0.82 in separating accepted from rejected papers and shows low score variability, offering a reliable first-pass assessment tool.
Jun 16, 2026 1 source
Artificial Intelligence #llms#multi-agent
Multi-Agent Peer-Reviewed Reasoning Boosts LLM Accuracy in Medical Question Answering
Researchers designed a multi-agent peer-reviewed reasoning method for medical question answering, where multiple LLMs generate and evaluate each other's chain-of-thought reasoning. Experiments with five models on three benchmarks showed the approach consistently outperforms single-model reasoning and majority voting, achieving best accuracy of 0.820. The method scales effectively and improves interpretability.
Jun 16, 2026 1 source