Artificial Intelligence #rotrag#rule of thumb reasoning
RoTRAG Framework Boosts Harm Detection Accuracy by 40% Using Retrieval-Augmented Generation
Researchers propose RoTRAG, a retrieval-augmented framework that incorporates human-written moral norms (Rules of Thumb) into LLM-based conversation harm detection. The method achieves an average relative F1 gain of around 40% across benchmark datasets and an 8.4% reduction in distributional error.
Jun 16, 2026 1 source