iGEN
Visit IGEN World Explore IGEN Expo
EXPLORE UPGRADE PLANS
BREAKING
GAS-Leak-LLM: Genetic Algorithm Jailbreaks Black-Box LLMs, Exposing Safety Gaps New Generative Recommendation Model HoloRec Uses Hierarchical Encoding and Interleaved Reasoning to Boost Accuracy Tensor-Coord: Algebraic Decomposition Enables Conflict-Free Multi-Agent LLM Planning Led by US, exits from gold ETFs continue for the 5th week in a row Domain-Guided Prompting Boosts Segment Anything Model for Seismic Interpretation Spokes Optimizes Diverse Pretraining Data Selection for LLMs, Boosting Performance Medical Heuristic Learning: LLM-Driven Framework for Interpretable Clinical Decision Rules Commodore Callback 8020 Brings Digital Detox With Modern Apps and Retro Design PreLort: Prefix-Nested LoRA Enables Federated Fine-Tuning Across Heterogeneous Hardware Ranks Research Shows 'Retrieve, Don't Retrain' Approach Cuts AI Model Adaptation Costs GAS-Leak-LLM: Genetic Algorithm Jailbreaks Black-Box LLMs, Exposing Safety Gaps New Generative Recommendation Model HoloRec Uses Hierarchical Encoding and Interleaved Reasoning to Boost Accuracy Tensor-Coord: Algebraic Decomposition Enables Conflict-Free Multi-Agent LLM Planning Led by US, exits from gold ETFs continue for the 5th week in a row Domain-Guided Prompting Boosts Segment Anything Model for Seismic Interpretation Spokes Optimizes Diverse Pretraining Data Selection for LLMs, Boosting Performance Medical Heuristic Learning: LLM-Driven Framework for Interpretable Clinical Decision Rules Commodore Callback 8020 Brings Digital Detox With Modern Apps and Retro Design PreLort: Prefix-Nested LoRA Enables Federated Fine-Tuning Across Heterogeneous Hardware Ranks Research Shows 'Retrieve, Don't Retrain' Approach Cuts AI Model Adaptation Costs
Home ›› Technology ›› Ai ›› Ai Ethics ›› Researchers Tackle Annotator Disagreement to Improve Hate Speech Classification Accuracy

Researchers Tackle Annotator Disagreement to Improve Hate Speech Classification Accuracy

A new research paper from Dehghan, Sen, and Yanikoglu explores the challenge of annotator disagreement in hate speech classification. The authors evaluate aggregation methods like majority voting and ordinal strategies, demonstrating that filtering non-consensus samples leads to over-optimistic results and that leveraging perceived hate speech strength enhances performance. They establish new state-of-the-art results for Turkish tweets.

iG
iGEN Editorial
June 16, 2026
Researchers Tackle Annotator Disagreement to Improve Hate Speech Classification Accuracy

Hate speech detection is a critical task, especially on social media where harmful content spreads quickly. However, the inherently subjective nature of hate speech leads to frequent disagreement among annotators, particularly for subtle or borderline content, according to a new study from researchers Dehghan, Somaiyeh; Sen, Mehmet Umut; and Yanikoglu, Berrin. Their paper, published on arXiv, examines this largely overlooked problem and evaluates a range of aggregation methods for handling annotator disagreement.

The Problem of Annotator Disagreement

The researchers note that traditional approaches often discard non-consensus samples or force a 'gold standard' through expert adjudication, ignoring valuable information about uncertainty and diverse human perspectives. This practice can bias models and produce over-optimistic results. The study analyzes methods including majority voting, ordinal strategies (minimum, maximum, and mean), and their impact across binary, 4-class, and 6-class classification tasks.

Key Findings: Modeling Disagreement Improves Robustness

The paper demonstrates that filtering non-consensus samples results in over-optimistic results. Instead, the authors show that annotator disagreement, when properly modeled, is a valuable resource for building more robust and reliable systems. They also leverage annotators' perceived hate speech strength scores to explore regression-based and hybrid modeling approaches, finding that this perceived strength provides a complementary signal that enhances classification performance.

Aggregation Method Description Impact on Performance
Majority Voting Standard label assignment based on most common annotation Baseline method
Ordinal (min, max, mean) Uses ordered labels from annotators Mixed results across tasks
Regression-based Uses continuous hate speech strength scores Enhances classification
Hybrid Combines classification with strength signals Achieves new state-of-the-art

State-of-the-Art Results for Turkish Tweets

The researchers applied their methods to Turkish tweets and established new state-of-the-art results for hate speech detection in that language. The study highlights that the perceived strength signal, when incorporated, improves model accuracy and robustness.

Implications for Enterprise AI Applications

While focused on hate speech, the findings have broader implications for any classification task where human annotation is subjective — including content moderation, customer feedback analysis, and even areas like supply chain risk assessment where expert judgments may vary. For CTOs and technology leaders building AI systems, the research underscores the importance of preserving annotator disagreement rather than discarding it, as it can lead to more reliable models.

The paper is available on arXiv and was submitted on February 12, 2025, with multiple revisions through June 2026.


Sources:

Keep Reading

Recommended Stories

Psychometric Datasheet Reveals 'Dark Current' Bias in LLM-as-a-Judge Evaluation Systems Technology

Psychometric Datasheet Reveals 'Dark Current' Bias in LLM-as-a-Judge Evaluation Systems

Researchers introduce a Judge Datasheet protocol to measure biases in LLM-as-a-judge systems, including dark current under vacuum inputs and positional false preference. A case study of three open-weight models reveals stark differences in measurement reliability, with implications for enterprise AI evaluation.

June 16, 2026
LLM-Encoded Knowledge Guides Federated Graph Recommendation to Improve Accuracy Technology

LLM-Encoded Knowledge Guides Federated Graph Recommendation to Improve Accuracy

Researchers propose a federated graph recommendation framework that leverages LLM-encoded semantic knowledge to guide cross-client structural aggregation, addressing the challenge of non-IID client data. The method consistently outperforms existing federated graph baselines on standard benchmarks.

June 16, 2026
AdaMame: New Training Recipe Solves Language Collapse in Multilingual Reasoning Models Technology

AdaMame: New Training Recipe Solves Language Collapse in Multilingual Reasoning Models

AdaMame, a two-stage training recipe for multilingual mathematical reasoning, addresses language collapse in large reasoning models. It adaptively aligns reasoning language to the query language without compromising accuracy, achieving Pareto-optimal performance across 12 languages.

June 16, 2026
Koshur Diacritizer: A Byte-Level Model Restores Diacritics for Kashmiri Language NLP Technology

Koshur Diacritizer: A Byte-Level Model Restores Diacritics for Kashmiri Language NLP

Researchers have developed Koshur Diacritizer, a byte-level sequence-to-sequence model based on ByT5-small, to restore missing diacritic marks in Kashmiri digital text. The model, trained on 23,700 sentence pairs, achieves a DERm of 0.2012 and word error rate of 0.2159, with a native expert accuracy of 77.5%. The dataset, model, and source code are publicly released to support low-resource language research.

June 16, 2026