iGEN
Visit IGEN World Explore IGEN Expo
EXPLORE UPGRADE PLANS
BREAKING
Medical World Models: Simulating Disease Progression to Guide Clinical Decisions Geometry-Aware Neural Operator Cuts Simulation Time for Plate Structures from Hours to Milliseconds AD Ports Group and Dajin Heavy Industry Partner to Advance Offshore Wind Logistics and Port Infrastructure Who Should Lead Decoding Now? Tracking Reliable Trajectories for Ensembling Masked Diffusion Language Models SkillVetBench Uses LLM-as-Judge to Evaluate Security Risks in Open-Source Agent Skills Developers Prioritize Business Over Societal Risks in Agentic AI, Study Finds 2026 Razer Blade 18 Review: Blistering Performance, Premium Build, and a Steep Price Tag Parallel Hybrid Architecture Combines GSS and Attention for Efficient Long-Context Language Modeling NVMOS: Novel AI Model Predicts Perceptual Quality of Non-Verbal Vocalizations in Speech CoffeeBench: New Benchmark Evaluates LLM Agents in Multi-Agent Economic Simulations Medical World Models: Simulating Disease Progression to Guide Clinical Decisions Geometry-Aware Neural Operator Cuts Simulation Time for Plate Structures from Hours to Milliseconds AD Ports Group and Dajin Heavy Industry Partner to Advance Offshore Wind Logistics and Port Infrastructure Who Should Lead Decoding Now? Tracking Reliable Trajectories for Ensembling Masked Diffusion Language Models SkillVetBench Uses LLM-as-Judge to Evaluate Security Risks in Open-Source Agent Skills Developers Prioritize Business Over Societal Risks in Agentic AI, Study Finds 2026 Razer Blade 18 Review: Blistering Performance, Premium Build, and a Steep Price Tag Parallel Hybrid Architecture Combines GSS and Attention for Efficient Long-Context Language Modeling NVMOS: Novel AI Model Predicts Perceptual Quality of Non-Verbal Vocalizations in Speech CoffeeBench: New Benchmark Evaluates LLM Agents in Multi-Agent Economic Simulations
Home ›› Topics ›› fairness

Topic

fairness

3 stories
New Auditing Framework Detects Synthetic Data Privacy Leaks Without Model Access Technology
Artificial Intelligence #synthetic data#auditing

New Auditing Framework Detects Synthetic Data Privacy Leaks Without Model Access

A new causal framework for auditing synthetic data detects privacy leaks by distinguishing true disclosures from phantom ones. It uses statistical hypothesis testing with holdout sets, requires no model access or canary insertion, and is orders of magnitude more efficient than shadow-model approaches.

Jun 16, 2026 1 source
New Benchmark 'AgentFairBench' Tests Whether LLM Agents Discriminate in Real Actions Technology
Artificial Intelligence #llm#ai agents

New Benchmark 'AgentFairBench' Tests Whether LLM Agents Discriminate in Real Actions

Researchers introduce AgentFairBench, a reproducible benchmark for demographic disparity in LLM agent actions. Unlike traditional fairness tests that grade answers, it evaluates actions across hiring, lending, and medical triage using counterfactual matched sets. A pilot study with 864 decisions reveals that naively comparing score spreads can overstate disparity by ~2.4X; using a proper null methodology, Claude Haiku 4.5 showed no significant demographic effect.

Jun 16, 2026 1 source
Researchers Tackle Annotator Disagreement to Improve Hate Speech Classification Accuracy Technology
Artificial Intelligence #hate speech#annotator disagreement

Researchers Tackle Annotator Disagreement to Improve Hate Speech Classification Accuracy

A new research paper from Dehghan, Sen, and Yanikoglu explores the challenge of annotator disagreement in hate speech classification. The authors evaluate aggregation methods like majority voting and ordinal strategies, demonstrating that filtering non-consensus samples leads to over-optimistic results and that leveraging perceived hate speech strength enhances performance. They establish new state-of-the-art results for Turkish tweets.

Jun 16, 2026 1 source