Home ›› Topics ›› bias

Topic

bias

12 stories

Artificial Intelligence #crime prediction#police

British Police Predictive AI Models Quietly Abandoned After Staff Lost Trust in Results

An investigation by WIRED and partner outlets reveals that Avon and Somerset Police built at least 23 predictive analytics models, including risk scores for burglary, court non-appearance, and domestic abuse. At least two models were quietly abandoned after staff decided they could no longer trust them, while over 36,000 performance scores showed genuinely poor predictive performance. The program, centered on the Think Family Database holding records on half a million people, operated with limited transparency, raising concerns about public trust and algorithmic accountability.

Jun 25, 2026 1 source

TreeTracer Visualizes Hidden LLM Bias Through Stochastic Path Aggregation for Enterprise AI Auditing

Technology

Artificial Intelligence #llm#bias

TreeTracer Visualizes Hidden LLM Bias Through Stochastic Path Aggregation for Enterprise AI Auditing

TreeTracer is a visual analytics tool that exposes hidden biases in large language models by aggregating stochastic generations into syntax-aligned trees. It uses perturbation analysis, ontology-based term replacement, and Sankey diagrams to compare model outputs, successfully detecting representational harms like pronoun suppression. Validated against GPT-2 XL and Apertus models, it reduces cognitive load for analysts.

Jun 20, 2026 1 source

P3B3 Benchmark Reveals Strong Brazilian Portuguese Bias in Large Language Models

Technology

Artificial Intelligence #llm#benchmark

P3B3 Benchmark Reveals Strong Brazilian Portuguese Bias in Large Language Models

According to a new research paper, a team introduced P3B3, an expert-curated benchmark for measuring bias between European and Brazilian Portuguese in large language models. Experiments show most LLMs strongly prefer Brazilian Portuguese, underscoring the need for more balanced variety representation in conversational AI.

Jun 16, 2026 1 source

Primacy Bias in Multimodal RAG: First Retrieved Items Dominate, Study Finds

Technology

Artificial Intelligence #artificial intelligence#multimodal

Primacy Bias in Multimodal RAG: First Retrieved Items Dominate, Study Finds

A research paper titled 'Lost at the End: Primacy Bias in Multimodal Retrieval-Augmented Question Answering' introduces a controlled probe to measure position bias in multimodal KB-VQA. The study finds a strong primacy effect, where the first retrieved passage significantly outperforms later ones, contrasting with the U-shaped 'lost-in-the-middle' pattern in text-only models. The findings call for reader-side interventions and question the adequacy of recall@k as a metric for deployed systems.

Jun 16, 2026 1 source

AI Pluralism and the Worlds It Misses: New Research Exposes Ontological Flattening

Technology

Artificial Intelligence #ai#pluralism

AI Pluralism and the Worlds It Misses: New Research Exposes Ontological Flattening

According to new research by Mushkani and Rashid, AI pluralism efforts often miss the deeper problem of ontological flattening—where AI systems impose restrictive categories that suppress contested meanings. The paper introduces Pluralistic Lifecycle Governance (PLG), a qualitative audit framework to document ontological openness and accountability throughout an AI system's lifecycle.

Jun 16, 2026 1 source

Psychometric Datasheet Reveals 'Dark Current' Bias in LLM-as-a-Judge Evaluation Systems

Technology

Artificial Intelligence #llm#artificial intelligence

Psychometric Datasheet Reveals 'Dark Current' Bias in LLM-as-a-Judge Evaluation Systems

Researchers introduce a Judge Datasheet protocol to measure biases in LLM-as-a-judge systems, including dark current under vacuum inputs and positional false preference. A case study of three open-weight models reveals stark differences in measurement reliability, with implications for enterprise AI evaluation.

Jun 16, 2026 1 source

Study Finds Gender Differences in AI Literacy and Deepfake Engagement Among Australian Students

Technology

Artificial Intelligence #ai literacy#gender differences

Study Finds Gender Differences in AI Literacy and Deepfake Engagement Among Australian Students

A study of 199 Australian secondary students found significant gender differences in baseline AI literacy, deepfake engagement, and STEM career aspirations. Male students reported higher STEM career interest, while female students were more likely to use AI for schoolwork and seek advice from AI tools. A one-day AI literacy workshop improved knowledge for both genders, with females showing broader gains including increased confidence and career interest in AI and computer science.

Jun 16, 2026 1 source

Algorithm Audit Reveals LLM Hotel Recommendations Biased by Eco-Labels, Ignore Management Responses

Technology

Artificial Intelligence #algorithm audit#reputation signals

Algorithm Audit Reveals LLM Hotel Recommendations Biased by Eco-Labels, Ignore Management Responses

A pre-specified algorithm audit of 12 large language models (LLMs) found that guest rating and price dominate hotel recommendations, while eco-certification is overweighted and management response is ignored. List position—a content-free artifact—also causally shifts recommendations, worth about $12 per night. The study grounds generative engine optimization and the accountability of AI infomediaries.

Jun 16, 2026 1 source

New Benchmark 'AgentFairBench' Tests Whether LLM Agents Discriminate in Real Actions

Technology

Artificial Intelligence #llm#ai agents

New Benchmark 'AgentFairBench' Tests Whether LLM Agents Discriminate in Real Actions

Researchers introduce AgentFairBench, a reproducible benchmark for demographic disparity in LLM agent actions. Unlike traditional fairness tests that grade answers, it evaluates actions across hiring, lending, and medical triage using counterfactual matched sets. A pilot study with 864 decisions reveals that naively comparing score spreads can overstate disparity by ~2.4X; using a proper null methodology, Claude Haiku 4.5 showed no significant demographic effect.

Jun 16, 2026 1 source

Researchers Tackle Annotator Disagreement to Improve Hate Speech Classification Accuracy

Technology

Artificial Intelligence #hate speech#annotator disagreement

Researchers Tackle Annotator Disagreement to Improve Hate Speech Classification Accuracy

A new research paper from Dehghan, Sen, and Yanikoglu explores the challenge of annotator disagreement in hate speech classification. The authors evaluate aggregation methods like majority voting and ordinal strategies, demonstrating that filtering non-consensus samples leads to over-optimistic results and that leveraging perceived hate speech strength enhances performance. They establish new state-of-the-art results for Turkish tweets.

Jun 16, 2026 1 source

Bridging the gender data gap: Why representation in AI is a business imperative

Technology

Artificial Intelligence #gender#data gap

Bridging the gender data gap: Why representation in AI is a business imperative

According to the UK government, 1 in 6 UK organizations have already implemented AI tools, but bias from unrepresentative data risks perpetuating discrimination and regulatory penalties. The London School of Economics found that large language models like Google's Gemma may introduce gender bias into care decisions. Experts stress that data integrity—through integration, governance, enrichment, and observability—is critical to mitigating bias and ensuring AI outputs are fair and accurate.

Jun 12, 2026 1 source

Algorithmic Monocultures: Impact on Hiring Diversity

Technology

Artificial Intelligence #algorithmic#hiring

Algorithmic Monocultures: Impact on Hiring Diversity

Algorithmic monocultures in hiring are creating homogeneous outcomes, impacting diversity. Over 90% of U.S. employers use similar algorithms, leading to systemic rejections and racial disparities.

Jun 8, 2026 1 source