iGEN
Visit IGEN World Explore IGEN Expo
EXPLORE UPGRADE PLANS
BREAKING
Modality-Aware Novelty Detection Framework MAND Improves Open-World Egocentric Activity Recognition Boosting Knowledge Graph Foundation Models via Enhanced Negative Sampling Gated QKAN-FWP: Quantum-Inspired Sequence Learning Achieves Parameter Efficiency on NISQ Devices The Robot Vacuums Cleaning My Three-Story Home for Me New Framework TRACED Evaluates LLM Reasoning Using Geometric Stability and Progress Everllence Lands First Order for Next-Gen Methane Dual-Fuel Engine on Car Carriers How Scale Design Impacts LLM Metacognition and Enterprise AI Reliability GMN4AD: New Graph Matching Network Boosts Alzheimer's Diagnosis Accuracy Using Multi-Center MRI Data Adaptive Memory Crystallization: New AI Architecture Slashes Forgetting by 80% While Boosting Knowledge Transfer by 43% RaBiT: Residual-Aware Binarization Training for Accurate and Efficient Large Language Models Modality-Aware Novelty Detection Framework MAND Improves Open-World Egocentric Activity Recognition Boosting Knowledge Graph Foundation Models via Enhanced Negative Sampling Gated QKAN-FWP: Quantum-Inspired Sequence Learning Achieves Parameter Efficiency on NISQ Devices The Robot Vacuums Cleaning My Three-Story Home for Me New Framework TRACED Evaluates LLM Reasoning Using Geometric Stability and Progress Everllence Lands First Order for Next-Gen Methane Dual-Fuel Engine on Car Carriers How Scale Design Impacts LLM Metacognition and Enterprise AI Reliability GMN4AD: New Graph Matching Network Boosts Alzheimer's Diagnosis Accuracy Using Multi-Center MRI Data Adaptive Memory Crystallization: New AI Architecture Slashes Forgetting by 80% While Boosting Knowledge Transfer by 43% RaBiT: Residual-Aware Binarization Training for Accurate and Efficient Large Language Models
Home ›› Technology ›› Ai ›› Open Science Gains Ground: 10-Year AI Study Shows Sharp Rise in Code and Data Sharing

Open Science Gains Ground: 10-Year AI Study Shows Sharp Rise in Code and Data Sharing

A decade-long analysis of 56,800 AI conference papers shows documentation practices improving dramatically, with code and data sharing nearly sixfold from 11% to 64%. Estimated reproducibility also rose from 28% to 64%, improvements that predated formal reproducibility checklists.

iG
iGEN Editorial
June 16, 2026
Open Science Gains Ground: 10-Year AI Study Shows Sharp Rise in Code and Data Sharing

The reproducibility crisis in artificial intelligence research has prompted major conferences to adopt documentation standards, but a new analysis of 56,800 papers from 2014 to 2024 suggests that the field's improvement in sharing code and data predates and far exceeds the impact of these formal requirements. According to a study by Coakley, Snelleman, Hoos, and Gundersen, published on arXiv, the proportion of papers that share both code and data increased nearly sixfold over the decade, from 11% to 64%.

Methodology and Scope

The researchers assessed all published papers from five leading AI conferences over the past decade. They identified seven reproducibility variables, which were quality-assured, and used them to analyze the 56,800 publications. The study focused on documentation practices rather than directly testing reproducibility—the reproducibility estimates were inferred from documentation practices based on empirical reproducibility rates from a prior study.

Key Findings

Metric 2014 2024
Papers sharing both code and data 11% 64%
Estimated reproducibility 28% 64%

According to the study, improvements in documentation practices predate the introduction of reproducibility checklists, suggesting these changes reflect a broader movement toward open science rather than a direct response to formal requirements. The authors noted that in the period 2014 to 2024, documentation practices have improved substantially.

Implications for AI Adoption

For enterprise technology leaders evaluating AI systems, the trend toward increased code and data sharing enhances the ability to verify and reproduce research findings. While the study does not directly assess commercial AI products, the same open-science principles that drive increased reproducibility in academic research can reduce the risk of adopting opaque or non-reproducible models. The shift from 11% to 64% code and data sharing indicates that a majority of AI research now provides the building blocks needed for independent validation.

The broader open science movement, as evidenced by this analysis, is reshaping how AI research is conducted and disseminated. Enterprise buyers of AI solutions should consider whether vendors' claims are grounded in reproducible, openly documented work—a practice that this study shows is becoming the norm rather than the exception.


Sources:

Keep Reading

Recommended Stories

New Framework TRACED Evaluates LLM Reasoning Using Geometric Stability and Progress Technology

New Framework TRACED Evaluates LLM Reasoning Using Geometric Stability and Progress

A new research framework called TRACED evaluates LLM reasoning quality by analyzing geometric progress and stability of reasoning traces. It distinguishes correct reasoning from hallucinations based on trajectory patterns, offering a more robust evaluation method than scalar probabilities.

June 16, 2026
New Unified Definition of AI Hallucination Pins It on Inaccurate World Modeling Technology

New Unified Definition of AI Hallucination Pins It on Inaccurate World Modeling

A new arXiv paper by Liu et al. proposes a unified definition of hallucination in large language models, defining it as inaccurate internal world modeling observable to the user. The framework subsumes prior definitions and distinguishes true hallucinations from planning or reward errors, and introduces the HalluWorld benchmark for stress-testing models.

June 16, 2026
Attention, Not Model Scale, Drives Human-AI Alignment in Multimodal Language Prediction, Research Finds Technology

Attention, Not Model Scale, Drives Human-AI Alignment in Multimodal Language Prediction, Research Finds

A study comparing five vision-language models with 600 human participants found that adding visual context significantly improved human-AI alignment in language prediction, with attention maps explaining up to 70% of inter-participant variance. The research indicates that attention to informative cues, not model scale, is the primary driver of alignment.

June 16, 2026
LLM Manuscript Scoring System Validated Against Peer-Review Outcomes at Major AI Conference Technology

LLM Manuscript Scoring System Validated Against Peer-Review Outcomes at Major AI Conference

Researchers validate AIPR, an LLM-based manuscript scoring system, against 300 ICLR submissions. The system achieves an AUROC of 0.82 in separating accepted from rejected papers and shows low score variability, offering a reliable first-pass assessment tool.

June 16, 2026