MINT Demo 2 Framework Detects Training Data in Vision-Language Models With 90% Accuracy

Researchers introduced MINT Demo 2, a framework to determine if specific data was used to train vision-language models. The system achieves up to 90% accuracy and includes a web platform for auditing multiple model types, aiming to improve AI transparency and regulatory compliance.

iGEN Editorial

June 16, 2026

MINT Demo 2 Framework Detects Training Data in Vision-Language Models With 90% Accuracy

Organizations deploying or purchasing AI models increasingly face a critical question: Was my proprietary or sensitive data used to train this model? A new framework called the Membership Inference Test (MINT) Demo 2, detailed in an arXiv paper by DeAlcala, Mancera, Fierrez, Morales, Tolosana, and Vera-Rodriguez, offers a practical answer. The researchers report that MINT can determine whether specific data was part of a model's training set with up to 90% accuracy, addressing a growing demand for transparency in machine learning.

The Privacy Problem in Vision-Language Models

Training large AI models—especially those combining vision and language—often relies on vast, publicly scraped datasets. Enterprises worry that their confidential images or documents may have been ingested without consent. According to the arXiv paper, MINT is designed to experimentally detect training data membership in models ranging from facial recognition systems to large language models (LLMs). The framework establishes a theoretical foundation and proposes multiple architectures depending on how much information is known about the audited model.

How MINT Works

The researchers tested MINT against a popular face recognition model and four state-of-the-art LLMs, using multiple diverse, large-scale public image and text databases. The results show "promising accuracy levels in the detection of training data of up to 90%," the paper states. The framework includes three variants:

Variant	Description	Use Case
MINT	Standard membership inference	Auditing models with known architecture
aMINT	Adaptive version	Models with limited access or black-box
gMINT	Generalized variant	Broad model families with heterogeneous data

These variants allow auditors to choose the appropriate approach based on their level of access and the model's opacity.

The MINT Demo 2 Platform

Building on earlier research, the team launched a comprehensive web platform that extends MINT's capabilities to both image and text modalities. The platform integrates MINT, aMINT, and gMINT, enabling users to audit a wide range of models through a single interface. This demonstrator aims to "promote AI transparency and provides a practical tool to foster compliance with emerging AI regulations," the paper explains. The platform's technological stack is designed for ease of use, making it accessible to non-specialists in privacy auditing.

Implications for Enterprise AI Governance

For CTOs and technology procurement leaders, MINT offers a way to verify vendor claims about training data. Enterprises can now test whether their proprietary images, customer data, or confidential documents were used without permission. This capability is particularly relevant as regulations like the EU AI Act and other privacy laws demand greater accountability. While the paper does not specify supply chain applications, the same technique could be applied to logistics datasets (e.g., surveillance images, shipment records) if integrated into vision-language models.

The researchers emphasize that the tool is a demonstrator aimed at encouraging responsible AI practices. As companies increasingly rely on third-party models, a standardized membership inference test could become a due diligence requirement. The paper notes that the platform is available for public use, though the exact URL is omitted from the abstract.

Competitive Context and Limitations

Existing membership inference tools exist, but MINT Demo 2 claims to unify multiple inference architectures into one platform. The 90% accuracy is benchmarked on specific datasets and models; real-world performance may vary. The paper does not compare directly with other tools, but it sets a baseline for future transparency benchmarks.

In summary, MINT Demo 2 provides a concrete, tested method for auditing training data in AI models. For enterprise decision-makers, it represents a step toward verifiable AI governance—a necessary tool in an era of increasing scrutiny over data provenance.

Sources:

MINT Demo 2 Framework Detects Training Data in Vision-Language Models With 90% Accuracy

The Privacy Problem in Vision-Language Models

How MINT Works

The MINT Demo 2 Platform

Implications for Enterprise AI Governance

Competitive Context and Limitations

Recommended Stories

From Privacy to Workflow Integrity: Communication-Graph Metadata Threat in Autonomous Agent Interoperability

Trump Tech Adviser Accuses China's Moonshot AI of Stealing from Anthropic via Distillation

OpenAI AI System Goes Rogue, Hacks Startup in 'Unprecedented' Cyber-Attack

AURA: Adaptive Uncertainty-Aware Refinement Framework for Auditing LLM-as-a-Judge Decisions