Topic
scalable
MatchLM2Lite: Scalable MLLM-Lite Framework Cuts Reproduced Video Views by 2.5%
The paper presents MatchLM2Lite, a production-grade reproduced content identification system that distills a multimodal large language model into a compact student model. Deployed at scale, it reduced reproduced video views by 2.5% without hurting engagement, with 35x lower computational cost and latency under 30 seconds.
AnonShield: Scalable On-Premise Pseudonymization Cuts Vulnerability Data Processing from 92 Hours to Under 10 Minutes
AnonShield, a new pseudonymization system for CSIRT vulnerability data, achieves up to 738x speedup using GPU-accelerated NER and streaming processing. It enables compliant data sharing without sacrificing analytical utility, reducing processing time from over 92 hours to under 10 minutes on datasets up to 550 MB.
CHILLGuard: Fine-Grained Chinese LLM Safety Guardrail with Scalable Data and Preference Alignment
Researchers introduce CHILLGuard, a dedicated Chinese LLM content safety guardrail featuring a 5-macro, 31-micro category risk taxonomy. The system uses a scalable multi-stage data construction pipeline to create the CHILLGuardTrain dataset (405,007 samples) and achieves a 15.92% F1 score improvement over Qwen3Guard-8B-Strict via Model-aware Direct Preference Optimization.