Artificial Intelligence #backdoor unlearning#machine learning
InstantForget: New Update-Free Backdoor Unlearning Method Uses Inference-Time Feature Reset for AI Security
A new research paper presents InstantForget, an update-free backdoor unlearning technique that operates at inference time without modifying model parameters. Using a Mahalanobis-based anomaly detector and feature reset, it reduces average attack success rate to 0.071 on CIFAR-10 with a detection AUROC of 0.981, though it fails on certain triggers and adaptive attacks.
Jun 16, 2026 1 source