Artificial Intelligence #ai#quantization
New Automated Quantization Framework AQ4SViT Compresses Spiking Vision Transformers for Embedded AI
Researchers propose AQ4SViT, an automated quantization framework for Spiking Vision Transformers that uses a search gating policy to find optimal compression settings. It offers two variants: Greedy search for speed and Beam search for deeper compression. Experimental results on ImageNet show up to 6.6x faster search time and up to 90% memory savings while maintaining accuracy within 1.5% of the original model.
Jun 16, 2026 1 source