Paper
10 October 2024 Detecting adversarial examples via an orthogonal knowledge-distillation-based approach
Hua Mu, Xing Yang, Anjie Peng, Kang Deng
Author Affiliations +
Proceedings Volume 13278, Seventh Global Intelligent Industry Conference (GIIC 2024); 132780Z (2024) https://doi.org/10.1117/12.3032925
Event: Seventh Global Intelligent Industry Conference (GIIC 2024), 2024, Shenzhen, China
Abstract
Detecting adversarial examples is an important defense against adversarial attacks. Existing supervised learning detectors perform well for known attacks but deteriorate when detecting unseen instances. To mitigate the sensitivity with training instances, we propose a detector based on the output inconsistency between the protected model and a designed dual model to detect unseen attacks. A test image with different predicted labels on the protected model and the dual model is taken as adversarial. To detect highly transferable adversarial examples and defense adaptive ensemble attacks against the proposed detector, an orthogonal knowledge distillation is employed to train the dual model. The distillation suppresses the transferability across the protected and dual model, therefore forcing them to output different labels for strong adversarial examples. Experimental results on CIFAR-10 and ImageNet show that our method detects various adversarial examples effectively. Compared with state-of-the-art methods, our method achieves at least 6.2% higher average detection accuracy in the cross-attack test. Our method is robust to the popular transferability-enhanced methods, with a minor accuracy decrease by up to 4% in the robustness test.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Hua Mu, Xing Yang, Anjie Peng, and Kang Deng "Detecting adversarial examples via an orthogonal knowledge-distillation-based approach", Proc. SPIE 13278, Seventh Global Intelligent Industry Conference (GIIC 2024), 132780Z (10 October 2024); https://doi.org/10.1117/12.3032925
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image classification

Machine learning

Defense and security

Convolutional neural networks

Back to Top