Backdoor Defense with Machine Unlearning - Details

author：

Indexed by：

CPCI-S EI

Abstract：

Backdoor　injection　attack　is　an　emerging　threat　to　the　security　of　neural　networks,　however,　there　still　exist　limited　effective　defense　methods　against　the　attack.　In　this　paper,　we　propose　BAERASER,　a　novel　method　that　can　erase　the　backdoor　injected　into　the　victim　model　through　machine　unlearning.　Specifically,　BAERASER　mainly　implements　backdoor　defense　in　two　key　steps.　First,　trigger　pattern　recovery　is　conducted　to　extract　the　trigger　patterns　infected　by　the　victim　model.　Here,　the　trigger　pattern　recovery　problem　is　equivalent　to　the　one　of　extracting　an　unknown　noise　distribution　from　the　victim　model,　which　can　be　easily　resolved　by　the　entropy　maximization　based　generative　model.　Subsequently,　BAERASER　leverages　these　recovered　trigger　patterns　to　reverse　the　backdoor　injection　procedure　and　induce　the　victim　model　to　erase　the　polluted　memories　through　a　newly　designed　gradient　ascent　based　machine　unlearning　method.　Compared　with　the　previous　machine　unlearning　solutions,　the　proposed　approach　gets　rid　of　the　reliance　on　the　full　access　to　training　data　for　retraining　and　shows　higher　effectiveness　on　backdoor　erasing　than　existing　fine-tuning　or　pruning　methods.　Moreover,　experiments　show　that　BAERASER　can　averagely　lower　the　attack　success　rates　of　three　kinds　of　state-of-the-art　backdoor　attacks　by　99%　on　four　benchmark　datasets.

Keyword：

Backdoor Defense Machine Unlearning Trigger Pattern Recovery

Community：

[ 1 ] [Liu, Yang]Xidian Univ, State Key Lab Integrated Serv Networks ISN, Xian, Peoples R China
[ 2 ] [Ma, Zhuo]Xidian Univ, State Key Lab Integrated Serv Networks ISN, Xian, Peoples R China
[ 3 ] [Ma, Jianfeng]Xidian Univ, State Key Lab Integrated Serv Networks ISN, Xian, Peoples R China
[ 4 ] [Liu, Yang]Xidian Univ, Shaanxi Key Lab Network & Syst Secur, Xian, Peoples R China
[ 5 ] [Ma, Jianfeng]Xidian Univ, Shaanxi Key Lab Network & Syst Secur, Xian, Peoples R China
[ 6 ] [Fan, Mingyuan]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou, Peoples R China
[ 7 ] [Liu, Ximeng]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou, Peoples R China
[ 8 ] [Chen, Cen]East China Normal Univ, Sch Data Sci & Engn, Shanghai, Peoples R China
[ 9 ] [Wang, Li]Ant Grp, Hangzhou, Peoples R China

Reprint 's Address：

Email：

Show more details

Version：

Backdoor Defense with Machine Unlearning
2022，

Related Keywords：

Learn to Forget: Machine Unlearning via Neuron Masking
2023，IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING
Graph Unlearning Using Knowledge Distillation
2023，25th International Conference on Information and Communications Security, ICICS 2023
Active forgetting via influence estimation for neural networks
2022，INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS

Source ：

IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2022)

ISSN： 0743-166X

Year： 2022

Page： 280-289

Cited Count：

WoS CC Cited Count： 29

SCOPUS Cited Count： 47

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

计算机与大数据学院、软件学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to