Publication Type

Conference Proceeding Article

Publication Date

2-2021

Abstract

A common problem in machine learning-based malware detection is that training data may contain noisy labels and it is challenging to make the training data noise-free at a large scale. To address this problem, we propose a generic framework to reduce the noise level of training data for the training of any machine learning-based Android malware detection. Our framework makes use of all intermediate states of two identical deep learning classification models during their training with a given noisy training dataset and generate a noise-detection feature vector for each input sample. Our framework then applies a set of outlier detection algorithms on all noise-detection feature vectors to reduce the noise level of the given training data before feeding it to any machine learning based Android malware detection approach. In our experiments with threedifferent Android malware detection approaches, our framework can detect significant portions of wrong labels in different training datasets at different noise ratios, and improve the performance of Android malware detection approaches.

Discipline

Databases and Information Systems | Information Security

Research Areas

Cybersecurity

Publication

Proceedings of the 2021 Network and Distributed System Security Symposium (NDSS 2021)

ISBN

1-891562-66-5

Identifier

10.14722/ndss.2021.24126

City or Country

Online

Citation

XU, Jiayun; LI, Yingjiu; and DENG, Robert H.. Differential training: A generic framework to reduce label noises for Android malware detection. (2021). Proceedings of the 2021 Network and Distributed System Security Symposium (NDSS 2021).
Available at: https://ink.library.smu.edu.sg/sis_research/6551

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Download

Included in

Databases and Information Systems Commons, Information Security Commons

COinS

Research Collection School Of Computing and Information Systems

Differential training: A generic framework to reduce label noises for Android malware detection

Publication Type

Publication Date

Abstract

Discipline

Research Areas

Publication

ISBN

Identifier

City or Country

Citation

Creative Commons License

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Differential training: A generic framework to reduce label noises for Android malware detection

Author

Publication Type

Publication Date

Abstract

Discipline

Research Areas

Publication

ISBN

Identifier

City or Country

Citation

Creative Commons License

Included in

Share

Search

Links

Browse

Links