UNICA IRIS Institutional Research Information System

Machine-learning models have been recently used for detecting malicious Android applications, reporting impressive performances on benchmark datasets, even when trained only on features statically extracted from the application, such as system calls and permissions. However, recent findings have highlighted the fragility of such in-vitro evaluations with benchmark datasets, showing that very few changes to the content of Android malware may suffice to evade detection. How can we thus trust that a malware detector performing well on benchmark data will continue to do so when deployed in an operating environment? To mitigate this issue, the most popular Android malware detectors use linear, explainable machine-learning models to easily identify the most influential features contributing to each decision. In this work, we generalize this approach to any black-box machine- learning model, by leveraging a gradient-based approach to identify the most influential local features. This enables using nonlinear models to potentially increase accuracy without sacrificing interpretability of decisions. Our approach also highlights the global characteristics learned by the model to discriminate between benign and malware applications. Finally, as shown by our empirical analysis on a popular Android malware detection task, it also helps identifying potential vulnerabilities of linear and nonlinear models against adversarial manipulations.

Explaining black-box android malware detection

Marco Melis;Davide Maiorca;Battista Biggio;Giorgio Giacinto;Fabio Roli

2018-01-01

Abstract

Machine-learning models have been recently used for detecting malicious Android applications, reporting impressive performances on benchmark datasets, even when trained only on features statically extracted from the application, such as system calls and permissions. However, recent findings have highlighted the fragility of such in-vitro evaluations with benchmark datasets, showing that very few changes to the content of Android malware may suffice to evade detection. How can we thus trust that a malware detector performing well on benchmark data will continue to do so when deployed in an operating environment? To mitigate this issue, the most popular Android malware detectors use linear, explainable machine-learning models to easily identify the most influential features contributing to each decision. In this work, we generalize this approach to any black-box machine- learning model, by leveraging a gradient-based approach to identify the most influential local features. This enables using nonlinear models to potentially increase accuracy without sacrificing interpretability of decisions. Our approach also highlights the global characteristics learned by the model to discriminate between benign and malware applications. Finally, as shown by our empirical analysis on a popular Android malware detection task, it also helps identifying potential vulnerabilities of linear and nonlinear models against adversarial manipulations.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
			2018
		
	Codice ISBN
	
			978-9-0827-9701-5
		
	Parole chiave
	
			Malware; Feature extraction; Detectors; Machine learning; Support vector machines; Signal processing algorithms; Approximation algorithms
		
	Tipologia:
	
			4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
melis18-eusipco-v2.pdf accesso aperto Tipologia: versione pre-print Dimensione 431.44 kB Formato Adobe PDF Visualizza/Apri	431.44 kB	Adobe PDF	Visualizza/Apri
melis18-eusipco-v3.pdf Solo gestori archivio Tipologia: versione post-print Dimensione 431.78 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	431.78 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11584/248107

Citazioni

ND

36

29

social impact