UNICA IRIS Institutional Research Information System

Sponge examples are test-time inputs optimized to increase energy consumption and prediction latency of deep networks deployed on hardware accelerators. By increasing the fraction of neurons activated during classification, these attacks reduce sparsity in network activation patterns, worsening the performance of hardware accelerators. In this work, we present a novel training-time attack, named sponge poisoning, which aims to worsen energy consumption and prediction latency of neural networks on any test input without affecting classification accuracy. To stage this attack, we assume that the attacker can control only a few model updates during training — a likely scenario, e.g., when model training is outsourced to an untrusted third party or distributed via federated learning. Our extensive experiments on image classification tasks show that sponge poisoning is effective, and that fine-tuning poisoned models to repair them poses prohibitive costs for most users, highlighting that tackling sponge poisoning remains an open issue.

Energy-latency attacks via sponge poisoning

Cinà, Antonio Emanuele^Primo;Demontis, Ambra;Biggio, Battista;Roli, Fabio;Pelillo, Marcello

2025-01-01

Abstract

Sponge examples are test-time inputs optimized to increase energy consumption and prediction latency of deep networks deployed on hardware accelerators. By increasing the fraction of neurons activated during classification, these attacks reduce sparsity in network activation patterns, worsening the performance of hardware accelerators. In this work, we present a novel training-time attack, named sponge poisoning, which aims to worsen energy consumption and prediction latency of neural networks on any test input without affecting classification accuracy. To stage this attack, we assume that the attacker can control only a few model updates during training — a likely scenario, e.g., when model training is outsourced to an untrusted third party or distributed via federated learning. Our extensive experiments on image classification tasks show that sponge poisoning is effective, and that fine-tuning poisoned models to repair them poses prohibitive costs for most users, highlighting that tackling sponge poisoning remains an open issue.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2025
			
	Parole chiave
	
				Adversarial machine learning; AI security; Deep neural networks; Energy poisoning; Sponge poisoning attacks
			
	Tipologia:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
ve_sponge_poisoning_is25.pdf Solo gestori archivio Tipologia: versione editoriale (VoR) Dimensione 1.93 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.93 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
postprint_InfSciences_2024__Sponge_Poisoning.pdf embargo fino al 01/07/2026 Tipologia: versione post-print (AAM) Dimensione 4.29 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	4.29 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11584/437825

Citazioni

ND

0

0

social impact