UNICA IRIS Institutional Research Information System

In several applications, input samples are more naturally represented in terms of similarities between each other, rather than in terms of feature vectors. In these settings, machine-learning algorithms can become very computationally demanding, as they may require matching the test samples against a very large set of reference prototypes. To mitigate this issue, different approaches have been developed to reduce the number of required reference prototypes. Current reduction approaches select a small subset of representative prototypes in the space induced by the similarity measure, and then separately train the classification function on the reduced subset. However, decoupling these two steps may not allow reducing the number of prototypes effectively without compromising accuracy. We overcome this limitation by jointly learning the classification function along with an optimal set of virtual prototypes, whose number can be either fixed a priori or optimized according to application-specific criteria. Creating a super-sparse set of virtual prototypes provides much sparser solutions, drastically reducing complexity at test time, at the expense of a slightly increased complexity during training. A much smaller set of prototypes also results in easier-to-interpret decisions. We empirically show that our approach can reduce up to ten times the complexity of Support Vector Machines, LASSO and ridge regression at test time, without almost affecting their classification accuracy.

Super-Sparse Learning in Similarity Spaces

DEMONTIS, AMBRA^Primo;MELIS, MARCO;BIGGIO, BATTISTA;FUMERA, GIORGIO;ROLI, FABIO^Ultimo

2016-01-01

Abstract

In several applications, input samples are more naturally represented in terms of similarities between each other, rather than in terms of feature vectors. In these settings, machine-learning algorithms can become very computationally demanding, as they may require matching the test samples against a very large set of reference prototypes. To mitigate this issue, different approaches have been developed to reduce the number of required reference prototypes. Current reduction approaches select a small subset of representative prototypes in the space induced by the similarity measure, and then separately train the classification function on the reduced subset. However, decoupling these two steps may not allow reducing the number of prototypes effectively without compromising accuracy. We overcome this limitation by jointly learning the classification function along with an optimal set of virtual prototypes, whose number can be either fixed a priori or optimized according to application-specific criteria. Creating a super-sparse set of virtual prototypes provides much sparser solutions, drastically reducing complexity at test time, at the expense of a slightly increased complexity during training. A much smaller set of prototypes also results in easier-to-interpret decisions. We empirically show that our approach can reduce up to ten times the complexity of Support Vector Machines, LASSO and ridge regression at test time, without almost affecting their classification accuracy.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
			2016
		
	Parole chiave
	
			Theoretical computer science; Artificial intelligence
		
	Tipologia:
	
			1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
demontis16-cim2.pdf Solo gestori archivio Descrizione: Articolo principale Tipologia: versione editoriale Dimensione 3.37 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	3.37 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
demontis16-cim.pdf Solo gestori archivio Descrizione: Articolo principale, pre-print Tipologia: versione pre-print Dimensione 555.22 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	555.22 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11584/189111

Citazioni

ND

6

6

social impact