When the feature selection process aims at discovering useful knowledge from data, not just producing an accurate classifier, the degree of stability of selected features is a very crucial issue. In the last years, the ensemble paradigm has been proposed as a primary avenue for enhancing the stability of feature selection, especially in high-dimensional/small sample size domains, such as biomedicine. However, the potential and the implications of the ensemble approach have been investigated only partially, and the indications provided by recent literature are not exhaustive yet. To give a contribution in this direction, we present an empirical analysis that evaluates the effects of an ensemble strategy in the context of gene selection from high-dimensional micro-array data. Our results show that the ensemble paradigm is not always and necessarily beneficial in itself, while it can be very useful when using selection algorithms that are intrinsically less stable.

On stability of ensemble gene selection

DESSI, NICOLETTA;PES, BARBARA;
2015-01-01

Abstract

When the feature selection process aims at discovering useful knowledge from data, not just producing an accurate classifier, the degree of stability of selected features is a very crucial issue. In the last years, the ensemble paradigm has been proposed as a primary avenue for enhancing the stability of feature selection, especially in high-dimensional/small sample size domains, such as biomedicine. However, the potential and the implications of the ensemble approach have been investigated only partially, and the indications provided by recent literature are not exhaustive yet. To give a contribution in this direction, we present an empirical analysis that evaluates the effects of an ensemble strategy in the context of gene selection from high-dimensional micro-array data. Our results show that the ensemble paradigm is not always and necessarily beneficial in itself, while it can be very useful when using selection algorithms that are intrinsically less stable.
2015
978-3-319-24833-2
978-3-319-24834-9
Ensemble paradigm; Feature selection stability; Gene selection
File in questo prodotto:
File Dimensione Formato  
IDEAL_2015.pdf

Solo gestori archivio

Descrizione: Articolo principale
Tipologia: versione editoriale (VoR)
Dimensione 1.65 MB
Formato Adobe PDF
1.65 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11584/181726
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 5
social impact