Feature selection has become the essential step in biomarker discovery from high-dimensional genomics data. It is recognized that different feature selection techniques may result in different set of biomarkers, i.e. different groups of genes highly correlated to a given pathological condition, but few direct comparisons exist that quantify these differences in a systematic way. In this paper, we propose a general methodology for comparing the outcomes of different selection techniques in the context of biomarker discovery. The comparison is carried out along two dimensions: (i) measuring the similarity/dissimilarity of selected gene sets, (ii) evaluating the implications of these differences in terms of both predictive performance and stability of selected gene sets. As a case study, we considered three benchmarks deriving from DNA micro-array experiments and conducted a comparative analysis among eight selection methods, representative of different classes of feature selection techniques. Our results show that the proposed approach can provide useful insight about the pattern of agreement of biomarker discovery techniques.

A comparative analysis of biomarker selection techniques

DESSI, NICOLETTA;PES, BARBARA
2013-01-01

Abstract

Feature selection has become the essential step in biomarker discovery from high-dimensional genomics data. It is recognized that different feature selection techniques may result in different set of biomarkers, i.e. different groups of genes highly correlated to a given pathological condition, but few direct comparisons exist that quantify these differences in a systematic way. In this paper, we propose a general methodology for comparing the outcomes of different selection techniques in the context of biomarker discovery. The comparison is carried out along two dimensions: (i) measuring the similarity/dissimilarity of selected gene sets, (ii) evaluating the implications of these differences in terms of both predictive performance and stability of selected gene sets. As a case study, we considered three benchmarks deriving from DNA micro-array experiments and conducted a comparative analysis among eight selection methods, representative of different classes of feature selection techniques. Our results show that the proposed approach can provide useful insight about the pattern of agreement of biomarker discovery techniques.
2013
Biomarker Discovery, Micro-array Data Analysis, Feature Selection, Stability in Feature Selection, Similarity Measures
File in questo prodotto:
File Dimensione Formato  
BIOMED387673.pdf

accesso aperto

Descrizione: Articolo principale
Tipologia: versione editoriale (VoR)
Dimensione 224.55 kB
Formato Adobe PDF
224.55 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11584/95287
Citazioni
  • ???jsp.display-item.citation.pmc??? 12
  • Scopus 38
  • ???jsp.display-item.citation.isi??? 31
social impact