Available document collections are more and more required for supervised text categorization tasks. They typically are collections of documents classified by domain engineers. In this paper, we propose a semantic text categorization approach able to automatically create document collections in which documents are classified according to WordNet Domains taxonomy. Experiments have been performed by training a classifier with an automatic document collection and comparing results with those obtained by training the same classifier on a hand-made document collection. Experimental results point out that, on average, the performances of the automatic approach are quite similar to those obtained on a document collection classified by domain engineers.

A Novel Semantic Approach to Document Collections

ARMANO, GIULIANO;VARGIU, ELOISA
2009-01-01

Abstract

Available document collections are more and more required for supervised text categorization tasks. They typically are collections of documents classified by domain engineers. In this paper, we propose a semantic text categorization approach able to automatically create document collections in which documents are classified according to WordNet Domains taxonomy. Experiments have been performed by training a classifier with an automatic document collection and comparing results with those obtained by training the same classifier on a hand-made document collection. Experimental results point out that, on average, the performances of the automatic approach are quite similar to those obtained on a document collection classified by domain engineers.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11584/105781
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact