We present a Question Answering system aimed to answer natural language questions over the open RDF spending data provided by LinkedSpeding. We propose an original machine-learning approach to learn generalized SPARQL templates from an existing training set of (NL question, SPARQL query) pairs. In our approach, the generalized SPARQL templates are fed to an instance-based classifier that associates a given user-provided question to an existing pair that is used to answer the user question. We employ an external tagger, delegating the Named-Entity Recognition (NER) task to a service developed for the domain we want to query. The problem is particularly challenging due to the small training set size available, counting only 100 questions/SPARQL queries. We illustrate the results of our new approach using data provided by the Question Answering over Linked Data challenge (QALD-6) task 3, showing that we can provide a correct answer to 14 of the 50 questions of the test set. These results are then compared to existing systems, including our previous system, QA3, where templates were provided by an expert rather than being generated automatically from a training set.
Machine learning of SPARQL templates for question answering over LinkedSpending
Atzori, Maurizio
;
2019-01-01
Abstract
We present a Question Answering system aimed to answer natural language questions over the open RDF spending data provided by LinkedSpeding. We propose an original machine-learning approach to learn generalized SPARQL templates from an existing training set of (NL question, SPARQL query) pairs. In our approach, the generalized SPARQL templates are fed to an instance-based classifier that associates a given user-provided question to an existing pair that is used to answer the user question. We employ an external tagger, delegating the Named-Entity Recognition (NER) task to a service developed for the domain we want to query. The problem is particularly challenging due to the small training set size available, counting only 100 questions/SPARQL queries. We illustrate the results of our new approach using data provided by the Question Answering over Linked Data challenge (QALD-6) task 3, showing that we can provide a correct answer to 14 of the 50 questions of the test set. These results are then compared to existing systems, including our previous system, QA3, where templates were provided by an expert rather than being generated automatically from a training set.| File | Dimensione | Formato | |
|---|---|---|---|
|
Machine_Learning_of_SPARQL_Templates_for_Question_Answering_Over_LinkedSpending.pdf
Solo gestori archivio
Descrizione: VoR
Tipologia:
versione editoriale (VoR)
Dimensione
136.93 kB
Formato
Adobe PDF
|
136.93 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
|
Atzori_Iris (1).pdf
accesso aperto
Descrizione: AAM
Tipologia:
versione post-print (AAM)
Dimensione
839.69 kB
Formato
Adobe PDF
|
839.69 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


