Large Language Models for Scientific Question Answering: An Extensive Analysis of the SciQA Benchmark

Lehmann, J.; Meloni, A.; Motta, E.; Osborne, F.; reforgiato Recupero, D.; Salatino, A. A.; Vahdati, S.

doi:10.1007/978-3-031-60626-7_11

The SciQA benchmark for scientific question answering aims to represent a challenging task for next-generation question-answering systems on which vanilla large language models fail. In this article, we provide an analysis of the performance of language models on this benchmark including prompting and fine-tuning techniques to adapt them to the SciQA task. We show that both fine-tuning and prompting techniques with intelligent few-shot selection allow us to obtain excellent results on the SciQA benchmark. We discuss the valuable lessons and common error categories, and outline their implications on how to optimise large language models for question answering over knowledge graphs.

Large Language Models for Scientific Question Answering: An Extensive Analysis of the SciQA Benchmark

Lehmann J.;Meloni A.;Motta E.;Osborne F.;reforgiato Recupero D.;Salatino A. A.;Vahdati S.

2024-01-01

Abstract

The SciQA benchmark for scientific question answering aims to represent a challenging task for next-generation question-answering systems on which vanilla large language models fail. In this article, we provide an analysis of the performance of language models on this benchmark including prompting and fine-tuning techniques to adapt them to the SciQA task. We show that both fine-tuning and prompting techniques with intelligent few-shot selection allow us to obtain excellent results on the SciQA benchmark. We discuss the valuable lessons and common error categories, and outline their implications on how to optimise large language models for question answering over knowledge graphs.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Codice ISBN
	
				9783031606250
9783031606267
			
	Parole chiave
	
				Few-shot learning
Fine-tuning
Knowledge graphs
Language models
Question answering

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11584/426624

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

1

0

UNICA IRIS Institutional Research Information System

Large Language Models for Scientific Question Answering: An Extensive Analysis of the SciQA Benchmark

Lehmann J.;Meloni A.;Motta E.;Osborne F.;reforgiato Recupero D.;Salatino A. A.;Vahdati S.

2024-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Attenzione

Citazioni

social impact

UNICA IRIS Institutional Research Information System

Large Language Models for Scientific Question Answering: An Extensive Analysis of the SciQA Benchmark

Lehmann J.;Meloni A.;Motta E.;Osborne F.;reforgiato Recupero D.;Salatino A. A.;Vahdati S.

2024-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Attenzione

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)