The convergence of the Internet of Things, artificial intelligence, and semantic technologies is reshaping how digital systems model and reason about human behavior. Human Digital Twins (HDTs) offer dynamic user representations by integrating multimodal data from physiological, behavioral, and linguistic sources. However, most existing HDTs perform low-level data fusion, lacking the semantic consistency needed for coherent reasoning and personalization. This paper proposes a novel pipeline for constructing Personal Knowledge Graphs (PKGs), i.e. adaptive, formal representations of user knowledge, that combine the generative capabilities of Large Language Models (LLMs) with ontology-based validation. The pipeline includes triplet extraction from language data, semantic alignment through embeddings, and ontology-driven graph construction. A comparative analysis between ontology-free and ontology-guided PKG generation demonstrates that semantic grounding substantially reduces spurious relationships and improves reasoning accuracy, establishing ontological validation as essential for reliable knowledge extraction. By bridging symbolic and generative paradigms, the proposed approach advances knowledge-driven HDTs that are proactive, interpretable, and resilient.

Giving voice to digital twins: How LLMs build human knowledge graphs

Pruner A.;Atzori L.;Nitti M.
2026-01-01

Abstract

The convergence of the Internet of Things, artificial intelligence, and semantic technologies is reshaping how digital systems model and reason about human behavior. Human Digital Twins (HDTs) offer dynamic user representations by integrating multimodal data from physiological, behavioral, and linguistic sources. However, most existing HDTs perform low-level data fusion, lacking the semantic consistency needed for coherent reasoning and personalization. This paper proposes a novel pipeline for constructing Personal Knowledge Graphs (PKGs), i.e. adaptive, formal representations of user knowledge, that combine the generative capabilities of Large Language Models (LLMs) with ontology-based validation. The pipeline includes triplet extraction from language data, semantic alignment through embeddings, and ontology-driven graph construction. A comparative analysis between ontology-free and ontology-guided PKG generation demonstrates that semantic grounding substantially reduces spurious relationships and improves reasoning accuracy, establishing ontological validation as essential for reliable knowledge extraction. By bridging symbolic and generative paradigms, the proposed approach advances knowledge-driven HDTs that are proactive, interpretable, and resilient.
2026
Human Digital Twins
Large language models
Ontology
Personal knowledge graph
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11584/483025
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact