The hyperlink structure of Wikipedia constitutes a key resource for many Natural Language Processing tasks and applications, as it provides several million semantic annotations of entities in context. Yet only a small fraction of mentions across the entire Wikipedia corpus is linked. In this paper we present the automatic construction and evaluation of a Semantically Enriched Wikipedia (SEW) in which the overall number of linked mentions has been more than tripled solely by exploiting the structure of Wikipedia itself and the wide-coverage sense inventory of BabelNet. As a result we obtain a sense-annotated corpus with more than 200 million annotations of over 4 million different concepts and named entities. We then show that our corpus leads to competitive results on multiple tasks, such as Entity Linking and Word Similarity.

Raganato, A., Delli Bovi, C., Navigli, R. (2016). Automatic construction and evaluation of a large semantically enriched wikipedia. In Proceedings of 25th International Joint Conference on Artificial Intelligence (pp.2894-2900). USA : AAAI PRESS.

Automatic construction and evaluation of a large semantically enriched wikipedia

Raganato Alessandro
;
2016

Abstract

The hyperlink structure of Wikipedia constitutes a key resource for many Natural Language Processing tasks and applications, as it provides several million semantic annotations of entities in context. Yet only a small fraction of mentions across the entire Wikipedia corpus is linked. In this paper we present the automatic construction and evaluation of a Semantically Enriched Wikipedia (SEW) in which the overall number of linked mentions has been more than tripled solely by exploiting the structure of Wikipedia itself and the wide-coverage sense inventory of BabelNet. As a result we obtain a sense-annotated corpus with more than 200 million annotations of over 4 million different concepts and named entities. We then show that our corpus leads to competitive results on multiple tasks, such as Entity Linking and Word Similarity.
Si
paper
wikipedia; hyperlinks; sense annotations; word sense disambiguation; entity linking; semantic similarity;
English
25th International Joint Conference on Artificial Intelligence (IJCAI-16) - 9 July 2016 through 15 July 2016
http://lcl.uniroma1.it/sew/papers/IJCAI16.pdf
Raganato, A., Delli Bovi, C., Navigli, R. (2016). Automatic construction and evaluation of a large semantically enriched wikipedia. In Proceedings of 25th International Joint Conference on Artificial Intelligence (pp.2894-2900). USA : AAAI PRESS.
Raganato, A; Delli Bovi, C; Navigli, R
File in questo prodotto:
File Dimensione Formato  
IJCAI16.pdf

Solo gestori archivio

Dimensione 540.23 kB
Formato Adobe PDF
540.23 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/361547
Citazioni
  • Scopus 23
  • ???jsp.display-item.citation.isi??? ND
Social impact