Accessing or integrating data lexicalized in different languages is a challenge. Multilingual lexical resources play a fundamental role in reducing the language barriers to map concepts lexicalized in different languages. In this paper we present a large-scale study on the effectiveness of automatic translations to support two key cross-lingual ontology mapping tasks: the retrieval of candidate matches and the selection of the correct matches for inclusion in the final alignment. We conduct our experiments using four different large gold standards, each one consisting of a pair of mapped wordnets, to cover four different families of languages. We categorize concepts based on their lexicalization (type of words, synonym richness, position in a subconcept graph) and analyze their distributions in the gold standards. Leveraging this categorization, we measure several aspects of translation effectiveness, such as word-translation correctness, word sense coverage, synset and synonym coverage. Finally, we thoroughly discuss several findings of our study, which we believe are helpful for the design of more sophisticated cross-lingual mapping algorithms.

ABU HELOU, M., Palmonari, M., Jarrar, M. (2016). Effectiveness of automatic translations for cross-lingual ontology mapping. THE JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 55, 165-208 [10.1613/jair.4789].

Effectiveness of automatic translations for cross-lingual ontology mapping

ABU HELOU, MAMOUN
Primo
;
PALMONARI, MATTEO LUIGI
Secondo
;
2016

Abstract

Accessing or integrating data lexicalized in different languages is a challenge. Multilingual lexical resources play a fundamental role in reducing the language barriers to map concepts lexicalized in different languages. In this paper we present a large-scale study on the effectiveness of automatic translations to support two key cross-lingual ontology mapping tasks: the retrieval of candidate matches and the selection of the correct matches for inclusion in the final alignment. We conduct our experiments using four different large gold standards, each one consisting of a pair of mapped wordnets, to cover four different families of languages. We categorize concepts based on their lexicalization (type of words, synonym richness, position in a subconcept graph) and analyze their distributions in the gold standards. Leveraging this categorization, we measure several aspects of translation effectiveness, such as word-translation correctness, word sense coverage, synset and synonym coverage. Finally, we thoroughly discuss several findings of our study, which we believe are helpful for the design of more sophisticated cross-lingual mapping algorithms.
Articolo in rivista - Articolo scientifico
Artificial Intelligence
English
2016
55
165
208
partially_open
ABU HELOU, M., Palmonari, M., Jarrar, M. (2016). Effectiveness of automatic translations for cross-lingual ontology mapping. THE JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 55, 165-208 [10.1613/jair.4789].
File in questo prodotto:
File Dimensione Formato  
10281-136175.pdf

accesso aperto

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 2.54 MB
Formato Adobe PDF
2.54 MB Adobe PDF Visualizza/Apri
J2016-jair.pdf

Solo gestori archivio

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 2.5 MB
Formato Adobe PDF
2.5 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/136175
Citazioni
  • Scopus 27
  • ???jsp.display-item.citation.isi??? 17
Social impact