Bicocca Open Archive

Accessing or integrating data lexicalized in different languages is a challenge. Multilingual lexical resources play a fundamental role in reducing the language barriers to map concepts lexicalized in different languages. In this paper we present a large-scale study on the effectiveness of automatic translations to support two key cross-lingual ontology mapping tasks: the retrieval of candidate matches and the selection of the correct matches for inclusion in the final alignment. We conduct our experiments using four different large gold standards, each one consisting of a pair of mapped wordnets, to cover four different families of languages. We categorize concepts based on their lexicalization (type of words, synonym richness, position in a subconcept graph) and analyze their distributions in the gold standards. Leveraging this categorization, we measure several aspects of translation effectiveness, such as word-translation correctness, word sense coverage, synset and synonym coverage. Finally, we thoroughly discuss several findings of our study, which we believe are helpful for the design of more sophisticated cross-lingual mapping algorithms.

ABU HELOU, M., Palmonari, M., Jarrar, M. (2016). Effectiveness of automatic translations for cross-lingual ontology mapping. THE JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 55, 165-208 [10.1613/jair.4789].

Effectiveness of automatic translations for cross-lingual ontology mapping

ABU HELOU, MAMOUN^Primo;PALMONARI, MATTEO LUIGI^Secondo;Jarrar, M.

2016

Abstract

Accessing or integrating data lexicalized in different languages is a challenge. Multilingual lexical resources play a fundamental role in reducing the language barriers to map concepts lexicalized in different languages. In this paper we present a large-scale study on the effectiveness of automatic translations to support two key cross-lingual ontology mapping tasks: the retrieval of candidate matches and the selection of the correct matches for inclusion in the final alignment. We conduct our experiments using four different large gold standards, each one consisting of a pair of mapped wordnets, to cover four different families of languages. We categorize concepts based on their lexicalization (type of words, synonym richness, position in a subconcept graph) and analyze their distributions in the gold standards. Leveraging this categorization, we measure several aspects of translation effectiveness, such as word-translation correctness, word sense coverage, synset and synonym coverage. Finally, we thoroughly discuss several findings of our study, which we believe are helpful for the design of more sophisticated cross-lingual mapping algorithms.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Artificial Intelligence
			
	Lingua del contenuto
	
				English
			
	Data di pubblicazione
	
				2016
			
	Rivista
	
				THE JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH
			
	Numero del volume
	
				55
			
	Pagina iniziale
	
				165
			
	Pagina finale
	
				208
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1613/jair.4789
			
	URL alternativo
	
				http://www.jair.org/media/4789/live-4789-9080-jair.pdf
			
	Fulltext
	
				partially_open
			
	Citazione
	
				ABU HELOU, M., Palmonari, M., Jarrar, M. (2016). Effectiveness of automatic translations for cross-lingual ontology mapping. THE JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 55, 165-208 [10.1613/jair.4789].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
10281-136175.pdf accesso aperto Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Dimensione 2.54 MB Formato Adobe PDF Visualizza/Apri	2.54 MB	Adobe PDF	Visualizza/Apri
J2016-jair.pdf Solo gestori archivio Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Dimensione 2.5 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	2.5 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/136175

Citazioni

31

18

Social impact