This paper1 presents the early stages of the development of a new treebank containing all of Dante Alighieri’s Latin works. In particular, it describes the conversion of the original TEI-XML files to CoNLL-U, the creation of a gold standard, the process of training four annotators and the evaluation of the syntactic annotation in terms of inter-annotator agreement and LA, UAS and LAS. The aim is to release a new resource, in view of the celebrations for the 700th anniversary of Dante’s death, which can support the development of the Vocabolario Dantesco.

Cecchini, F., Sprugnoli, R., Moretti, G., Passarotti, M. (2020). UDante: First steps towards the universal dependencies treebank of dante’s latin works. In CLiC-it 2020 Italian Conference on Computational Linguistics 2020 Proceedings of the Seventh Italian Conference on Computational Linguistics (pp.1-7). CEUR-WS.

UDante: First steps towards the universal dependencies treebank of dante’s latin works

Cecchini F. M.;
2020

Abstract

This paper1 presents the early stages of the development of a new treebank containing all of Dante Alighieri’s Latin works. In particular, it describes the conversion of the original TEI-XML files to CoNLL-U, the creation of a gold standard, the process of training four annotators and the evaluation of the syntactic annotation in terms of inter-annotator agreement and LA, UAS and LAS. The aim is to release a new resource, in view of the celebrations for the 700th anniversary of Dante’s death, which can support the development of the Vocabolario Dantesco.
paper
Computational linguistics
English
7th Italian Conference on Computational Linguistics, CLiC-it 2020 - 1 March 2021 through 3 March 2021
2021
CLiC-it 2020 Italian Conference on Computational Linguistics 2020 Proceedings of the Seventh Italian Conference on Computational Linguistics
2020
2769
1
7
https://ceur-ws.org/Vol-2769/
open
Cecchini, F., Sprugnoli, R., Moretti, G., Passarotti, M. (2020). UDante: First steps towards the universal dependencies treebank of dante’s latin works. In CLiC-it 2020 Italian Conference on Computational Linguistics 2020 Proceedings of the Seventh Italian Conference on Computational Linguistics (pp.1-7). CEUR-WS.
File in questo prodotto:
File Dimensione Formato  
paper_14.pdf

accesso aperto

Descrizione: This volume and its papers are published under the Creative Commons License Attribution 4.0 International (CC BY 4.0).
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 283.13 kB
Formato Adobe PDF
283.13 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/504970
Citazioni
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
Social impact