This paper1 presents the early stages of the development of a new treebank containing all of Dante Alighieri’s Latin works. In particular, it describes the conversion of the original TEI-XML files to CoNLL-U, the creation of a gold standard, the process of training four annotators and the evaluation of the syntactic annotation in terms of inter-annotator agreement and LA, UAS and LAS. The aim is to release a new resource, in view of the celebrations for the 700th anniversary of Dante’s death, which can support the development of the Vocabolario Dantesco.
Cecchini, F., Sprugnoli, R., Moretti, G., Passarotti, M. (2020). UDante: First steps towards the universal dependencies treebank of dante’s latin works. In CLiC-it 2020 Italian Conference on Computational Linguistics 2020 Proceedings of the Seventh Italian Conference on Computational Linguistics (pp.1-7). CEUR-WS.
UDante: First steps towards the universal dependencies treebank of dante’s latin works
Cecchini F. M.;
2020
Abstract
This paper1 presents the early stages of the development of a new treebank containing all of Dante Alighieri’s Latin works. In particular, it describes the conversion of the original TEI-XML files to CoNLL-U, the creation of a gold standard, the process of training four annotators and the evaluation of the syntactic annotation in terms of inter-annotator agreement and LA, UAS and LAS. The aim is to release a new resource, in view of the celebrations for the 700th anniversary of Dante’s death, which can support the development of the Vocabolario Dantesco.File | Dimensione | Formato | |
---|---|---|---|
paper_14.pdf
accesso aperto
Descrizione: This volume and its papers are published under the Creative Commons License Attribution 4.0 International (CC BY 4.0).
Tipologia di allegato:
Publisher’s Version (Version of Record, VoR)
Licenza:
Creative Commons
Dimensione
283.13 kB
Formato
Adobe PDF
|
283.13 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.