This paper1 presents the early stages of the development of a new treebank containing all of Dante Alighieri’s Latin works. In particular, it describes the conversion of the original TEI-XML files to CoNLL-U, the creation of a gold standard, the process of training four annotators and the evaluation of the syntactic annotation in terms of inter-annotator agreement and LA, UAS and LAS. The aim is to release a new resource, in view of the celebrations for the 700th anniversary of Dante’s death, which can support the development of the Vocabolario Dantesco.
UDante: First Steps Towards the Universal Dependencies Treebank of Dante’s Latin Works / Cecchini, Flavio; Sprugnoli, Rachele; Moretti, Giovanni; Passarotti, Marco. - ELETTRONICO. - (2020), pp. 99-105. (Intervento presentato al convegno Seventh Italian Conference on Computational Linguistics (CLiC-it 2020) tenutosi a online nel 1-3 marzo 2021).
UDante: First Steps Towards the Universal Dependencies Treebank of Dante’s Latin Works
Sprugnoli Rachele;
2020-01-01
Abstract
This paper1 presents the early stages of the development of a new treebank containing all of Dante Alighieri’s Latin works. In particular, it describes the conversion of the original TEI-XML files to CoNLL-U, the creation of a gold standard, the process of training four annotators and the evaluation of the syntactic annotation in terms of inter-annotator agreement and LA, UAS and LAS. The aim is to release a new resource, in view of the celebrations for the 700th anniversary of Dante’s death, which can support the development of the Vocabolario Dantesco.File | Dimensione | Formato | |
---|---|---|---|
9791280136282(2)-111-117.pdf
accesso aperto
Tipologia:
Versione (PDF) editoriale
Licenza:
Creative commons
Dimensione
442.84 kB
Formato
Adobe PDF
|
442.84 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.