This paper presents a new resource, called Content Types Dataset, to promote the analysis of texts as a composition of units with specific semantic and functional roles. By developing this dataset, we also introduce a new NLP task for the automatic classification of Content Types. The annotation scheme and the dataset are described together with two sets of classification experiments.
The Content Types Dataset: a New Resource to Explore Semantic and Functional Characteristics of Texts / Sprugnoli, Rachele; Caselli, Tommaso; Tonelli, Sara; Moretti, Giovanni. - 2:(2017), pp. 260-266. (Intervento presentato al convegno 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017 tenutosi a Valencia (Spagna) nel 3-7 April 2017).
The Content Types Dataset: a New Resource to Explore Semantic and Functional Characteristics of Texts
Rachele Sprugnoli;
2017-01-01
Abstract
This paper presents a new resource, called Content Types Dataset, to promote the analysis of texts as a composition of units with specific semantic and functional roles. By developing this dataset, we also introduce a new NLP task for the automatic classification of Content Types. The annotation scheme and the dataset are described together with two sets of classification experiments.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.