Desafíos y reflexiones sobre utilización de corpus libres: El caso de PRESEEA en el estudio de la reformulación
Palabras clave:
spoken language, free corpora, corpus segmentation, rephrasingResumen
Our main purpose is to present our experience working on a free spoken-language corpus in Spanish. As part of my doctoral thesis project, concerned with rephrasing used in spoken interaction, I decided to study this phenomenon by analyzing the data available in the PRESEEA corpus. In order to annotate a part of this corpus, we created a TextGrid file on PRAAT associated with each audio file. Then, we imported those files onto ELAN to start the annotation process, with categories related to five language levels relevant for our research: prosodic, syntactic, macrosyntactic, interactional and rephrasing phenomena. In addition to sharing our methodological work experience, we would like to emphasize the importance of free spoken corpora for the development of the research in linguistics.Descargas
Citas
Blanche-Benveniste, C. 2010. Approches de la langue parlée en français. Paris: Ophrys.
Colón de Carvajal, I. 2013. Guide practique pour annoter sur ELAN. Publicado por la Cellule de Corpus Complexes del Laboratorio ICAR UMR 5191. http://www.icar.cnrs.fr/ccc/ressources/ (consultado en junio, 2018).
Feuillard, C. 1989. La Syntaxe fonctionnelle dans le cadre des théories linguistiques contemporaines. Thèse d’État, Université Paris V.
PRESEEA : Corpus del Proyecto para el estudio sociolingüístico del español de España y de América. Alcalá de Henares: Universidad de Alcalá. http://preseea.linguas.net/ (consultado abril, 2018).
Selting, M. 2000. The construction of units in conversational talk. Language in Society 29: 477-517.
Traverso, V. 2008. Analyser un corpus de langue parlée en interaction : questions méthodologiques. Verbum XXX(4): 313-328.
Traverso, V. 2007. Analyser les conversations. Paris: Armand Colin.