MINI-CORPUS DEL ESPAÑOL PARA IPIC

Carlota Nicolás Martínez, Marina Lombán Somacarrera

Abstract


The paper presents the IPIC extension to a Spanish mini-corpus, conceived and tagged according to the L-AcT criteria and comparable with the Italian and the Brazilian Portuguese mini corpora already stored in the Data Base. IPIC was created by the LABLITA team for grounding the cross-linguistic comparison of Information Structure in spontaneous speech and can be applied to corpora, collected according to a similar corpus design and size (approx. 5000 reference units). The reference units of speech analysis (utterance, stanza) are identifying through terminal prosodic breaks and aligned to the corresponding sound. In turn, reference units are segmented through non-terminal prosodic breaks in information units that are tagged according to the L-AcT repertory of information functions. The paper shows typical Spanish examples of each information unit type, detailing in the case also their specific terminology. The corpus informationally tagged allows to carry out a solid and deep analysis also at the grammatical and lexical levels of spoken Spanish.


Full Text:

PDF (Español)

References


Austin, J.L. 1962. How to do things with words. Oxford: Oxford University Press.

Biber, D., S. Johansson, G. Leech, S. Conrad & E. Finnegan (eds). 1999. The Longman Grammar of Spoken and Written English, London: Longman.

Cresti, E. 2018. The illocution-prosody relation and the information pattern in the spontaneous speech according to the Language into Act Theory (L-AcT). En Moroni, M., Heinz, M. (eds). Linguistik Prosody: Grammar, information structure, interaction. 88/1: https://bop.unibe.ch/index.php/linguistik-online/index

Cresti, E. 2000. Corpus di italiano parlato, 2 voll., CD-ROM. Firenze: Accademia della Crusca.

Cresti, E.. & M. Moneglia. 2018. The definition of the Topic within Language into Act Theory and its identification in spontaneous speech corpora. En Borreguero Zuloaga, M., V. Atayan & S. Grosse (eds), Revue Romane, vol. 53:1, 30-62.

Cresti, E. & M. Moneglia. 2005. C-ORAL-ROM: Integrated Reference Corpora for Spoken Romance Languages, Amsterdam-Philadelphia: John Benjamins.

Cresti, E. & M. Moneglia. 2010. Informational patterning theory and the corpus-based descrip-tion of spoken language. The compositionality issue in the topic-comment pattern. En M. Moneglia & A. Panunzi (eds), Bootstrapping Information from Corpora in a Cross-Linguistic Perspective. Firenze: FUP.

Mittmann, M. M. & T. Raso. 2012. The C-ORAL-BRASIL Informationally Tagged Mini-Corpus. En Mello H., A. Panunzi & T. Raso (eds). Illocution, modality, attitude, information patterning and speech annotation. Firenze: Firenze University Press, 151-183.

Moneglia, M. & E. Cresti. 1997. Intonazione e criteri di trascrizione del parlato. En U. Bortolini & E. Pizzuto (eds), Il progetto CHILDES Italia, vol. II. Pisa: Il Cerro, 59-90.

Moneglia, M. & Cresti, E. 2006. “C-ORAL-ROM. Prosodic Boundaries for Spontaneous Speech Analysis.” En Spoken Language Corpus and Linguistics Informatics, Kawaguchi, Y., S. Zaima & T. Takagaki (eds). Amsterdam: Benjamins, 89-112.

Nicolás Martínez, C. 2012. C-Or-DiAL (Corpus Oral Didáctico Anotado Lingüísticamente). Madrid: Liceus.

Panunzi, A., & L. Gregori. 2012. DB-IPIC: An XML database for the representation of information structure in spoken language. En Mello H. R., A. Panunzi & T. Raso (eds), Pragmatics and Prosody: Illocution, Modality, Attitude, Information Patterning and Speech Annotation, Firenze: Firenze University Press, 133-150.

Panunzi, A. & M. Mittmann. 2014. The IPIC resource and a cross-linguistic analysis of information structure in Italian and Brazilian Portuguese. En Spoken Corpora and Linguistic Studies, Raso, T. & H. Mello (eds), Amsterdam: Benjamins, 189-227.

Pons Borderia, S. 2018. The combination of discourse markers in spontaneous conversations. En Borreguero Zuloaga, M., V. Atayan & S. Grosse (eds), Revue Romane, vol. 53:1, 121-158.

Raso, T. & Mello, H. 2010. The C-ORAL-BRASIL corpus. En M. Moneglia & A. Panunzi (eds), Bootstrapping Information from Corpora in a Cross-Linguistic Perspective. Uni-versità degli Studi di Firenze, 193-213.

Raso, T. & Mello, H. (eds) 2012. C-ORAL-BRASIL I. Corpus de referência para a fala espontânea informal do português do Brasil. Belo Horizonte: UFMG.

WinPitch. http://www.winpitch.com/


Refbacks

  • There are currently no refbacks.




CHIMERA Romance Corpora and Linguistic Studies

ISSN: 2386-2629