Vol. 3 No. 2 (2016): Approaching Diversity in Speech Studies: New Methodologies under Empirical Perspectives
Corpora and Tools

COLPI. Compilation of an Indigenous Brazilian Portuguese L2 corpus

Gláucia Buratto Rodrigues de Mello
UFMG, FAPEMIG
Heliana Mello
UFMG, FAPEMIG
Published September 21, 2016

Keywords:

Brazilian Portuguese, Indigenous peoples, second language corpus

Abstract

COLPI stands for "Corpus Oral de Língua Portuguesa Indígena" or Indigenous Portuguese Language Oral Corpus. It is a small sized oral corpus which documents Brazilian Portuguese as a second language as spoken by Brazilian Indigenous peoples.  In this paper we describe its compilation process and its main characteristics. This corpus represents a first step in the attempt to document and make available data that so far has been scattered and not accessible to researchers. The recordings were carried by an anthropologist in her fieldwork and mostly document narratives, therefore portraying monologic texts. COLPI is part of a larger project aimed at documenting Brazilian Portuguese spontaneous speech, the C-ORAL-BRASIL corpus.