September 21, 2016
Brazilian Portuguese, Indigenous peoples, second language corpus
COLPI stands for "Corpus Oral de Língua Portuguesa Indígena" or Indigenous Portuguese Language Oral Corpus. It is a small sized oral corpus which documents Brazilian Portuguese as a second language as spoken by Brazilian Indigenous peoples. In this paper we describe its compilation process and its main characteristics. This corpus represents a first step in the attempt to document and make available data that so far has been scattered and not accessible to researchers. The recordings were carried by an anthropologist in her fieldwork and mostly document narratives, therefore portraying monologic texts. COLPI is part of a larger project aimed at documenting Brazilian Portuguese spontaneous speech, the C-ORAL-BRASIL corpus.