Skip to content

SIKOR Kven free corpus

The SIKOR Kven free corpus is a monolingual text corpus of Kven that contains administrative, law, religious, non-fiction, fiction, and news texts. It is work done by the Giellatekno and Divvun research groups, Department of Linguistics, UiT The Arctic University of Norway, as well as by members of the language community. In particular, the following colleagues have contributed to the creation of the ressource: Ciprian Gerstenberger, Børre Gaup, Sindre Trosterud, Leena Niiranen, Kaisa Maliniemi, Paula Paksuniemi, Mervi Haavisto, Trond Trosterud, and Anne-Kaisa Räisänen. Linguistically, the data set (21,024 sentences; 182,697 tokens) features word form, lemma, and morphosyntactic analysis. The corpus has been automatically processed and linguistically analyzed with the Giellatekno/Divvun tools. Therefore, it may contain wrong annotations. In case you find any errors the creators would appreciate your feedback sent to giellatekno@uit.no and feedback@divvun.no.
Please note that the Giellatekno resources are dynamic in nature. To ensure that you have a completely updated version, please contact Giellatekno (see Contact Info in metadata).

The SIKOR Kven free corpus is a monolingual text corpus of Kven that contains administrative, law, religious, non-fiction, fiction, and news texts. It is work done by the Giellatekno and Divvun research groups, Department of Linguistics, UiT The Arctic University of Norway, as well as by members of the language community. In particular, the following colleagues have contributed to the creation of the ressource: Ciprian Gerstenberger, Børre Gaup, Sindre Trosterud, Leena Niiranen, Kaisa Maliniemi, Paula Paksuniemi, Mervi Haavisto, Trond Trosterud, and Anne-Kaisa Räisänen. Linguistically, the data set (21,024 sentences; 182,697 tokens) features word form, lemma, and morphosyntactic analysis. The corpus has been automatically processed and linguistically analyzed with the Giellatekno/Divvun tools. Therefore, it may contain wrong annotations. In case you find any errors the creators would appreciate your feedback sent to giellatekno@uit.no and feedback@divvun.no.
Please note that the Giellatekno resources are dynamic in nature. To ensure that you have a completely updated version, please contact Giellatekno (see Contact Info in metadata).

Extended metadata

Download resources

Download metadata