Skip to content

The SKRIV Corpus

Texts written by students in upper secondary vocational
education programs. The corpus is especially suitable for the analysis of texts written by students with Norwegian as their second language.
There are approx 225 texts and 112 000 words in the corpus. The texts differ in length, genre and type.

The text corpus have three different versions of each text: one scanned original in pdf format and two transcribed versions in txt format: one original transcription with errors and one version where the errors are corrected.
All versions are linked and it is possible to search in both transcribed versions.

Texts written by students in upper secondary vocational
education programs. The corpus is especially suitable for the analysis of texts written by students with Norwegian as their second language.
There are approx 225 texts and 112 000 words in the corpus. The texts differ in length, genre and type.

The text corpus have three different versions of each text: one scanned original in pdf format and two transcribed versions in txt format: one original transcription with errors and one version where the errors are corrected.
All versions are linked and it is possible to search in both transcribed versions.

Extended metadata

Download resources

Go to resource page