Skip to content

N-gram – Norwegian Nynorsk

This corpus is a collection of n-grams (n=1-6) based on approximately 60 million words of running text from the Nynorsk part of the Norwegian Newspaper Corpus and the text corpus of Nordic Language Technology AS.

This version contains all the n-grams, sorted by frequency and alphabetically, respectively.

A version listing only the 1000 most frequent n-grams can also be downloaded. Frequency lists (unigrams) are also available for download separately.

This corpus is a collection of n-grams (n=1-6) based on approximately 60 million words of running text from the Nynorsk part of the Norwegian Newspaper Corpus and the text corpus of Nordic Language Technology AS.

This version contains all the n-grams, sorted by frequency and alphabetically, respectively.

A version listing only the 1000 most frequent n-grams can also be downloaded. Frequency lists (unigrams) are also available for download separately.

Extended metadata

Download resources

Download metadata