Skip to content

NST N-gram – Norwegian Bokmål

These n-grams are derived from parts of the Text Corpus from Nordic Language Technology AS (NST). The source material consists of 510 million words of running text.

The n-grams are also available as an overview listing only the 1000 most frequent n-grams (n=1-6).

In the full version, all the derived n-grams (n=1-6) are sorted alphabetically and by frequency, respectively. Frequency lists (unigrams) are also available separately.

These n-grams are derived from parts of the Text Corpus from Nordic Language Technology AS (NST). The source material consists of 510 million words of running text.

The n-grams are also available as an overview listing only the 1000 most frequent n-grams (n=1-6).

In the full version, all the derived n-grams (n=1-6) are sorted alphabetically and by frequency, respectively. Frequency lists (unigrams) are also available separately.

Extended metadata

Download resources

Download metadata