Skip to content

Norwegian idioms

This dataset consists of 3537 Norwegian idioms and phrases that appear more than 100 times in the online library of the National Library of Norway. There are 3455 idioms in Norwegian Bokmål and 88 in Norwegian Nynorsk. In the future we will try to add more idioms for Nynorsk. See the documentation file for a description of the dataset. The data can be used to measure a generative language model’s ability to complete well known idioms or as a masked language modeling task.

This dataset consists of 3537 Norwegian idioms and phrases that appear more than 100 times in the online library of the National Library of Norway. There are 3455 idioms in Norwegian Bokmål and 88 in Norwegian Nynorsk. In the future we will try to add more idioms for Nynorsk. See the documentation file for a description of the dataset. The data can be used to measure a generative language model’s ability to complete well known idioms or as a masked language modeling task.

Extended metadata

Download resources

Download metadata