Skip to content
National Library of Norway
|
Språkbanken
Norsk
The Norwegian Language Bank
Resource Catalogue
I samarbeid med
Vis filter
Skjul filter
Type
Origin
Vis filter
Skjul filter
Text
20.04.2020
Norwegian Newspaper Corpus
The Norwegian Newspaper Corpus was a project at the University of Bergen where news websites were crawled for news articles. This version of The Norwegian Newspaper Corpus consists of text from 1998 …
Language:
Norwegian Bokmål, Norwegian Nynorsk
Origin:
Language Bank
Licence:
Creative_Commons-BY-NC (CC-BY-NC)
Type:
Text
Updated:
20.04.2020
Lexicon
02.04.2020
Norwegian Words
Norwegian Words is a searchable lexical database containing approximately 1650 Norwegian nouns, verbs and adjectives. The database contains information about different properties that can affect …
Origin:
CLARINO Text Laboratory Centre
Licence:
CLARIN_PUB-BY
Type:
Lexicon
Updated:
02.04.2020
Speech, Text
02.04.2020
LIA sápmi – the LIA corpus of Sami dialects
The LIA Sápmi corpus is a speech corpus with recordings from 1960 - 1990 of Sami dialects from the northern part of Norway, Finland and Sweden, some recordings from NRK sami radio and some from UiT, …
Language:
Northern sami
Origin:
CLARINO Text Laboratory Centre
Licence:
CLARIN_ACA-NC-LOC-PRIV-ND-*
Type:
Speech, Text
Updated:
02.04.2020
Text
25.02.2020
Corpus with Book Reviews from Bokelskere.no
This corpus is a dump of user generated book reviews and discussions from Bokelskere.no (meaning "book lovers"), a web community where users review and discuss new and old literature, both fiction and …
Language:
Norwegian Bokmål, Norwegian Nynorsk
Origin:
Language Bank
Licence:
Creative_Commons-ZERO (CC-ZERO)
Type:
Text
Updated:
25.02.2020
Text
19.02.2020
The KIAP corpus
KIAP is a corpus of 450 research articles covering three disciplines (economics, linguistics and medicine) and three languages (English, French and Norwegian). It is available in Copuscle at the …
Language:
Norwegian, French, English
Origin:
CLARINO Bergen Centre
Licence:
Creative_Commons-BY (CC-BY)
Type:
Text
Updated:
19.02.2020
Text
11.12.2019
Discussions from Wikipedia
This corpus is a dump of discussion threads from the Norwegian Wikipedia, where authors discuss various issues regarding the publication of specific Wikipedia articles. The material is split into two …
Language:
Norwegian Bokmål, Norwegian Nynorsk
Origin:
Language Bank
Licence:
Creative_Commons-BY-SA (CC-BY-SA)
Type:
Text
Updated:
11.12.2019
Text
29.05.2019
NorNE – Norwegian Named Entities
NorNE (Norwegian Named Entities) is a text corpus composed of the same texts as the Norwegian Dependency Treebank (NDT), but this version is in addition tagged with named entities. The corpus contains …
Language:
Norwegian Bokmål, Norwegian Nynorsk
Origin:
Language Bank
Licence:
Creative_Commons-ZERO (CC-ZERO)
Type:
Text
Updated:
29.05.2019
Text
20.05.2019
ASK – The Norwegian Second Language Corpus
ASK is an electronic, searchable text corpus of Norwegian as a second language, with links between linguistic data and personal data. Ask was established by the Norwegian Second Language Corpus …
Language:
Norwegian bokmål, Norwegian
Origin:
CLARINO Bergen Centre
Licence:
CLARIN_RES-PRIV
Type:
Text
Updated:
20.05.2019
Text
22.03.2019
Texts from Norwegian Wikipedia
This corpus is a dump from approximately March 20 2019 of all Wikipedia articles written in Norwegian Bokmål, Norwegian Nynorsk and Northern Sami. The corpus contains 492,864 articles for Norwegian …
Language:
Norwegian Bokmål, Norwegian Nynorsk, Northern Sami
Origin:
Language Bank
Licence:
Creative_Commons-BY-SA (CC-BY-SA)
Type:
Text
Updated:
22.03.2019
03.03.2019
Universal Dependencies 2.3 (copy @ INESS)
The “Universal Dependencies 2.3” collection is searchable at the INESS portal; to read further details about the original collection as a whole, or about individual treebanks in the collections, …
Origin:
CLARINO Bergen Centre
Licence:
unspecified
Updated:
03.03.2019
Vis filter
Skjul filter