Skip to content
National Library of Norway
|
Språkbanken
Norsk
The Norwegian Language Bank
Resource Catalogue
I samarbeid med
Vis filter
Skjul filter
Type
Origin
Vis filter
Skjul filter
Text
28.10.2021
N-grams from NBdigital 2021
This resource contains n-grams - i.e. unigrams, bigrams and trigrams - from all books and newspapers that had been digitized at the National Library of Norway up to July 2021. The n-grams have been …
Language:
Norwegian Bokmål, Norwegian Nynorsk, Northern Sami, Southern Sami, Lule Sami, Kven
Origin:
Language Bank
Licence:
Creative_Commons-ZERO (CC-ZERO)
Type:
Text
Updated:
28.10.2021
Lexicon
28.09.2021
ONOMASTICA Pronunciation Lexicon 2
ONOMASTICA Version 2 is an updated version of the original ONOMASTICA Pronunciation Lexicon. To make the lexicon more accessible, Språkbanken has parsed the original files, and generated a version of …
Language:
Norwegian
Origin:
Language Bank
Licence:
Creative_Commons-BY (CC-BY)
Type:
Lexicon
Updated:
28.09.2021
Text
18.08.2021
Translation Memories from EFTA
These translation memories have been made by the EEA Coordination Division at the European Free Trade Association (EFTA) Secretariat in Brussels, where they are used on a daily basis as a work tool in …
Language:
English, Norwegian Bokmål, English, Norwegian Nynorsk
Origin:
Language Bank
Licence:
Creative_Commons-ZERO (CC-ZERO)
Type:
Text
Updated:
18.08.2021
Speech, Text
04.05.2021
TAUS – The spoken language investigation in Oslo
The material from TAUS (The spoken language investigation in Oslo) is based on informal interviews with people from Oslo. The interviews were made in 1971-73. The informants are mainly from two …
Language:
Norwegian, Norwegian Bokmål
Origin:
CLARINO Text Laboratory Centre
Licence:
CLARIN_ACA-NC-LOC-PRIV-ND-*
Type:
Speech, Text
Updated:
04.05.2021
Text
04.05.2021
TAUS – downloadable transcriptions
TAUS (The spoken language investigation in Oslo) v.3 is a speech corpus with 86 speakers and 387 551 tokens. The downloadable version of the corpus contains the transcriptions, approx. 387 500 tokens, …
Language:
Norwegian, Norwegian Bokmål
Origin:
CLARINO Text Laboratory Centre
Licence:
Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
Type:
Text
Updated:
04.05.2021
Text
30.04.2021
Målfrid 2021 – Freely Available Documents from Norwegian State Institutions
This corpus consists of documents from 339 internet domains run by Norwegian state institutions, and comprises approximately 4.1 billion tokens (words and punctuation) in total, which makes it one of …
Language:
Norwegian Bokmål, Norwegian Nynorsk, Northern Sami, Southern Sami, Lule Sami, English
Origin:
Language Bank
Licence:
Norwegian Licence for Open Government Data (NLOD)
Type:
Text
Updated:
30.04.2021
Speech, Text, Video
16.04.2021
Norsk talespråkskorpus – Oslodelen
NoTa-Oslo is a speech corpus with interviews and conversations from 166 informants born and raised in Oslo and the Oslo area. The informants are carefully selected w.r.t. sociolinguistic variables and …
Language:
Norwegian, Norwegian Bokmål
Origin:
CLARINO Text Laboratory Centre
Licence:
CLARIN_ACA-NC-LOC-PRIV-ND-*
Type:
Speech, Text, Video
Updated:
16.04.2021
Text
16.04.2021
Nordic Dialect Corpus – downloadable transcriptions
Nordic Dialect Corpus v. 4.0 is a corpus of Norwegian, Swedish, Danish, Faroese, Icelandic and Övdalian spoken language. It consists of spontaneous speech data from dialects of the North Germanic …
Language:
Norwegian Bokmål (the orthographic transcriptions), Swedish (Övdalien included), Danish, Icelandic, Faroese
Origin:
CLARINO Text Laboratory Centre
Licence:
Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
Type:
Text
Updated:
16.04.2021
Speech, Text, Video
16.04.2021
Nordic Dialect Corpus v. 4.0
Nordic Dialect Corpus v.4.0 is a corpus of Norwegian, Swedish, Danish, Faroese, Icelandic and Övdalian spoken language. It consists of spontaneous speech data from dialects of the North Germanic …
Language:
Norwegian Bokmål (the orthographic transcriptions), Swedish (Övdalien included), Danish, Icelandic, Faroese
Origin:
CLARINO Text Laboratory Centre
Licence:
CLARIN_ACA-NC-LOC-PRIV-ND-*
Type:
Speech, Text, Video
Updated:
16.04.2021
Text
16.04.2021
NoTa-Oslo – downloadable transcriptions
NoTa-Oslo is a speech corpus with interviews and conversations from 166 informants born and raised in Oslo and the Oslo area. The informants are carefully selected w.r.t. sociolinguistic variables and …
Language:
Norwegian, Norwegian Bokmål
Origin:
CLARINO Text Laboratory Centre
Licence:
Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
Type:
Text
Updated:
16.04.2021
Vis filter
Skjul filter