Skip to content
National Library of Norway
|
Språkbanken
Norsk
The Norwegian Language Bank
Resource Catalogue
I samarbeid med
Vis filter
Skjul filter
Type
Origin
Vis filter
Skjul filter
Text
10.10.2024
Norwegian idioms
This dataset consists of 3537 Norwegian idioms and phrases that appear more than 100 times in the online library of the National Library of Norway. There are 3455 idioms in Norwegian Bokmål and 88 in …
Language:
Norwegian Bokmål, Norwegian Nynorsk
Origin:
Language Bank
Licence:
Creative_Commons-ZERO (CC-ZERO)
Type:
Text
Updated:
10.10.2024
Speech
10.07.2024
Norwegian Government Press Conference Speech Corpus
The Norwegian Government Press Conference Speech Corpus (NorGovPCC) consists of approximately 138 hours of speech generated from audio with aligned subtitles from press conferences published by the …
Origin:
Language Bank
Licence:
Norwegian Licence for Open Government Data (NLOD)
Type:
Speech
Updated:
10.07.2024
Speech, Text
23.03.2024
TeflonNorL2
This page is currently a placeholder for the Norwegian data in the Teflon project. The Teflon project (https://teflon.aalto.fi/) aims at studying computer assisted language learning for immigrant …
Language:
Norwegian
Origin:
Language Bank
Licence:
unspecified
Type:
Speech, Text
Updated:
23.03.2024
Tool
09.02.2024
Grapheme-to-Phoneme Models for Norwegian Bokmål
This resource contains Grapheme-to-Phoneme (G2P) models for Norwegian Bokmål, which have been adapted to the G2P system Phonetisaurus (https://github.com/AdolfVonKleist/Phonetisaurus). The G2P models …
Language:
Origin:
Language Bank
Licence:
Creative_Commons-ZERO (CC-ZERO)
Type:
Tool
Updated:
09.02.2024
Tool
11.01.2024
Glossa
Glossa is a tool for researchers who want to search linguistically annotated corpora. Glossa is designed to make it easy for researchers to: - create complex searches - explore the result via e.g. …
Language:
Origin:
CLARINO Text Laboratory Centre
Licence:
MIT license
Type:
Tool
Updated:
11.01.2024
Speech, Text
19.12.2023
NST Norwegian ASR Database (16 kHz) – Reorganized
This database was created by Nordic Language Technology for the development of automatic speech recognition and dictation in Norwegian. In this version (from 2022), the organization of the data has …
Language:
Norwegian
Origin:
Language Bank
Licence:
Creative_Commons-ZERO (CC-ZERO)
Type:
Speech, Text
Updated:
19.12.2023
Tool
20.11.2023
Mapping between Norwegian municipalities and dialect regions
This resource provides a mapping between Norwegian municipalities and dialect regions, and can be used, e.g., to infer the dialect region of a speaker in a speech dataset based on their place of …
Origin:
Language Bank
Licence:
Creative_Commons-BY (CC-BY)
Type:
Tool
Updated:
20.11.2023
Speech, Text
15.11.2023
Stortinget Speech Corpus version 1.0
The Stortinget Speech Corpus (SSC) is a 5000+ hours speech dataset for weak supervision ASR created from audio and aligned proceedings text from Stortinget, the Norwegian Parliament. It contains …
Language:
Norwegian
Origin:
Language Bank
Licence:
Creative_Commons-ZERO (CC-ZERO)
Type:
Speech, Text
Updated:
15.11.2023
Text
27.10.2023
NDT 2.0 with Constituent Structure
In this version of the Norwegian Dependency Treebank 2.0 constituent structure (c-structure) similar to the one found in NorGramBank has been added. This can be used to train one syntactic parser for …
Language:
Norwegian Bokmål, Norwegian Nynorsk
Origin:
Language Bank
Licence:
Creative_Commons-ZERO (CC-ZERO)
Type:
Text
Updated:
27.10.2023
Tool
20.10.2023
spaCy for Norwegian Nynorsk
These spaCy models are trained on the NorNE dataset in a version compatible with Universal Dependencies. spaCy is a widely used library in python for language technology applications. spaCy does not …
Origin:
Language Bank
Licence:
MIT license
Type:
Tool
Updated:
20.10.2023
Vis filter
Skjul filter