Skip to content
National Library of Norway
|
Språkbanken
Norsk
The Norwegian Language Bank
Resource Catalogue
I samarbeid med
Vis filter
Skjul filter
Type
Origin
Vis filter
Skjul filter
27.08.2018
The Corpus of Free Trade Agreements (FTA)
Corpus of Free Trade Agreements (English/Spanish) The FTA corpus consists of 233 XML source files in each language. The corpus contains approximately 1370000 words in the English section and 1483000 …
Origin:
CLARINO Bergen Centre
Licence:
CLARIN_ACA
Updated:
27.08.2018
27.08.2018
The LOB corpus (POS tagged)
The Lancaster - Oslo/Bergen (LOB) Corpus is a million-word collection of present-day (1961) British English texts. The corpus was compiled under the direction of Geoffrey Leech, University of …
Origin:
CLARINO Bergen Centre
Licence:
CLARIN_ACA
Updated:
27.08.2018
Text
27.08.2018
The London-Lund Corpus of Spoken English (LLC)
The London-Lund Corpus contains samples of educated spoken British English, in orthographic transcription with detailed prosodic marking. It consists of 100 'texts', each of some 5,000 running words. …
Origin:
CLARINO Bergen Centre
Licence:
CLARIN_ACA
Type:
Text
Updated:
27.08.2018
Text
01.07.2018
Transcriptions and selected audio files from LIA Norwegian for download
All transcriptions from LIA Norwegian are downloadable in plain text format. A folder containing 553 transcriptions from LIA Norwegian, in ELAN format, along with their corresponding audio, can …
Language:
Norwegian, Norwegian Nynorsk
Origin:
CLARINO Text Laboratory Centre
Licence:
Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
Type:
Text
Updated:
01.07.2018
Speech, Text
01.07.2018
LIA Norwegian – Corpus of historical dialect recordings
LIA Norwegian is a speech corpus with old recordings (1939 - 1996) from four Norwegian universities: NTNU, UoB, UoO and UoT. The recordings are mainly made for dialect and onomastic research and the …
Language:
Norwegian, Norwegian Nynorsk
Origin:
CLARINO Text Laboratory Centre
Licence:
CLARIN_ACA-NC-LOC-PRIV-ND-*
Type:
Speech, Text
Updated:
01.07.2018
Lexicon
13.04.2018
SNORRE Terminology Database
The SNORRE Terminology Database has been developed by Standards Norway in cooperation with The Language Council of Norway and The Ministry of Culture. SNORRE contains technical terms and definitions …
Language:
Norwegian Bokmål, Norwegian Nynorsk, English
Origin:
Language Bank
Licence:
Creative_Commons-ZERO (CC-ZERO)
Type:
Lexicon
Updated:
13.04.2018
Text
04.04.2018
CORYL (Corpus of Young Learner Language)
Coryl is a young learner corpus, which consists of English texts written by Norwegian pupils. The texts which make up the corpus, were collected in the course of the National Testing of English …
Language:
English
Origin:
CLARINO Bergen Centre
Licence:
Creative_Commons-BY (CC-BY)
Type:
Text
Updated:
04.04.2018
Text
02.04.2018
Norwegian-English Parallel Corpus from Public Web Sites
This is a sentence-aligned parallel corpus built from the public web sites www.nav.no, www.nyinorge.no and skatteetaten.no. These web sites provide information in both Norwegian Bokmål and Nynorsk, …
Language:
Norwegian Nynorsk, English, Norwegian Bokmål, English
Origin:
Language Bank
Licence:
Creative_Commons-BY (CC-BY)
Type:
Text
Updated:
02.04.2018
Text
02.04.2018
Public Nynorsk-English Parallel Corpus (PubNEPC)
PubNEPC is a Nynorsk-English sentence-aligned parallel corpus built from the public web sites www.nav.no and skatteetaten.no. The corpus contains only those sentences that have a corresponding …
Language:
Norwegian Nynorsk, English
Origin:
CLARINO Bergen Centre
Licence:
Creative_Commons-BY (CC-BY)
Type:
Text
Updated:
02.04.2018
Text
02.04.2018
Public Bokmål-English Parallel Corpus (PubBEPC)
PubBEPC is a Bokmål-English sentence-aligned parallel corpus built from the public web sites www.nav.no, www.nyinorge.no and skatteetaten.no. The corpus contains only those sentences that have a …
Language:
Norwegian Bokmål, English
Origin:
CLARINO Bergen Centre
Licence:
Creative_Commons-BY (CC-BY)
Type:
Text
Updated:
02.04.2018
Vis filter
Skjul filter