Resources from the resource bank Archive - Page 11 of 120 - Språkbanken

I samarbeid med

The Corpus of Free Trade Agreements (FTA)

Corpus of Free Trade Agreements (English/Spanish) The FTA corpus consists of 233 XML source files in each language. The corpus contains approximately 1370000 words in the English section and 1483000 …

Origin:
CLARINO Bergen Centre
Licence:
CLARIN_ACA
Updated:
27.08.2018
The LOB corpus (POS tagged)

The Lancaster - Oslo/Bergen (LOB) Corpus is a million-word collection of present-day (1961) British English texts. The corpus was compiled under the direction of Geoffrey Leech, University of …

Origin:
CLARINO Bergen Centre
Licence:
CLARIN_ACA
Updated:
27.08.2018
The London-Lund Corpus of Spoken English (LLC)

The London-Lund Corpus contains samples of educated spoken British English, in orthographic transcription with detailed prosodic marking. It consists of 100 'texts', each of some 5,000 running words. …

Origin:
CLARINO Bergen Centre
Licence:
CLARIN_ACA
Type:
Text
Updated:
27.08.2018
Transcriptions and selected audio files from LIA Norwegian for download

All transcriptions from LIA Norwegian are downloadable in plain text format. A folder containing 553 transcriptions from LIA Norwegian, in ELAN format, along with their corresponding audio, can …

Language:
Norwegian, Norwegian Nynorsk
Origin:
CLARINO Text Laboratory Centre
Licence:
Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
Type:
Text
Updated:
01.07.2018
LIA Norwegian – Corpus of historical dialect recordings

LIA Norwegian is a speech corpus with old recordings (1939 - 1996) from four Norwegian universities: NTNU, UoB, UoO and UoT. The recordings are mainly made for dialect and onomastic research and the …

Language:
Norwegian, Norwegian Nynorsk
Origin:
CLARINO Text Laboratory Centre
Licence:
CLARIN_ACA-NC-LOC-PRIV-ND-*
Type:
Speech, Text
Updated:
01.07.2018
SNORRE Terminology Database

The SNORRE Terminology Database has been developed by Standards Norway in cooperation with The Language Council of Norway and The Ministry of Culture. SNORRE contains technical terms and definitions …

Language:
Norwegian Bokmål, Norwegian Nynorsk, English
Origin:
Language Bank
Licence:
Creative_Commons-ZERO (CC-ZERO)
Type:
Lexicon
Updated:
13.04.2018
CORYL (Corpus of Young Learner Language)

Coryl is a young learner corpus, which consists of English texts written by Norwegian pupils. The texts which make up the corpus, were collected in the course of the National Testing of English …

Language:
English
Origin:
CLARINO Bergen Centre
Licence:
Creative_Commons-BY (CC-BY)
Type:
Text
Updated:
04.04.2018
Norwegian-English Parallel Corpus from Public Web Sites

This is a sentence-aligned parallel corpus built from the public web sites www.nav.no, www.nyinorge.no and skatteetaten.no. These web sites provide information in both Norwegian Bokmål and Nynorsk, …

Language:
Norwegian Nynorsk, English, Norwegian Bokmål, English
Origin:
Language Bank
Licence:
Creative_Commons-BY (CC-BY)
Type:
Text
Updated:
02.04.2018
Public Nynorsk-English Parallel Corpus (PubNEPC)

PubNEPC is a Nynorsk-English sentence-aligned parallel corpus built from the public web sites www.nav.no and skatteetaten.no. The corpus contains only those sentences that have a corresponding …

Language:
Norwegian Nynorsk, English
Origin:
CLARINO Bergen Centre
Licence:
Creative_Commons-BY (CC-BY)
Type:
Text
Updated:
02.04.2018
Public Bokmål-English Parallel Corpus (PubBEPC)

PubBEPC is a Bokmål-English sentence-aligned parallel corpus built from the public web sites www.nav.no, www.nyinorge.no and skatteetaten.no. The corpus contains only those sentences that have a …

Language:
Norwegian Bokmål, English
Origin:
CLARINO Bergen Centre
Licence:
Creative_Commons-BY (CC-BY)
Type:
Text
Updated:
02.04.2018