Skip to content
National Library of Norway
|
Språkbanken
Norsk
The Norwegian Language Bank
Resource Catalogue
I samarbeid med
Vis filter
Skjul filter
Type
Origin
Vis filter
Skjul filter
27.08.2018
The Freiburg – Brown Corpus of American English
The Freiburg - Brown Corpus of American English (Frown) contains texts from 1991. Like the original Brown and LOB corpora, Frown contains 500 texts of around 2000 words each, distributed across 15 …
Origin:
CLARINO Bergen Centre
Licence:
CLARIN_ACA
Updated:
27.08.2018
27.08.2018
The Corpus of Free Trade Agreements (FTA)
Corpus of Free Trade Agreements (English/Spanish) The FTA corpus consists of 233 XML source files in each language. The corpus contains approximately 1370000 words in the English section and 1483000 …
Origin:
CLARINO Bergen Centre
Licence:
CLARIN_ACA
Updated:
27.08.2018
27.08.2018
The LOB corpus (POS tagged)
The Lancaster - Oslo/Bergen (LOB) Corpus is a million-word collection of present-day (1961) British English texts. The corpus was compiled under the direction of Geoffrey Leech, University of …
Origin:
CLARINO Bergen Centre
Licence:
CLARIN_ACA
Updated:
27.08.2018
Text
27.08.2018
The London-Lund Corpus of Spoken English (LLC)
The London-Lund Corpus contains samples of educated spoken British English, in orthographic transcription with detailed prosodic marking. It consists of 100 'texts', each of some 5,000 running words. …
Origin:
CLARINO Bergen Centre
Licence:
CLARIN_ACA
Type:
Text
Updated:
27.08.2018
Text
27.08.2018
Helsinki Corpus of Older Scots
The Helsinki Corpus of Older Scots was compiled as a supplement to the diachronic part of the Helsinki Corpus of English Texts. The Scottish texts were selected according to the same principles of …
Origin:
CLARINO Bergen Centre
Licence:
CLARIN_ACA
Type:
Text
Updated:
27.08.2018
Tool
05.06.2018
The Oslo-Bergen Tagger
The Oslo-Bergen tagger is a robust morphological and syntactic tagger developed at the University of Oslo and at Uni Computing in Bergen over several years. The tagger consists of three main modules: …
Language:
Norwegian
Origin:
CLARINO Text Laboratory Centre
Licence:
General Public License (GPL)
Type:
Tool
Updated:
05.06.2018
Text
05.06.2018
Nordic Syntax Database
The database consists of judgments by 924 Nordic dialect speakers from 207 places to a list of sentences that illustrate various syntactic phenomena. Many of the speakers are the same in both database …
Language:
Norwegian, Swedish, Danish, Icelandic, Faroese
Origin:
CLARINO Text Laboratory Centre
Licence:
CLARIN_ACA-NC-LOC-PRIV-ND-*
Type:
Text
Updated:
05.06.2018
Text
05.06.2018
NoWaC v 1.0 (Norwegian Web as Corpus)
Web-based corpus of Bokmål Norwegian containing about 700 million tokens. The corpus has been built by crawling, downloading and processing web documents in the .no top-level internet domain between …
Language:
Norwegian Bokmål
Origin:
CLARINO Text Laboratory Centre
Licence:
Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
Type:
Text
Updated:
05.06.2018
Tool
05.06.2018
Glossa (new version)
Glossa is a tool for researchers who want to search linguistically annotated corpora. Glossa is designed to make it easy for researchers to: - create complex searches - explore the result via e.g. …
Language:
Origin:
CLARINO Text Laboratory Centre
Licence:
MIT license
Type:
Tool
Updated:
05.06.2018
Text
05.06.2018
Frequency lists from NoWaC – Norwegian Web as Corpus
Frequency lists from NoWaC - Norwegian Web as Corpus - a web-based corpus of Bokmål Norwegian containing about 700 million tokens. The corpus has been built by crawling, downloading and processing …
Language:
Norwegian, Norwegian bokmål
Origin:
CLARINO Text Laboratory Centre
Licence:
Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
Type:
Text
Updated:
05.06.2018
Vis filter
Skjul filter