Frequency lists from NoWaC – Norwegian Web as Corpus
Extended metadata
- resource Common Info
- resource Type: corpus
- identification Info
- resource Name: Frequency lists from NoWaC – Norwegian Web as Corpus
- description: Frequency lists from NoWaC – Norwegian Web as Corpus – a web-based corpus of Bokmål Norwegian containing about 700 million tokens. The corpus has been built by crawling, downloading and processing web documents in the .no top-level internet domain between November 2009 and January 2010. NoWaC has been built with permission from the Norwegian Ministry of Culture (Kulturdepartementet).
- url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/prosjekter/nowac/index.html
- P I D: http://hdl.handle.net/11538/0000-0005-E7C0-D
- distribution Info
- licence Info
- user Category: Public
- distribution Access Medium: downloadable
- download Location: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/tjenester/nowac-frequency.html
- licence
- licence Family: Creative Commons (CC)
- licence Name: Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
- licence Url: http://creativecommons.org/licenses/by-nc-sa/2.0/
- conditions Of Use: BY
- conditions Of Use: NC
- conditions Of Use: SA
- licensor:
- actor Info
- actor Type: organization
- organization Info
- organization Name: University of Oslo
- organization Name: Universitetet i Oslo
- organization Short Name: UiO
- organization Short Name: UoO
- department Name: Department of Linguistics and Scandinavian Studies
- department Name: Institutt for lingvistiske og nordiske studier (ILN)
- communication Info
- email: tekstlab-post@iln.uio.no
- url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
- address: Box 1102 Blindern
- zip Code: 0317
- city: OSLO
- country: Norway
- distribution Rights Holder
- actor Info
- actor Type: organization
- organization Info
- organization Name: University of Oslo
- organization Name: Universitetet i Oslo
- organization Short Name: UiO
- organization Short Name: UoO
- department Name: Department of Linguistics and Scandinavian Studies
- department Name: Institutt for lingvistiske og nordiske studier (ILN)
- communication Info
- email: tekstlab-post@iln.uio.no
- url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
- address: Box 1102 Blindern
- zip Code: 0317
- city: OSLO
- country: Norway
- actor Info
- licence Info
- contact
- actor Info
- actor Type: organization
- organization Info
- organization Name: The Text Laboratory
- organization Short Name: Textlab
- department Name: Department of Linguistics and Scandinavian Studies, University of Oslo
- communication Info
- email: tekstlab-post@iln.uio.no
- url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
- address: Box 1102 Blindern
- zip Code: 0317
- city: OSLO
- country: Norway
- actor Info
- metadata Info
- metadata Creation Date: 17.02.2015
- metadata Last Date Updated: 05.06.2018
- metadata Creator
- actor Info
- actor Type: person
- person Info
- surname: Hagen
- given Name: Kristin
- sex: female
- organization Info
- organization Name: Department of Linguistics and Scandinavian Studies, University of Oslo
- organization Short Name: ILN
- department Name: Department of Linguistics and Scandinavian Studies, University of Oslo
- communication Info
- email: kristin.hagen@iln.uio.no
- url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
- address: Box 1102 Blindern
- zip Code: 0317
- city: OSLO
- country: Norway
- actor Info
- version Info
- version: v 1.0
- resource Documentation Info
- documentation Unstructured
- role: documentation
- document Unstructured: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/tjenester/nowac-frequency.html
- documentation Unstructured
- resource Creation Info
- creation Start Date: 01.01.2012
- creation End Date: 31.12.2012
- resource Creator
- actor Info
- actor Type: organization
- organization Info
- organization Name: The Text Laboratory
- organization Short Name: Textlab
- department Name: Department of Linguistics and Scandinavian Studies, University of Oslo
- communication Info
- email: tekstlab-post@iln.uio.no
- url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
- address: Box 1102 Blindern
- zip Code: 0317
- city: OSLO
- country: Norway
- actor Info
- relation Info
- resource Relation
- related Resource
- reference Scope: externalResource
- resource Reference: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/prosjekter/nowac/index.html
- related Resource
- reference Scope: thisResource
- relation Type
- relation Name: source
- related Resource
- resource Relation
- corpus Info
- corpus Type: Ngram Corpus
- corpus Part Info
- media Type: textNgram
- corpus Text Ngram Info
- ngram Info
- base Item: word
- order: 1
- text Format Info
- mime Type: txt
- character Encoding Info
- character Encoding: utf-8
- ngram Info
- corpus Part General Info
- source Work Info
- title: NoWaC v 1.0 (Norwegian Web as Corpus)
- linguality Info
- linguality Type: monolingual
- language Info
- language Id: No
- language Name: Norwegian
- language Info
- language Id: Nb
- language Name: Norwegian bokmål
- time Coverage Info
- time Coverage: November 2009 – January 2010
- source Work Info
dc:type | corpus |
dc:title | Frequency lists from NoWaC – Norwegian Web as Corpus |
dc:identifier | oai:tekstlab.uio.no:nowac-freq |
dc:description | Frequency lists from NoWaC – Norwegian Web as Corpus – a web-based corpus of Bokmål Norwegian containing about 700 million tokens. The corpus has been built by crawling, downloading and processing web documents in the .no top-level internet domain between November 2009 and January 2010. NoWaC has been built with permission from the Norwegian Ministry of Culture (Kulturdepartementet). |
dc:publisher | |
dc:format | downloadable |
dc:date | 2012-01-01 |
dc:date | 2012-12-31 |
dc:rights | Public |
dc:rights | Creative Commons (CC) |
dc:rights | Creative_Commons-BY-NC-SA (CC-BY-NC-SA) |
dc:rights | http://creativecommons.org/licenses/by-nc-sa/2.0/ |
dc:creator | The Text Laboratory |
dc:lang | Norwegian |
dc:lang | Norwegian bokmål |