Norwegian Newspaper Corpus Bokmål
Extended metadata
- resource Common Info
- resource Type: corpus
- identification Info
- resource Name: Norwegian Newspaper Corpus Bokmål
- resource Name: Norsk aviskorpus bokmål
- description: The Norwegian Newspaper Corpus (NNC) Bokmål version is a large monitor corpus representing contemporary Norwegian language in the written variety Norwegian Bokmål. A corresponding corpus is available for Norwegian nynorsk, see URL in metadata. The corpus is compiled through daily harvesting and processing of published texts from the web edition of Norwegian newspapers. The version available in Corpuscle is annotated with newspaper title and date, and spans from October 1998 to May 2020. To search in the full corpus until today's date, go to: http://avis.uib.no/avis/sok/copy_of_sok-i-hele-korpuset. That search interface has less functions than in Corpuscle, but allows you to search in the newest material. The Corpuscle material will be updated regularly. For questions about the contents of the corpus, contact Knut Hofland (Uni Research Computing). For questions related to the Corpuscle version, contact Paul Meurer (Uni Research Computing). See Contact information in metadata. To refer to the Norwegian newspaper Corpus, we suggest the following references: Norwegian Newspaper Corpus Bokmål. 2020. Created by the project Norsk aviskorpus. Distributed by the CLARINO Bergen Centre: hdl:11495/D9B5-0349-4330-0 Andersen, Gisle, and Knut Hofland. 2012. “Building a Large Corpus Based on Newspapers from the Web.” In Exploring Newspaper Language: Using the Web to Create and Investigate a Large Corpus of Modern Norwegian, edited by Gisle Andersen, 1–28. Studies in Corpus Linguistics 49. Amsterdam/Philadelphia: John Benjamins Publishing Company
- resource Short Name: NNC
- url: http://clarino.uib.no/korpuskel/landing-page?identifier=avis-plain&view=short
- url: http://avis.uib.no/
- url: http://clarino.uib.no/korpuskel/landing-page?identifier=avis-nno&view=short
- P I D: hdl:11495/D9B5-0349-4330-0
- identifier: avis-plain
- distribution Info
- licence Info
- user Category: Public
- distribution Access Medium: accessibleThroughInterface
- download Location: http://clarino.uib.no/korpuskel/landing-page?identifier=avis-plain
- licence
- licence Family: Creative Commons (CC)
- licence Name: Creative_Commons-BY-NC (CC-BY-NC)
- licence Url: http://creativecommons.org/licenses/by-nc/4.0/
- conditions Of Use: BY
- conditions Of Use: NC
- ipr Holder
- actor Info
- actor Type: organization
- organization Info
- organization Name: Uni Research AS
- department Name: Uni Research Computing
- communication Info
- email: knut.hofland@uni.no
- url: http://uni.no/nb/staff/directory/knut-hofland/
- city: Bergen
- country: Norway
- telephone Number: +47 5558 9463
- actor Info
- licence Info
- contact
- actor Info
- actor Type: organization
- organization Info
- organization Name: CLARINO Bergen Centre
- communication Info
- email: clarin@uib.no
- url: https://repo.clarino.uib.no/xmlui/
- actor Info
- actor Type: person
- person Info
- surname: Hofland
- given Name: Knut
- sex: male
- position: Fagkonsulent / Specialist Consultant
- affiliation:
- organization Info
- organization Name: Uni Research AS
- department Name: Uni Research Computing
- communication Info
- email: knut.hofland@uni.no
- url: http://uni.no/nb/staff/directory/knut-hofland/
- city: Bergen
- country: Norway
- telephone Number: +47 5558 9463
- actor Info
- actor Info
- actor Type: person
- person Info
- surname: Meurer
- given Name: Paul
- sex: male
- position: Senior researcher
- affiliation:
- organization Info
- organization Name: Uni Research AS
- department Name: Uni Research Computing
- communication Info
- email: paul.meurer@uni.no
- metadata Creation Date: 29.09.2015
- metadata Last Date Updated: 21.12.2022
- metadata Creator
- actor Info
- actor Type: person
- person Info
- surname: Lyse
- given Name: Gunn Inger
- sex: female
- position: Researcher (Ph.D)
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- communication Info
- email: clarin@uib.no
- actor Info
- documentation Unstructured
- role: documentation
- document Unstructured: Andersen, Gisle, and Knut Hofland. 2012. “Building a Large Corpus Based on Newspapers from the Web.” In Exploring Newspaper Language: Using the Web to Create and Investigate a Large Corpus of Modern Norwegian, edited by Gisle Andersen, 1–28. Studies in Corpus Linguistics 49. Amsterdam/Philadelphia: John Benjamins Publishing Company
- creation Start Date: 1998
- funding Project:
- project Info
- project Name: Norsk aviskorpus
- url: http://avis.uib.no/avis/om-aviskorpuset/english
- funding Type: nationalFunds
- funder: Research Council of Norway
- funding Country: Norway
- corpus Info
- corpus Type: Written Corpus
- corpus Part Info
- media Type: text
- corpus Part General Info
- source Work Info
- title: The Newspaper corpus is compiled of text from the following ten newspapers that were incuded from the start on October 13 1998 (listed by newspaper code in the NCC and the full name of the newspaper): AA – Adresseavisen AP – Aftenposten BT – Bergens Tidende DA – Dagsavisen DB – Dagbladet DN – Dagens Næringsliv FV – Fædrelandsvennen NL – Nordlys OD – Odin (public information) SA – Stavanger Aftenblad VG – Verdens Gang
- linguality Info
- linguality Type: monolingual
- language Info
- language Id: no
- language Name: Norwegian
- language Info
- language Id: nb
- language Name: Norwegian Bokmål
- modality Info
- modality Type: writtenLanguage
- size Info
- size: 2 117 226 336
- size Unit: tokens
- annotation Info
- annotation Type: other
- annotation Format: Annotated with newspaper title and date.
- classification Info
- genre Info
- genre Type: textGenre
- genre: newspaper and magazines
- genre Info
- time Coverage Info
- time Coverage: October 1998 – November 2022
- source Work Info
dc:type | corpus |
dc:title | Norwegian Newspaper Corpus Bokmål |
dc:identifier | oai:clarino.uib.no:avis-plain |
dc:description | The Norwegian Newspaper Corpus (NNC) Bokmål version is a large monitor corpus representing contemporary Norwegian language in the written variety Norwegian Bokmål. A corresponding corpus is available for Norwegian nynorsk, see URL in metadata. The corpus is compiled through daily harvesting and processing of published texts from the web edition of Norwegian newspapers. The version available in Corpuscle is annotated with newspaper title and date, and spans from October 1998 to May 2020. To search in the full corpus until today's date, go to: http://avis.uib.no/avis/sok/copy_of_sok-i-hele-korpuset. That search interface has less functions than in Corpuscle, but allows you to search in the newest material. The Corpuscle material will be updated regularly. For questions about the contents of the corpus, contact Knut Hofland (Uni Research Computing). For questions related to the Corpuscle version, contact Paul Meurer (Uni Research Computing). See Contact information in metadata. To refer to the Norwegian newspaper Corpus, we suggest the following references: Norwegian Newspaper Corpus Bokmål. 2020. Created by the project Norsk aviskorpus. Distributed by the CLARINO Bergen Centre: hdl:11495/D9B5-0349-4330-0 Andersen, Gisle, and Knut Hofland. 2012. “Building a Large Corpus Based on Newspapers from the Web.” In Exploring Newspaper Language: Using the Web to Create and Investigate a Large Corpus of Modern Norwegian, edited by Gisle Andersen, 1–28. Studies in Corpus Linguistics 49. Amsterdam/Philadelphia: John Benjamins Publishing Company |
dc:publisher | |
dc:format | accessibleThroughInterface |
dc:date | 1998 |
dc:date | |
dc:rights | Public |
dc:rights | Creative Commons (CC) |
dc:rights | Creative_Commons-BY-NC (CC-BY-NC) |
dc:rights | http://creativecommons.org/licenses/by-nc/4.0/ |
dc:lang | Norwegian |
dc:lang | Norwegian Bokmål |