Public Domain Texts from NBdigital
Extended metadata
- resource Common Info:
- resource Type: corpus
- identification Info:
- resource Name: Public Domain Texts from NBdigital
- resource Name: Fritt tilgjengelege tekster frå NBdigital
- description: This corpus consists of public domain texts from the National Library's online collection. The corpus contains 26,344 books (and other written material) by 10,756 different authors (including, e.g., public institutions for publically available material). The material is downloadable as compressed tar-files containing the texts in two formats: html and simple text without any markup. The character encoding is UTF-8 for both formats. The quality of the texts varies depending on the quality of the OCR. In addition to texts in Norwegian (Bokmål and Nynorsk), the collection contains texts in several other languages.
- description: Denne tekstsamlinga er sett saman av tekster som ikkje er underlagt opphavsrettslege restriksjonar (lenger). Materialet består av 26.344 OCR-handsama tekster fordelte på 10.756 ulike forfattarar og andre tekstprodusentar (t.d. offentlege institusjonar). Materialet kan lastast ned som komprimerte tar.gz-filer som inneheld tekstene i to format: html- og tekstfiler utan nokon form for koding. Teiknkodinga er UTF-8 for begge formata. Tekstene er henta rett ut frå Nettbiblioteket. Kvaliteten på tekstene er varierande, avhengig av kor god OCR-lesinga er. I tillegg til tekster på norsk (bokmål og nynorsk), inneheld samlinga tekster på fleire andre språk.
- url: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-34/
- P I D: hdl:21.11146/34
- identifier: sbr-34
- distribution Info:
- licence Info:
- user Category: Public
- distribution Access Medium: downloadable
- download Location: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-34/
- licence:
- licence Family: Creative Commons (CC)
- licence Name: Creative_Commons-ZERO (CC-ZERO)
- licence Url: https://creativecommons.org/publicdomain/zero/1.0/
- licensor:
- actor Info:
- actor Type: organization
- role: Licensor
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- communication Info:
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- address: P.O. Box 2674 Solli
- zip Code: 0203
- city: Oslo
- region: Oslo
- country: Norway
- distribution Rights Holder
- actor Info:
- actor Type: organization
- role: Distribution Rights Holder
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- communication Info:
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- address: P.O. Box 2674 Solli
- zip Code: 0203
- city: Oslo
- region: Oslo
- country: Norway
- actor Info:
- actor Type: organization
- role: Contact
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- actor Info:
- actor Type: person
- role: Metadata Creator
- person Info:
- surname: Ohren
- given Name: Oddrun Pauline
- affiliation:
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: Acquisition and Bibliographic Services
- department Name: Tilvekst og kunnskapsorganisering
- actor Info:
- actor Type: organization
- role: Resource Creator
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- corpus Info:
- corpus Type: Multilingual Corpus
- corpus Part Info:
- media Type: text
- corpus Text Info:
- text Format Info:
- mime Type: text/plain
- size Per Text Format:
- size Info:
- size: 26344
- size Unit: texts
- text Format Info:
- mime Type: text/html
- size Per Text Format:
- size Info:
- size: 26344
- size Unit: texts
- character Encoding Info:
- character Encoding: UTF-8
- corpus Part General Info:
- linguality Info:
- linguality Type: multilingual
- multilinguality Type: other
- multilinguality Type Details: Books and text in various languages
- language Info:
- language Id: nb
- language Name: Norwegian Bokmål
- language Info:
- language Id: nn
- language Name: Norwegian Nynorsk
- language Info:
- language Id: no
- language Name: Norwegian
- modality Info:
- modality Type: writtenLanguage
- size Info:
- size: 26344
- size Unit: texts
dc:type | corpus |
dc:title | Public Domain Texts from NBdigital |
dc:identifier | oai:nb.no:sbr-34 |
dc:description | This corpus consists of public domain texts from the National Library's online collection. The corpus contains 26,344 books (and other written material) by 10,756 different authors (including, e.g., public institutions for publically available material). The material is downloadable as compressed tar-files containing the texts in two formats: html and simple text without any markup. The character encoding is UTF-8 for both formats. The quality of the texts varies depending on the quality of the OCR. In addition to texts in Norwegian (Bokmål and Nynorsk), the collection contains texts in several other languages. |
dc:publisher | |
dc:format | downloadable |
dc:date | 2015-05-26 |
dc:date | 2015-05-28 |
dc:rights | Public |
dc:rights | Creative Commons (CC) |
dc:rights | Creative_Commons-ZERO (CC-ZERO) |
dc:rights | https://creativecommons.org/publicdomain/zero/1.0/ |
dc:creator | National Library of Norway |
dc:lang | Norwegian Bokmål |
dc:lang | Norwegian Nynorsk |
dc:lang | Norwegian |