N-gram – nynorsk
Utvidet metadata
- resource Common Info:
- resource Type: corpus
- identification Info:
- resource Name: N-gram – Norwegian Nynorsk
- resource Name: N-gram – nynorsk
- description: This corpus is a collection of n-grams (n=1-6) based on approximately 60 million words of running text from the Nynorsk part of the Norwegian Newspaper Corpus and the text corpus of Nordic Language Technology AS. This version contains all the n-grams, sorted by frequency and alphabetically, respectively. A version listing only the 1000 most frequent n-grams can also be downloaded. Frequency lists (unigrams) are also available for download separately.
- description: Med utgangspunkt i dei nynorske tekstene i Norsk aviskorpus og tekstkorpuset til Nordisk språkteknologi har Språkbanken produsert n-gram (n=1-6) for ei tekstmengd på ca. 60 millionar ord med løpande tekst. Denne versjonen inneheld alle n-gramma, sorterte alfabetisk og etter frekvens. Materialet kan òg lastast ned som ein enkel oversikt over dei 1000 mest frekvente n-gramma, og i tillegg som frekvenslister over enkeltorda (unigram).
- url: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-8/
- P I D: hdl:21.11146/8
- identifier: sbr-8
- distribution Info:
- licence Info:
- user Category: Public
- distribution Access Medium: downloadable
- download Location: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-8/
- licence:
- licence Family: Creative Commons (CC)
- licence Name: Creative_Commons-ZERO (CC-ZERO)
- licence Url: https://creativecommons.org/publicdomain/zero/1.0/
- licensor:
- actor Info:
- actor Type: organization
- role: Licensor
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- communication Info:
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- address: P.O. Box 2674 Solli
- zip Code: 0203
- city: Oslo
- region: Oslo
- country: Norway
- distribution Rights Holder
- actor Info:
- actor Type: organization
- role: Distribution Rights Holder
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- communication Info:
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- address: P.O. Box 2674 Solli
- zip Code: 0203
- city: Oslo
- region: Oslo
- country: Norway
- actor Info:
- actor Type: organization
- role: Contact
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- actor Info:
- actor Type: person
- role: Metadata Creator
- person Info:
- surname: Birkenes
- given Name: Magnus Breder
- affiliation:
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- actor Info:
- actor Type: person
- role: Resource Creator
- person Info:
- surname: Hofland
- given Name: Knut
- affiliation:
- organization Info:
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UiB
- corpus Info:
- corpus Type: Ngram Corpus
- corpus Part Info:
- media Type: textNgram
- corpus Text Ngram Info:
- ngram Info:
- base Item: word
- order: 6
- text Format Info:
- mime Type: text/plain
- size Per Text Format:
- size Info:
- size: 59300000
- size Unit: words
- size Info:
- size: 7,9
- size Unit: gb
- character Encoding Info:
- character Encoding: Windows
- corpus Part General Info:
- linguality Info:
- linguality Type: monolingual
- language Info:
- language Id: nn
- language Name: Norwegian Nynorsk
- size Per Language:
- size Info:
- size: 59300000
- size Unit: words
- size Info:
- size: 7,9
- size Unit: gb
- language Variety Info:
- language Variety Type: other
- language Variety Name: news text
- size Per Language Variety:
- size Info:
- size: 59300000
- size Unit: words
- size Info:
- size: 7,9
- size Unit: gb
- modality Info:
- modality Type: writtenLanguage
- modality Type Details: news text
- size Per Modality:
- size Info:
- size: 59300000
- size Unit: words
- size Info:
- size: 7,9
- size Unit: gb
- size Info:
- size: 59300000
- size Unit: words
- size Info:
- size: 7,9
- size Unit: gb
dc:type | corpus |
dc:title | N-gram – nynorsk |
dc:identifier | oai:nb.no:sbr-8 |
dc:description | Med utgangspunkt i dei nynorske tekstene i Norsk aviskorpus og tekstkorpuset til Nordisk språkteknologi har Språkbanken produsert n-gram (n=1-6) for ei tekstmengd på ca. 60 millionar ord med løpande tekst. Denne versjonen inneheld alle n-gramma, sorterte alfabetisk og etter frekvens. Materialet kan òg lastast ned som ein enkel oversikt over dei 1000 mest frekvente n-gramma, og i tillegg som frekvenslister over enkeltorda (unigram). |
dc:publisher | |
dc:format | downloadable |
dc:date | 2012-01-02 |
dc:date | 2012-06-11 |
dc:rights | Public |
dc:rights | Creative Commons (CC) |
dc:rights | Creative_Commons-ZERO (CC-ZERO) |
dc:rights | https://creativecommons.org/publicdomain/zero/1.0/ |
dc:creator | Knut Hofland |
dc:lang | nynorsk |
Last ned ressurser
-
ngram_nno.zip
-
1gram_nno_abc.zip
-
1gram_nno_f1_abc.zip
-
1gram_nno_f1_freq.zip
-
1gram_nno_abc.zip
-
ngram_nno.pdf