South Saami lemma frequency list
Extended metadata
- resource Common Info
- resource Type: corpus
- identification Info
- resource Name: South Saami lemma frequency list
- description: The South Saami lemma frequency list is work done by the Giellatekno and Divvun research groups, Department of Linguistics, UiT The Arctic University of Norway, as well as by members of the language community. In particular, Ciprian-Virgil Gerstenberger compiled the list from the entire SIKOR South Saami corpus version 2015-10-10. The data is in an one-lemma-per-line format with the following values: <ABS_FREQ> <LEMMA> <POS>. Since the list has been derived automatically, it may contain wrong values. In case you find any errors the creators would appreciate your feedback sent to giellatekno@uit.no and feedback@divvun.no. Please note that the Giellatekno resources are dynamic in nature. A stable "snapshot" is deposited with regular intervals at the CLARINO Bergen repository for download. To ensure that you have a completely updated version, please contact Giellatekno (see Contact Info in metadata).
- resource Short Name: sma_lemma_freq_20151010
- distribution Info
- availability Start Date: 17.10.2015
- licence Info
- user Category: Public
- licence
- licence Family: Creative Commons (CC)
- licence Name: Creative_Commons-BY (CC-BY)
- licence Url: http://creativecommons.org/licenses/by/4.0/
- conditions Of Use: BY
- contact
- actor Info
- actor Type: organization
- role: creator
- role: contact
- organization Info
- organization Name: Giellatekno, Saami Language Technology
- organization Short Name: Giellatekno
- department Name: Department of Linguistics, UiT The Arctic University of Norway
- communication Info
- actor Info
- actor Type: organization
- role: creator
- role: contact
- organization Info
- organization Name: The Divvun group at UiT
- organization Short Name: Divvun
- department Name: Department of Linguistics, UiT The Arctic University of Norway
- communication Info
- email: feedback@divvun.no
- url: http://divvun.no
- actor Info
- metadata Info
- metadata Creation Date: 14.12.2015
- metadata Last Date Updated: 18.03.2016
- metadata Creator
- actor Info
- actor Type: person
- role: contact
- role: creator
- person Info
- surname: Gerstenberger
- given Name: Ciprian-Virgil
- sex: male
- affiliation:
- organization Info
- organization Name: Norges arktiske universitet
- organization Name: The arctic university of Norway
- organization Short Name: UiT
- department Name: Giellatekno – Saami Language Technology
- communication Info
- actor Info
- resource Creation Info
- creation Start Date: 10.10.2015
- creation End Date: 10.10.2015
- resource Creator
- actor Info
- actor Type: person
- role: creator
- person Info
- surname: Gerstenberger
- given Name: Ciprian-Virgil
- affiliation:
- organization Info
- organization Name: Norges arktiske universitet
- organization Name: The arctic university of Norway
- organization Short Name: UiT
- department Name: Giellatekno – Saami Language Technology
- communication Info
- email: ciprian.gerstenberger@uit.no
- url: http://ansatte.uit.no/ciprian.gerstenberger
- city: Tromsø
- country: Norway
- actor Info
- corpus Info
- corpus Type: Written Corpus
- corpus Part Info
- media Type: textNgram
- corpus Text Ngram Info
- ngram Info
- base Item: word
- order: 1
- ngram Info
- corpus Part General Info
- linguality Info
- linguality Type: monolingual
- language Info
- language Id: sma
- language Name: South Saami
- size Info
- size: 65,055
- size Unit: unigrams
- annotation Info
- annotation Type: lemmatization
- annotation Type: morphosyntacticAnnotation-posTagging
- annotation Description: The data format is an one-lemma-per-line with the following values: <ABS_FREQ> <LEMMA> <POS>.
- time Coverage Info
- time Coverage: 1993-2015
- linguality Info
dc:type | corpus |
dc:title | South Saami lemma frequency list |
dc:identifier | oai:repo.clarino.uib.no:11509/105 |
dc:description | The South Saami lemma frequency list is work done by the Giellatekno and Divvun research groups, Department of Linguistics, UiT The Arctic University of Norway, as well as by members of the language community. In particular, Ciprian-Virgil Gerstenberger compiled the list from the entire SIKOR South Saami corpus version 2015-10-10. The data is in an one-lemma-per-line format with the following values: <ABS_FREQ> <LEMMA> <POS>. Since the list has been derived automatically, it may contain wrong values. In case you find any errors the creators would appreciate your feedback sent to giellatekno@uit.no and feedback@divvun.no. Please note that the Giellatekno resources are dynamic in nature. A stable "snapshot" is deposited with regular intervals at the CLARINO Bergen repository for download. To ensure that you have a completely updated version, please contact Giellatekno (see Contact Info in metadata). |
dc:publisher | |
dc:format | |
dc:date | 2015-10-10 |
dc:date | 2015-10-10 |
dc:rights | Public |
dc:rights | Creative Commons (CC) |
dc:rights | Creative_Commons-BY (CC-BY) |
dc:rights | http://creativecommons.org/licenses/by/4.0/ |
dc:creator | Ciprian-Virgil Gerstenberger |
dc:lang | South Saami |