The Humit Tagger
Extended metadata
- resource Common Info:
- resource Type: toolService
- identification Info:
- resource Name: The Humit Tagger
- resource Name: Humit-taggeren
- description: The Humit Tagger is a morphological AI tagger for Norwegian Bokmål and Nynorsk developed at Humit, University of Oslo. The tagger is based on a neural network, more precisely a pre-trained BERT model for Norwegian, developed by the National Library of Norway. The tagger is a so-called sequence classifier, which selects morphological tags but not lemmas. In this first version of the Humit Tagger, the full-form word list from Norsk ordbank is used as a basis for lemma selection.
- resource Short Name: humit-tagger
- url: https://www.hf.uio.no/humit/english/resources/humit-tagger/index.html
- distribution Info:
- licence Info:
- distribution Access Medium: Downloadable
- download Location: https://github.com/humit-oslo/humit-tagger
- execution Location: https://tekstlab.uio.no/humtag_nett/
- licence:
- licence Family: MIT
- licence Name: MIT
- licence Url: http://en.wikipedia.org/wiki/MIT_License
- conditions Of Use: BY
- licensor:
- actor Info:
- actor Type: organization
- organization Info:
- organization Name: University of Oslo
- organization Name: Universitetet i Oslo
- organization Short Name: UiO
- organization Short Name: UoO
- department Name: Humit – senter for digital utvikling på HF
- department Name: Humit – Centre for digital development at HF
- communication Info:
- email: humit@hf.uio.no
- url: https://www.hf.uio.no/humit/english/
- city: OSLO
- country: Norway
- distribution Rights Holder
- actor Info:
- actor Type: organization
- organization Info:
- organization Name: University of Oslo
- organization Name: Universitetet i Oslo
- organization Short Name: UiO
- organization Short Name: UoO
- department Name: Humit – Centre for digital development at HF
- department Name: Humit – senter for digital utvikling på HF
- communication Info:
- email: humit@hf.uio.no
- url: https://www.hf.uio.no/humit/english/
- city: Oslo
- country: Norway
- actor Info:
- actor Type: organization
- organization Info:
- organization Name: Humit – Centre for digital development at HF
- organization Short Name: Humit
- actor Info:
- actor Type: person
- organization Info:
- organization Name: Humit – Centre for digital development at HF
- organization Short Name: Humit
- actor Info:
- actor Type: person
- person Info:
- surname: Hagen
- given Name: Kristin
- actor Info:
- actor Type: organization
- communication Info:
- email: humit@hf.uio.no
- tool Info:
- input Info:
- media Type: text
- resource Type: corpus
- modality Type: writtenLanguage
- language Name: Norwegian
- language Name: Norwegian Bokmål
- language Name: Norwegian Nynorsk
- language Id: No
- language Id: Nb
- language Id: Nn
- mime Type: txt, xml
- character Encoding: utf-8
- annotation Type: lemmatization
- annotation Type: morphosyntacticAnnotation-posTagging
- tagset: http://www.tekstlab.uio.no/obt-ny/english/tagset.html
- segmentation Level: word
- segmentation Level: clause
- output Info:
- media Type: text
- resource Type: corpus
- modality Type: writtenLanguage
- language Name: Norwegian
- language Name: Norwegian Bokmål
- language Name: Norwegian Nynorsk
- language Id: No
- language Id: Nb
- language Id: Nn
- mime Type: txt, xml
- character Encoding: utf-8
- tagset: http://www.tekstlab.uio.no/obt-ny/english/tagset.html
- segmentation Level: clause
- segmentation Level: word
- tool Service Operation Info:
- operating System: See https://github.com/humit-oslo/humit-tagger
dc:type | toolService |
dc:title | The Humit Tagger |
dc:identifier | oai:tekstlab.uio.no:humit-tagger |
dc:description | The Humit Tagger is a morphological AI tagger for Norwegian Bokmål and Nynorsk developed at Humit, University of Oslo. The tagger is based on a neural network, more precisely a pre-trained BERT model for Norwegian, developed by the National Library of Norway. The tagger is a so-called sequence classifier, which selects morphological tags but not lemmas. In this first version of the Humit Tagger, the full-form word list from Norsk ordbank is used as a basis for lemma selection. |
dc:publisher | |
dc:format | Downloadable |
dc:date | 2022 |
dc:date | 2024 |
dc:rights | |
dc:rights | MIT |
dc:rights | MIT |
dc:rights | http://en.wikipedia.org/wiki/MIT_License |
dc:lang | Norwegian |