Norsk UD-trebank
Utvidet metadata
- resource Common Info:
- resource Type: corpus
- identification Info:
- resource Name: Norwegian UD Treebank
- resource Name: Norsk UD-trebank
- description: Universal Dependencies (UD) is a framework for annotating grammar consistently in different languages. The grammatical annotations include tokenization, part-of-speech tags (POS), morphological features, and dependency relations. For more information about the annotation standard and guidelines, see the official UD documentation (https://universaldependencies.org/guidelines.html). UD for Bokmål (https://universaldependencies.org/treebanks/no_bokmaal/index.html) and Nynorsk (https://universaldependencies.org/treebanks/no_nynorsk/index.html) are based on the Norwegian Dependency Treebank (NDT). The annotations have been automatically converted to UDs standard with Grew (https://grew.fr/). The conversion scripts are publicly available in the Github repo grew_ndt2ud (https://github.com/Sprakbanken/grew_ndt2ud).
- description: Universal Dependencies (UD) er eit rammeverk for å annotere grammatikk einsarta på tvers av ulike språk. Kategoriane for den grammatiske annotasjonen inkluderer orddeling (tokenization), ordklassar (part-of-speech, POS), morfologiske trekk (features), og syntaktiske relasjonar (dependency relations). For meir informasjon om annotasjonsstandarden og retningsliner, sjå https://universaldependencies.org/guidelines.html. UD-trebankane for bokmål og nynorsk er baserte på Norsk Dependenstrebank (NDT). Analysane i NDT har vorte automatisk konverterte til UD-annotasjonsstandarden med Grew (https://grew.fr/). Konverteringskoden er offentleg tilgjengeleg i Github-repoet grew_ndt2ud (https://github.com/Sprakbanken/grew_ndt2ud).
- resource Short Name: UD-NO
- resource Short Name: UD-NO
- url: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-83/
- P I D: hdl:21.11146/83
- identifier: sbr-83
- distribution Info:
- licence Info:
- user Category: Public
- distribution Access Medium: downloadable
- download Location: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-83/
- licence:
- licence Family: Creative Commons (CC)
- licence Name: Creative_Commons-BY-SA (CC-BY-SA)
- licence Url: https://creativecommons.org/licenses/by-sa/4.0/
- conditions Of Use: BY
- conditions Of Use: SA
- contact
- actor Info:
- actor Type: organization
- role: Contact
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- communication Info:
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- address: P.O. Box 2674 Solli
- zip Code: 0203
- city: Oslo
- region: Oslo
- country: Norway
- actor Info:
- actor Type: organization
- role: Metadata Creator
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- actor Info:
- actor Type: organization
- role: Resource Creator
- organization Info:
- organization Name: National Library of Norway
- organization Name: Nasjonalbiblioteket
- organization Short Name: NLN
- organization Short Name: NB
- department Name: The Language Bank
- department Name: Språkbanken
- corpus Info:
- corpus Type: Treebank
- corpus Part Info:
- media Type: text
- corpus Text Info:
- text Format Info:
- mime Type: text/plain
- size Per Text Format:
- size Info:
- size: 611574
- size Unit: tokens
- size Info:
- size: 37619
- size Unit: sentences
- character Encoding Info:
- character Encoding: UTF-8
- corpus Part General Info:
- linguality Info:
- linguality Type: bilingual
- multilinguality Type: comparable
- language Info:
- language Id: nb
- language Name: Norwegian Bokmål
- size Per Language:
- size Info:
- size: 310221
- size Unit: tokens
- size Info:
- size: 20044
- size Unit: sentences
- language Info:
- language Id: nn
- language Name: Norwegian Nynorsk
- size Per Language:
- size Info:
- size: 301353
- size Unit: tokens
- size Info:
- size: 17575
- size Unit: sentences
- annotation Info:
- annotation Type: syntacticAnnotation-treebanks
- segmentation Level: word
- annotation Format: Universal Dependencies (UD)
- tagset: https://universaldependencies.org/
- time Coverage Info:
- time Coverage: 2000-2013
dc:type | corpus |
dc:title | Norsk UD-trebank |
dc:identifier | oai:nb.no:sbr-83 |
dc:description | Universal Dependencies (UD) er eit rammeverk for å annotere grammatikk einsarta på tvers av ulike språk. Kategoriane for den grammatiske annotasjonen inkluderer orddeling (tokenization), ordklassar (part-of-speech, POS), morfologiske trekk (features), og syntaktiske relasjonar (dependency relations). For meir informasjon om annotasjonsstandarden og retningsliner, sjå https://universaldependencies.org/guidelines.html. UD-trebankane for bokmål og nynorsk er baserte på Norsk Dependenstrebank (NDT). Analysane i NDT har vorte automatisk konverterte til UD-annotasjonsstandarden med Grew (https://grew.fr/). Konverteringskoden er offentleg tilgjengeleg i Github-repoet grew_ndt2ud (https://github.com/Sprakbanken/grew_ndt2ud). |
dc:publisher | |
dc:format | downloadable |
dc:date | 2022-06-20 |
dc:date | 2023-05-11 |
dc:rights | Public |
dc:rights | Creative Commons (CC) |
dc:rights | Creative_Commons-BY-SA (CC-BY-SA) |
dc:rights | https://creativecommons.org/licenses/by-sa/4.0/ |
dc:creator | National Library of Norway |
dc:creator | University of Oslo |
dc:lang | bokmål |
dc:lang | nynorsk |