Norsk dependenstrebank (NDT)
Utvidet metadata
- resource Common Info:
- resource Type: corpus
- identification Info:
- resource Name: Norsk dependenstrebank (NDT)
- resource Name: Norwegian Dependency Treebank
- description: Norsk dependenstrebank er to separate trebankar, med tekster på bokmål og nynorsk, annotert morfologisk og syntaktisk. Kvar trebank innheld om lag 300.000 "tokens" (ordformer inkl. teiknsetjing). Den morfologiske analysen følgjer Norsk referansegrammatikk, medan dependensgrammatikk vert nytta for den syntaktiske analysen. Annoteringa er gjort maskinelt, men er kvalitetssjekka og manuelt korrigert av to lingvistar, og held såleis ein "gullstandard".
- description: The Norwegian Dependency Treebank (NDT) consists of text which is manually annotated with morphological features, syntactic functions and hierarchical structure. The formalism used for the syntactic annotation is dependency grammar. With a few exceptions, the syntactic analysis follows Norsk referensegrammatikk ‘Norwegian Reference Grammar'. The treebank consists of two parts, containing 300.000 tokens (words and punctuation) each for Norwegian Bokmål and Nynorsk, respectively.
- resource Short Name: NDT
- resource Short Name: NDT
- url: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-10/
- P I D: hdl:21.11146/10
- identifier: sbr-10
- distribution Info:
- licence Info:
- user Category: Public
- distribution Access Medium: downloadable
- download Location: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-10/
- licence:
- licence Family: Creative Commons (CC)
- licence Name: Creative_Commons-ZERO (CC-ZERO)
- licence Url: https://creativecommons.org/publicdomain/zero/1.0/
- conditions Of Use: *
- non Standard Conditions Of Use: * NORED * No redistribution * The original third-party contents are not included in this CC-0 license, and these individual works may not be republished as stand-alone texts.
- licensor:
- actor Info:
- actor Type: organization
- role: Licensor
- organization Info:
- organization Name: Nasjonalbiblioteket
- organization Name: National Library of Norway
- organization Short Name: NB
- organization Short Name: NLN
- department Name: Språkbanken
- department Name: The Language Bank
- communication Info:
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- address: P.O. Box 2674 Solli
- zip Code: 0203
- city: Oslo
- region: Oslo
- country: Norway
- distribution Rights Holder
- actor Info:
- actor Type: organization
- role: Distribution Rights Holder
- organization Info:
- organization Name: Nasjonalbiblioteket
- organization Name: National Library of Norway
- organization Short Name: NB
- organization Short Name: NLN
- department Name: Språkbanken
- department Name: The Language Bank
- communication Info:
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- address: P.O. Box 2674 Solli
- zip Code: 0203
- city: Oslo
- region: Oslo
- country: Norway
- actor Info:
- actor Type: organization
- role: Contact
- organization Info:
- organization Name: Nasjonalbiblioteket
- organization Name: National Library of Norway
- organization Short Name: NB
- organization Short Name: NLN
- department Name: Språkbanken
- department Name: The Language Bank
- actor Info:
- actor Type: person
- role: Metadata Creator
- person Info:
- surname: Ohren
- given Name: Oddrun Pauline
- affiliation:
- organization Info:
- organization Name: Nasjonalbiblioteket
- organization Name: National Library of Norway
- organization Short Name: NB
- organization Short Name: NLN
- department Name: Tilvekst og kunnskapsorganisering
- department Name: Acquisition and Bibliographic Services
- actor Info:
- actor Type: person
- role: Resource Creator
- person Info:
- surname: Solberg
- given Name: Per Erik
- affiliation:
- organization Info:
- organization Name: Nasjonalbiblioteket
- organization Name: National Library of Norway
- organization Short Name: NB
- organization Short Name: NLN
- department Name: Språkbanken
- department Name: The Language Bank
- corpus Info:
- corpus Type: Treebank
- corpus Part Info:
- media Type: text
- corpus Text Info:
- text Format Info:
- mime Type: text/tab-separated-values
- size Per Text Format:
- size Info:
- size: 600000
- size Unit: tokens
- text Format Info:
- mime Type: text/xml
- size Per Text Format:
- size Info:
- size: 600000
- size Unit: tokens
- character Encoding Info:
- character Encoding: UTF-8
- corpus Part General Info:
- linguality Info:
- linguality Type: monolingual
- language Info:
- language Id: nb
- language Name: Norwegian Bokmål
- size Per Language:
- size Info:
- size: 300000
- size Unit: tokens
- language Info:
- language Id: nn
- language Name: Norwegian Nynorsk
- size Per Language:
- size Info:
- size: 300000
- size Unit: tokens
- modality Info:
- modality Type: writtenLanguage
- modality Type Details: Blog text, news text, parliament proceedings, government white papers
- size Info:
- size: 600000
- size Unit: tokens
- annotation Info:
- annotation Type: morphosyntacticAnnotation-posTagging
- segmentation Level: word
- annotation Mode: manual
- annotation Manual Unstructured:
- role: annotationManual
- document Unstructured: https://www.nb.no/sbfil/dok/20140314_guidelines_ndt_english.pdf
- time Coverage Info:
- time Coverage: 2000-2013
dc:type | corpus |
dc:title | Norsk dependenstrebank (NDT) |
dc:identifier | oai:nb.no:sbr-10 |
dc:description | Norsk dependenstrebank er to separate trebankar, med tekster på bokmål og nynorsk, annotert morfologisk og syntaktisk. Kvar trebank innheld om lag 300.000 "tokens" (ordformer inkl. teiknsetjing). Den morfologiske analysen følgjer Norsk referansegrammatikk, medan dependensgrammatikk vert nytta for den syntaktiske analysen. Annoteringa er gjort maskinelt, men er kvalitetssjekka og manuelt korrigert av to lingvistar, og held såleis ein "gullstandard". |
dc:publisher | |
dc:format | downloadable |
dc:date | 2011-01-03 |
dc:date | 2014-03-28 |
dc:rights | Public |
dc:rights | Creative Commons (CC) |
dc:rights | Creative_Commons-ZERO (CC-ZERO) |
dc:rights | https://creativecommons.org/publicdomain/zero/1.0/ |
dc:creator | Per Erik Solberg |
dc:creator | Kari Kinn |
dc:creator | Pål Kristian Eriksen |
dc:lang | bokmål |
dc:lang | nynorsk |