NDT 2.0 med konstituentstruktur
Utvidet metadata
- resource Common Info:
- resource Type: corpus
- identification Info:
- resource Name: NDT 2.0 med konstituentstruktur
- resource Name: NDT 2.0 with Constituent Structure
- description: I denne versjonen av Norsk dependenstrebank 2.0 er det lagt til konstituentstruktur (c-struktur) lik den man finner i NorGramBank. Med denne kan man trene én syntaktisk parser for begge de grammatiske rammeverkene (dependens- og konstituentanalyse). Det er påvist at dette kan være fordelaktig for ytelsen til parsere. C-strukturen er tilordnet automatisk. Man bør derfor utvise forsiktighet med resultatene, siden analysene ikke er sjekket manuelt. Kontakt oss gjerne på sprakbanken@nb.no om du har spørsmål eller kommentarer til denne ressursen.
- description: In this version of the Norwegian Dependency Treebank 2.0 constituent structure (c-structure) similar to the one found in NorGramBank has been added. This can be used to train one syntactic parser for both grammatical frameworks (dependency- and constituentanalyses). This has been shown to be beneficial to the performance of parsers. The c-structure has been assigned automatically. Hence, caution should be exercised with respect to the results, since the analyses have not been checked manually. Feel free to contact us at sprakbanken@nb.no if you have questions or comments about this resource.
- url: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-90/
- P I D: hdl:21.11146/90
- identifier: sbr-90
- distribution Info:
- licence Info:
- user Category: Public
- distribution Access Medium: downloadable
- download Location: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-90/
- licence:
- licence Family: Creative Commons (CC)
- licence Name: Creative_Commons-ZERO (CC-ZERO)
- licence Url: https://creativecommons.org/publicdomain/zero/1.0/
- conditions Of Use: *
- non Standard Conditions Of Use: * NORED * No redistribution * The original third-party contents are not included in this CC-0 license, and these individual works may not be republished as stand-alone texts.
- licensor:
- actor Info:
- actor Type: organization
- role: Licensor
- organization Info:
- organization Name: Nasjonalbiblioteket
- organization Name: National Library of Norway
- organization Short Name: NB
- organization Short Name: NLN
- department Name: Språkbanken
- department Name: The Language Bank
- communication Info:
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- address: P.O. Box 2674 Solli
- zip Code: 0203
- city: Oslo
- region: Oslo
- country: Norway
- contact
- actor Info:
- actor Type: organization
- role: Contact
- organization Info:
- organization Name: Nasjonalbiblioteket
- organization Name: National Library of Norway
- organization Short Name: NB
- organization Short Name: NLN
- department Name: Språkbanken
- department Name: The Language Bank
- communication Info:
- email: sprakbanken@nb.no
- url: https://www.nb.no/sprakbanken/
- address: P.O. Box 2674 Solli
- zip Code: 0203
- city: Oslo
- region: Oslo
- country: Norway
- actor Info:
- actor Type: organization
- role: Metadata Creator
- organization Info:
- organization Name: Nasjonalbiblioteket
- organization Name: National Library of Norway
- organization Short Name: NB
- organization Short Name: NLN
- department Name: Språkbanken
- department Name: The Language Bank
- actor Info:
- actor Type: organization
- role: Resource Creator
- organization Info:
- organization Name: Nasjonalbiblioteket
- organization Name: National Library of Norway
- organization Short Name: NB
- organization Short Name: NLN
- department Name: Språkbanken
- department Name: The Language Bank
- corpus Info:
- corpus Type: Treebank
- corpus Part Info:
- media Type: text
- corpus Text Info:
- text Format Info:
- mime Type: text/x-conll
- character Encoding Info:
- character Encoding: UTF-8
- corpus Part General Info:
- linguality Info:
- linguality Type: bilingual
- multilinguality Type: multilingualSingleText
- multilinguality Type Details: Blog text, news text, parliament proceedings, government white papers in Norwegian Bokmål and Norwegian Nynorsk
- language Info:
- language Id: nb
- language Name: Norwegian Bokmål
- size Per Language:
- size Info:
- size: 300000
- size Unit: tokens
- language Variety Info:
- language Variety Type: other
- language Variety Name: Blog text, news text, parliament proceedings, government white papers
- language Info:
- language Id: nn
- language Name: Norwegian Nynorsk
- size Per Language:
- size Info:
- size: 300000
- size Unit: tokens
- language Variety Info:
- language Variety Type: other
- language Variety Name: Blog text, news text, parliament proceedings, government white papers
- modality Info:
- modality Type: writtenLanguage
- modality Type Details: Blog text, news text, parliament proceedings, government white papers
- size Info:
- size: 600000
- size Unit: tokens
- annotation Info:
- annotation Type: morphosyntacticAnnotation-posTagging
- annotation Description: conll-u
- annotation Format: conll-u
- annotation Info:
- annotation Type: syntacticAnnotation-treebanks
- annotation Description: C-structure (NorGramBank)
dc:type | corpus |
dc:title | NDT 2.0 med konstituentstruktur |
dc:identifier | oai:nb.no:sbr-90 |
dc:description | I denne versjonen av Norsk dependenstrebank 2.0 er det lagt til konstituentstruktur (c-struktur) lik den man finner i NorGramBank. Med denne kan man trene én syntaktisk parser for begge de grammatiske rammeverkene (dependens- og konstituentanalyse). Det er påvist at dette kan være fordelaktig for ytelsen til parsere. C-strukturen er tilordnet automatisk. Man bør derfor utvise forsiktighet med resultatene, siden analysene ikke er sjekket manuelt. Kontakt oss gjerne på sprakbanken@nb.no om du har spørsmål eller kommentarer til denne ressursen. |
dc:publisher | |
dc:format | downloadable |
dc:date | 2023-01-01 |
dc:date | 2023-10-27 |
dc:rights | Public |
dc:rights | Creative Commons (CC) |
dc:rights | Creative_Commons-ZERO (CC-ZERO) |
dc:rights | https://creativecommons.org/publicdomain/zero/1.0/ |
dc:creator | Nasjonalbiblioteket |
dc:creator | Universitetet i Oslo |
dc:creator | Universitetet i Bergen |
dc:lang | bokmål |
dc:lang | nynorsk |