NorGramBank Annotations of fiction text from ‘Nynorskkorpuset ved Norsk Ordbok 2014’
Utvidet metadata
- resource Common Info
- resource Type: corpus
- identification Info
- resource Name: NorGramBank Annotations of fiction text from 'Nynorskkorpuset ved Norsk Ordbok 2014'
- description: The treebank "Annotations of fiction text from 'Nynorskkorpuset ved Norsk Ordbok 2014' is a syntactically annotated corpus which uses text extracts from Nynorskkorpuset ved Norsk Ordbok 2014 (no2014.uio.no). This treebank is part of INESS NorGramBank collection (see URL in metadata).
- resource Short Name: Nynorskkorpuset fiction
- resource Short Name: Nynorskkorpuset skjønnlitteratur
- url: http://clarino.uib.no/iness/landing-page?resource=nno-nnk-sk&view=short
- url: http://clarino.uib.no/iness/landing-page?resource=nno-nnk-sk
- url: http://clarino.uib.no/comedi/metadata-editor?&identifier=NorGramBank
- P I D: hdl:11495/DA66-216B-8F1A-9
- identifier: nno-nnk-sk
- distribution Info
- licence Info
- user Category: Academic
- distribution Access Medium: accessibleThroughInterface
- execution Location: https://hdl.handle.net/11495/DA66-216B-8F1A-9
- licence
- licence Family: CLARIN
- licence Name: CLARIN_ACA-DEP
- licence Url: https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NORED=1&DEP=1
- conditions Of Use: BY
- conditions Of Use: DEP
- conditions Of Use: ID
- conditions Of Use: NORED
- non Standard Conditions Of Use: NOTE: All linguistic annotations in NorGram and NorGramBank are research material based on original texts compiled from various sources. Please note that the original texts, as individual works ('åndsverk'), remain Copyright protected by Norwegian law (http://lovdata.no/lov/1961-05-12-2). Any attempt to use the INESS linguistic annotations to reconstruct and to publish or in any way misuse the original text is a violation of Copyright law.
- ipr Holder
- actor Info
- actor Type: organization
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- department Name: Institutt for lingvistiske, litterære og estetiske studier (LLE)
- communication Info
- email: clarin@uib.no
- url: https://repo.clarino.uib.no/
- url: https://clarin.b.uib.no
- city: Bergen
- country: Norway
- actor Info
- licence Info
- contact
- actor Info
- actor Type: person
- person Info
- surname: Rosén
- given Name: Victoria
- sex: female
- position: Associate Professor
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- communication Info
- email: iness@uib.no
- actor Info
- metadata Info
- metadata Creation Date: 10.02.2016
- metadata Language Name: English
- metadata Language Id: en
- metadata Last Date Updated: 10.06.2016
- metadata Creator
- actor Info
- actor Type: person
- person Info
- surname: Lyse
- given Name: Gunn Inger
- sex: female
- position: Researcher (Ph.D)
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- communication Info
- email: iness@uib.no
- email: clarin@uib.no
- actor Info
- documentation Unstructured
- role: documentation
- document Unstructured: http://clarino.uib.no/iness/page?page-id=Publications
- creation Start Date: 2011
- creation End Date: 2016
- funding Project:
- project Info
- project Name: Infrastructure for the Exploration of Syntax and Semantics
- project Short Name: INESS
- project I D: 195323
- url: http://clarino.uib.no/iness/
- funding Type: nationalFunds
- funder: The Research Council of Norway under the Infrastruktur program
- funder: University of Bergen
- funding Country: Norway
- corpus Info
- corpus Type: Treebank
- corpus Part Info
- media Type: text
- corpus Text Info
- character Encoding Info
- character Encoding: UTF-8
- character Encoding Info
- corpus Part General Info
- source Work Info
- work Description: The source material for the annotations described in this metadata file is fiction text in Norwegian Nynorsk from Nynorskkorpuset ved Norsk Ordbok 2014 (no2014.uio.no). The IPR holder of Nynorskkorpuset is Department of Linguistics and Scandinavian Studies, Faculty of Humanities, University of Oslo, Norway. Source work reference: Ridings, Daniel & Oddrun Grønvik. 2012. A corpus based method for a diachronic study of the central vocabulary of New Norwegian. In: Birgit Eaker et al. 2012. Rapport från Konfersensen om lexikografi I Norden. Lund 24-27 maj 2011. Nordiska studier i lexikografi 11, p. 524-33. The IPR holder of the results of INESS’ linguistic annotations is Department of Linguistic, Literary and Aesthetic Studies, Faculty of Humanities, University of Bergen, Norway. The full list of authors and texts in this treebank can be inspected in INESS via "Treebank overview". The full source texts remain Copyright protected, and cannot be redistributed by INESS.
- linguality Info
- linguality Type: monolingual
- language Info
- language Id: no
- language Name: Norwegian
- language Info
- language Id: nn
- language Name: Norwegian Nynorsk
- modality Info
- modality Type: writtenLanguage
- size Info
- size: 89963
- size Unit: sentences
- size Info
- size: 915441
- size Unit: words
- annotation Info
- annotation Type: syntacticAnnotation-treebanks
- annotation Standoff: false
- segmentation Level: sentence
- annotation Format: XLE (Packed c- and f-structures in Prolog)
- tagset: http://prosjekt.digital.uni.no/projects/inesspublic/wiki/NorGram_Lexical_Categories_(Preterminals); http://prosjekt.digital.uni.no/projects/inesspublic/wiki/NorGram_Phrase_Structure_Categories; http://prosjekt.digital.uni.no/projects/inesspublic/wiki/NorGram_F-structure_Features
- theoretic Model: Lexical Functional Grammar (LFG)
- annotation Mode: mixed
- annotation Mode Details: Automatic parsing, manual disambiguation using discriminants.
- annotation Manual Unstructured
- role: annotationManual
- document Unstructured: http://clarino.uib.no/iness/page?page-id=_NorGram_annotator_guidelines_
- annotation Tool
- target Resource Name U R I: LFG Parsebanker
- annotator:
- actor Info
- actor Type: person
- person Info
- surname: Gyri Smørdal
- given Name: Losnegaard
- sex: female
- position: Researcher
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- actor Info
- actor Type: person
- person Info
- surname: Lyse
- given Name: Gunn Inger
- sex: female
- position: Researcher (Ph.D)
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- source Work Info
- actor Info
- actor Type: person
- person Info
- surname: Thunes
- given Name: Martha
- sex: female
- position: Postdoc in INESS
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- actor Info
- actor Type: person
- person Info
- surname: Haugereid
- given Name: Petter
- sex: male
- position: Researcher (Ph.D)
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- annotation Type: other
- annotation Description: Text Preprocessing: When a corpus is parsed, there will always be words that are unknown to the morphological analyzer and/or the lexicon. Thus, the documents must be preprocessed before syntactic parsing. INESS has therefore developed an intelligent browser-based preprocessing interface which facilitates efficient text cleanup and the treatment of unknown word forms. For more details, cf. Rosén et al (2012). 'An integrated web-based treebank annotation system'. http://clarino.uib.no/iness/page?page-id=Publications.
- segmentation Level: word
- annotation Mode: interactive
- annotator:
- actor Info
- actor Type: person
- person Info
- surname: Gyri Smørdal
- given Name: Losnegaard
- sex: female
- position: Researcher
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- actor Type: person
- person Info
- surname: Lyse
- given Name: Gunn Inger
- sex: female
- position: Researcher (Ph.D)
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- actor Type: person
- person Info
- surname: Thunes
- given Name: Martha
- sex: female
- position: Postdoc in INESS
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- actor Type: person
- person Info
- surname: Haugereid
- given Name: Petter
- sex: male
- position: Researcher (Ph.D)
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- actor Type: person
- person Info
- surname: Fatnes
- given Name: Ingeborg
- sex: female
- position: Scientific assistant in INESS (text preprocessing)
- actor Type: person
- person Info
- surname: Dale
- given Name: Ingerid
- sex: female
- position: Scientific assistant in INESS (text preprocessing)
- actor Type: person
- person Info
- surname: Bergmann
- given Name: Julie
- sex: female
- position: Scientific assistant in INESS (text preprocessing)
- genre Info
- genre Type: textGenre
- genre: fiction and drama
- creation Mode: mixed
- creation Mode Details: The annotation is created through parsebanking. Analyses produced by parsing with the Norwegian LFG grammar NorGram on the XLE platform were manually disambiguated with discriminants, and reparsed after grammar and lexicon updates.
- original Source
- target Resource Name U R I: http://no2014.uio.no/
- creation Tool
- target Resource Name U R I: XLE
- creation Tool
- target Resource Name U R I: NorGram (online demonstrator: http://clarino.uib.no/iness/xle-web)
- creation Tool
- target Resource Name U R I: LFG Parsebanker
dc:type | corpus |
dc:title | NorGramBank Annotations of fiction text from 'Nynorskkorpuset ved Norsk Ordbok 2014' |
dc:identifier | oai:clarino.uib.no:nno-nnk-sk |
dc:description | The treebank "Annotations of fiction text from 'Nynorskkorpuset ved Norsk Ordbok 2014' is a syntactically annotated corpus which uses text extracts from Nynorskkorpuset ved Norsk Ordbok 2014 (no2014.uio.no). This treebank is part of INESS NorGramBank collection (see URL in metadata). |
dc:publisher | |
dc:format | accessibleThroughInterface |
dc:date | 2011 |
dc:date | 2016 |
dc:rights | Academic |
dc:rights | CLARIN |
dc:rights | CLARIN_ACA-DEP |
dc:rights | https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NORED=1&DEP=1 |
dc:lang | norsk |
dc:lang | nynorsk |