META-NORD Sofie Parallel Treebank
Extended metadata
- resource Common Info
- resource Type: corpus
- identification Info
- resource Name: META-NORD Sofie Parallel Treebank
- description: The Sofie Parallel Treebank is a syntactically annotated parallel corpus based on the first chapters of the novel “Sofies verden” by Jostein Gaarder, published by Aschehoug forlag. The treebank is a product of the META-NORD project and its goal to promote the accessability of existing treebanks for the languages in the project. SOURCE TEXT The Norwegian novel Sofies verden (Gaarder 1991) was chosen as a suitable basis for treebanking because it is linguistically rich and professionally translated in many languages, and because some treebanks already existed for text selections from this material in some languages in the META-NORD area. Previous work was done by the Nordic Treebank Network, funded by the Nordic Language Technology Program (2001-2005) but had not been maintained and was no longer accessible. It was decided to gather those treebanks, document them, supplement them with additional treebanks for some languages where this effort was feasible, and make the resulting resources accessible. The resulting work has been a joint effort between META-NORD and the INESS project, which hosts the treebank. The rights for the Finnish treebank have not been cleared, and this treebank is currently unavailable. More information about the treebank development in META-NORD is available in the META-NORD Deliverable 3.4 on Parallel Treebanks (http://www.meta-nord.eu).
- url: http://clarino.uib.no/iness/landing-page?resource=sofie-par
- url: http://clarino.uib.no/iness/landing-page?resource=sofie-par&view=short
- P I D: hdl:11495/D934-F8F4-D36A-7
- identifier: Sofie
- distribution Info
- licence Info
- user Category: Public
- distribution Access Medium: downloadable
- distribution Access Medium: accessibleThroughInterface
- download Location: http://clarino.uib.no/iness
- execution Location: http://clarino.uib.no/iness
- attribution Text: The "Sofie analyses" is research material based on the novel "Sofies verden" [Sophie's world] by Jostein Gaarder, published by Aschehoug Forlag. If you use INESS in your research, please link to the INESS webpage (http://clarino.uib.no/iness) in materials included with your data. We suggest the following reference in your scientific publications: Victoria Rosén, Koenraad De Smedt, Paul Meurer, and Helge Dyvik. An open infrastructure for advanced treebanking. In Jan Hajič, Koenraad De Smedt, Marko Tadić, and António Branco (eds.) META-RESEARCH Workshop on Advanced Treebanking at LREC2012, pages 22–29, Istanbul, Turkey, May 2012.
- licence
- licence Family: none
- licence Name: unspecified
- licence Url: https://clarino.uib.no/comedi/licenses/sofie-parallel-license.txt
- conditions Of Use: BY
- conditions Of Use: LRT
- conditions Of Use: NORED
- ipr Holder
- actor Info
- actor Type: person
- person Info
- surname: Gaarder
- given Name: Jostein
- sex: male
- communication Info
- actor Info
- actor Type: organization
- organization Info
- organization Name: University of Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- communication Info
- email: post@lle.uib.no
- actor Info
- licence Info
- contact
- actor Info
- actor Type: person
- person Info
- surname: Rosén
- given Name: Victoria
- sex: female
- position: Associate Professor
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- communication Info
- email: iness@uib.no
- actor Info
- metadata Info
- metadata Creation Date: 25.06.2015
- source: The present metadata are authoritative metadata. They are based on metadata from the project META-NORD (project end date 31.01.2013), published in the META-SHARE catalogue.
- original Metadata Schema: META-SHARE
- original Metadata Link: http://metashare.nb.no/repository/browse/meta-nord-sofie-parallel-treebank/fca71f10348711e29e75001708556d5a1b3257bb3f534b6da57ca5949e8ed9fa/
- metadata Language Name: English
- metadata Language Id: en
- metadata Last Date Updated: 05.10.2022
- metadata Creator
- actor Info
- actor Type: person
- person Info
- surname: Lyse
- given Name: Gunn Inger
- sex: female
- position: Researcher (Ph.D)
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- communication Info
- email: iness@uib.no
- email: clarin@uib.no
- actor Info
- validated: true
- validation Type: content
- validation Mode: mixed
- validation Mode Details: All alignments have been manually validated. See the descriptions of the monolingual treebanks for validation/evaluation of the individual annotations.
- validation Extent: partial
- corpus Info
- corpus Type: Treebank
- corpus Part Info
- media Type: text
- corpus Part General Info
- source Work Info
- title: Sofies verden
- work Description: The novel Sofies verden (Sophie's world), ISBN: 9788203254147.
- author:
- actor Info
- actor Type: person
- person Info
- surname: Gaarder
- given Name: Jostein
- sex: male
- publisher:
- actor Info
- actor Type: organization
- organization Info
- organization Name: Aschehoug forlag
- organization Name: Aschehoug Publishing House
- communication Info
- email: Even.Rakil@aschehoug.no
- url: http://www.aschehoug.no/om/english
- city: Oslo
- country: Norway
- telephone Number: +47 22400449
- source Work Info
- linguality Info
- linguality Type: multilingual
- multilinguality Type: parallel
- multilinguality Type Details: All language pairs have been aligned at sentence (parse unit) level.
- language Info
- language Id: no
- language Name: Norwegian
- size Per Language
- size Info
- size: 255
- size Unit: sentences
- size Info
- language Info
- language Id: sv
- language Name: Swedish
- size Per Language
- size Info
- size: 215
- size Unit: sentences
- size Info
- language Info
- language Id: da
- language Name: Danish
- size Per Language
- size Info
- size: 103
- size Unit: sentences
- size Info
- language Info
- language Id: et
- language Name: Estonian
- size Per Language
- size Info
- size: 52
- size Unit: sentences
- size Info
- language Info
- language Id: ka
- language Name: Georgian
- size Per Language
- size Info
- size: 1025
- size Unit: sentences
- size Info
- language Info
- language Id: de
- language Name: German
- size Per Language
- size Info
- size: 528
- size Unit: sentences
- size Info
- language Info
- language Id: is
- language Name: Icelandic
- size Per Language
- size Info
- size: 194
- size Unit: sentences
- size Info
- language Info
- language Id: en
- language Name: English
- size Per Language
- size Info
- size: 225
- size Unit: sentences
- size Info
- modality Info
- modality Type: writtenLanguage
- classification Info
- genre Info
- genre Type: textGenre
- genre: fiction and drama
- genre Info
- creation Info
dc:type | corpus |
dc:title | META-NORD Sofie Parallel Treebank |
dc:identifier | oai:clarino.uib.no:sofie-par |
dc:description | The Sofie Parallel Treebank is a syntactically annotated parallel corpus based on the first chapters of the novel “Sofies verden” by Jostein Gaarder, published by Aschehoug forlag. The treebank is a product of the META-NORD project and its goal to promote the accessability of existing treebanks for the languages in the project. SOURCE TEXT The Norwegian novel Sofies verden (Gaarder 1991) was chosen as a suitable basis for treebanking because it is linguistically rich and professionally translated in many languages, and because some treebanks already existed for text selections from this material in some languages in the META-NORD area. Previous work was done by the Nordic Treebank Network, funded by the Nordic Language Technology Program (2001-2005) but had not been maintained and was no longer accessible. It was decided to gather those treebanks, document them, supplement them with additional treebanks for some languages where this effort was feasible, and make the resulting resources accessible. The resulting work has been a joint effort between META-NORD and the INESS project, which hosts the treebank. The rights for the Finnish treebank have not been cleared, and this treebank is currently unavailable. More information about the treebank development in META-NORD is available in the META-NORD Deliverable 3.4 on Parallel Treebanks (http://www.meta-nord.eu). |
dc:publisher | |
dc:format | downloadable |
dc:date | |
dc:date | |
dc:rights | Public |
dc:rights | none |
dc:rights | unspecified |
dc:rights | https://clarino.uib.no/comedi/licenses/sofie-parallel-license.txt |
dc:lang | Norwegian |
dc:lang | Swedish |
dc:lang | Danish |
dc:lang | Estonian |
dc:lang | Georgian |
dc:lang | German |
dc:lang | Icelandic |
dc:lang | English |