Icelandic Parsed Historical Corpus
Utvidet metadata
- resource Common Info
- resource Type: corpus
- identification Info
- resource Name: Icelandic Parsed Historical Corpus
- description: About 1000000 words of Icelandic text, from every century between the 12th and the 21st centuries inclusive annotated for phrase structure, part-of-speech-tagged and lemmatized. A copy of the treebank is searchable via the INESS portal. The original is downloadable on a LGPL license, see elsewhere in the metadata for a link.
- resource Short Name: icepahc
- url: http://clarino.uib.no/iness/landing-page?resource=isl-icepahc&view=short
- url: http://clarino.uib.no/iness/landing-page?resource=isl-icepahc
- url: http://clarino.uib.no/iness/landing-page?resource=sofie-par&view=short
- P I D: hdl:11495/DB27-725F-305B-1
- identifier: isl-icepahc
- distribution Info
- licence Info
- user Category: Public
- distribution Access Medium: downloadable
- distribution Access Medium: accessibleThroughInterface
- download Location: http://linguist.is/icelandic_treebank/Download
- execution Location: http://hdl.handle.net/11495/DB27-725F-305B-1
- attribution Text: Wallenberg, Joel, Anton Karl Ingason, Einar Freyr Sigurðsson and Eiríkur Rögnvaldsson. 2011. Icelandic Parsed Historical Corpus (IcePaHC). Version 0.9. http://www.linguist.is/icelandic_treebank
- licence
- licence Family: GNU
- licence Name: Lesser General Public License (LGPL)
- licence Url: https://www.gnu.org/licenses/lgpl.html
- conditions Of Use: BY
- conditions Of Use: SA
- ipr Holder
- actor Info
- actor Type: person
- person Info
- surname: Rögnvaldsson
- given Name: Eiríkur
- sex: male
- position: Professor
- affiliation:
- organization Info
- organization Name: University of Iceland
- organization Short Name: HI
- department Name: Department of Icelandic and Comparative Cultural Studies
- communication Info
- email: eirikur@hi.is
- actor Info
- actor Info
- actor Type: person
- person Info
- surname: Ingason
- given Name: Anton Karl
- sex: male
- communication Info
- email: anton.karl.ingason@gmail.com
- actor Info
- actor Type: person
- person Info
- surname: Wallenberg
- given Name: Joel
- sex: male
- communication Info
- email: joel.wallenberg@gmail.com
- actor Info
- actor Type: person
- person Info
- surname: Sigurðsson
- given Name: Einar Freyr
- sex: male
- communication Info
- email: einarfs@gmail.com
- licence Info
- contact
- actor Info
- actor Type: person
- role: For questions regarding the INESS portal
- person Info
- surname: Rosén
- given Name: Victoria
- sex: female
- position: Associate Professor
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- communication Info
- email: iness@uib.no
- actor Info
- actor Info
- actor Type: person
- role: For questions regarding the resource itself
- person Info
- surname: Rögnvaldsson
- given Name: Eiríkur
- sex: male
- position: Professor
- affiliation:
- organization Info
- organization Name: University of Iceland
- organization Short Name: HI
- department Name: Department of Icelandic and Comparative Cultural Studies
- communication Info
- email: eirikur@hi.is
- metadata Creation Date: 06.07.2016
- source: This metadata is based on the metadata originally created in META-SHARE in 2013. The original metadatain META-SHARE should be considered as authoritative.
- original Metadata Schema: META-SHARE
- original Metadata Link: http://metashare.tilde.com/repository/browse/icelandic-parsed-historical-corpus/5a72f4b07eff11e5aa3b001dd8b71c6659ba6ffd78614d6a82a8b5da47d2a2c3/
- metadata Language Name: English
- metadata Language Id: en
- metadata Last Date Updated: 06.07.2016
- metadata Creator
- actor Info
- actor Type: person
- person Info
- surname: Lyse
- given Name: Gunn Inger
- sex: female
- position: Researcher (Ph.D)
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- communication Info
- email: iness@uib.no
- email: clarin@uib.no
- actor Info
- corpus Info
- corpus Type: Treebank
- corpus Part General Info
- source Work Info
- work Description: Text included in version 0.9: http://linguist.is/icelandic_treebank/Download#Texts_included_in_Version_0.9
- linguality Info
- linguality Type: monolingual
- language Info
- language Id: is
- language Name: Icelandic
- modality Info
- modality Type: writtenLanguage
- size Info
- size: 73014
- size Unit: sentences
- size Info
- size: 1057182
- size Unit: words
- annotation Info
- annotation Type: syntacticAnnotation-treebanks
- annotation Standoff: false
- segmentation Level: sentence
- segmentation Level: word
- tagset: http://www.linguist.is/icelandic_treebank/Tagset
- conformance To Standards Best Practices: pennTreeBank
- theoretic Model: Phrase structure, constituency
- annotation Mode: mixed
- annotation Manual Unstructured
- role: annotationManual
- document Unstructured: http://www.linguist.is/icelandic_treebank/Icelandic_Parsed_Historical_Corpus_%28IcePaHC%29#Annotation_guidelines
- annotation Tool
- target Resource Name U R I: Annotald
- classification Info
- genre Info
- genre Type: textGenre
- genre: fiction and drama
- genre Info
- time Coverage Info
- time Coverage: 12th to 21st centuries
- geographic Coverage Info
- geographic Coverage: Iceland
- source Work Info
dc:type | corpus |
dc:title | Icelandic Parsed Historical Corpus |
dc:identifier | oai:clarino.uib.no:isl-icepahc |
dc:description | About 1000000 words of Icelandic text, from every century between the 12th and the 21st centuries inclusive annotated for phrase structure, part-of-speech-tagged and lemmatized. A copy of the treebank is searchable via the INESS portal. The original is downloadable on a LGPL license, see elsewhere in the metadata for a link. |
dc:publisher | |
dc:format | downloadable |
dc:date | |
dc:date | |
dc:rights | Public |
dc:rights | GNU |
dc:rights | Lesser General Public License (LGPL) |
dc:rights | https://www.gnu.org/licenses/lgpl.html |
dc:lang | islandsk |