LinGO Redwoods Treebank (copy @ INESS)
Extended metadata
- resource Common Info
- resource Type: corpus
- identification Info
- resource Name: LinGO Redwoods Treebank (copy @ INESS)
- description: The LinGO Redwoods Treebank is a collection of hand-annotated corpora analysed with the LinGO ERG. For each utterance from a corpus, the treebank records (in principle) all analyses hypothesized by the grammar, together with an annotator decision as to which reading is preferred in context. The key innovative aspect of the Redwoods approach to treebanking is the anchoring of all linguistic data captured in the treebank to the HPSG framework and a generally-available broad-coverage grammar of English, viz. the LinGO English Resource Grammar. Unlike existing treebanks, there is no need to define a (new) form of grammatical representation specific to the treebank (and, consequently, less dissemination effort in establishing this representation). Instead, the treebank records complete syntacto-semantic analyses as defined by the LinGO ERG; tools are provided to extract many different types of linguistic information at varying granularity. Other relevant aspects of the Redwoods Treebank include the integration of alternate, though dispreferred analyses for each utterance and the dynamic nature of the annotations: as the underlying grammar evolves and improves its analyses, there is a provision for a (nearly) fully automated update of the treebank against a version of the original corpus analysed with the revised grammar. As a methodological results, part of the Redwoods data are now regularly maintained as part of the grammar regression cycle with each new release of the ERG.
- url: http://clarino.uib.no/iness/landing-page?resource=eng-redwoods&view=short
- url: http://lingo.stanford.edu/redwoods/
- P I D: hdl:11495/DB0A-7D96-098F-5
- identifier: eng-redwoods
- distribution Info
- licence Info
- user Category: Public
- execution Location: http://hdl.handle.net/11495/DB0A-7D96-098F-5
- licence
- licence Family: GNU
- licence Name: General Public License (GPL)
- licence Url: http://www.gnu.org/licenses/gpl.html
- conditions Of Use: BY
- conditions Of Use: SA
- licence Info
- contact
- actor Info
- actor Type: person
- person Info
- surname: Rosén
- given Name: Victoria
- sex: female
- position: Associate Professor
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- communication Info
- email: iness@uib.no
- actor Info
- metadata Info
- metadata Creation Date: 14.06.2016
- metadata Last Date Updated: 14.06.2016
- metadata Creator
- actor Info
- actor Type: person
- person Info
- surname: Lyse
- given Name: Gunn Inger
- sex: female
- position: Researcher (Ph.D)
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- communication Info
- email: clarin@uib.no
- actor Info
- corpus Info
- corpus Type: Treebank
- corpus Part General Info
- linguality Info
- linguality Type: monolingual
- language Info
- language Id: en
- language Name: English
- modality Info
- modality Type: writtenLanguage
- size Info
- size: 47805
- size Unit: sentences
- size Info
- size: 535006
- size Unit: words
- annotation Info
- annotation Type: syntacticAnnotation-treebanks
- theoretic Model: HPSG
- linguality Info
dc:type | corpus |
dc:title | LinGO Redwoods Treebank (copy @ INESS) |
dc:identifier | oai:clarino.uib.no:eng-redwoods |
dc:description | The LinGO Redwoods Treebank is a collection of hand-annotated corpora analysed with the LinGO ERG. For each utterance from a corpus, the treebank records (in principle) all analyses hypothesized by the grammar, together with an annotator decision as to which reading is preferred in context. The key innovative aspect of the Redwoods approach to treebanking is the anchoring of all linguistic data captured in the treebank to the HPSG framework and a generally-available broad-coverage grammar of English, viz. the LinGO English Resource Grammar. Unlike existing treebanks, there is no need to define a (new) form of grammatical representation specific to the treebank (and, consequently, less dissemination effort in establishing this representation). Instead, the treebank records complete syntacto-semantic analyses as defined by the LinGO ERG; tools are provided to extract many different types of linguistic information at varying granularity. Other relevant aspects of the Redwoods Treebank include the integration of alternate, though dispreferred analyses for each utterance and the dynamic nature of the annotations: as the underlying grammar evolves and improves its analyses, there is a provision for a (nearly) fully automated update of the treebank against a version of the original corpus analysed with the revised grammar. As a methodological results, part of the Redwoods data are now regularly maintained as part of the grammar regression cycle with each new release of the ERG. |
dc:publisher | |
dc:format | |
dc:date | |
dc:date | |
dc:rights | Public |
dc:rights | GNU |
dc:rights | General Public License (GPL) |
dc:rights | http://www.gnu.org/licenses/gpl.html |
dc:lang | English |