Lassy Small Corpus, version 1.2.1 (copy @ INESS)
Utvidet metadata
- resource Common Info
- resource Type: corpus
- identification Info
- resource Name: Lassy Small Corpus, version 1.2.1 (copy @ INESS)
- description: The Lassy Small Corpus 1.1 is a 1 million word corpus with manually verified syntactic annotations. The lemma and postag annotations have been automatically assigned using Tadpole. The syntactic dependency annotations have been assigned using the Alpino parser. The automatically assigned lemmas, postags and syntactic dependency annotations were checked and corrected. Organisations involved in the building of the Lassy Large Corpus: Alfa-informatica, University of Groningen; CCL, K.U. Leuven. ACCESS: The INESS copy can be used by all employees and students of University of Bergen, Dep. of Linguistic, Literary and Aesthetic studies. Others need to apply to the rights holders of the original first. The Lassy version at INESS may be used for academic purposes under the following conditions: attribution required, no derivatives, no redistribution, non-commercial.
- url: http://clarino.uib.no/iness/landing-page?resource=nld-lassy-con&view=short
- url: http://clarino.uib.no/iness/landing-page?resource=nld-lassy-con
- P I D: hdl:11372/LRT-1493
- identifier: nld-lassy-con
- distribution Info
- licence Info
- user Category: Restricted
- distribution Access Medium: accessibleThroughInterface
- execution Location: http://clarino.uib.no/iness/landing-page?resource=nld-lassy-con&view=short
- licence
- licence Family: none
- licence Name: unspecified
- conditions Of Use: BY
- conditions Of Use: ND
- conditions Of Use: NORED
- conditions Of Use: NC
- licence Info
- contact
- actor Info
- actor Type: person
- person Info
- surname: De Smedt
- given Name: Koenraad
- position: Professor
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- department Name: Institutt for lingvistiske, litterære og estetiske studier (LLE)
- communication Info
- email: desmedt@uib.no
- email: iness@uib.no
- actor Info
- metadata Info
- metadata Creation Date: 11.06.2016
- source: This metadata, created for the Lassy copy at INESS, is based on the metadata created for the original resource, PID: http://hdl.handle.net/11372/LRT-1493 The original metadata should be considered as authoritative.
- original Metadata Schema: CMDI
- original Metadata Link: https://lindat.mff.cuni.cz/repository/rest/handle/11372/LRT-1493/citations/cmdi
- metadata Language Name: English
- metadata Language Id: en
- metadata Last Date Updated: 04.07.2016
- metadata Creator
- actor Info
- actor Type: person
- person Info
- surname: Lyse
- given Name: Gunn Inger
- sex: female
- position: Researcher (Ph.D)
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- communication Info
- email: iness@uib.no
- email: clarin@uib.no
- actor Info
- validated: true
- validation Type: content
- validation Mode: manual
- validation Extent: full
- validation Extent Details: manually verified syntactic annotations
- resource Creator
- actor Info
- actor Type: person
- person Info
- surname: van Noord
- given Name: Gertjan
- actor Info
- actor Type: person
- person Info
- surname: Schuurman
- given Name: Ineke
- actor Info
- actor Type: person
- person Info
- surname: van Eynde
- given Name: Frank
- actor Info
- actor Type: person
- person Info
- surname: Bouma
- given Name: Gosse
- actor Info
- funding Project:
- project Info
- project Name: Large Scale Syntactic Annotation of written Dutch
- project Short Name: LASSY
- project I D: STEVIN, STE-05-20
- funding Type: nationalFunds
- funding Country: Netherlands
- corpus Info
- corpus Type: Treebank
- corpus Part General Info
- linguality Info
- linguality Type: monolingual
- language Info
- language Id: nl
- language Name: Dutch
- modality Info
- modality Type: writtenLanguage
- size Info
- size: 65200
- size Unit: sentences
- size Info
- size: 990087
- size Unit: words
- annotation Info
- annotation Type: syntacticAnnotation-treebanks
- segmentation Level: sentence
- segmentation Level: word
- theoretic Model: Constraint Grammar
- annotation Mode: automatic
- annotation Mode Details: The syntactic dependency annotations have been assigned using the Alpino parser.
- annotation Tool
- target Resource Name U R I: Alpino parser
- annotation Info
- annotation Type: morphosyntacticAnnotation-posTagging
- annotation Type: lemmatization
- annotation Description: The lemma and postag annotations have been automatically assigned using Tadpole.
- segmentation Level: word
- annotation Tool
- target Resource Name U R I: Tadpole
- linguality Info
dc:type | corpus |
dc:title | Lassy Small Corpus, version 1.2.1 (copy @ INESS) |
dc:identifier | oai:clarino.uib.no:nld-lassy-con |
dc:description | The Lassy Small Corpus 1.1 is a 1 million word corpus with manually verified syntactic annotations. The lemma and postag annotations have been automatically assigned using Tadpole. The syntactic dependency annotations have been assigned using the Alpino parser. The automatically assigned lemmas, postags and syntactic dependency annotations were checked and corrected. Organisations involved in the building of the Lassy Large Corpus: Alfa-informatica, University of Groningen; CCL, K.U. Leuven. ACCESS: The INESS copy can be used by all employees and students of University of Bergen, Dep. of Linguistic, Literary and Aesthetic studies. Others need to apply to the rights holders of the original first. The Lassy version at INESS may be used for academic purposes under the following conditions: attribution required, no derivatives, no redistribution, non-commercial. |
dc:publisher | |
dc:format | accessibleThroughInterface |
dc:date | |
dc:date | |
dc:rights | Restricted |
dc:rights | none |
dc:rights | unspecified |
dc:rights | |
dc:creator | Gertjan van Noord |
dc:creator | Ineke Schuurman |
dc:creator | Frank van Eynde |
dc:creator | Gosse Bouma |
dc:lang | Dutch |