The Morphologically Annotated Part of BulTreeBank
Utvidet metadata
- resource Common Info
- resource Type: corpus
- identification Info
- resource Name: The Morphologically Annotated Part of BulTreeBank
- description: This distribution represents only the morphological information encoded in BulTreeBank – HPSG-based Treebank of Bulgarian. It contains about 214000 tokens. It was used for the training of the TreeTagger for Bulgarian. It contains sentences from Bulgarian Grammar Textbooks, Newspapers, Literature and other sources of texts. Full documentation (Style Book, Tagset description) of the Treebank can be found on: http://www.bultreebank.org/TechRep.html
- resource Short Name: BulTreeBank-Morph
- url: http://clarino.uib.no/korpuskel/landing-page?resource=bul-treebank&view=short
- url: http://www.bultreebank.org/btbmorf/
- P I D: hdl:11495/D93F-C6E9-65D9-2
- identifier: bul-treebank
- distribution Info
- licence Info
- user Category: Public
- distribution Access Medium: downloadable
- distribution Access Medium: accessibleThroughInterface
- download Location: http://www.bultreebank.org/btbmorf/
- execution Location: https://hdl.handle.net/11495/D93F-C6E9-65D9-2
- licence
- licence Family: META-SHARE (MS)
- licence Name: META-SHARE NonCommercial NoRedistribution (MS-NC-NoReD)
- licence Url: http://www.meta-net.eu/meta-share/meta-share-licenses/META-SHARE%20NonCommercial%20NoRedistribution-v%201.0.pdf
- conditions Of Use: BY
- conditions Of Use: ID
- conditions Of Use: LRT
- conditions Of Use: NC
- conditions Of Use: NORED
- ipr Holder
- actor Info
- actor Type: person
- person Info
- surname: Simov
- given Name: Kiril
- sex: male
- position: Associate Professor
- affiliation:
- organization Info
- organization Name: Bulgarian Academy of Sciences
- department Name: BulTreeBank Group, Linguistic Modelling Laboratory, IICT
- communication Info
- email: kivs@bultreebank.org
- url: http://www.bultreebank.org/btbmorf/
- address: Acad. G.Bonchev 25A
- zip Code: 1113
- city: Sofia
- country: Bulgaria
- telephone Number: +359888473413
- actor Info
- licence Info
- contact
- actor Info
- actor Type: person
- person Info
- surname: Simov
- given Name: Kiril
- sex: male
- position: Associate Professor
- affiliation:
- organization Info
- organization Name: Bulgarian Academy of Sciences
- department Name: BulTreeBank Group, Linguistic Modelling Laboratory, IICT
- communication Info
- email: kivs@bultreebank.org
- url: http://www.bultreebank.org/btbmorf/
- address: Acad. G.Bonchev 25A
- zip Code: 1113
- city: Sofia
- country: Bulgaria
- telephone Number: +359888473413
- actor Info
- metadata Creation Date: 26.05.2015
- source: This metadata is based on the metadata originally created in META-SHARE in 2012. The present metadata should be considered as authoritative.
- original Metadata Schema: META-SHARE
- original Metadata Link: http://metashare.nb.no/repository/browse/the-morphologically-annotated-part-of-bultreebank/b3f0ba40395711e2b66e001708556d5a5db5c7f848dc4048b06b47f7835d6956/
- metadata Language Name: English
- metadata Language Id: en
- metadata Last Date Updated: 05.03.2018
- metadata Creator
- actor Info
- actor Type: person
- person Info
- surname: Lyse
- given Name: Gunn Inger
- sex: female
- position: Researcher (Ph.D)
- affiliation:
- organization Info
- organization Name: University of Bergen
- organization Name: Universitetet i Bergen
- organization Short Name: UiB
- organization Short Name: UoB
- department Name: Department of Linguistic, Literary and Aesthetic Studies
- communication Info
- email: iness@uib.no
- email: clarin@uib.no
- actor Info
- actor Info
- actor Type: person
- person Info
- surname: Simov
- given Name: Kiril
- sex: male
- position: Associate Professor
- affiliation:
- organization Info
- organization Name: Bulgarian Academy of Sciences
- department Name: BulTreeBank Group, Linguistic Modelling Laboratory, IICT
- communication Info
- email: kivs@bultreebank.org
- url: http://www.bultreebank.org/btbmorf/
- address: Acad. G.Bonchev 25A
- zip Code: 1113
- city: Sofia
- country: Bulgaria
- telephone Number: +359888473413
- documentation Unstructured
- role: documentation
- document Unstructured: http://www.bultreebank.org/btbmorf/
- corpus Info
- corpus Type: Written Corpus
- corpus Part Info
- media Type: text
- corpus Part General Info
- source Work Info
- work Description: It contains sentences from Bulgarian Grammar Textbooks, Newspapers, Literature and other sources of texts. For a full text acknowledgement, see: http://www.bultreebank.org/TextAcknowledgements.html
- linguality Info
- linguality Type: monolingual
- language Info
- language Id: bg
- language Name: Bulgarian
- size Info
- size: 214000
- size Unit: tokens
- annotation Info
- annotation Type: morphosyntacticAnnotation-posTagging
- tagset: http://www.bultreebank.org/TechRep/BTB-TR03.pdf
- theoretic Model: HPSG
- annotation Mode: mixed
- annotation Mode Details: The morphological analyzer assigns all possible morphosyntactic analyses to tokens.The process of disambiguation is two-fold: first a set of 'certain' rules are applied, to ensure full precision. Then the rest of the corpus has been disambiguated manually. (Source: p.2, http://www.bultreebank.org/TechRep/BTB-TR03.pdf)
- annotation Manual Unstructured
- role: annotationManual
- document Unstructured: Several documents can be found at: http://www.bultreebank.org/TechRep.html. Selected document: Kiril Simov, Petya Osenova and Milena Slavcheva. BTB-TR03: BulTreeBank Morphosyntactic Tagset. BulTreeBank Project Technical Report № 03. 2004
- source Work Info
dc:type | corpus |
dc:title | The Morphologically Annotated Part of BulTreeBank |
dc:identifier | oai:clarino.uib.no:bul-treebank |
dc:description | This distribution represents only the morphological information encoded in BulTreeBank – HPSG-based Treebank of Bulgarian. It contains about 214000 tokens. It was used for the training of the TreeTagger for Bulgarian. It contains sentences from Bulgarian Grammar Textbooks, Newspapers, Literature and other sources of texts. Full documentation (Style Book, Tagset description) of the Treebank can be found on: http://www.bultreebank.org/TechRep.html |
dc:publisher | |
dc:format | downloadable |
dc:date | |
dc:date | |
dc:rights | Public |
dc:rights | META-SHARE (MS) |
dc:rights | META-SHARE NonCommercial NoRedistribution (MS-NC-NoReD) |
dc:rights | http://www.meta-net.eu/meta-share/meta-share-licenses/META-SHARE%20NonCommercial%20NoRedistribution-v%201.0.pdf |
dc:lang | Bulgarian |