This distribution represents the BulTreeBank, as distributed via the INESS infrastructure. The integration of the treebank in INESS means that it is now indexed and searchable within the INESS treebanking infrastructure.
For download (and not just search), the treebank is downloadable from its original site (see http://www.bultreebank.org).
General info about the treebank (taken from http://www.bultreebank.org): This distribution represents the dependency information encoded in BulTreeBank – HPSG-based Treebank of Bulgarian.
It contains about 196000 tokens. It contains sentences from Bulgarian Grammar Textbooks, Newspapers, Literature and other sources of texts. Full documentation (Style Book, Tagset description) of the Treebank can be found on: http://www.bultreebank.org/TechRep.html. The BulTreeBank-DP is provided in the CoNNL-X shared task table format.
This distribution represents the BulTreeBank, as distributed via the INESS infrastructure. The integration of the treebank in INESS means that it is now indexed and searchable within the INESS treebanking infrastructure.
For download (and not just search), the treebank is downloadable from its original site (see http://www.bultreebank.org).
General info about the treebank (taken from http://www.bultreebank.org): This distribution represents the dependency information encoded in BulTreeBank – HPSG-based Treebank of Bulgarian.
It contains about 196000 tokens. It contains sentences from Bulgarian Grammar Textbooks, Newspapers, Literature and other sources of texts. Full documentation (Style Book, Tagset description) of the Treebank can be found on: http://www.bultreebank.org/TechRep.html. The BulTreeBank-DP is provided in the CoNNL-X shared task table format.
Extended metadata
resource Common Info
resource Type: corpus
identification Info
resource Name: The Dependency Part of BulTreeBank
description: This distribution represents the BulTreeBank, as distributed via the INESS infrastructure. The integration of the treebank in INESS means that it is now indexed and searchable within the INESS treebanking infrastructure.
For download (and not just search), the treebank is downloadable from its original site (see http://www.bultreebank.org).
General info about the treebank (taken from http://www.bultreebank.org): This distribution represents the dependency information encoded in BulTreeBank – HPSG-based Treebank of Bulgarian.
It contains about 196000 tokens. It contains sentences from Bulgarian Grammar Textbooks, Newspapers, Literature and other sources of texts. Full documentation (Style Book, Tagset description) of the Treebank can be found on: http://www.bultreebank.org/TechRep.html. The BulTreeBank-DP is provided in the CoNNL-X shared task table format.
attribution Text: To refer to the BulTreeBank, use the following reference in your scientific publications:Kiril Simov. BTB-TR01: BulTreeBank Project Overview. BulTreeBank Project Technical Report № 01. 2004. URL: http://www.bultreebank.org/TechRep.html.
If you use INESS in your research, please link to the INESS webpage (http://clarino.uib.no/iness) in materials included with your data. We suggest the following reference in your scientific publications: Victoria Rosén, Koenraad De Smedt, Paul Meurer, and Helge Dyvik. An open infrastructure for advanced treebanking. In Jan Hajič, Koenraad De Smedt, Marko Tadić, and António Branco (eds.) META-RESEARCH Workshop on Advanced Treebanking at LREC2012, pages 22–29, Istanbul, Turkey, May 2012.
non Standard Conditions Of Use: From http://www.bultreebank.org/dpbtb/: "If you are interested in using BulTreeBank-DP, please, fill in the user agreement form, print it, scan it and send it to Kiril Simov. If not possible to send it electronically, please, send it by regular mail to: [read more at: http://www.bultreebank.org/dpbtb/]
ipr Holder
actor Info
actor Type: person
person Info
surname: Simov
given Name: Kiril
sex: male
position: Associate Professor
affiliation:
organization Info
organization Name: Bulgarian Academy of Sciences
department Name: BulTreeBank Group, Linguistic Modelling Laboratory, IICT
source: This metadata is based on the metadata originally created in META-SHARE in 2012. The present metadata should be considered as authoritative. Information has been gathered from the bultreebank.org webpages, see the "Resource documentation info" in the present metadata.
work Description: It contains sentences from Bulgarian Grammar Textbooks, Newspapers, Literature and other sources of texts. For a full text acknowledgement, see: http://www.bultreebank.org/TextAcknowledgements.html
linguality Info
linguality Type: monolingual
language Info
language Id: bg
language Name: Bulgarian
size Info
size: 196000
size Unit: tokens
annotation Info
annotation Type: syntacticAnnotation-treebanks
annotation Format: CoNNL-X shared task table format
This distribution represents the BulTreeBank, as distributed via the INESS infrastructure. The integration of the treebank in INESS means that it is now indexed and searchable within the INESS treebanking infrastructure.
For download (and not just search), the treebank is downloadable from its original site (see http://www.bultreebank.org).
General info about the treebank (taken from http://www.bultreebank.org): This distribution represents the dependency information encoded in BulTreeBank – HPSG-based Treebank of Bulgarian.
It contains about 196000 tokens. It contains sentences from Bulgarian Grammar Textbooks, Newspapers, Literature and other sources of texts. Full documentation (Style Book, Tagset description) of the Treebank can be found on: http://www.bultreebank.org/TechRep.html. The BulTreeBank-DP is provided in the CoNNL-X shared task table format.