Skip to content

NorGramBank fiction in Norwegian Nynorsk

The “NorGramBank fiction in Norwegian Nynorsk” treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata).
As of October 2015, the treebank comprises 260285 sentences, 2884376 words and 91 documents. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing.

The “NorGramBank fiction in Norwegian Nynorsk” treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata).
As of October 2015, the treebank comprises 260285 sentences, 2884376 words and 91 documents. The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing.

Extended metadata

Download metadata

Go to resource page