NorGrambank children’s fiction in Norwegian Nynorsk
The treebank «NorGrambank children’s fiction in Norwegian Nynorsk» is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata).
As of October 2015, the treebank comprises 106434 sentences, 1043260 words, 76 documents.
The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing.
The treebank «NorGrambank children’s fiction in Norwegian Nynorsk» is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata).
As of October 2015, the treebank comprises 106434 sentences, 1043260 words, 76 documents.
The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing.
Utvidet metadata
dc:type
corpus
dc:title
NorGrambank children's fiction in Norwegian Nynorsk
dc:identifier
oai:clarino.uib.no:nno-child
dc:description
The treebank "NorGrambank children's fiction in Norwegian Nynorsk" is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata).
As of October 2015, the treebank comprises 106434 sentences, 1043260 words, 76 documents.
The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing.