NorGramBank – Fiction in Norwegian Bokmål

Clarino UiB

Lisens: CLARIN_ACA

Oppdatert: 2015-10-14

The "NorGramBank fiction in Norwegian Bokmål" treebank is a syntactically annotated corpus based on data taken from bokhylla.no at the National Library of Norway. This treebank is part of INESS NorGramBank collection (see URL in metadata).

As of October 2015, the treebank comprises 2 469 916 sentences and 26 903 637 words.

The source text was OCR-read by the National Library of Norway; INESS has preprocessed the source text semi-automatically with regard to OCR errors (misinterpreted letters etc) before syntactic parsing.

Vis utvidede metadata

The link will take you to an external site: We take no responsibility whatsoever for the content of external links.