Skip to content

Norwegian Dependency Treebank

The Norwegian Dependency Treebank (NDT) consists of text which is manually annotated with morphological features, syntactic functions and hierarchical structure. The formalism used for the syntactic annotation is dependency grammar. With a few exceptions, the syntactic analysis follows Norsk referensegrammatikk ‘Norwegian Reference Grammar’.

The treebank consists of two parts, containing 300.000 tokens (words and punctuation) each for Norwegian Bokmål and Nynorsk, respectively.

The Norwegian Dependency Treebank (NDT) consists of text which is manually annotated with morphological features, syntactic functions and hierarchical structure. The formalism used for the syntactic annotation is dependency grammar. With a few exceptions, the syntactic analysis follows Norsk referensegrammatikk ‘Norwegian Reference Grammar’.

The treebank consists of two parts, containing 300.000 tokens (words and punctuation) each for Norwegian Bokmål and Nynorsk, respectively.

Extended metadata

Download resources

Download metadata