The LIA Treebank

The LIA Treebank includes 5250 speech segments and 55 410 tokens from the speech corpus LIA Norwegian. The treebank is annotated with morphological and dependency-style syntactic analysis and manually corrected. The treebank is available in both conllx-format and conllu-format.

LIA Norwegian is a speech corpus with old recordings (1939 - 1996) from four Norwegian universities: NTNU, UoB, UoO and UoT.

The LIA Treebank includes 5250 speech segments and 55 410 tokens from the speech corpus LIA Norwegian. The treebank is annotated with morphological and dependency-style syntactic analysis and manually corrected. The treebank is available in both conllx-format and conllu-format.

LIA Norwegian is a speech corpus with old recordings (1939 - 1996) from four Norwegian universities: NTNU, UoB, UoO and UoT.

Extended metadata

Download resources

Download metadata