Tagged texts in Norwegian Bokmål from NBdigital (public domain material)

CLARINO NB – Språkbanken

Lisens: Creative_Commons-ZERO (CC-ZERO)

Oppdatert: 2016-03-07

The resource contains 4808 morphologically tagged texts in Norwegian Bokmål from NB's corpus of free texts.

All texts were published after 1960. The texts were automatically tagged with the Oslo-Bergen tagger (see: http://www.tekstlab.uio.no/obt-ny/), with syntactic disambiguation. This should theoretically give an accuracy of approx. 96,5%. At the same time, one has to consider that fact that the texts have been OCRed (average word confidence at ca. 90%), which means that the over-all accuracy is considerably lower.

Vis utvidede metadata

The link will take you to an external site: We take no responsibility whatsoever for the content of external links.