The SKRIV Corpus

Clarino - Textlab

Lisens: CLARIN_ACA-NC-LOC-ND

Oppdatert: 2017-08-14

Texts written by students in upper secondary vocational

education programs. The corpus is especially suitable for the analysis of texts written by students with Norwegian as their second language.

There are approx 225 texts and 112 000 words in the corpus. The texts differ in length, genre and type.

The text corpus have three different versions of each text: one scanned original in pdf format and two transcribed versions in txt format: one original transcription with errors and one version where the errors are corrected.

All versions are linked and it is possible to search in both transcribed versions.

Vis utvidede metadata

The link will take you to an external site: We take no responsibility whatsoever for the content of external links.