Skip to content

Norsk talespråkskorpus – Oslodelen

NoTa-Oslo is a speech corpus with interviews and conversations from 166 informants born and raised in Oslo and the Oslo area. The informants are carefully selected w.r.t. sociolinguistic variables and therefore representative in terms of age, gender, place of residence and education. NoTa-Oslo consists of approx. 957 000 words that are orthographically transcribed and morphologically tagged. The corpus is searchable in a specially designed search interface, and the transcriptions are linked to audio and video files.

NoTa-Oslo is a speech corpus with interviews and conversations from 166 informants born and raised in Oslo and the Oslo area. The informants are carefully selected w.r.t. sociolinguistic variables and therefore representative in terms of age, gender, place of residence and education. NoTa-Oslo consists of approx. 957 000 words that are orthographically transcribed and morphologically tagged. The corpus is searchable in a specially designed search interface, and the transcriptions are linked to audio and video files.

Extended metadata

Download resources

Go to resource page