LIA sápmi – the LIA corpus of Sami dialects

The LIA Sápmi corpus is a speech corpus with recordings from 1960 – 1990 of Sami dialects from the northern part of Norway, Finland and Sweden, some recordings from NRK sami radio and some from UiT, mostly collected by Niels Jernsletten. The the topics of the interviews and conversations are typically about old trades and traditional life.
The corpus have about 190 000 tokens and 122 speakers from 19 places.
Automatic lemmatization, morphological tagging and translation to Norwegian are done by Giellatekno.

Download resources

index.html

Extended metadata

Go to resource page

Go to resource page https://tekstlab.uio.no/glossa3/saami

dc:type	corpus
dc:title	LIA sápmi – the LIA corpus of Sami dialects
dc:identifier	oai:tekstlab.uio.no:lia-sapmi
dc:description	The LIA Sápmi corpus is a speech corpus with recordings from 1960 – 1990 of Sami dialects from the northern part of Norway, Finland and Sweden, some recordings from NRK sami radio and some from UiT, mostly collected by Niels Jernsletten. The the topics of the interviews and conversations are typically about old trades and traditional life. The corpus have about 190 000 tokens and 122 speakers from 19 places. Automatic lemmatization, morphological tagging and translation to Norwegian are done by Giellatekno.
dc:publisher
dc:format	accessibleThroughInterface
dc:date	2014-04-01
dc:date	2019-11-01
dc:rights	Academic
dc:rights	CLARIN
dc:rights	CLARIN_ACA-NC-LOC-PRIV-ND-*
dc:rights	https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&LOC=1&PRIV=1&NORED=1&ND=1
dc:creator	The LIA project (Project participants and employees in the LIA project)
dc:lang	Northern sami

LIA sápmi – the LIA corpus of Sami dialects

Download resources

Extended metadata

Dublin Core (DC)

Go to resource page