Nordic Dialect Corpus – downloadable transcriptions

Nordic Dialect Corpus v. 4.0 is a corpus of Norwegian, Swedish, Danish, Faroese, Icelandic and Övdalian spoken language. It consists of spontaneous speech data from dialects of the North Germanic languages across all of the Nordic countries. The linguistic data in the corpus comes from a variety of sources, (see homepage – Data Collection), recorded in 1998 – 2015. The corpus contains more than 2.75 million words from conversations and interviews by dialect speakers.

The downloadable version of the corpus contains all transcriptions in the corpus, both in txt and html format. The Norwegian and Övdaliantranscriptions are available in to versions: one phonetic and one orthographic. The other transcriptions are orthographically transcribed.

Download resources

Extended metadata

Go to resource page

Go to resource page http://www.tekstlab.uio.no/nota/scandiasyn/

dc:type	corpus
dc:title	Nordic Dialect Corpus – downloadable transcriptions
dc:identifier	oai:tekstlab.uio.no:nordic-dialect-corpus-transcriptions
dc:description	Nordic Dialect Corpus v. 4.0 is a corpus of Norwegian, Swedish, Danish, Faroese, Icelandic and Övdalian spoken language. It consists of spontaneous speech data from dialects of the North Germanic languages across all of the Nordic countries. The linguistic data in the corpus comes from a variety of sources, (see homepage – Data Collection), recorded in 1998 – 2015. The corpus contains more than 2.75 million words from conversations and interviews by dialect speakers. The downloadable version of the corpus contains all transcriptions in the corpus, both in txt and html format. The Norwegian and Övdaliantranscriptions are available in to versions: one phonetic and one orthographic. The other transcriptions are orthographically transcribed.
dc:publisher
dc:format	downloadable
dc:date	2005-01-01
dc:date	2019-09-31
dc:rights	Public
dc:rights	Creative Commons (CC)
dc:rights	Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
dc:rights	http://creativecommons.org/licenses/by-nc-sa/4.0/
dc:lang	Norwegian Bokmål (the orthographic transcriptions)
dc:lang	Swedish (Övdalien included)
dc:lang	Danish
dc:lang	Icelandic
dc:lang	Faroese

Nordic Dialect Corpus – downloadable transcriptions

Download resources

Extended metadata

Dublin Core (DC)

Go to resource page