Skip to content

Nordic Dialect Corpus v. 4.0

Nordic Dialect Corpus v.4.0 is a corpus of Norwegian, Swedish, Danish, Faroese, Icelandic and Övdalian spoken language. It consists of spontaneous speech data from dialects of the North Germanic languages across all of the Nordic countries. The linguistic data in the corpus comes from a variety of sources, (see homepage – Data Collection), recorded in 1998 – 2015. The corpus contains more than 2.75 million words from conversations and interviews by dialect speakers. It is transcribed and linked to audio and video, has a map function, and can be searched in a large variety of ways. Even if the aim of the corpus is Nordic syntax research, the corpus is a general one, a Norwegian Dialect Corpus, a Swedish Dialect Corpus and so on, to be used in a wide range of research areas, such as phonology, morphology and lexicography.

Note! v. 3.0 contains old recordings and transcriptions from Målførearkivet (Oslo Old Dialect Archive. The same transcriptions are now searchable in LIA Norwegian – Corpus of Old Dialect Recordings.
Use v. 4.0 to search the corpus without the old Målførearkiv recordings.

Nordic Dialect Corpus v.4.0 is a corpus of Norwegian, Swedish, Danish, Faroese, Icelandic and Övdalian spoken language. It consists of spontaneous speech data from dialects of the North Germanic languages across all of the Nordic countries. The linguistic data in the corpus comes from a variety of sources, (see homepage – Data Collection), recorded in 1998 – 2015. The corpus contains more than 2.75 million words from conversations and interviews by dialect speakers. It is transcribed and linked to audio and video, has a map function, and can be searched in a large variety of ways. Even if the aim of the corpus is Nordic syntax research, the corpus is a general one, a Norwegian Dialect Corpus, a Swedish Dialect Corpus and so on, to be used in a wide range of research areas, such as phonology, morphology and lexicography.

Note! v. 3.0 contains old recordings and transcriptions from Målførearkivet (Oslo Old Dialect Archive. The same transcriptions are now searchable in LIA Norwegian – Corpus of Old Dialect Recordings.
Use v. 4.0 to search the corpus without the old Målførearkiv recordings.

Extended metadata

Download resources

Go to resource page