Skip to content
Nasjonalbiblioteket |Språkbanken
  • Norsk
  • The Norwegian Language Bank
  • Resource Catalogue

I samarbeid med CLARINO, illustration

Type

Distributed by

  • Text 19.02.2020

    The KIAP corpus

    KIAP is a corpus of 450 research articles covering three disciplines (economics, linguistics and medicine) and three languages (English, French and Norwegian). It is available in Copuscle at the …
    Language:
    Norwegian, French, English
    Distributed by:
    CLARINO Bergen Centre
    Licence:
    Creative_Commons-BY (CC-BY)
    Type:
    Text
    Updated:
    19.02.2020
  • Speech, Text 15.01.2020

    TAUS – The spoken language investigation in Oslo

    The material from TAUS (The spoken language investigation in Oslo) is based on informal interviews with people from Oslo. The interviews were made in 1971-73. The informants are mainly from two …
    Language:
    Norwegian, Norwegian Bokmål
    Distributed by:
    CLARINO Text Laboratory Centre
    Licence:
    CLARIN_ACA-NC-LOC-PRIV-ND-*
    Type:
    Speech, Text
    Updated:
    15.01.2020
  • Text 15.01.2020

    TAUS – downloadable transcriptions

    TAUS (The spoken language investigation in Oslo) v.3 is a speech corpus with 86 speakers and 387 551 tokens. The downloadable version of the corpus contains the transcriptions, approx. 387 500 tokens, …
    Language:
    Norwegian, Norwegian Bokmål
    Distributed by:
    CLARINO Text Laboratory Centre
    Licence:
    Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
    Type:
    Text
    Updated:
    15.01.2020
  • Text 11.12.2019

    Discussions from Wikipedia

    This corpus is a dump of discussion threads from the Norwegian Wikipedia, where authors discuss various issues regarding the publication of specific Wikipedia articles. The material is split into two …
    Language:
    Norwegian Bokmål, Norwegian Nynorsk
    Distributed by:
    Language Bank
    Licence:
    Creative_Commons-BY-SA (CC-BY-SA)
    Type:
    Text
    Updated:
    11.12.2019
  • Speech, Text, Video 01.11.2019

    Corpus of American Nordic Speech v.3.1

    CANS v.3.1 - Corpus of American Nordic Speech - is a speech corpus with speakers from USA and Canada speaking Norwegian and Swedish. Most of the informants learnt to speak their Nordic language as …
    Language:
    Norwegian Bokmål, Swedish
    Distributed by:
    CLARINO Text Laboratory Centre
    Licence:
    CLARIN_ACA-NC-LOC-PRIV-ND-*
    Type:
    Speech, Text, Video
    Updated:
    01.11.2019
  • Text 01.11.2019

    Corpus of American Nordic Speech – downloadable transcriptions

    CANS v.3.1 - Corpus of American Nordic Speech - is a speech corpus with speakers from USA and Canada speaking Norwegian and Swedish. Most of the informants learnt to speak their Nordic language as …
    Language:
    Norwegian Bokmål, Swedish
    Distributed by:
    CLARINO Text Laboratory Centre
    Licence:
    Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
    Type:
    Text
    Updated:
    01.11.2019
  • Speech, Text 01.11.2019

    LIA sápmi – the LIA corpus of Sami dialects

    The LIA Sápmi corpus is a speech corpus with recordings from 1960 - 1990 of Sami dialects from the northern part of Norway, Finland and Sweden, some recordings from NRK sami radio and some from UiT, …
    Language:
    Northern sami
    Distributed by:
    CLARINO Text Laboratory Centre
    Licence:
    CLARIN_ACA-NC-LOC-PRIV-ND-*
    Type:
    Speech, Text
    Updated:
    01.11.2019
  • Text 01.10.2019

    Nordic Dialect Corpus – downloadable transcriptions

    Nordic Dialect Corpus v. 4.0 is a corpus of Norwegian, Swedish, Danish, Faroese, Icelandic and Övdalian spoken language. It consists of spontaneous speech data from dialects of the North Germanic …
    Language:
    Norwegian Bokmål (the orthographic transcriptions), Swedish (Övdalien included), Danish, Icelandic, Faroese
    Distributed by:
    CLARINO Text Laboratory Centre
    Licence:
    Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
    Type:
    Text
    Updated:
    01.10.2019
  • Speech, Text, Video 01.10.2019

    Nordic Dialect Corpus v. 4.0

    Nordic Dialect Corpus v.4.0 is a corpus of Norwegian, Swedish, Danish, Faroese, Icelandic and Övdalian spoken language. It consists of spontaneous speech data from dialects of the North Germanic …
    Language:
    Norwegian Bokmål (the orthographic transcriptions), Swedish (Övdalien included), Danish, Icelandic, Faroese
    Distributed by:
    CLARINO Text Laboratory Centre
    Licence:
    CLARIN_ACA-NC-LOC-PRIV-ND-*
    Type:
    Speech, Text, Video
    Updated:
    01.10.2019
  • Text 29.05.2019

    NorNE – Norwegian Named Entities

    NorNE (Norwegian Named Entities) is a text corpus composed of the same texts as the Norwegian Dependency Treebank (NDT), but this version is in addition tagged with named entities. The corpus contains …
    Language:
    Norwegian Bokmål, Norwegian Nynorsk
    Distributed by:
    Language Bank
    Licence:
    Creative_Commons-ZERO (CC-ZERO)
    Type:
    Text
    Updated:
    29.05.2019

Åpningstider

Mandag–fredag: 09:00–21:00
Lørdag: 10:00–18:00

Les mer om åpningstider og tilbud

Besøksadresse

Henrik Ibsens gate 110, Oslo
Finsetveien 2, Mo i Rana

Kontaktinformasjon

E-post: nb@nb.no
Telefon: 23 27 60 00

Snarveier

  • NB in English
  • Spør Nasjonalbiblioteket
  • Ledige stillinger
  • Standardnummerering
  • Adresseoversikt
  • Hjelp og informasjon
  • Tilgjengelighetserklæring
  • Presse
  • Offentlig postjournal
  • Om oss
  • Pliktavlevering
  • Personvernerklæring

Ansvarlig redaktør

Aslak Sira Myhre

Organisasjonsnummer

976 029 100

Sosiale medier

  • Instagram
  • Facebook