Skip to content
Nasjonalbiblioteket |Språkbanken
  • Norsk
  • The Norwegian Language Bank
  • Resource Catalogue

I samarbeid med CLARINO, illustration

Type

Origin

  • Text, Video 06.05.2025

    Administrative law concepts in Norwegian sign language

    This dataset consists of 32 films with explanations of key administrative law concepts in Norwegian sign language. The films are produced at the Department of Public and International Law, The Faculty …
    Language:
    Norwegian Sign Language
    Origin:
    Language Bank
    Licence:
    Creative_Commons-BY-NC (CC-BY-NC)
    Type:
    Text, Video
    Updated:
    06.05.2025
  • Text 16.04.2025

    Norwegian Newspaper Corpus annotated (2001-2009)

    This is a subpart of the Norwegian Newspaper Corpus for bokmål, grammatically annotated with information about each word’s lemma, part of speech (word class) and morphological analysis based on an …
    Language:
    Norwegian, Norwegian Bokmål
    Origin:
    CLARINO Bergen Centre
    Licence:
    Creative_Commons-BY-NC (CC-BY-NC)
    Type:
    Text
    Updated:
    16.04.2025
  • Text 14.04.2025

    Norwegian Newspaper Corpus Nynorsk

    The Norwegian Newspaper Corpus (Nynorsk) is a freely accessible text corpus representing modern Norwegian in the written variety Norwegian Nynorsk. As of today, the material contains texts from 1998 …
    Language:
    Norwegian, Norwegian Nynorsk
    Origin:
    CLARINO Bergen Centre
    Licence:
    Creative_Commons-BY (CC-BY)
    Type:
    Text
    Updated:
    14.04.2025
  • Text 31.01.2025

    Målfrid 2025 – Freely Available Documents from Norwegian State Institutions

    This corpus consists of documents from 493 domains of Norwegian state institutions and comprises approximately 2.4 billion tokens in total. In addition to Norwegian Bokmål and Nynorsk texts, the …
    Language:
    Norwegian Bokmål, Norwegian Nynorsk, English, Northern Sami, Southern Sami, Lule Sami
    Origin:
    Language Bank
    Licence:
    Norwegian Licence for Open Government Data (NLOD)
    Type:
    Text
    Updated:
    31.01.2025
  • Tool 28.01.2025

    Synthetic text images for North, South, Lule and Inare Sámi

    This dataset contains synthetic line images meant for fitting OCR models for North, South, Lule and Inari Sámi. Clean line images are created using Pillow and they are subsequently distorted using …
    Language:
    Origin:
    Language Bank
    Licence:
    Creative_Commons-BY (CC-BY)
    Type:
    Tool
    Updated:
    28.01.2025
  • Tool 22.01.2025

    OCR Models for Sámi Languages

    This is a collection of models for OCR (optical character recognition) of Sámi languages. These can be used to recognize text in images of printed text (scanned books, magazines, etc.) in North …
    Language:
    Origin:
    Language Bank
    Licence:
    Creative_Commons-BY (CC-BY)
    Type:
    Tool
    Updated:
    22.01.2025
  • Text 10.10.2024

    Norwegian idioms

    This dataset consists of 3537 Norwegian idioms and phrases that appear more than 100 times in the online library of the National Library of Norway. There are 3455 idioms in Norwegian Bokmål and 88 in …
    Language:
    Norwegian Bokmål, Norwegian Nynorsk
    Origin:
    Language Bank
    Licence:
    Creative_Commons-ZERO (CC-ZERO)
    Type:
    Text
    Updated:
    10.10.2024
  • Speech 10.07.2024

    Norwegian Government Press Conference Speech Corpus

    The Norwegian Government Press Conference Speech Corpus (NorGovPCC) consists of approximately 138 hours of speech generated from audio with aligned subtitles from press conferences published by the …
    Origin:
    Language Bank
    Licence:
    Norwegian Licence for Open Government Data (NLOD)
    Type:
    Speech
    Updated:
    10.07.2024
  • Speech, Text 23.03.2024

    TeflonNorL2

    This page is currently a placeholder for the Norwegian data in the Teflon project. The Teflon project (https://teflon.aalto.fi/) aims at studying computer assisted language learning for immigrant …
    Language:
    Norwegian
    Origin:
    Language Bank
    Licence:
    unspecified
    Type:
    Speech, Text
    Updated:
    23.03.2024
  • Tool 09.02.2024

    Grapheme-to-Phoneme Models for Norwegian Bokmål

    This resource contains Grapheme-to-Phoneme (G2P) models for Norwegian Bokmål, which have been adapted to the G2P system Phonetisaurus (https://github.com/AdolfVonKleist/Phonetisaurus). The G2P models …
    Language:
    Origin:
    Language Bank
    Licence:
    Creative_Commons-ZERO (CC-ZERO)
    Type:
    Tool
    Updated:
    09.02.2024

Åpningstider

Mandag–fredag: 09:00–21:00
Lørdag: 10:00–18:00

Les mer om åpningstider og tilbud

Besøksadresse

Henrik Ibsens gate 110, Oslo
Finsetveien 2, Mo i Rana

Kontaktinformasjon

E-post: nb@nb.no
Telefon: 23 27 60 00

Snarveier

  • NB in English
  • Spør Nasjonalbiblioteket
  • Ledige stillinger
  • Standardnummerering
  • Adresseoversikt
  • Hjelp og informasjon
  • Tilgjengelighetserklæring
  • Presse
  • Offentlig postjournal
  • Om oss
  • Pliktavlevering
  • Personvernerklæring

Ansvarlig redaktør

Aslak Sira Myhre

Organisasjonsnummer

976 029 100

Sosiale medier

  • Instagram
  • Facebook