Skip to content
Nasjonalbiblioteket |Språkbanken
  • Norsk
  • The Norwegian Language Bank
  • Resource Catalogue

I samarbeid med CLARINO, illustration

Type

Origin

  • Text 31.01.2024

    Målfrid 2024 – Freely Available Documents from Norwegian State Institutions

    This corpus consists of documents from 497 domains of Norwegian state institutions and comprises approximately 2.6 billion tokens in total. In addition to Norwegian Bokmål and Nynorsk texts, the …
    Language:
    Norwegian Bokmål, Norwegian Nynorsk, English, Northern Sami, Southern Sami, Lule Sami
    Origin:
    Language Bank
    Licence:
    Norwegian Licence for Open Government Data (NLOD)
    Type:
    Text
    Updated:
    31.01.2024
  • Tool 11.01.2024

    Glossa

    Glossa is a tool for researchers who want to search linguistically annotated corpora. Glossa is designed to make it easy for researchers to: - create complex searches - explore the result via e.g. …
    Language:
    Origin:
    CLARINO Text Laboratory Centre
    Licence:
    MIT license
    Type:
    Tool
    Updated:
    11.01.2024
  • Speech, Text 19.12.2023

    NST Norwegian ASR Database (16 kHz) – Reorganized

    This database was created by Nordic Language Technology for the development of automatic speech recognition and dictation in Norwegian. In this version (from 2022), the organization of the data has …
    Language:
    Norwegian
    Origin:
    Language Bank
    Licence:
    Creative_Commons-ZERO (CC-ZERO)
    Type:
    Speech, Text
    Updated:
    19.12.2023
  • Tool 20.11.2023

    Mapping between Norwegian municipalities and dialect regions

    This resource provides a mapping between Norwegian municipalities and dialect regions, and can be used, e.g., to infer the dialect region of a speaker in a speech dataset based on their place of …
    Origin:
    Language Bank
    Licence:
    Creative_Commons-BY (CC-BY)
    Type:
    Tool
    Updated:
    20.11.2023
  • Speech, Text 15.11.2023

    Stortinget Speech Corpus version 1.0

    The Stortinget Speech Corpus (SSC) is a 5000+ hours speech dataset for weak supervision ASR created from audio and aligned proceedings text from Stortinget, the Norwegian Parliament. It contains …
    Language:
    Norwegian
    Origin:
    Language Bank
    Licence:
    Creative_Commons-ZERO (CC-ZERO)
    Type:
    Speech, Text
    Updated:
    15.11.2023
  • Text 27.10.2023

    NDT 2.0 with Constituent Structure

    In this version of the Norwegian Dependency Treebank 2.0 constituent structure (c-structure) similar to the one found in NorGramBank has been added. This can be used to train one syntactic parser for …
    Language:
    Norwegian Bokmål, Norwegian Nynorsk
    Origin:
    Language Bank
    Licence:
    Creative_Commons-ZERO (CC-ZERO)
    Type:
    Text
    Updated:
    27.10.2023
  • Tool 20.10.2023

    spaCy for Norwegian Nynorsk

    These spaCy models are trained on the NorNE dataset in a version compatible with Universal Dependencies. spaCy is a widely used library in python for language technology applications. spaCy does not …
    Origin:
    Language Bank
    Licence:
    MIT license
    Type:
    Tool
    Updated:
    20.10.2023
  • Text 24.08.2023

    Norwegian Dependency Treebank 2.0

    This is version 2.0 of the Norwegian Dependency Treebank (NDT), developed by the National Library of Norway in 2011-2014. In version 2.0 of NDT, the grammatical annotations remain the same as in the …
    Language:
    Norwegian Bokmål, Norwegian Nynorsk
    Origin:
    Language Bank
    Licence:
    Creative_Commons-ZERO (CC-ZERO)
    Type:
    Text
    Updated:
    24.08.2023
  • Speech, Text 18.08.2023

    Norwegian Conversation Speech Corpus

    NB Samtale is a speech corpus made by the Language Bank at the National Library of Norway. The corpus contains orthographically transcribed speech from podcasts and recordings of live events at the …
    Language:
    Norwegian
    Origin:
    Language Bank
    Licence:
    Creative_Commons-ZERO (CC-ZERO)
    Type:
    Speech, Text
    Updated:
    18.08.2023
  • Speech, Text 13.07.2023

    Norwegian Parliamentary Speech Corpus 2.0

    This is version 2.0 of The Norwegian Parliamentary Speech Corpus (NPSC). In version 2.0, a number of changes have been made to the transcriptions, and some identified errors in the corpus have been …
    Language:
    Norwegian
    Origin:
    Language Bank
    Licence:
    Creative_Commons-ZERO (CC-ZERO)
    Type:
    Speech, Text
    Updated:
    13.07.2023

Åpningstider

Mandag–fredag: 09:00–21:00
Lørdag: 10:00–18:00

Les mer om åpningstider og tilbud

Besøksadresse

Henrik Ibsens gate 110, Oslo
Finsetveien 2, Mo i Rana

Kontaktinformasjon

E-post: nb@nb.no
Telefon: 23 27 60 00

Snarveier

  • NB in English
  • Spør Nasjonalbiblioteket
  • Ledige stillinger
  • Standardnummerering
  • Adresseoversikt
  • Hjelp og informasjon
  • Tilgjengelighetserklæring
  • Presse
  • Offentlig postjournal
  • Om oss
  • Pliktavlevering
  • Personvernerklæring

Ansvarlig redaktør

Aslak Sira Myhre

Organisasjonsnummer

976 029 100

Sosiale medier

  • Instagram
  • Facebook