<OAI-PMH xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://www.openarchives.org/OAI/2.0/" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/          http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd">
  <responseDate>2026-06-03T05:44:44.313Z</responseDate>
  <request verb="GetRecord">https://www.nb.no/sprakbanken/oai</request>
  <GetRecord>
    <record>
      <header>
        <identifier>oai:nb.no:sbr-85</identifier>
        <datestamp/>
      </header>
      <metadata>
        <cmd:CMD xmlns:cmd="http://www.clarin.eu/cmd/1" xmlns="http://www.clarin.eu/cmd/" xmlns:cmdp="http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1407745711925" CMDVersion="1.2" xsi:schemaLocation="http://www.clarin.eu/cmd/1 https://infra.clarin.eu/CMDI/1.x/xsd/cmd-envelop.xsd http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1407745711925 https://catalog.clarin.eu/ds/ComponentRegistry/rest/registry/1.1/profiles/clarin.eu:cr1:p_1407745711925/1.2/xsd">
          <cmd:Header>
            <cmd:MdCreator>Arne Martinus Lindstad</cmd:MdCreator>
            <cmd:MdCreationDate>2023-08-18</cmd:MdCreationDate>
            <cmd:MdSelfLink>https://www.nb.no/sprakbanken/oai?verb=GetRecord&amp;identifier=oai:nb.no:sbr-85&amp;metadataPrefix=cmdi</cmd:MdSelfLink>
            <cmd:MdProfile>clarin.eu:cr1:p_1407745711925</cmd:MdProfile>
            <cmd:MdCollectionDisplayName>Språkbanken NB</cmd:MdCollectionDisplayName>
          </cmd:Header>
          <cmd:Resources>
            <cmd:ResourceProxyList>
              <cmd:ResourceProxy id="nbsam_wav">
                <cmd:ResourceType mimetype="application/zip">Resource</cmd:ResourceType>
                <cmd:ResourceRef>https://www.nb.no/sbfil/taledata/nb_samtale.zip</cmd:ResourceRef>
              </cmd:ResourceProxy>
              <cmd:ResourceProxy id="nbsam_txt">
                <cmd:ResourceType mimetype="text/plain">Resource</cmd:ResourceType>
                <cmd:ResourceRef>https://www.nb.no/sbfil/taledata/backslash_notations.txt</cmd:ResourceRef>
              </cmd:ResourceProxy>
              <cmd:ResourceProxy id="nbsam_trans">
                <cmd:ResourceType mimetype="application/pdf">Resource</cmd:ResourceType>
                <cmd:ResourceRef>https://www.nb.no/sbfil/taledata/NB_Samtale_transcription_guidelines.pdf</cmd:ResourceRef>
              </cmd:ResourceProxy>
              <cmd:ResourceProxy id="nbsam_doc">
                <cmd:ResourceType mimetype="application/pdf">Resource</cmd:ResourceType>
                <cmd:ResourceRef>https://www.nb.no/sbfil/taledata/NB_Samtale_About_the_corpus.pdf</cmd:ResourceRef>
              </cmd:ResourceProxy>
            </cmd:ResourceProxyList>
            <cmd:JournalFileProxyList/>
            <cmd:ResourceRelationList/>
          </cmd:Resources>
          <cmd:IsPartOfList/>
          <cmd:Components>
            <cmdp:corpusProfile>
              <cmdp:resourceCommonInfo>
                <cmdp:resourceType>corpus</cmdp:resourceType>
                <cmdp:identificationInfo>
                  <cmdp:resourceName xml:lang="nb">NB Samtale</cmdp:resourceName>
                  <cmdp:resourceName xml:lang="en">Norwegian Conversation Speech Corpus</cmdp:resourceName>
                  <cmdp:description xml:lang="nb">NB Samtale er et talekorpus med ortografisk transkribert lydmateriale hentet fra podkaster og opptak av arrangementer på Nasjonalbiblioteket. Korpuset inneholder samtaler mellom flere personer, og talen er spontan og har typiske trekk ved muntlig språk. Lydmaterialet er valgt ut med tanke på god balanse mellom kjønnene og god dialektvariasjon, og korpuset har transkripsjoner på både bokmål og nynorsk.

NB Samtale er tenkt som et open-source-datasett for trening av automatisk talegjenkjenning, spesifikt gjenkjenning av spontan tale mellom flere personer i samtale. Det er til sammen 24 timer transkribert tale fra 69 talere fordelt på 12.080 segmenter som hver er en individuell WAV-fil. Metadataene inneholder blant annet informasjon om segmentenes kildefil, tidskode og varighet, samt talernes kjønn, dialekt og målform.

NB Samtale er utviklet av Språkbanken ved Nasjonalbiblioteket. Vi setter stor pris på tilbakemeldinger og forslag til forbedringer. Kontakt oss på sprakbanken@nb.no.</cmdp:description>
                  <cmdp:description xml:lang="en">NB Samtale is a speech corpus made by the Language Bank at the National Library of Norway. The corpus contains orthographically transcribed speech from podcasts and recordings of live events at the National Library. The corpus is intended as an open source dataset for Automatic Speech Recognition (ASR) development, and is specifically aimed at improving ASR systems' handle on conversational speech.

The corpus consists of 12,080 segments, a total of 24 hours transcribed speech from 69 speakers. The corpus ensures both gender and dialect variation, and speakers from five broad dialect areas are represented. Both Bokmål and Nynorsk transcriptions are present in the corpus, with Nynorsk making up approximately 25% of the transcriptions.

We greatly appreciate feedback and suggestions for improvements. PLease contact us at sprakbanken@nb.no.</cmdp:description>
                  <cmdp:url cmd:description="resource homepage">https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-85/</cmdp:url>
                  <cmdp:PID cmd:description="handle">hdl:21.11146/85</cmdp:PID>
                  <cmdp:identifier>sbr-85</cmdp:identifier>
                </cmdp:identificationInfo>
                <cmdp:distributionInfo>
                  <cmdp:licenceInfo>
                    <cmdp:userCategory>Public</cmdp:userCategory>
                    <cmdp:distributionAccessMedium>downloadable</cmdp:distributionAccessMedium>
                    <cmdp:downloadLocation cmd:description="resource homepage">https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-85/</cmdp:downloadLocation>
                    <cmdp:licence>
                      <cmdp:licenceFamily>Creative Commons (CC)</cmdp:licenceFamily>
                      <cmdp:licenceName>Creative_Commons-ZERO (CC-ZERO)</cmdp:licenceName>
                      <cmdp:licenceURL>https://creativecommons.org/publicdomain/zero/1.0/</cmdp:licenceURL>
                    </cmdp:licence>
                    <cmdp:licensor>
                      <cmdp:actorInfo>
                        <cmdp:actorType>organization</cmdp:actorType>
                        <cmdp:role xml:lang="en">Licensor</cmdp:role>
                        <cmdp:organizationInfo>
                          <cmdp:organizationName xml:lang="nb">Nasjonalbiblioteket</cmdp:organizationName>
                          <cmdp:organizationName xml:lang="en">National Library of Norway</cmdp:organizationName>
                          <cmdp:organizationShortName xml:lang="nb">NB</cmdp:organizationShortName>
                          <cmdp:organizationShortName xml:lang="en">NLN</cmdp:organizationShortName>
                          <cmdp:departmentName xml:lang="nb">Språkbanken</cmdp:departmentName>
                          <cmdp:departmentName xml:lang="en">The Language Bank</cmdp:departmentName>
                        </cmdp:organizationInfo>
                        <cmdp:communicationInfo>
                          <cmdp:email>sprakbanken@nb.no</cmdp:email>
                          <cmdp:url>https://www.nb.no/sprakbanken/</cmdp:url>
                          <cmdp:address>P.O. Box 2674 Solli</cmdp:address>
                          <cmdp:zipCode>0203</cmdp:zipCode>
                          <cmdp:city>Oslo</cmdp:city>
                          <cmdp:region>Oslo</cmdp:region>
                          <cmdp:country>Norway</cmdp:country>
                        </cmdp:communicationInfo>
                      </cmdp:actorInfo>
                    </cmdp:licensor>
                  </cmdp:licenceInfo>
                </cmdp:distributionInfo>
                <cmdp:contact>
                  <cmdp:actorInfo>
                    <cmdp:actorType>organization</cmdp:actorType>
                    <cmdp:role xml:lang="en">Contact</cmdp:role>
                    <cmdp:organizationInfo>
                      <cmdp:organizationName xml:lang="nb">Nasjonalbiblioteket</cmdp:organizationName>
                      <cmdp:organizationName xml:lang="en">National Library of Norway</cmdp:organizationName>
                      <cmdp:organizationShortName xml:lang="nb">NB</cmdp:organizationShortName>
                      <cmdp:organizationShortName xml:lang="en">NLN</cmdp:organizationShortName>
                      <cmdp:departmentName xml:lang="nb">Språkbanken</cmdp:departmentName>
                      <cmdp:departmentName xml:lang="en">The Language Bank</cmdp:departmentName>
                    </cmdp:organizationInfo>
                    <cmdp:communicationInfo>
                      <cmdp:email>sprakbanken@nb.no</cmdp:email>
                      <cmdp:url>https://www.nb.no/sprakbanken/</cmdp:url>
                      <cmdp:address>P.O. Box 2674 Solli</cmdp:address>
                      <cmdp:zipCode>0203</cmdp:zipCode>
                      <cmdp:city>Oslo</cmdp:city>
                      <cmdp:region>Oslo</cmdp:region>
                      <cmdp:country>Norway</cmdp:country>
                    </cmdp:communicationInfo>
                  </cmdp:actorInfo>
                </cmdp:contact>
                <cmdp:metadataInfo>
                  <cmdp:metadataCreationDate>2023-08-18</cmdp:metadataCreationDate>
                  <cmdp:metadataLanguageName>Norwegian Bokmål</cmdp:metadataLanguageName>
                  <cmdp:metadataLanguageName>English</cmdp:metadataLanguageName>
                  <cmdp:metadataLanguageId>nb</cmdp:metadataLanguageId>
                  <cmdp:metadataLanguageId>en</cmdp:metadataLanguageId>
                  <cmdp:metadataLastDateUpdated>2023-08-18</cmdp:metadataLastDateUpdated>
                  <cmdp:metadataCreator>
                    <cmdp:actorInfo>
                      <cmdp:actorType>organization</cmdp:actorType>
                      <cmdp:role xml:lang="en">Metadata creator</cmdp:role>
                      <cmdp:organizationInfo>
                        <cmdp:organizationName xml:lang="nb">Nasjonalbiblioteket</cmdp:organizationName>
                        <cmdp:organizationName xml:lang="en">National Library of Norway</cmdp:organizationName>
                        <cmdp:organizationShortName xml:lang="nb">NB</cmdp:organizationShortName>
                        <cmdp:organizationShortName xml:lang="en">NLN</cmdp:organizationShortName>
                        <cmdp:departmentName xml:lang="nb">Språkbanken</cmdp:departmentName>
                        <cmdp:departmentName xml:lang="en">The Language Bank</cmdp:departmentName>
                      </cmdp:organizationInfo>
                      <cmdp:communicationInfo>
                        <cmdp:email>sprakbanken@nb.no</cmdp:email>
                        <cmdp:url>https://www.nb.no/sprakbanken/</cmdp:url>
                        <cmdp:address>P.O. Box 2674 Solli</cmdp:address>
                        <cmdp:zipCode>0203</cmdp:zipCode>
                        <cmdp:city>Oslo</cmdp:city>
                        <cmdp:region>Oslo</cmdp:region>
                        <cmdp:country>Norway</cmdp:country>
                      </cmdp:communicationInfo>
                    </cmdp:actorInfo>
                  </cmdp:metadataCreator>
                </cmdp:metadataInfo>
                <cmdp:versionInfo>
                  <cmdp:version>1.0</cmdp:version>
                  <cmdp:lastDateUpdated>2023-08-18</cmdp:lastDateUpdated>
                </cmdp:versionInfo>
                <cmdp:validationInfo>
                  <cmdp:validated>true</cmdp:validated>
                  <cmdp:validationType>content</cmdp:validationType>
                  <cmdp:validationMode>manual</cmdp:validationMode>
                  <cmdp:validationExtent>full</cmdp:validationExtent>
                  <cmdp:validator>
                    <cmdp:actorInfo>
                      <cmdp:actorType>organization</cmdp:actorType>
                      <cmdp:role xml:lang="en">Resource Validator</cmdp:role>
                      <cmdp:organizationInfo>
                        <cmdp:organizationName xml:lang="nb">Nasjonalbiblioteket</cmdp:organizationName>
                        <cmdp:organizationName xml:lang="en">National Library of Norway</cmdp:organizationName>
                        <cmdp:organizationShortName xml:lang="nb">NB</cmdp:organizationShortName>
                        <cmdp:organizationShortName xml:lang="en">NLN</cmdp:organizationShortName>
                        <cmdp:departmentName xml:lang="nb">Språkbanken</cmdp:departmentName>
                        <cmdp:departmentName xml:lang="en">The Language Bank</cmdp:departmentName>
                      </cmdp:organizationInfo>
                      <cmdp:communicationInfo>
                        <cmdp:email>sprakbanken@nb.no</cmdp:email>
                        <cmdp:url>https://www.nb.no/sprakbanken/</cmdp:url>
                        <cmdp:address>P.O. Box 2674 Solli</cmdp:address>
                        <cmdp:zipCode>0203</cmdp:zipCode>
                        <cmdp:city>Oslo</cmdp:city>
                        <cmdp:region>Oslo</cmdp:region>
                        <cmdp:country>Norway</cmdp:country>
                      </cmdp:communicationInfo>
                    </cmdp:actorInfo>
                  </cmdp:validator>
                </cmdp:validationInfo>
                <cmdp:resourceCreationInfo>
                  <cmdp:creationStartDate>2022-07-01</cmdp:creationStartDate>
                  <cmdp:creationEndDate>2023-08-18</cmdp:creationEndDate>
                  <cmdp:resourceCreator>
                    <cmdp:actorInfo>
                      <cmdp:actorType>organization</cmdp:actorType>
                      <cmdp:role xml:lang="en">Resource Creator</cmdp:role>
                      <cmdp:organizationInfo>
                        <cmdp:organizationName xml:lang="nb">Nasjonalbiblioteket</cmdp:organizationName>
                        <cmdp:organizationName xml:lang="en">National Library of Norway</cmdp:organizationName>
                        <cmdp:organizationShortName xml:lang="nb">NB</cmdp:organizationShortName>
                        <cmdp:organizationShortName xml:lang="en">NLN</cmdp:organizationShortName>
                        <cmdp:departmentName xml:lang="nb">Språkbanken</cmdp:departmentName>
                        <cmdp:departmentName xml:lang="en">The Language Bank</cmdp:departmentName>
                      </cmdp:organizationInfo>
                      <cmdp:communicationInfo>
                        <cmdp:email>sprakbanken@nb.no</cmdp:email>
                        <cmdp:url>https://www.nb.no/sprakbanken/</cmdp:url>
                        <cmdp:address>P.O. Box 2674 Solli</cmdp:address>
                        <cmdp:zipCode>0203</cmdp:zipCode>
                        <cmdp:city>Oslo</cmdp:city>
                        <cmdp:region>Oslo</cmdp:region>
                        <cmdp:country>Norway</cmdp:country>
                      </cmdp:communicationInfo>
                    </cmdp:actorInfo>
                  </cmdp:resourceCreator>
                </cmdp:resourceCreationInfo>
              </cmdp:resourceCommonInfo>
              <cmdp:corpusInfo>
                <cmdp:corpusType>Multimodal Corpus</cmdp:corpusType>
                <cmdp:corpusPartInfo>
                  <cmdp:mediaType>audio</cmdp:mediaType>
                  <cmdp:corpusAudioInfo>
                    <cmdp:audioSizeInfo>
                      <cmdp:sizeInfo>
                        <cmdp:size>12080</cmdp:size>
                        <cmdp:sizeUnit>files</cmdp:sizeUnit>
                      </cmdp:sizeInfo>
                      <cmdp:sizeInfo>
                        <cmdp:size>24</cmdp:size>
                        <cmdp:sizeUnit>hours</cmdp:sizeUnit>
                      </cmdp:sizeInfo>
                      <cmdp:sizeInfo>
                        <cmdp:size>3</cmdp:size>
                        <cmdp:sizeUnit>gb</cmdp:sizeUnit>
                      </cmdp:sizeInfo>
                      <cmdp:durationOfAudioInfo>
                        <cmdp:size>24</cmdp:size>
                        <cmdp:durationUnit>hours</cmdp:durationUnit>
                      </cmdp:durationOfAudioInfo>
                    </cmdp:audioSizeInfo>
                    <cmdp:audioContentInfo>
                      <cmdp:speechItems>freeSpeech</cmdp:speechItems>
                    </cmdp:audioContentInfo>
                    <cmdp:audioFormatInfo>
                      <cmdp:mimeType>audio/wav</cmdp:mimeType>
                    </cmdp:audioFormatInfo>
                  </cmdp:corpusAudioInfo>
                </cmdp:corpusPartInfo>
                <cmdp:corpusPartInfo>
                  <cmdp:mediaType>text</cmdp:mediaType>
                  <cmdp:corpusTextInfo>
                    <cmdp:textFormatInfo>
                      <cmdp:mimeType>application/json</cmdp:mimeType>
                    </cmdp:textFormatInfo>
                    <cmdp:characterEncodingInfo>
                      <cmdp:characterEncoding>UTF-8</cmdp:characterEncoding>
                    </cmdp:characterEncodingInfo>
                  </cmdp:corpusTextInfo>
                </cmdp:corpusPartInfo>
                <cmdp:corpusPartGeneralInfo>
                  <cmdp:lingualityInfo>
                    <cmdp:lingualityType>monolingual</cmdp:lingualityType>
                  </cmdp:lingualityInfo>
                  <cmdp:languageInfo>
                    <cmdp:languageId>no</cmdp:languageId>
                    <cmdp:languageName>Norwegian</cmdp:languageName>
                    <cmdp:sizePerLanguage>
                      <cmdp:sizeInfo>
                        <cmdp:size>12080</cmdp:size>
                        <cmdp:sizeUnit>files</cmdp:sizeUnit>
                      </cmdp:sizeInfo>
                      <cmdp:sizeInfo>
                        <cmdp:size>24</cmdp:size>
                        <cmdp:sizeUnit>hours</cmdp:sizeUnit>
                      </cmdp:sizeInfo>
                    </cmdp:sizePerLanguage>
                    <cmdp:languageVarietyInfo>
                      <cmdp:languageVarietyType>dialect</cmdp:languageVarietyType>
                      <cmdp:languageVarietyName>Norwegian dialects</cmdp:languageVarietyName>
                    </cmdp:languageVarietyInfo>
                  </cmdp:languageInfo>
                  <cmdp:modalityInfo>
                    <cmdp:modalityType>spokenLanguage</cmdp:modalityType>
                  </cmdp:modalityInfo>
                  <cmdp:annotationInfo>
                    <cmdp:annotationType>speechAnnotation-orthographicTranscription</cmdp:annotationType>
                    <cmdp:segmentationLevel>word</cmdp:segmentationLevel>
                    <cmdp:annotationMode>mixed</cmdp:annotationMode>
                    <cmdp:annotationModeDetails>Automatic annotation followed by manual correction and proofreading by two linguists.</cmdp:annotationModeDetails>
                    <cmdp:annotationStartDate>2022-07-01</cmdp:annotationStartDate>
                    <cmdp:annotationEndDate>2023-08-18</cmdp:annotationEndDate>
                  </cmdp:annotationInfo>
                </cmdp:corpusPartGeneralInfo>
              </cmdp:corpusInfo>
            </cmdp:corpusProfile>
          </cmd:Components>
        </cmd:CMD>
      </metadata>
    </record>
  </GetRecord>
</OAI-PMH>