{"id":35318,"date":"2025-02-10T10:49:05","date_gmt":"2025-02-10T09:49:05","guid":{"rendered":"https:\/\/www.nb.no\/sprakbanken\/ressurskatalog\/oai-clarino-uib-no-lb-2022052002\/"},"modified":"2025-02-10T11:05:26","modified_gmt":"2025-02-10T10:05:26","slug":"oai-clarino-uib-no-lb-2022052002","status":"publish","type":"language-resource","link":"https:\/\/www.nb.no\/sprakbanken\/ressurskatalog\/oai-clarino-uib-no-lb-2022052002\/","title":{"rendered":""},"content":{"rendered":"<p><?xml version='1.0' encoding='utf-8'?><br \/>\n<record><\/p>\n<header><identifier>oai:clarino.uib.no:lb-2022052002<\/identifier><datestamp>2024-12-04T12:58:22Z<\/datestamp><setSpec>FIN-CLARIN<\/setSpec><\/header>\n<p><metadata><CMD xmlns=\"http:\/\/www.clarin.eu\/cmd\/\"><br \/>\n<Header><br \/>\n<MdCreator>metashareToCmdi.xsl remove_metashare_namespace.xsl<\/MdCreator><br \/>\n<MdCreationDate>2024-03-27<\/MdCreationDate><br \/>\n<MdSelfLink>urn:nbn:fi:lb-2022052002<\/MdSelfLink><br \/>\n<MdProfile>clarin.eu:cr1:p_1361876010571<\/MdProfile><\/p>\n<p><\/Header><Resources><ResourceProxyList><ResourceProxy id=\"_1\"><ResourceType mimetype=\"\">Resource<\/ResourceType><ResourceRef>http:\/\/urn.fi\/urn:nbn:fi:lb-2022052003<\/ResourceRef><\/ResourceProxy><br \/>\n<\/ResourceProxyList><JournalFileProxyList \/><br \/>\n<ResourceRelationList \/><br \/>\n<IsPartOfList \/><\/Resources><br \/>\n<Components><br \/>\n<resourceInfo><br \/>\n    <identificationInfo ComponentId=\"clarin.eu:cr1:c_1349361150743\"><br \/>\n        <resourceName xml:lang=\"fi\">Aallon puheentunnistuskorpus eduskunnan istunnoista 2008-2020, versio 2<\/resourceName><br \/>\n        <resourceName xml:lang=\"en\">Aalto Finnish Parliament ASR Corpus 2008-2020, version 2<\/resourceName><br \/>\n        <description xml:lang=\"fi\">T\u00e4m\u00e4 aineisto tulee saataville Kielipankin latauspalveluun, ks. Access location.<\/description><br \/>\n        <description xml:lang=\"en\"># Aalto Finnish Parliament ASR Corpus 2008-2020, version 2<\/p>\n<p>Short name: `fi-parliament-asr-v2`<br \/>\nPersistent Identifier of this resource: http:\/\/urn.fi\/urn:nbn:fi:lb-2022052002<\/p>\n<p>This corpus is extracted from the Finnish parliament plenary session transcripts and videos by the<br \/>\nAalto Speech Recognition group. The original session transcripts and videos are available at the web<br \/>\nportals of the Parliament of Finland (avoindata.eduskunta.fi and verkkolahetys.eduskunta.fi). The<br \/>\ncorpus is split into three parts:<br \/>\n 1. 2015-2020 set<br \/>\n 2. 2008-2016 set<br \/>\n 3. Development and test sets<\/p>\n<p>A non-overlapping combination of the 2008-2016 set and the 2015-2020 set form a training set of size:<br \/>\n &#8211; 1 422 318 sample pairs<br \/>\n &#8211; 3 130 hours of speech<br \/>\n &#8211; 19 356 831 word tokens<\/p>\n<p>All audio files in this corpus are single-channel wavs with sample rate 16 kHz and 16-bit precision.<br \/>\nThe transcript files (.trn) are plain text files.<\/p>\n<p>See this github repository for data preparation and baseline models using the Kaldi toolkit:<br \/>\nhttps:\/\/github.com\/aalto-speech\/fin-parl-models<\/p>\n<p>&#8212;<\/p>\n<p>## 2015-2020 set<\/p>\n<p>This subset is extracted from the Finnish parliament plenary session transcripts and videos by the<br \/>\nAalto Speech Recognition group in 2021.<\/p>\n<p>The tools and code used to produce this subset:<br \/>\n &#8211; Preprocessing and postprocessing: https:\/\/github.com\/aalto-speech\/fi-parliament-tools<br \/>\n &#8211; Decoding and segmentation: Kaldi, https:\/\/github.com\/kaldi-asr\/kaldi<\/p>\n<p>### Data<\/p>\n<p>This subset contains samples of speech (.wav) and their corresponding transcripts (.trn) from sessions<br \/>\nbetween 1\/2015 and 104\/2020. Few sessions that had broken or empty session transcript are left out,<br \/>\nso the session range has some gaps. Samples are grouped by session. Each filename is formed from the<br \/>\nfollowing components:<\/p>\n<p>&gt; Filename (Kaldi-compatible utterance id): &lt;mpid&gt;-&lt;session_number&gt;-&lt;session_year&gt;-&lt;startsec&gt;-&lt;endsec&gt;<br \/>\n&gt; e.g.: 00259-001-2015-00186868-00187044<\/p>\n<p>Further details:<\/p>\n<p>|     Component    |                                                              Definition                                                      |<br \/>\n|:&#8212;&#8212;&#8212;&#8212;&#8212;-:|:&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;-:|<br \/>\n| &lt;mpid&gt;           |              The unique Member of Parliament identifier given to the MPs in the parliament&#8217;s public databases.               |<br \/>\n| &lt;session_number&gt; |      A running number given to the plenary session which together with the working year uniquely identifies the session.     |<br \/>\n| &lt;session_year&gt;   |       The parliamentary working year of the session. In election years, the working year differs from the calendar year.     |<br \/>\n| &lt;startsec&gt;       | The start timestamp of the segment in the full plenary session audio. Format is seconds + two decimals, 00186868 = 1868.68 s |<br \/>\n| &lt;endsec&gt;         |                  Like start timestamp, this marks the end timestamp of the segment in the original audio.                    |<\/p>\n<p>This subset is machine-extracted so there remains some inaccuracies in the samples. The audio quality<br \/>\nalso varies.<\/p>\n<p>### Statistics<\/p>\n<p>In total, there are:<br \/>\n &#8211; 984 676 sample pairs<br \/>\n &#8211; 1 780 hours of speech<br \/>\n &#8211; 11 234 724 word tokens<\/p>\n<p>### Text data<\/p>\n<p>This subset comes with a 10 million word token in-domain text corpus in the file<br \/>\n`parl-full-transcripts-78-2016-104-2020.train`. This 10 million token text corpus can be combined<br \/>\nwith the 20 million token text corpus that comes with the 2008-2016 set to form a 30 million token<br \/>\ntext corpus.<\/p>\n<p>### Note about MPIDs<\/p>\n<p>There is one speaker in this subset that is not an MP, Risto Hiekkataipale (MPID: 00002). His MPID<br \/>\nis arbitrary. The 2015-2020 set and 2008-2016 set use different speaker IDs. A mapping is provided in<br \/>\n`speaker-id-mapping.csv`.<\/p>\n<p>&#8212;<\/p>\n<p>## 2008-2016 set<\/p>\n<p>This subset is extracted from the Finnish parliament plenary session transcripts and videos by the<br \/>\nAalto Speech Recognition group in 2017.<\/p>\n<p>Code used to produce this subset:<br \/>\n &#8211; https:\/\/github.com\/aalto-speech\/finnish-parliament-scripts<\/p>\n<p>### Data<\/p>\n<p>This subset contains samples of speech (.wav) and their corresponding transcripts (.trn) from sessions<br \/>\nbetween 71\/2008 and 77\/2016. A list of samples from sessions held in 2008-2014, that do not overlap<br \/>\nwith samples in the 2015-2020 set, is provided in `2008-2014-samples.list`. Samples are grouped by speaker.<br \/>\nEach filepath is formed from the following components:<\/p>\n<p>&gt; Utterance id: &lt;speaker-id&gt;\/&lt;speaker-name&gt;_&lt;sample-id&gt;<br \/>\n&gt; e.g.: 0004\/aila_paloniemi_00045.wav<\/p>\n<p>Further details:<\/p>\n<p>|   Component    |                   Definition                    |<br \/>\n|:&#8212;&#8212;&#8212;&#8212;&#8211;:|:&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8211;:|<br \/>\n| &lt;speaker-id&gt;   |      A number identifier for the speaker.       |<br \/>\n| &lt;speaker-name&gt; |  Speaker&#8217;s name in &laquo;firstname_lastname&raquo; order.  |<br \/>\n| &lt;sample-id&gt;    |   A number identifier assigned to each sample.  |<\/p>\n<p>This subset is machine-extracted so there remains some inaccuracies in the samples. The audio quality<br \/>\nalso varies. A mapping to the 2015-2020 set MP IDs is provided in `speaker-id-mapping.csv`.<\/p>\n<p>### Splits<\/p>\n<p>The paper &laquo;Automatic Construction of the Finnish Parliament Speech Corpus&raquo; by Mansikkaniemi et al.<br \/>\n(see citation) uses training splits which are defined in the following files:<br \/>\n &#8211; `parl-all.train.list`<br \/>\n &#8211; `parl-400.train.list`<br \/>\n &#8211; `parl-60min.train.list`<br \/>\n &#8211; `parl-30min.train.list`<\/p>\n<p>### Additional files<\/p>\n<p>There are two additional files provided with the 2008-2016 set:<\/p>\n<p>1. `dropped_duplicates.list` &#8211; There are some utterances in the raw dataset that have overlapping<br \/>\n   utterance id. This file indicates which duplicates were dropped in the paper by Mansikkaniemi et al.<br \/>\n   (see citation). The `local\/data_prep.sh` script in the Github repository https:\/\/github.com\/aalto-speech\/fin-parl-models<br \/>\n   can recreate the Kaldi input files for the 2008-2016 set used in Mansikkaniemi et al.<br \/>\n2. `utt2year` &#8211; This file maps utterance ids to the year they were spoken. This file is compatible<br \/>\n   with the Kaldi input files created by the script `local\/data_prep.sh` mentioned above.<\/p>\n<p>### Statistics<\/p>\n<p>In total, there are:<br \/>\n &#8211; 522 543 sample pairs<br \/>\n &#8211; 1560 hours of speech<br \/>\n &#8211; 9 743 296 word tokens (in .trn files)<\/p>\n<p>In the 2008-2014 subset, there are:<br \/>\n &#8211; 437 642 sample pairs<br \/>\n &#8211; 1 350 hours of speech<br \/>\n &#8211; 8 122 107 word tokens (in .trn files)<\/p>\n<p>### Text data<\/p>\n<p>This subset comes with a 20 million word token in-domain text corpus in the file `parl-transcripts.train`.<br \/>\nThe text corpus is extracted from the 2008-2016 session transcripts.<\/p>\n<p>&#8212;<\/p>\n<p>## Development and test sets<\/p>\n<p>This subset contains the dev and test sets for Finnish Parliament ASR corpus. There are three sets:<\/p>\n<p>  1. 2016-dev<br \/>\n  2. 2016-test<br \/>\n  3. 2020-test<\/p>\n<p>The 2016 sets have been created with the same tools as the 2008-2016 train set. Similarly, the<br \/>\n2020 test set and 2015-2020 train set have been created with the same pipeline. Each dev and test<br \/>\nset has been cleaned and corrected by hand.<\/p>\n<p>### Data<\/p>\n<p>The 2016 sets contain samples of speech (.wav) and their corresponding transcripts (.trn) from the<br \/>\nsame sessions as the 2008-2016 train set. The samples are split to seen and unseen speakers. Read<br \/>\nmore about the seen\/unseen split in the paper &laquo;Automatic Construction of the Finnish Parliament<br \/>\nSpeech Corpus&raquo; by Mansikkaniemi et al. (see citation below). Each filename is formed from the<br \/>\nfollowing components:<\/p>\n<p>&gt; Utterance id: &lt;speaker-name&gt;_&lt;sample-id&gt;<br \/>\n&gt; e.g.: anne_mari_virolainen_04297.wav<\/p>\n<p>Further details:<\/p>\n<p>|   Component    |                   Definition                    |<br \/>\n|:&#8212;&#8212;&#8212;&#8212;&#8211;:|:&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8211;:|<br \/>\n| &lt;speaker-name&gt; |  Speaker&#8217;s name in &laquo;firstname_lastname&raquo; order.  |<br \/>\n| &lt;sample-id&gt;    |   A number identifier assigned to each sample.  |<\/p>\n<p>A mapping that connects `&lt;speaker-name&gt;` to the speaker IDs used in training sets 2008-2016 and<br \/>\n2015-2020 is provided in `dev-test-speakers.csv`.<\/p>\n<p>The 2020 test set has been created from the sessions held in autumn 2020, ranging from 105\/2020<br \/>\nto 170\/2020. The data is in the same format as the 2015-2020 train set.<\/p>\n<p>### Splits<\/p>\n<p>The paper &laquo;Automatic Construction of the Finnish Parliament Speech Corpus&raquo; uses seen and unseen<br \/>\nspeaker splits for the 2016 dev and test sets. These splits are defined in the files (subset<br \/>\nduration, HH:MM:SS, in parentheses):<br \/>\n &#8211; `seen_dev.list` (2:36:51)<br \/>\n &#8211; `seen_test.list` (2:53:51)<br \/>\n &#8211; `unseen_dev.list` (2:45:27)<br \/>\n &#8211; `unseen_test.list` (2:48:20)<\/p>\n<p>More details in the paper.<\/p>\n<p>&#8212;<\/p>\n<p>## Citation<\/p>\n<p>The 2008-2016 set and dev-test set are detailed in the following publication:<\/p>\n<p>&laquo;`<br \/>\n@conference{Aaltodoc:http:\/\/urn.fi\/URN:NBN:fi:aalto-201710157137,<br \/>\ntitle={Automatic Construction of the Finnish Parliament Speech Corpus},<br \/>\nauthor={Mansikkaniemi, Andre; Smit, Peter; Kurimo, Mikko},<br \/>\nyear={2017-08},<br \/>\nlanguage={en},<br \/>\npages={3762-3766},<br \/>\nkeyword={automatic speech recognition; speech-to-text alignment; DNN acoustic models; parliament speech dat; transcribed speech corpus},<br \/>\nseries={Interspeech 2017},<br \/>\ndoi={10.21437\/Interspeech.2017-1115},<br \/>\nurl={http:\/\/urn.fi\/URN:NBN:fi:aalto-201710157137},<br \/>\n}<br \/>\n&laquo;`<\/p>\n<p>&#8212;<\/p>\n<p>## License<\/p>\n<p>See the `LICENSE.md` file.<\/p>\n<p>&#8212;<\/p>\n<p>## Contact<\/p>\n<p>Authors: Anja Virkkunen, Andr\u00e9 Mansikkaniemi, and Mikko Kurimo of the Aalto Speech Recognition Group<br \/>\nContact via kielipankki@csc.fi<\/description><br \/>\n        <resourceShortName xml:lang=\"en\">fi-parliament-asr-v2<\/resourceShortName><br \/>\n        <url>http:\/\/urn.fi\/urn:nbn:fi:lb-2022052003<\/url><br \/>\n        <metaShareId>NOT_DEFINED_FOR_V2<\/metaShareId><br \/>\n        <identifier>http:\/\/urn.fi\/urn:nbn:fi:lb-2022052002<\/identifier><br \/>\n    <\/identificationInfo><br \/>\n    <distributionInfo ComponentId=\"clarin.eu:cr1:c_1352813745459\"><br \/>\n        <availability>available-unrestrictedUse<\/availability><br \/>\n        <availabilityStartDate>2022-06-10<\/availabilityStartDate>\n        <licenceInfo ComponentId=\"clarin.eu:cr1:c_1352813745464\">\n            <licence>CLARIN_PUB<\/licence>\n            <restrictionsOfUse>attribution<\/restrictionsOfUse><br \/>\n            <distributionAccessMedium>downloadable<\/distributionAccessMedium>\n        <\/licenceInfo>\n        <licenceInfo ComponentId=\"clarin.eu:cr1:c_1352813745464\">\n            <licence>CLARIN_PUB<\/licence>\n            <restrictionsOfUse>attribution<\/restrictionsOfUse><br \/>\n            <restrictionsOfUse>noDerivatives<\/restrictionsOfUse><br \/>\n            <restrictionsOfUse>other<\/restrictionsOfUse><br \/>\n            <distributionAccessMedium>downloadable<\/distributionAccessMedium>\n            <licensorOrganization ComponentId=\"clarin.eu:cr1:c_1361876010643\">\n                <role>licensor<\/role><br \/>\n                <organizationInfo ComponentId=\"clarin.eu:cr1:c_1352813745461\"><br \/>\n                    <organizationName xml:lang=\"en\">FIN-CLARIN<\/organizationName><br \/>\n                    <organizationShortName xml:lang=\"en\">FIN-CLARIN<\/organizationShortName><br \/>\n                    <departmentName xml:lang=\"en\">University of Helsinki<\/departmentName><br \/>\n                    <communicationInfo ComponentId=\"clarin.eu:cr1:c_1352813745460\"><br \/>\n                        <email>finclarin@helsinki.fi<\/email><br \/>\n                        <url>http:\/\/www.helsinki.fi\/fin-clarin<\/url><\/p>\n<address>PO Box 24 (Unioninkatu 40)<\/address>\n<p>                        <zipCode>00014<\/zipCode><br \/>\n                        <city>University of Helsinki<\/city><br \/>\n                        <country>Finland<\/country><br \/>\n                    <\/communicationInfo><br \/>\n                <\/organizationInfo>\n            <\/licensorOrganization>\n            <licensorOrganization ComponentId=\"clarin.eu:cr1:c_1361876010643\">\n                <role>licensor<\/role><br \/>\n                <organizationInfo ComponentId=\"clarin.eu:cr1:c_1352813745461\"><br \/>\n                    <organizationName xml:lang=\"en\">The Parliament of Finland<\/organizationName><br \/>\n                    <communicationInfo ComponentId=\"clarin.eu:cr1:c_1352813745460\"><br \/>\n                        <email>firstname.surname@eduskunta.fi<\/email><br \/>\n                        <url>http:\/\/web.eduskunta.fi\/Resource.phx\/parliament\/index.htx?lng=en<\/url><br \/>\n                        <city>Helsinki<\/city><br \/>\n                        <country>Finland<\/country><br \/>\n                    <\/communicationInfo><br \/>\n                <\/organizationInfo>\n            <\/licensorOrganization>\n            <distributionRightsHolderOrganization ComponentId=\"clarin.eu:cr1:c_1361876010640\"><br \/>\n                <role>distributionRightsHolder<\/role><br \/>\n                <organizationInfo ComponentId=\"clarin.eu:cr1:c_1352813745461\"><br \/>\n                    <organizationName xml:lang=\"en\">FIN-CLARIN<\/organizationName><br \/>\n                    <organizationShortName xml:lang=\"en\">FIN-CLARIN<\/organizationShortName><br \/>\n                    <departmentName xml:lang=\"en\">University of Helsinki<\/departmentName><br \/>\n                    <communicationInfo ComponentId=\"clarin.eu:cr1:c_1352813745460\"><br \/>\n                        <email>finclarin@helsinki.fi<\/email><br \/>\n                        <url>http:\/\/www.helsinki.fi\/fin-clarin<\/url><\/p>\n<address>PO Box 24 (Unioninkatu 40)<\/address>\n<p>                        <zipCode>00014<\/zipCode><br \/>\n                        <city>University of Helsinki<\/city><br \/>\n                        <country>Finland<\/country><br \/>\n                    <\/communicationInfo><br \/>\n                <\/organizationInfo><br \/>\n            <\/distributionRightsHolderOrganization>\n        <\/licenceInfo>\n        <iprHolderOrganization ComponentId=\"clarin.eu:cr1:c_1361876010642\"><br \/>\n            <role>iprHolder<\/role><br \/>\n            <organizationInfo ComponentId=\"clarin.eu:cr1:c_1352813745461\"><br \/>\n                <organizationName xml:lang=\"en\">The Parliament of Finland<\/organizationName><br \/>\n                <communicationInfo ComponentId=\"clarin.eu:cr1:c_1352813745460\"><br \/>\n                    <email>firstname.surname@eduskunta.fi<\/email><br \/>\n                    <url>http:\/\/web.eduskunta.fi\/Resource.phx\/parliament\/index.htx?lng=en<\/url><br \/>\n                    <city>Helsinki<\/city><br \/>\n                    <country>Finland<\/country><br \/>\n                <\/communicationInfo><br \/>\n            <\/organizationInfo><br \/>\n        <\/iprHolderOrganization><br \/>\n    <\/distributionInfo><br \/>\n    <contactPerson ComponentId=\"clarin.eu:cr1:c_1352813745468\"><br \/>\n        <role>contactPerson<\/role>\n        <personInfo ComponentId=\"clarin.eu:cr1:c_1349361150746\">\n            <surname xml:lang=\"en\">The Language Bank of Finland<\/surname><br \/>\n            <givenName xml:lang=\"en\">User support at CSC &#8211; IT Center for Science Ltd.<\/givenName><br \/>\n            <sex>unknown<\/sex><br \/>\n            <communicationInfo ComponentId=\"clarin.eu:cr1:c_1352813745460\"><br \/>\n                <email>kielipankki@csc.fi<\/email><br \/>\n            <\/communicationInfo><br \/>\n            <affiliation ComponentId=\"clarin.eu:cr1:c_1352813745462\"><br \/>\n                <role>affiliation<\/role><br \/>\n                <organizationInfo ComponentId=\"clarin.eu:cr1:c_1352813745461\"><br \/>\n                    <organizationName xml:lang=\"fi\">CSC &#8211; Tieteen tietotekniikan keskus Oy<\/organizationName><br \/>\n                    <organizationName xml:lang=\"en\">CSC \u2014 IT Center for Science Ltd<\/organizationName><br \/>\n                    <organizationShortName xml:lang=\"en\">CSC<\/organizationShortName><br \/>\n                    <departmentName xml:lang=\"fi\">Kielipankki<\/departmentName><br \/>\n                    <departmentName xml:lang=\"en\">Language Bank of Finland<\/departmentName><br \/>\n                    <communicationInfo ComponentId=\"clarin.eu:cr1:c_1352813745460\"><br \/>\n                        <email>kielipankki@csc.fi<\/email><br \/>\n                        <url>http:\/\/www.csc.fi\/english<\/url><\/p>\n<address>P.O. Box 405<\/address>\n<p>                        <zipCode>FI-02101<\/zipCode><br \/>\n                        <city>Espoo<\/city><br \/>\n                        <country>Finland<\/country><br \/>\n                        <telephoneNumber>+358 (0)9 457 2001<\/telephoneNumber><br \/>\n                        <faxNumber>+358 (0)9 457 2302<\/faxNumber><br \/>\n                    <\/communicationInfo><br \/>\n                <\/organizationInfo><br \/>\n            <\/affiliation>\n        <\/personInfo>\n    <\/contactPerson><br \/>\n    <metadataInfo ComponentId=\"clarin.eu:cr1:c_1349361150745\"><br \/>\n        <metadataCreationDate>2022-05-18<\/metadataCreationDate><br \/>\n        <metadataLastDateUpdated>2024-03-26<\/metadataLastDateUpdated><br \/>\n    <\/metadataInfo><br \/>\n    <resourceDocumentationInfo ComponentId=\"clarin.eu:cr1:c_1355150532301\"><br \/>\n        <documentationUnstructured ComponentId=\"clarin.eu:cr1:c_1355150532302\"><br \/>\n            <role>documentation<\/role><br \/>\n            <documentUnstructured>Resource group page: http:\/\/urn.fi\/urn:nbn:fi:lb-2021081105<\/documentUnstructured><br \/>\n        <\/documentationUnstructured><br \/>\n        <documentationUnstructured ComponentId=\"clarin.eu:cr1:c_1355150532302\"><br \/>\n            <role>documentation<\/role><br \/>\n            <documentUnstructured>How to cite: https:\/\/www.kielipankki.fi\/viittaus\/?key=urn:nbn:fi:lb-2022052002&amp;lang=en<\/documentUnstructured><br \/>\n        <\/documentationUnstructured><br \/>\n        <documentationStructured ComponentId=\"clarin.eu:cr1:c_1361876010648\"><br \/>\n            <role>documentation<\/role><br \/>\n            <documentInfo ComponentId=\"clarin.eu:cr1:c_1353678848788\"><br \/>\n                <documentType>other<\/documentType><br \/>\n                <title xml:lang=\"en\">License (eduskunta, audio and video)<\/title><br \/>\n                <editor>FIN-CLARIN<\/editor><br \/>\n                <url>http:\/\/urn.fi\/urn:nbn:fi:lb-2019112822<\/url><br \/>\n                <documentLanguageName>English<\/documentLanguageName><br \/>\n                <documentLanguageId>en<\/documentLanguageId><br \/>\n            <\/documentInfo><br \/>\n        <\/documentationStructured><br \/>\n        <documentationStructured ComponentId=\"clarin.eu:cr1:c_1361876010648\"><br \/>\n            <role>documentation<\/role><br \/>\n            <documentInfo ComponentId=\"clarin.eu:cr1:c_1353678848788\"><br \/>\n                <documentType>other<\/documentType><br \/>\n                <title xml:lang=\"fi\">Lisenssi (eduskunta, \u00e4\u00e4ni ja video)<\/title><br \/>\n                <editor>FIN-CLARIN<\/editor><br \/>\n                <url>http:\/\/urn.fi\/urn:nbn:fi:lb-2019112621<\/url><br \/>\n                <documentLanguageName>Finnish<\/documentLanguageName><br \/>\n                <documentLanguageId>fi<\/documentLanguageId><br \/>\n            <\/documentInfo><br \/>\n        <\/documentationStructured><br \/>\n        <documentationStructured ComponentId=\"clarin.eu:cr1:c_1361876010648\"><br \/>\n            <role>documentation<\/role><br \/>\n            <documentInfo ComponentId=\"clarin.eu:cr1:c_1353678848788\"><br \/>\n                <documentType>other<\/documentType><br \/>\n                <title xml:lang=\"en\">License (eduskunta, text)<\/title><br \/>\n                <editor>FIN-CLARIN<\/editor><br \/>\n                <url>http:\/\/urn.fi\/urn:nbn:fi:lb-2019112823<\/url><br \/>\n                <documentLanguageName>English<\/documentLanguageName><br \/>\n                <documentLanguageId>en<\/documentLanguageId><br \/>\n            <\/documentInfo><br \/>\n        <\/documentationStructured><br \/>\n        <documentationStructured ComponentId=\"clarin.eu:cr1:c_1361876010648\"><br \/>\n            <role>documentation<\/role><br \/>\n            <documentInfo ComponentId=\"clarin.eu:cr1:c_1353678848788\"><br \/>\n                <documentType>other<\/documentType><br \/>\n                <title xml:lang=\"fi\">Lisenssi (eduskunta, teksti)<\/title><br \/>\n                <editor>FIN-CLARIN<\/editor><br \/>\n                <url>http:\/\/urn.fi\/urn:nbn:fi:lb-2019112821<\/url><br \/>\n                <documentLanguageName>Finnish<\/documentLanguageName><br \/>\n                <documentLanguageId>fi<\/documentLanguageId><br \/>\n            <\/documentInfo><br \/>\n        <\/documentationStructured><br \/>\n    <\/resourceDocumentationInfo><br \/>\n    <relationInfo ComponentId=\"clarin.eu:cr1:c_1355150532307\"><br \/>\n        <relationType>IsNewVersionOf<\/relationType><br \/>\n        <relatedResource ComponentId=\"clarin.eu:cr1:c_1355150532308\"><br \/>\n            <targetResourceNameURI>Aalto Finnish Parliament ASR Corpus 2008-2020, http:\/\/urn.fi\/urn:nbn:fi:lb-2021051903<\/targetResourceNameURI><br \/>\n        <\/relatedResource><br \/>\n    <\/relationInfo><br \/>\n    <corpusInfo ComponentId=\"clarin.eu:cr1:c_1355150532309\"><br \/>\n        <resourceType>corpus<\/resourceType><br \/>\n        <corpusMediaType ComponentId=\"clarin.eu:cr1:c_1355150532310\"><br \/>\n            <corpusTextInfo ComponentId=\"clarin.eu:cr1:c_1355150532311\"><br \/>\n                <mediaType>text<\/mediaType>\n                <lingualityInfo ComponentId=\"clarin.eu:cr1:c_1355150532313\">\n                    <lingualityType>monolingual<\/lingualityType>\n                <\/lingualityInfo>\n                <languageInfo ComponentId=\"clarin.eu:cr1:c_1355150532314\"><br \/>\n                    <languageId>fi<\/languageId><br \/>\n                    <languageName>Finnish<\/languageName><br \/>\n                <\/languageInfo><br \/>\n                <sizeInfo ComponentId=\"clarin.eu:cr1:c_1353678848785\"><br \/>\n                    <size>1422318<\/size><br \/>\n                    <sizeUnit>items<\/sizeUnit><br \/>\n                <\/sizeInfo><br \/>\n                <sizeInfo ComponentId=\"clarin.eu:cr1:c_1353678848785\"><br \/>\n                    <size>19356831<\/size><br \/>\n                    <sizeUnit>words<\/sizeUnit><br \/>\n                <\/sizeInfo><br \/>\n            <\/corpusTextInfo><br \/>\n            <corpusAudioInfo ComponentId=\"clarin.eu:cr1:c_1360230992157\"><br \/>\n                <mediaType>audio<\/mediaType>\n                <lingualityInfo ComponentId=\"clarin.eu:cr1:c_1355150532313\">\n                    <lingualityType>monolingual<\/lingualityType>\n                <\/lingualityInfo>\n                <languageInfo ComponentId=\"clarin.eu:cr1:c_1355150532314\"><br \/>\n                    <languageId>fi<\/languageId><br \/>\n                    <languageName>Finnish<\/languageName><br \/>\n                <\/languageInfo><br \/>\n                <audioSizeInfo ComponentId=\"clarin.eu:cr1:c_1360230992160\"><br \/>\n                    <sizeInfo ComponentId=\"clarin.eu:cr1:c_1353678848785\"><br \/>\n                        <size>3130<\/size><br \/>\n                        <sizeUnit>hours<\/sizeUnit><br \/>\n                    <\/sizeInfo><br \/>\n                    <durationOfEffectiveSpeechInfo ComponentId=\"clarin.eu:cr1:c_1360230992158\"><br \/>\n                        <size>3130<\/size><br \/>\n                        <durationUnit>hours<\/durationUnit><br \/>\n                    <\/durationOfEffectiveSpeechInfo><br \/>\n                    <durationOfAudioInfo ComponentId=\"clarin.eu:cr1:c_1360230992159\"><br \/>\n                        <size>3130<\/size><br \/>\n                        <durationUnit>hours<\/durationUnit><br \/>\n                    <\/durationOfAudioInfo><br \/>\n                <\/audioSizeInfo><br \/>\n            <\/corpusAudioInfo><br \/>\n        <\/corpusMediaType><br \/>\n    <\/corpusInfo><br \/>\n<\/resourceInfo><\/Components><\/CMD><\/metadata><\/record><\/p>\n","protected":false},"template":"","categories":[],"tags":[],"language-resource-type":[],"language-resource-origin":[7558],"class_list":["post-35318","language-resource","type-language-resource","status-publish","hentry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v27.1 (Yoast SEO v27.1.1) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>- Spr\u00e5kbanken<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.nb.no\/sprakbanken\/ressurskatalog\/oai-clarino-uib-no-lb-2022052002\/\" \/>\n<meta property=\"og:locale\" content=\"nb_NO\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.nb.no\/sprakbanken\/ressurskatalog\/oai-clarino-uib-no-lb-2022052002\/\" \/>\n<meta property=\"og:site_name\" content=\"Spr\u00e5kbanken\" \/>\n<meta property=\"article:modified_time\" content=\"2025-02-10T10:05:26+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Ansl. lesetid\" \/>\n\t<meta name=\"twitter:data1\" content=\"8 minutter\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.nb.no\/sprakbanken\/ressurskatalog\/oai-clarino-uib-no-lb-2022052002\/\",\"url\":\"https:\/\/www.nb.no\/sprakbanken\/ressurskatalog\/oai-clarino-uib-no-lb-2022052002\/\",\"name\":\"- Spr\u00e5kbanken\",\"isPartOf\":{\"@id\":\"https:\/\/www.nb.no\/sprakbanken\/#website\"},\"datePublished\":\"2025-02-10T09:49:05+00:00\",\"dateModified\":\"2025-02-10T10:05:26+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.nb.no\/sprakbanken\/ressurskatalog\/oai-clarino-uib-no-lb-2022052002\/#breadcrumb\"},\"inLanguage\":\"nb-NO\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.nb.no\/sprakbanken\/ressurskatalog\/oai-clarino-uib-no-lb-2022052002\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.nb.no\/sprakbanken\/ressurskatalog\/oai-clarino-uib-no-lb-2022052002\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.nb.no\/sprakbanken\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Resources from the resource bank\",\"item\":\"https:\/\/www.nb.no\/sprakbanken\/en\/resource-catalogue\/\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.nb.no\/sprakbanken\/#website\",\"url\":\"https:\/\/www.nb.no\/sprakbanken\/\",\"name\":\"Spr\u00e5kbanken\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.nb.no\/sprakbanken\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"nb-NO\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"- Spr\u00e5kbanken","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.nb.no\/sprakbanken\/ressurskatalog\/oai-clarino-uib-no-lb-2022052002\/","og_locale":"nb_NO","og_type":"article","og_url":"https:\/\/www.nb.no\/sprakbanken\/ressurskatalog\/oai-clarino-uib-no-lb-2022052002\/","og_site_name":"Spr\u00e5kbanken","article_modified_time":"2025-02-10T10:05:26+00:00","twitter_card":"summary_large_image","twitter_misc":{"Ansl. lesetid":"8 minutter"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.nb.no\/sprakbanken\/ressurskatalog\/oai-clarino-uib-no-lb-2022052002\/","url":"https:\/\/www.nb.no\/sprakbanken\/ressurskatalog\/oai-clarino-uib-no-lb-2022052002\/","name":"- Spr\u00e5kbanken","isPartOf":{"@id":"https:\/\/www.nb.no\/sprakbanken\/#website"},"datePublished":"2025-02-10T09:49:05+00:00","dateModified":"2025-02-10T10:05:26+00:00","breadcrumb":{"@id":"https:\/\/www.nb.no\/sprakbanken\/ressurskatalog\/oai-clarino-uib-no-lb-2022052002\/#breadcrumb"},"inLanguage":"nb-NO","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.nb.no\/sprakbanken\/ressurskatalog\/oai-clarino-uib-no-lb-2022052002\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.nb.no\/sprakbanken\/ressurskatalog\/oai-clarino-uib-no-lb-2022052002\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.nb.no\/sprakbanken\/"},{"@type":"ListItem","position":2,"name":"Resources from the resource bank","item":"https:\/\/www.nb.no\/sprakbanken\/en\/resource-catalogue\/"}]},{"@type":"WebSite","@id":"https:\/\/www.nb.no\/sprakbanken\/#website","url":"https:\/\/www.nb.no\/sprakbanken\/","name":"Spr\u00e5kbanken","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.nb.no\/sprakbanken\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"nb-NO"}]}},"lang":"nb","translations":{"nb":35318,"en":35321},"pll_sync_post":[],"_links":{"self":[{"href":"https:\/\/www.nb.no\/sprakbanken\/wp-json\/wp\/v2\/language-resource\/35318","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.nb.no\/sprakbanken\/wp-json\/wp\/v2\/language-resource"}],"about":[{"href":"https:\/\/www.nb.no\/sprakbanken\/wp-json\/wp\/v2\/types\/language-resource"}],"wp:attachment":[{"href":"https:\/\/www.nb.no\/sprakbanken\/wp-json\/wp\/v2\/media?parent=35318"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.nb.no\/sprakbanken\/wp-json\/wp\/v2\/categories?post=35318"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.nb.no\/sprakbanken\/wp-json\/wp\/v2\/tags?post=35318"},{"taxonomy":"language-resource-type","embeddable":true,"href":"https:\/\/www.nb.no\/sprakbanken\/wp-json\/wp\/v2\/language-resource-type?post=35318"},{"taxonomy":"language-resource-origin","embeddable":true,"href":"https:\/\/www.nb.no\/sprakbanken\/wp-json\/wp\/v2\/language-resource-origin?post=35318"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}