PubNEPC is a Nynorsk-English sentence-aligned parallel corpus built from the public web sites www.nav.no and skatteetaten.no.
The corpus contains only those sentences that have a corresponding translation.
PubNEPC is a Nynorsk-English sentence-aligned parallel corpus built from the public web sites www.nav.no and skatteetaten.no.
The corpus contains only those sentences that have a corresponding translation.
Utvidet metadata
resource Common Info
resource Type: corpus
identification Info
resource Name: Public Nynorsk-English Parallel Corpus (PubNEPC)
description: PubNEPC is a Nynorsk-English sentence-aligned parallel corpus built from the public web sites www.nav.no and skatteetaten.no.
The corpus contains only those sentences that have a corresponding translation.
document Unstructured: The resource is documented at the corpus website http://clarino.uib.no/korpuskel.
corpus Info
corpus Type: Written Corpus
corpus Part Info
media Type: text
corpus Text Info
text Format Info
mime Type: application/xml
corpus Part General Info
language Info
language Id: nno
language Name: Norwegian Nynorsk
size Per Language
size Info
size: 21056
size Unit: sentences
size Info
size: 289722
size Unit: tokens
language Info
language Id: en
language Name: English
size Per Language
size Info
size: 20998
size Unit: sentences
size Info
size: 353837
size Unit: tokens
dc:type
corpus
dc:title
Public Nynorsk-English Parallel Corpus (PubNEPC)
dc:identifier
oai:clarino.uib.no:parallel-nno
dc:description
PubNEPC is a Nynorsk-English sentence-aligned parallel corpus built from the public web sites www.nav.no and skatteetaten.no.
The corpus contains only those sentences that have a corresponding translation.