Skip to content

Translation memories from Målfrid

This corpus derives from the Målfrid corpus, and contains translation memories based on parallel text extracted from 132 public sector internet domains.

The file format is compressed tmx for each domain/language pair (i.e. English-Norwegian Bokmål, English-Norwegian Nynorsk and Norwegian Bokmål-Norwegian Nynorsk).

Number of translation units for each language pair after cleaning of the data:

– English-Norwegian Bokmål: 145,267
– English-Norwegian Nynorsk: 12,268
– Norwegian Bokmål-Norwegian Nynorsk: 50,957

See the documentation file (p. 2) for more detailed information about the corpus.

This corpus derives from the Målfrid corpus, and contains translation memories based on parallel text extracted from 132 public sector internet domains.

The file format is compressed tmx for each domain/language pair (i.e. English-Norwegian Bokmål, English-Norwegian Nynorsk and Norwegian Bokmål-Norwegian Nynorsk).

Number of translation units for each language pair after cleaning of the data:

– English-Norwegian Bokmål: 145,267
– English-Norwegian Nynorsk: 12,268
– Norwegian Bokmål-Norwegian Nynorsk: 50,957

See the documentation file (p. 2) for more detailed information about the corpus.

Extended metadata

Download resources

Download metadata