These n-grams are derived from the Norwegian Newspaper Corpus and part of the Text Corpus from Nordisk språkteknologi (NST). In total, the source material consists of 1175 million words of running text. In this version, the n-grams are sorted alphabetically and by frequency, respectively. Frequency lists (unigrams) are published in a separate distribution. There is also a "light" version available, listing the 1000 most frequent n-grams.
The link will take you to an external site: We take no responsibility whatsoever for the content of external links.