N-grams for Norwegian Bokmål (based on NST news text)

CLARINO NB – Språkbanken

Lisens: Creative_Commons-ZERO (CC-ZERO)

Oppdatert: 2016-02-03

These n-grams are derived from part of the Text Corpus from Nordisk språkteknologi (NST). The source material consists of 510 million words of running text. The n-grams are also available in a "light" version listing only the 1.000 most frequent n-grams (n=1-6). In the full version, all the derived n-grams (n=1-6) are sorted alphabetically and by frequency, respectively. Frequency lists (unigrams) are also available separately.

Vis utvidede metadata

The link will take you to an external site: We take no responsibility whatsoever for the content of external links.