Skip to content

NST N-gram – Swedish

This collection of n-grams (n=1-6) has been produced on the basis of approximately 400 million words of running text from the Swedish text corpus of Nordic Language Technology AS. The corpus contains all the n-grams, sorted alphabetically and by frequency, respectively. There is also a second format available, making it possible to select text types. This version contains more texts and is based on approximately 437 million words. A simplified version, listing the 1.000 most frequent n-grams is also available separately.

This collection of n-grams (n=1-6) has been produced on the basis of approximately 400 million words of running text from the Swedish text corpus of Nordic Language Technology AS. The corpus contains all the n-grams, sorted alphabetically and by frequency, respectively. There is also a second format available, making it possible to select text types. This version contains more texts and is based on approximately 437 million words. A simplified version, listing the 1.000 most frequent n-grams is also available separately.

Extended metadata

Download resources

Download metadata