NoWaC v 1.0 (Norwegian Web as Corpus)

Clarino - Textlab

Lisens: Creative_Commons-BY-NC-SA (CC-BY-NC-SA)

Oppdatert: 2017-06-08

Frequency lists from NoWaC - Norwegian Web as Corpus - a web-based corpus of Bokmål Norwegian containing about 700 million tokens. The corpus has been built by crawling, downloading and processing web documents in the .no top-level internet domain between November 2009 and January 2010. NoWaC has been built with permission from the Norwegian Ministry of Culture (Kulturdepartementet).

Vis utvidede metadata

The link will take you to an external site: We take no responsibility whatsoever for the content of external links.