The Lexicographic Corpus for Norwegian Bokmål

The corpus consists of texts collected from available literature/prose from 1985 to 2013. The corpus is composed of texts from five genres: non-fiction prose (45 %) fiction (35 %) newpapers/magazines (10 %), TV subtitles (5 %), and non-standardized, unpublished texts (5 %), all in all 100 mill words.
The corpus is grammatically tagged with the original version of The Oslo-Bergen tagger.

Download resources

Extended metadata

Go to resource page

Go to resource page https://tekstlab.uio.no/glossa3/bokmal

dc:type	corpus
dc:title	The Lexicographic Corpus for Norwegian Bokmål
dc:identifier	oai:tekstlab.uio.no:LBK2013
dc:description	The corpus consists of texts collected from available literature/prose from 1985 to 2013. The corpus is composed of texts from five genres: non-fiction prose (45 %) fiction (35 %) newpapers/magazines (10 %), TV subtitles (5 %), and non-standardized, unpublished texts (5 %), all in all 100 mill words. The corpus is grammatically tagged with the original version of The Oslo-Bergen tagger.
dc:publisher
dc:format	accessibleThroughInterface
dc:date
dc:date	2013-12-31
dc:rights	Academic
dc:rights	CLARIN
dc:rights	CLARIN_ACA-NC-LOC-ND
dc:rights	https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&LOC=1&NORED=1&ND=1
dc:creator	University of Oslo
dc:creator	The Text Laboratory
dc:lang	Norwegian Bokmål

The Lexicographic Corpus for Norwegian Bokmål

Download resources

Extended metadata

Dublin Core (DC)

Go to resource page