The LOB corpus (POS tagged)

Clarino UiB

Lisens: CLARIN_ACA

Oppdatert: 2016-05-20

The Lancaster - Oslo/Bergen (LOB) Corpus is a million-word collection of present-day (1961) British English texts.

The corpus was compiled under the direction of Geoffrey Leech, University of Lancaster, and Stig Johansson, University of Oslo, in collaboration with Knut Hofland, Norwegian Computing Centre for the Humanities, Bergen. Like its American counterpart, the Brown Corpus (see Francis and Kucera 1979), it contains 500 text samples of approximately 2,000 words distributed over 15 text categories.

Part of the ICAME Corpus Collection.

Vis utvidede metadata

The link will take you to an external site: We take no responsibility whatsoever for the content of external links.