The Freiburg – LOB Corpus of British English (FLOB)
The Freiburg – LOB Corpus of British English (FLOB) contains texts from 1991.
Like the original Brown and LOB corpora, FLOB contains 500 texts of around 2000 words each, distributed across 15 text categories.
In 1991 a group of students at Freiburg University were engaged in what at first sight must appear as an almost anachronistic activity: they were keying in extracts of roughly 2,000 words from British newspapers. The sampling model was the press section of the LOB corpus. 1992 saw the beginning of a new Brown corpus. The ultimate aim was to compile parallel one-million-word corpora of the early 1990s that matched the original LOB and Brown corpora as closely as possible, and that would thus provide linguists with an empirical basis to study language change in progress.
The main aim in compiling the press section of FLOB was to match the 1991 material as closely as possible with that used in LOB by sampling the same newspapers. For the other sections, the same magazines and periodicals used in LOB were sampled whenever possible. In the sampling of monographs great care was taken to select books on equivalent topics rather than to randomly select titles from bibliographical sources. The main aim was to achieve close comparability with FLOB rather than statistical representativeness.
The Freiburg – LOB Corpus of British English (FLOB) contains texts from 1991.
Like the original Brown and LOB corpora, FLOB contains 500 texts of around 2000 words each, distributed across 15 text categories.
In 1991 a group of students at Freiburg University were engaged in what at first sight must appear as an almost anachronistic activity: they were keying in extracts of roughly 2,000 words from British newspapers. The sampling model was the press section of the LOB corpus. 1992 saw the beginning of a new Brown corpus. The ultimate aim was to compile parallel one-million-word corpora of the early 1990s that matched the original LOB and Brown corpora as closely as possible, and that would thus provide linguists with an empirical basis to study language change in progress.
The main aim in compiling the press section of FLOB was to match the 1991 material as closely as possible with that used in LOB by sampling the same newspapers. For the other sections, the same magazines and periodicals used in LOB were sampled whenever possible. In the sampling of monographs great care was taken to select books on equivalent topics rather than to randomly select titles from bibliographical sources. The main aim was to achieve close comparability with FLOB rather than statistical representativeness.
Utvidet metadata
resource Common Info
resource Type: corpus
identification Info
resource Name: The Freiburg – LOB Corpus of British English (FLOB)
description: The Freiburg – LOB Corpus of British English (FLOB) contains texts from 1991.
Like the original Brown and LOB corpora, FLOB contains 500 texts of around 2000 words each, distributed across 15 text categories.
In 1991 a group of students at Freiburg University were engaged in what at first sight must appear as an almost anachronistic activity: they were keying in extracts of roughly 2,000 words from British newspapers. The sampling model was the press section of the LOB corpus. 1992 saw the beginning of a new Brown corpus. The ultimate aim was to compile parallel one-million-word corpora of the early 1990s that matched the original LOB and Brown corpora as closely as possible, and that would thus provide linguists with an empirical basis to study language change in progress.
The main aim in compiling the press section of FLOB was to match the 1991 material as closely as possible with that used in LOB by sampling the same newspapers. For the other sections, the same magazines and periodicals used in LOB were sampled whenever possible. In the sampling of monographs great care was taken to select books on equivalent topics rather than to randomly select titles from bibliographical sources. The main aim was to achieve close comparability with FLOB rather than statistical representativeness.
attribution Text: The Freiburg-LOB Corpus (FLOB) (original version) compiled by Christian Mair, Albert-Ludwigs-Universität Freiburg
The Freiburg-LOB Corpus (FLOB) (POS-tagged version) compiled by Christian Mair, Albert Ludwigs-Universität Freiburg, and Geoffrey Leech, University of Lancaster
document Unstructured: Original (plain-text) Version: Hundt, Marianne, Andrea Sand, and Rainer Siemund. 1999. Manual of Information to accompany The Freiburg – LOB Corpus of British English (FLOB). Freiburg: Department of English. Albert-Ludwigs-Universität Freiburg.
http://clu.uni.no/icame/manuals/FLOB/INDEX.HTM.
POS-Tagged Version: Hinrichs, Lars, Nicholas Smith, and Birgit Waibel. 2007. The part-of-speech-tagged ‘Brown’ corpora: A manual of information, including pointers for successful use. Freiburg: Department of English. Albert-Ludwigs-Universität Freiburg.
http://clu.uni.no/icame/manuals/FLOB-Manual-tagged.pdf
documentation Structured
role: documentation
document Info
document Type: manual
title: Manual of Information
to accompany
The Freiburg – LOB Corpus of American English (FLOB)
author: Marianne Hundt
and Andrea Sand
and Rainer Siemund
The Freiburg – LOB Corpus of British English (FLOB)
dc:identifier
oai:clarino.uib.no:flob
dc:description
The Freiburg – LOB Corpus of British English (FLOB) contains texts from 1991.
Like the original Brown and LOB corpora, FLOB contains 500 texts of around 2000 words each, distributed across 15 text categories.
In 1991 a group of students at Freiburg University were engaged in what at first sight must appear as an almost anachronistic activity: they were keying in extracts of roughly 2,000 words from British newspapers. The sampling model was the press section of the LOB corpus. 1992 saw the beginning of a new Brown corpus. The ultimate aim was to compile parallel one-million-word corpora of the early 1990s that matched the original LOB and Brown corpora as closely as possible, and that would thus provide linguists with an empirical basis to study language change in progress.
The main aim in compiling the press section of FLOB was to match the 1991 material as closely as possible with that used in LOB by sampling the same newspapers. For the other sections, the same magazines and periodicals used in LOB were sampled whenever possible. In the sampling of monographs great care was taken to select books on equivalent topics rather than to randomly select titles from bibliographical sources. The main aim was to achieve close comparability with FLOB rather than statistical representativeness.