Skip to content

The LOB corpus (POS tagged)

The Lancaster – Oslo/Bergen (LOB) Corpus is a million-word collection of present-day (1961) British English texts.

The corpus was compiled under the direction of Geoffrey Leech, University of Lancaster, and Stig Johansson, University of Oslo, in collaboration with Knut Hofland, Norwegian Computing Centre for the Humanities, Bergen. Like its American counterpart, the Brown Corpus (see Francis and Kucera 1979), it contains 500 text samples of approximately 2,000 words distributed over 15 text categories.

Part of the ICAME Corpus Collection.

The Lancaster – Oslo/Bergen (LOB) Corpus is a million-word collection of present-day (1961) British English texts.

The corpus was compiled under the direction of Geoffrey Leech, University of Lancaster, and Stig Johansson, University of Oslo, in collaboration with Knut Hofland, Norwegian Computing Centre for the Humanities, Bergen. Like its American counterpart, the Brown Corpus (see Francis and Kucera 1979), it contains 500 text samples of approximately 2,000 words distributed over 15 text categories.

Part of the ICAME Corpus Collection.

Extended metadata

Download metadata

Go to resource page