LT World

You are here: Home kb Information & Knowledge Technologies Written Language Corpora

Written Language Corpora

Corpus Linguistics: investigating language structure and use.
D. Biber and S. Conrad and R. Reppen.
CUP. Cambridge, 1998.

Corpus Linguistics.
T. McEnery and A. Wilson.
EUP. Edinburgh, 2001.


Any collection of more than one text can be called a corpus, (corpus being Latin for "body", hence a corpus is any body of text). But the term "corpus" when used in the context of modern linguistics means a machine-readable text collection which is representative for the language use under investigation.

Written Corpora