LT World

You are here: Home kb Information & Knowledge Technologies Written Language Corpora

Written Language Corpora


Corpus Linguistics: investigating language structure and use.
D. Biber and S. Conrad and R. Reppen.
CUP. Cambridge, 1998.

Corpus Linguistics.
T. McEnery and A. Wilson.
EUP. Edinburgh, 2001.

Corpus



http://www.lt-world.org/hlt_survey/ltw-chapter12-2.pdf

Any collection of more than one text can be called a corpus, (corpus being Latin for "body", hence a corpus is any body of text). But the term "corpus" when used in the context of modern linguistics means a machine-readable text collection which is representative for the language use under investigation.


Written Corpora