Written Language Corpora — LT World

LT World

Supporters

provided by

dfki logo

with support by

eu star logofp7 logo

through

meta logo
clarin logo

as well as by

bmbf logo

through

take logo

N.B.

This site uses Google Analytics to record statistics about site visits - see Legal Information.

You are here: Home kb Information & Knowledge Technologies Written Language Corpora

Written Language Corpora


Corpus Linguistics: investigating language structure and use.
D. Biber and S. Conrad and R. Reppen.
CUP. Cambridge, 1998.

Corpus Linguistics.
T. McEnery and A. Wilson.
EUP. Edinburgh, 2001.

Corpus



http://www.lt-world.org/hlt_survey/ltw-chapter12-2.pdf

Any collection of more than one text can be called a corpus, (corpus being Latin for "body", hence a corpus is any body of text). But the term "corpus" when used in the context of modern linguistics means a machine-readable text collection which is representative for the language use under investigation.


Written Corpora