External Links
Google Scholar
provided by
German Research Center for Artificial Intelligence
with support by
as well as by

Multilingual Corpora

definition: Any collection of more than one text in more than one language can be called a multilingual corpus, (corpus being Latin for "body", hence a multilingual corpus is any body of multilingual texts). But the term "multilingual corpus" when used in the context of modern linguistics means a machine-readable text collection of multilingual texts which are representative for the language use under investigation.
related project(s):
related organisation(s):
related person(s):
  • Silvia Hansen-Schirra
  • Serge Sharoff
  • Martin Wynne
  • Silvia Bernardini
  • Knut Hofland
  • Wolfgang Teubert
  • Michael Barlow
  • Douglas Biber
  • Stella Neumann
  • Josef Schmied
  • Stig Johansson
related system(s) / resource(s):
  • XCorpus
  • JOC-CES Multilingual (En-De-Fr-It-Sp) Corpus
  • Multilingual corpora for cooperation (MLCC)
  • Bundesregierung Multilingual (Fr-De-En) Corpus
  • NATO Multilingual (Fr-De-En) Corpus
  • Chemnitz Internet Grammar
  • European Corpus Initiative Multilingual (ECI/MCI 1) Corpus
  • Orwell's 1984 parallel English-Romanian Text
  • TELRI multilingual Plato corpus
  • European Free Trade Organization Multilingual (De-En) Corpus
  • Canadian Hansard
  • Swiss Government Multilingual (Fr-De-It) Corpus
  • ParaConc
  • Bible of University of Maryland Parallel Corpus
  • ET10-63 Parallel Corpus
  • BAF French - English Parallel Corpus
  • Xkwic/CQP (IMS Corpus Workbench)
  • ITU or CRATER Parallel (Sp-Fr-En) Corpus
  • English-Norwegian Parallel Corpus (ENCP)
  • English Turkish Aligned Parallel Corpora
  • TDT2 Multilanguage Text Corpus
  • European Language Newspaper Text
  • Oslo Multilingual Corpus (OMC)
  • Multilingual translation corpus
  • Intellectual Property and Copyright Multilingual (Fr-En) Corpus
related publication(s):

A Guide to ParaConc.
M. Barlow.
Athelstan. Houston. 1995.

Dimensions of register variation: A cross-linguistic comparison.
D. Biber. Cambridge University Press. Cambridge. 1995.

Corpus Linguistics: Investigating Language Structure and Use.
D. Biber and S. Conrad and R. Reppen.
Cambridge University Press. Cambridge. 1998.