Multilingual Corpora
- Evaluations and Language Resources Distribution Agency (ELDA)
- Contragram Research Group
- Trans-European Language Resources Infrastructure (TELRI)
- Pan American Health Organization (PAHO)
- Linguistic Data Consortium (LDC)
- European Corpus Initiative (ECI)
- European Language Resources Association (ELRA)
- International Computer Archive of Modern and Medieval English (ICAME)
- TELRI Research Archive of Computational Tools and Resources (TRACTOR)
- ET10-63 Parallel Corpus
- TDT2 Multilanguage Text Corpus
- Orwell's 1984 parallel English-Romanian Text
- NATO Multilingual (Fr-De-En) Corpus
- Multilingual translation corpus
- JOC-CES Multilingual (En-De-Fr-It-Sp) Corpus
- Swiss Government Multilingual (Fr-De-It) Corpus
- Oslo Multilingual Corpus (OMC)
- The Canadian Hansard Corpus
- Intellectual Property and Copyright Multilingual (Fr-En) Corpus
- English-Norwegian Parallel Corpus (ENCP)
- CHILDES
- BAF French - English Parallel Corpus
- European Corpus Initiative Multilingual (ECI/MCI 1) Corpus
- An Environment for Managing Corpus and Multilingual Web Server (XCorpus)
- Chemnitz Internet Grammar
- European Language Newspaper Text
- Multilingual corpora for cooperation (MLCC)
- ParaConc
- Xkwic/CQP (IMS Corpus Workbench)
- Bundesregierung Multilingual (Fr-De-En) Corpus
- TELRI multilingual Plato corpus
- ITU or CRATER Parallel (Sp-Fr-En) Corpus
- European Free Trade Organization Multilingual (De-En) Corpus
- English Turkish Aligned Parallel Corpora
- Bible of University of Maryland Parallel Corpus
- Concordancing and parallel corpora (ParaConc)
Any collection of more than one text in more than one language can be called a multilingual corpus, (corpus being Latin for "body", hence a multilingual corpus is any body of multilingual texts). But the term "multilingual corpus" when used in the context of modern linguistics means a machine-readable text collection of multilingual texts which are representative for the language use under investigation.







