Language Data for English
- Canadian English Speech Recognition Database ---- Place Name (Desktop)-150 Speakers (King-ASR-102)
- The Penn Treebank (PTB)
- Proposition Bank I
- The Freiburg - LOB Corpus of British English (FLOB)
- The SUSANNE Corpus
- CHILDES Child Language Data Exchange System (CHILDES)
- European Corpus Initiative Multilingual Corpus I (ECI/MCI)
- Chinese and Enghish Mixing Speech Synthesis Database (Female) (King-TTS-011)
- American English Speech Recognition Database ---- Sentences (Desktop)-150 Speakers (King-ASR-107)
- American English Speech Recognition Database ---- Person Name (Desktop)-150 Speakers (King-ASR-109)
- Canadian English Speech Recognition Database ---- Digit String (Telephone)-150 Speakers (King-ASR-104)
- American English Speech Recognition Database ---- Person Name (Telephone)-150 Speakers (King-ASR-113)
- UK English Speech Corpus for TTS (Female) (King-TTS-006)
- The Bank of English (COBUILD Corpus)
- American English Speech Recognition Database ---- Place Name (Telephone)-150 Speakers (King-ASR-114)
- Lancaster Parsed Corpus (ICAME)
- Simplified Chinese-to-English Dictionary (King-MT-004)
- Vienna-Oxford International Corpus of English (VOICE)
- British National Corpus (BNC)
- Lincoln Lab Speech Enhancement Corpus (LLSEC)
- Dutch Parallel Corpus (DPC)
- The LUCY Corpus
- Canadian English Speech Recognition Database ---- Sentences (Telephone)-150 Speakers (King-ASR-103)
- Japanese - English Personal Names (King-MT-009)
- TIMIT Acoustic-Phonetic Continuous Speech Corpus (TIMIT )
- UK English Speech Recognition Database ---- Sentences (Desktop)-200 Speakers (King-ASR-177)
- Japanese - English Place Names (King-MT-010)
- TDT2 English Audio Corpus
- The HCRC Map Task Corpus
- US English Pronunciation Lexicon (King-Lexicon-004)
- Verbmobil Word Frequency Lists
- International Corpus of Learner English (ICLE)
- Occidental Chinese Speech Recognition Database ---- (Desktop)-300 Speakers (King-ASR-127)
- Canadian English Speech Recognition Database ---- Place Name (Telephone)-150 Speakers (King-ASR-106)
- Project Gutenberg (PG)
- The Brown University Standard Corpus of Present-Day American English (Corpus BROWN )
- Multilingual translation corpus
- American English Speech Recognition Database ---- Place Name (Desktop)-150 Speakers (King-ASR-110)
- Canadian English Speech Recognition Database ---- Person Name (Telephone)-150 Speakers (King-ASR-105)
- Multilingual Aligned Annotated Corpus (CRATER)
- Japanese - English Dictionary of Technical Terms (King-MT-008)
- Prague Czech-English Dependency Treebank 1.0 (PCEDT 1.0)
- Canadian English Speech Recognition Database ---- Person Name (Desktop)-150 Speakers (King-ASR-101)
- Canadian English Speech Recognition Database ---- Digit String (Desktop)-150 Speakers (King-ASR-100)
- US English Speech Recognition Corpus (desktop) – 50 speakers (King-ASR-090)
- The PARC 700 Dependency Bank
- Multilingual Proper Noun Database (King-MT-007)
- Australian English Speech Recognition Database ---- Sentences (Desktop)-200 Speakers (King-ASR-176)
- UK English Speech Recognition Database ---- Sentences (Desktop)-200 Speakers (King-ASR-177)
- Chinese English Speech Recognition Database ---- (Desktop)-100 Speakers (King-ASR-126)
- US English speech Recognition Database—(Mobile)--150 speakers (King-ASR-139)
- UK English Pronunciation Lexicon (King-Lexicon-005)
- US English Speech Recognition Database ---- (in-car)-300 Speakers (King-ASR-131)
- Chinese-English Parallel Corpus of SMS (King-MT-002)
- American English Speech Recognition Database ---- Sentences (Telephone)-150 Speakers (King-ASR-111)
- American English Speech Recognition Database ---- Digit String (Desktop)-150 Speakers (King-ASR-108)
- The CHRISTINE Corpus
- Chinese-English-Korean-Japanese Parallel Corpus (King-MT-001)
- The International Corpus of English (ICE)
- London-Lund Corpus of spoken English (LLC)
- English-to-Simplified Chinese Dictionary (King-MT-005)
- Canadian English Speech Recognition Database ---- Sentences (Desktop)-150 Speakers (King-ASR-099)
- Penn Discourse Treebank (PDTB)
- English Parser Evaluation Corpus
- USENET corpus
- The British component of the International Corpus of English (ICE-GB)
- Simplified Chinese—English Computer Terms (King-MT-003)
- The American National Corpus (ANC)
- US TTS speech database (Female)
- Chinese and Enghish Mixing Speech Synthesis Database (Male) (King-TTS-012)
- Chinese and Enghish Mixing Speech Synthesis Database (Female) (King-TTS-011)
- American English Speech Recognition Database ---- Digit String (Telephone)-150 Speakers (King-ASR-112)
- The LinGO Redwoods Treebank