LT World

You are here: Home kb Resources & Tools Language Data Russian Speech Recognition Database ---- Sentences (Desktop)-200 Speakers (King-ASR-183)

Russian Speech Recognition Database ---- Sentences (Desktop)-200 Speakers (King-ASR-183)

The whole data has been proofread manually with precise data labeling.

  • Speech Corpus

  • Multimodal
  • Spoken
  • Written

This Russian Speech Recognition database was collected in Russia and contains the voices of 200 different native speakers who were demographic balanced according to age distribution (12~18,19~29,30~49,50~60), Gender (100±5%Males, 100±5%Females) and regional accents.
A script pool with 20,000 simple sentences was phonetically designed for both training and testing of speech recognizers. Each speaker has recorded 300 sentences which were randomly selected from the script pool. All speakers were recorded in a quiet office room through two professional microphones.
Each prompted utterance is stored in a separate file and each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included.
All the data was transcribed and labeled.

  • Russian

  • Monolingual

  • Dialectology
  • Phonology
  • Phonetics

  • Speech Recognizer
  • Speech Recognition Applications
  • Speech Recognition System

The database was made for the tuning and testing purpose of speech recognition systems for speech ASR applications.