UK English Speech Recognition Database ---- Sentences (Desktop)-200 Speakers (King-ASR-177)
All the audio files were manually transcribed and annotated
- Speech Corpus
This UK English desktop speech recognition database was collected by Speechocean’s project team in UK. This database is one of our databases of Speech Data ----Desktop Project (SDD) which contains the database collections for 30 languages presently.
It contains the voices of 200 different native speakers who were balanced distributed by age (mainly 16 – 30,31 – 45,46 – 60), gender (106 males, 94 females) and regional accents (for the details, please see the technical document).The script was specially designed to provide material for both training and testing of many classes of speech recognizers. Each speaker has been recorded in a quiet office environment and 300 phonetically rich sentences were randomly selected from a pool of sentences specially designed.
The speech data are stored as sequences of 48.1 kHz, 16 bit and uncompressed. A pronunciation lexicon with a phonemic transcription in SAMPA is also included. The pure recording hours are 189.1. And the phoneme labelling of 6843 sentences (100034 words) which were chosen from 24 speakers were manually made.
Pure Recording Hours: 189.1 hours
The database was made for the Training and testing purpose of speech recognition systems for English speech ASR applications. The script was specially designed to provide material for both training and testing of many classes of speech recognizers.