Train SizeTest SizeLength Number of ClassesType
214 18961024 39SOUND
Data Source: Link Here
Donated By: H. Hamooni, A. Mueen
Description: This data set is a subsample of the data used in the paper Dual-domain Hierarchical Classification of Phonetic Time Series. Each series is extracted from the segmented audio collected from Google Translate, and the Merrriam-Webster online dictionary. Each of these sources have different features. Audio files collected from Google translate, Oxford, and Merrriam-Webster dictionaries are recorded at 22050, 44100 and 11025 samples per second respectively. All of them have male and female speakers in different ratios. The Oxford dictionary includes British and American accent pronunciation for each word. After data collection, they segment waveforms of the words to generate phonemes using the Forced Aligner tool from the Penn Phonetics Laboratory.
Download this dataset
Dataset Image

Best Algorithm: COTE
Best Accuracy: 36.20%