ISSN 0975-3583
 

Journal of Cardiovascular Disease Research



    CNN and MFCC based Speech Net: Children Speech Recognition model


    C Ramadevi, Kommu Anusha, Pavankumar Thummeti
    JCDR. 2019: 463-474

    Abstract

    Children speech recognition based on short-term spectral features is a challenging task. One of the reasons is that children speech has high fundamental frequency that is comparable to formant frequency values. Furthermore, as children grow, their vocal apparatus also undergoes changes. This presents difficulties in extracting standard short-term spectral-based features reliably for speech recognition. In recent years, novel acoustic modeling methods have emerged that learn both the feature and phone classifier in an end-to-end manner from the raw speech signal. Through an investigation on PF-STAR corpus we show that children speech recognition can be improved using end-to-end acoustic modeling methods. Finally, the simulations revealed that the proposed MFCC-CNN resulted in superior performance as compared to GMM model

    Description

    » PDF

    Volume & Issue

    Volume 10 Issue 4

    Keywords