Abstract
In this article, epoch-based dynamic features such as sequence of epoch interval values and epoch strength values are explored to classify infant cries. Epoch is the instant of significant excitation of the vocal tract system during the production of speech. For voiced speech, the most significant excitation takes place around the instant of glottal closure. The different types of infant cries considered in this work are hunger, pain, and wet diaper. In this work, epoch strength and epoch interval features are used to represent infant cry-specific information from the acoustic signal. In this study, the proposed features such as epoch interval and epoch strength values are determined using zero-frequency filter-based method. Gaussian mixture models are used to classify the above-mentioned cries from the features proposed in this work. GMMs are developed separately for each of the cries using the proposed features. The infant cry database collected under a telemedicine project at the Indian Institute of Technology Kharagpur has been used for this study. In the first step, infant cry recognition accuracy is investigated separately using epoch interval and epoch strength features. To enhance recognition performance, GMMs developed using various features are combined through score level fusion techniques. The recognition performance using a combination of evidence is found to be superior over individual systems.