A journal of IEEE and CAA , publishes high-quality papers in English on original theoretical/experimental research and development in all areas of automation
Volume 6 Issue 3
May  2019

IEEE/CAA Journal of Automatica Sinica

  • JCR Impact Factor: 6.171, Top 11% (SCI Q1)
    CiteScore: 11.2, Top 5% (Q1)
    Google Scholar h5-index: 51, TOP 8
Turn off MathJax
Article Contents
Lichuan Liu, Wei Li, Xianwen Wu and Benjamin X. Zhou, "Infant Cry Language Analysis and Recognition: An Experimental Approach," IEEE/CAA J. Autom. Sinica, vol. 6, no. 3, pp. 778-788, May 2019. doi: 10.1109/JAS.2019.1911435
Citation: Lichuan Liu, Wei Li, Xianwen Wu and Benjamin X. Zhou, "Infant Cry Language Analysis and Recognition: An Experimental Approach," IEEE/CAA J. Autom. Sinica, vol. 6, no. 3, pp. 778-788, May 2019. doi: 10.1109/JAS.2019.1911435

Infant Cry Language Analysis and Recognition: An Experimental Approach

doi: 10.1109/JAS.2019.1911435
Funds:  This work was supported by the Gerber Foundation and the Northern Illinois University Research Foundation
More Information
  • Recently, lots of research has been directed towards natural language processing. However, the baby's cry, which serves as the primary means of communication for infants, has not yet been extensively explored, because it is not a language that can be easily understood. Since cry signals carry information about a babies' wellbeing and can be understood by experienced parents and experts to an extent, recognition and analysis of an infant's cry is not only possible, but also has profound medical and societal applications. In this paper, we obtain and analyze audio features of infant cry signals in time and frequency domains. Based on the related features, we can classify given cry signals to specific cry meanings for cry language recognition. Features extracted from audio feature space include linear predictive coding (LPC), linear predictive cepstral coefficients (LPCC), Bark frequency cepstral coefficients (BFCC), and Mel frequency cepstral coefficients (MFCC). Compressed sensing technique was used for classification and practical data were used to design and verify the proposed approaches. Experiments show that the proposed infant cry recognition approaches offer accurate and promising results.


  • loading
  • [1]
    H. Karp, The Happiest Baby on the Block; Fully Revised and Updated Second Edition: The New Way to Calm Crying, New York City, NY, USA, 2015.
    J. A. Green, P. G. Whitney, and M. Potegalb, "Screaming, yelling, whining and crying: categorical and intensity differences in vocal expressions of anger and sadness in children's tantrums, " Emotion, vol. 5, no. 11, pp. 1124-1133, Oct. 2011. http://med.wanfangdata.com.cn/Paper/Detail/PeriodicalPaper_PM21707157
    Y. Kheddache and C. Tadj, "Acoustic measures of the cry characteristics of healthy newborns and newborns with pathologies, " Journal of Biomedical Science and Engineering, vol. 6, no. 8, 9 pages, 2013.
    L. Liu, K. Kuo, and Sen M. Kuo, "Infant cry classification integrated ANC system for infant incubators, " in Proc. IEEE International Conf. on Networking, Sensing and Control, Paris, France, 2013, pp. 383-387.
    L. Liu and K. Kuo, "Active noise control systems integrated with infant cry detection and classification for infant incubators, " in Proc. Acoustic, pp. 1-6. 2012.
    L. LaGasse, A. Neal, and M. Lester, "Assessment of infant cry: acoustic cry analysis and parental perception, " Ment Retard Dev Disabil Res Rev., vol. 11, no. 1, pp. 83-93, 2005. doi: 10.1002/(ISSN)1098-2779
    Várallyay Jr. György, "Future prospects of the application of the infant cry in the medicine, " Periodica Polytechnica Ser. El. Eng, vol. 50, no. 1-2, pp. 47-62, 2006. http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=Open J-Gate000001024444
    G. Buonocore and C.V. Bellieni, Neonatal Pain, Suffering, Pain and Risk of Brain Damage in the Fetus and Newborn, Berlin, Germany, Springer, 2008.
    L. L. LaGasse, R. Neal, and B. M. Lester. "Assessment of infant cry: acoustic cry analysis and parental perception, " Mental Retardation and Developmental Disabilities Research Reviews, vol. 11, no. 1. pp. 83-93, 2005. doi: 10.1002/(ISSN)1098-2779
    L. Tan and J. Jiang, Digital Signal Processing: Fundamentals and Applications (3rd edition). Cambridge, MA, USA, Academic Press, 2017.
    Z. Ren, K. Qian, Z. X. Zhang, V. Pandit, A. Baird, and B. Schuller, "Deep scalogram representations for acoustic scene classification, " IEEE/CAA J. Autom. Sinica, vol. 5, no. 3, pp. 662-669, May 2018. http://www.cnki.com.cn/Article/CJFDTotal-ZDHB201803002.htm
    Dong Yu and Jinyu Li. "Recent progresses in deep learning based acoustic models, " IEEE/CAA J. Autom. Sinica, vol. 4, no. 3, pp. 396-409, April 2017 http://www.cnki.com.cn/Article/CJFDTotal-ZDHB201703002.htm
    B. Goldand N. Morgan, Speech and Audio Signal Processing. New York, NY, USA, John Wiley & Sons, 2011.
    V. R. Fisichelli, S. Karelitz, C. F. Z. Boukydis, and B. M. Lester, "The cry attencies of normal infants and those with brain damage, " Infant Crying, Plenum Press, 1985.
    C. F. Z. Boukydis and B. M. Lester, Infant Crying: Theoretical and Research Perspectives, Berlin, Germany, Springer Science and Bussiness Media, 2012.
    S. Ludington-Hoe, X. Cong, and F. Hashemi, "Infant crying: nature, physiologic consequences, and select interventions, " Neonatal Netw. vol. 21, no. 2, pp. 29-36. Mar. 2002.
    P. Dunstan, Calm the Crying: The Secret Baby Language That Reveals the Hidden Meaning Behind an Infant's Cry, New York City, NY, USA, Avery, 2012.
    M. Sahidullah, and G. K. Saha, "Design analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition, " Speech Communication, vol. 54, no. 4, pp. 543-565, May 2012. http://www.sciencedirect.com/science/article/pii/S0167639311001622
    F. Katzberg, R. Mazur, M. Maass, P. Koch, and A. Mertins, "A compressed sensing framework for dynamic sound-field measurements, " IEEE/ACM Trans. Audio, Speech, and Language Processing, vol. 26, no. 11, pp. 1962-1975, Jun. 2018.
    D. Needell and R. Ward, "Two-subspace projection method for coherent overdetermined systems, " Journal of Fourier Analysis and Applications, vol. 19, no. 2, pp. 256-269, April, 2013. doi: 10.1007/s00041-012-9248-z
    C. Lau, "Development of suck and swallow mechanisms in infants, " Ann. Nutr. Metab., vol. 7, no. 5, pp. 7-14, July 2015.
    P. Runefors and E. Arnbjönsson, "A sound spectrogram analysis of children's crying after painful stimuli during the first year of life, " Folia honiatr. Logop., vol. 2, no. 57, pp. 90-95, Mar-Apr. 2005.


    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(11)  / Tables(4)

    Article Metrics

    Article views (5590) PDF downloads(252) Cited by()


    DownLoad:  Full-Size Img  PowerPoint