Reseach Themes

Current Location: Home > Research Area > Reseach Themes > Content

Research Area

Auditory, Speech, and Music

Time:2022-12-19    Click:
  • Responsible Person:Xihong Wu
  • Member:Tianshu Qu, Jing Chen, Shan Gao
负责人 Xihong Wu 成员 Tianshu Qu, Jing Chen, Shan Gao

The field of Auditory, Speech, and Music focuses on artificial general intelligence, employing research paradigms based on neuroscience and brain-inspired intelligence. It adopts an embodied learning approach through body-environment interaction, with the goal of conducting theoretical, methodological, and applied research in areas such as scene analysis of complex acoustic environments, human-computer speech interaction, and intelligent music composition. The aim is to construct and refine new frameworks for auditory perception processing models, speech knowledge representation, and cognitive development.

The main research areas include the mechanisms and modeling of auditory perception, active/passive target source detection and localization and enhancement in complex acoustic environments for robots, speech perception and understanding and generation, music signal source separation and analysis, as well as automatic composition, orchestration, performance, mixing, and 3D virtual sound field. The research aims to provide intelligent robots with autonomous auditory perception and cognitive abilities for environment analysis and understanding, developing general objectives, adaptive strategies, and effective methods in real-world settings.

This research has received support from various national, provincial, international, and industry-funded projects, including the National Key R&D Program of China, National Program on Key Basic Research Project (973 Program), National High-Tech R&D Program (863 Program), Major or Key Program of National Natural Science Foundation of China, Major Program of National Social Science Fund of China, Key Basic Research Projects of the Science and Technology Commission of the Central Military Commission, and Science and Technology Innovation 2030-Major Project. Significant progress has been made in related research and technology fields, with over 70 papers published in renowned domestic and international journals and top conferences, including IEEE TASLP, Hearing Research, JASA, JAES, ICASSP, Interspeech, AAAI, AES Convention, and more. Additionally, more than 50 national invention patents have been filed.

Close

Address: No. 5, Yiheyuan Road, Haidian District, Beijing Feedback: its@pku.edu.cn

Copyright © All Rights Reserved.