Research Area

Current Location: Home > Research Area > Content

Research Area

Research progress of Xihong Wu's team in the field of sound field analysis and environmental modeling

Time:2022-12-28    Click:

The acoustic environment is one of the most familiar environments to humans, and interacting with the environment is an important characteristic of embodied intelligence. In the analysis of three-dimensional sound fields, the decomposition of spherical harmonics of sound enables the decoupling of source information and environmental information, making it widely used in three-dimensional sound field analysis based on spherical arrays. In this study, based on the signals received by a spherical array, a spherical harmonic domain expansion is performed. An iterative inversion model training method is employed to couple the acoustic environment with the target signal, separating the spherical harmonic domain signals in multiple dimensions, including multiple sound sources, early reverberation, late reverberation, and noise. The separated spherical harmonic function signals can be used for tasks such as source analysis and acoustic environmental description. The related technological achievements have been applied in various fields, including three-dimensional sound field recording and playback, three-dimensional sound field control, sound source detection, localization and enhancement, and multi-channel audio coding and decoding.

This research has been supported by the National Key Research and Development Program of China, the National High-Tech Research and Development Program (863 Program), the National Natural Science Foundation of China, and other projects. Multiple findings have been published in well-known domestic and international journals and top conferences, including IEEE TASLP, JAES, ICASSP, Interspeech, AAAI, AES Convention, and have been applied for national invention patents. Some patented technologies have been adopted by standards such as 3GPP IVAS and AVS3-P3, and have been implemented in Huawei's MateView, headphones, and other products. The AVS3-P3 codec was used for the 2022 Mid-Autumn Festival Gala and the FIFA World Cup - Qatar 2022 broadcast.

Live Recording of the Concert


Close

Address: No. 5, Yiheyuan Road, Haidian District, Beijing Feedback: its@pku.edu.cn

Copyright © All Rights Reserved.