Earticle

현재 위치 Home

A DNN-Based Personalized HRTF Estimation Method for 3D Immersive Audio

첫 페이지 보기
  • 발행기관
    국제인공지능학회(구 한국인터넷방송통신학회) 바로가기
  • 간행물
    International Journal of Internet, Broadcasting and Communication 바로가기
  • 통권
    Vol.13 No.1 (2021.02)바로가기
  • 페이지
    pp.161-167
  • 저자
    Ji Su Son, Seung Ho Choi
  • 언어
    영어(ENG)
  • URL
    https://www.earticle.net/Article/A391048

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

원문정보

초록

영어
This paper proposes a new personalized HRTF estimation method which is based on a deep neural network (DNN) model and improved elevation reproduction using a notch filter. In the previous study, a DNN model was proposed that estimates the magnitude of HRTF by using anthropometric measurements [1]. However, since this method uses zero-phase without estimating the phase, it causes the internalization (i.e., the insidethe- head localization) of sound when listening the spatial sound. We devise a method to estimate both the magnitude and phase of HRTF based on the DNN model. Personalized HRIR was estimated using the anthropometric measurements including detailed data of the head, torso, shoulders and ears as inputs for the DNN model. After that, the estimated HRIR was filtered with an appropriate notch filter to improve elevation reproduction. In order to evaluate the performance, both of the objective and subjective evaluations are conducted. For the objective evaluation, the root mean square error (RMSE) and the log spectral distance (LSD) between the reference HRTF and the estimated HRTF are measured. For subjective evaluation, the MUSHRA test and preference test are conducted. As a result, the proposed method can make listeners experience more immersive audio than the previous methods.

목차

Abstract
1. Introduction
2. Proposed Personalized HRTF Estimation Method
2.1. The CIPIC database and preprocessing
2.2. A DNN-based 3D immersive audio reproduction model
3. Performance Evaluation
3.1 Objective evaluation
3.2. Subjective evaluation
4. Conclusions
Acknowledgement
References

키워드

3D immersive audio Head-related transfer function (HRTF) Head-related impulse response (HRIR) Deep Neural Network (DNN) notch filter personalization Anthropometric measurement

저자

  • Ji Su Son [ Dept. of Electronic and IT Media Engineering, Seoul National University of Science and Technology, Seoul, Korea ]
  • Seung Ho Choi [ Dept. of Electronic and IT Media Engineering, Seoul National University of Science and Technology, Seoul, Korea ] Corresponding Author

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

  • 발행기관명
    국제인공지능학회(구 한국인터넷방송통신학회) [The International Association for Artificial Intelligence]
  • 설립연도
    2000
  • 분야
    공학>전자/정보통신공학
  • 소개
    인터넷방송, 인터넷 TV , 방송 통신 네트워크 및 관련 분야에 대한 국내는 물론 국제적인 학술, 기술의 진흥발전에 공헌하고 지식 정보화 사회에 기여하고자 한다.

간행물

  • 간행물명
    International Journal of Internet, Broadcasting and Communication
  • 간기
    계간
  • pISSN
    2288-4920
  • eISSN
    2288-4939
  • 수록기간
    2009~2025
  • 십진분류
    KDC 326 DDC 380

이 권호 내 다른 논문 / International Journal of Internet, Broadcasting and Communication Vol.13 No.1

    피인용수 : 0(자료제공 : 네이버학술정보)

    함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

      페이지 저장