Efficient Urban Sound Classification : A Fused Visual Feature Approach

Poster Session 1 : IT Fusion Technologies etc.

간행물

한국차세대컴퓨팅학회 학술대회 바로가기
권호(발행년)

ICNGC 2025 The 11th International Conference on Next Generation Computing 2025 (2025.12) 바로가기
페이지

pp.172-173
저자

Myeonghoe Lee, Pankoo Kim, Chang Choi
언어

영어(ENG)
URL

https://www.earticle.net/Article/A478488

영어: This research analyzes methods to enhance Urban Sound Classification performance by converting audio into Melspectrogram and MFCC images. Using the UrbanSound8K dataset, we compared single-representation and feature-level fusion strategies across CNN architectures. Results show that ResNet achieved the highest accuracy of 0.9594. However, a DenseNet-based fusion model proved more efficient, reaching a competitive accuracy of 0.9456 with fewer resources, demonstrating the potential for practical models that balance performance and efficiency.

Myeonghoe Lee [ Department of Computer Engineering Gachon University Seongnam-si, Republic of Korea ]
Pankoo Kim [ Department of Computer Engineering Chosun University Gwangju, Republic of Korea ]
Chang Choi [ Department of Computer Engineering Gachon University Seongnam-si, Republic of Korea ] Corresponding Author

자료제공 : 네이버학술정보

Earticle