PathoVoiceFAI : Enhancing Voice Pathology Classification in Human Voices

Oral Session I - III : Multi-Modality and Recommendation Systems

간행물

한국차세대컴퓨팅학회 학술대회 바로가기
권호(발행년)

The 10th International Conference on Next Generation Computing 2024 (2024.11) 바로가기
페이지

pp.179-182
저자

Srinidhi Kanagachalam, Rasim Mahmudov, Deok-Hwan Kim
언어

영어(ENG)
URL

https://www.earticle.net/Article/A468838

영어: Voice pathology classification has become one of the primary objectives of research in biomedical engineering. This paper proposes PathoVoiceFAI, a technique that enhances the multiclass pathology classification by leveraging the power of attention layers and appropriate fusioning technique to fuse the multimodal inputs. The preliminary results show that use of mid-level fusion with attention layers improves the classification accuracy by 5% in comparison to the standard decision-level fusion technique. This highlights the effect of powerful feature extraction in enhancing the classification outcomes for application in clinical environment.

Srinidhi Kanagachalam [ Department of Electrical and Computer Engineering Inha University Incheon, South Korea ]
Rasim Mahmudov [ Department of Electrical and Computer Engineering Inha University ]
Deok-Hwan Kim [ Department of Electrical and Computer Engineering Inha University ] Corresponding Author

자료제공 : 네이버학술정보

Earticle