The 10th International Conference on Next Generation Computing 2024 (2024.11)바로가기
페이지
pp.179-182
저자
Srinidhi Kanagachalam, Rasim Mahmudov, Deok-Hwan Kim
언어
영어(ENG)
URL
https://www.earticle.net/Article/A468838
원문정보
초록
영어
Voice pathology classification has become one of the primary objectives of research in biomedical engineering. This paper proposes PathoVoiceFAI, a technique that enhances the multiclass pathology classification by leveraging the power of attention layers and appropriate fusioning technique to fuse the multimodal inputs. The preliminary results show that use of mid-level fusion with attention layers improves the classification accuracy by 5% in comparison to the standard decision-level fusion technique. This highlights the effect of powerful feature extraction in enhancing the classification outcomes for application in clinical environment.
목차
Abstract I. INTRODUCTION II. BACKGROUND A. Multiclass pathology classification B. Deep learning techniques III. PATHOVOICEFAI ARCHITECTURE IV. EXPERIMENTAL EVALUATION V. CONCLUSION ACKNOWLEDGEMENT REFERENCES