Multimodal Emotion Recognition based on Feature-level fusion of Facial Expression-Audio Modalities

Deoghwa KIM; Han Wang; Deok-Hwan Kim

216.73.216.223

개인회원 가입

개인회원
기관회원

개인회원 로그인

개인회원 가입으로 더욱 편리하게 이용하세요. 개인회원 가입

아이디/비밀번호를 잊으셨나요? 아이디/비밀번호 찾기

기관회원 로그인

소속기관에서 검색되지 않는 기관은 무료원문다운이 불가능합니다. 개인회원 가입 후 유료구매를 하시거나 소속기관 도서관에 이용문의해 주세요.

Home

Oral Session I - III : Multi-Modality and Recommendation Systems

Multimodal Emotion Recognition based on Feature-level fusion of Facial Expression-Audio Modalities

발행기관

한국차세대컴퓨팅학회 바로가기
간행물

한국차세대컴퓨팅학회 학술대회 바로가기
통권

The 10th International Conference on Next Generation Computing 2024 (2024.11)바로가기
페이지

pp.187-190
저자

Deoghwa KIM, Han Wang, Deok-Hwan Kim
언어

영어(ENG)
URL

https://www.earticle.net/Article/A468840

원문정보

초록

영어: This paper proposed a Feature-level fusion technique that combines facial expression and audio modalities for multimodal emotion recognition. The learning model utilizes a hybrid approach combining CNN and LSTM to learn the spatiotemporal characteristics of video and audio modalities effectively. Compared to a unimodal approach, speech emotion recognition achieved 74% accuracy, and facial emotion recognition achieved 83% accuracy, while the proposed multimodal approach achieved 93% accuracy, demonstrated that multimodal emotion recognition is more accurate than unimodal emotion recognition. Furthermore, in tests using the RAVDESS dataset, the proposed model achieved higher emotion recognition rates compared to related studies. This study demonstrated the possibility of multimodal emotion recognition and designed a model capable of recognizing emotions in various environments and situations. Through this, we aim to contribute to the advancement of emotion recognition technology.

Abstract
I. INTRODUCTION
II. RELATED WORK
A. Facial Emotion Recognition
B. Speech Emotion Recognition
C. Multimodal(Speech + Facial Emotion Recognition, Facial+ EEG Emotion Recognition)
III. PROPOSED METHOD
A. Preprocessing Process for Video and Audio Data
B. Structure of Proposed Model
IV. EXPERIMENTS
A. Used DATASET
B. EXPERIMENTS RESULTS
V. CONCLUSION
ACKNOWLEDGMENT
REFERENCES

키워드

Multimodal Emotion Recognition Feature-level fusion LSTM CNN

저자

Deoghwa KIM [ Department of Electrical and Computer Engineering Inha University Incheon, South Korea ]
Han Wang [ Department of Electrical and Computer Engineering Inha University Incheon, South Korea ]
Deok-Hwan Kim [ Department of Electrical and Computer Engineering Inha University Incheon, South Korea ] Corresponding Author

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

발행기관명

한국차세대컴퓨팅학회 [Korean Institute of Next Generation Computing]
설립연도
2005
분야
공학>컴퓨터학
소개
본 학회는 차세대 PC 및 그 관련분야의 학술활동을 통하여 차세대 PC의 학문 및 기술발전을 도모하고 산업발전 및 국제협력 증진을 목적으로 한다.

간행물

간행물명

한국차세대컴퓨팅학회 학술대회
간기
반년간
수록기간
2021~2025
십진분류
KDC 566 DDC 004

이 권호 내 다른 논문 / 한국차세대컴퓨팅학회 학술대회 The 10th International Conference on Next Generation Computing 2024

피인용수 : 0건 (자료제공 : 네이버학술정보)

함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

출처 : 네이버학술정보

0개의 논문이 장바구니에 담겼습니다.

페이지 저장

소속기관 조회

이용자님의 소속기관(단체)이 서비스에 가입되어 있는지 확인해 보십시오.
기관회원에 소속되어 있는 이용자는 원문을 무료로 이용할 수 있습니다.

상호: 주식회사 학술교육원 I 대표: 노방용 I 사업자등록번호: 122-81-88227 I 통신판매업신고번호: 제2008-인천부평-00176호 I 정보보호책임자: 이두영
주소: (21319)인천광역시 부평구 영성중로 50 미래타워 701호 I 전화: 0505-555-0740 I 팩스: 0505-555-0741 I 이메일: earticle@earticle.net

음성지원 및 돋보기 서비스

Earticle

Multimodal Emotion Recognition based on Feature-level fusion of Facial Expression-Audio Modalities

원문정보

초록

목차

키워드

저자

참고문헌

간행물 정보

발행기관

간행물

이 권호 내 다른 논문 / 한국차세대컴퓨팅학회 학술대회 The 10th International Conference on Next Generation Computing 2024

피인용수 : 0건 (자료제공 : 네이버학술정보)

함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.