Earticle

현재 위치 Home

Human-Machine Interaction Technology (HIT)

Command Control System Based on User Authentication through Visual and Voice Fusion

첫 페이지 보기
  • 발행기관
    국제인공지능학회(구 한국인터넷방송통신학회) 바로가기
  • 간행물
    The International Journal of Advanced Smart Convergence 바로가기
  • 통권
    Volume 14 Number 3 (2025.09)바로가기
  • 페이지
    pp.43-54
  • 저자
    Sanghoon Lee, Dongjin Kwon
  • 언어
    영어(ENG)
  • URL
    https://www.earticle.net/Article/A474313

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

원문정보

초록

영어
A Recently AI systems have increasingly focused on integration with various systems for classification and recognition, including IoT applications. This paper introduced to integrate to speech recognition and object detection for user recognition system. The speech recognition model incorporates preprocessing techniques based on voice signal processing, utilizing features such as Mel spectrogram, Mel-frequency Cepstral Coefficients (MFCC), and chroma. These signal processing was important in recently speech recognition research field. also, it can be able to makes elaborate to word classification. so ours model was consist of Convolutional Neural Network(CNN) based model. according to CNN model was simple architecture, it was used to low memory and high inference time. The chroma analysis was consist of voice Pitch data. So, we can classifier to user gender using this analysis. The Your Only Look Once(YOLO) object-based detection research has been actively conducted recently. this model has low memory, high inference speed and great performance accuracy. ours system has integrate to word classification, gender classification and YOLO object detection system. this system worked in user authentication in the administrator system. the user vocalize a word to issue a simple command, and the user’s voice pattern and characteristics are classified, and the gender classification system classifies the gender after determining the voice pitch for further user recognition. Finally we used the QT framework to construct applications and fuse systems to make them easily accessible to users.

목차

Abstract
1. Introduction
2. Background knowledge
2-1. AI System
2-2 Speech recognition.
2-3 Object detection.
3. Suggestion
3-1. Setup environment
3-2. System Ui
4. Real-time object detection result
5. Speech recognition result
6. Conclusion
References

키워드

Instruction classification Object detection Gender classification AI system

저자

  • Sanghoon Lee [ Master, Independent Researcher ]
  • Dongjin Kwon [ Associate Professor, Department of Computer Electronics Engineering, Seoil University, Korea ] Corresponding Author

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

  • 발행기관명
    국제인공지능학회(구 한국인터넷방송통신학회) [The International Association for Artificial Intelligence]
  • 설립연도
    2000
  • 분야
    공학>전자/정보통신공학
  • 소개
    인터넷방송, 인터넷 TV , 방송 통신 네트워크 및 관련 분야에 대한 국내는 물론 국제적인 학술, 기술의 진흥발전에 공헌하고 지식 정보화 사회에 기여하고자 한다.

간행물

  • 간행물명
    The International Journal of Advanced Smart Convergence
  • 간기
    계간
  • pISSN
    2288-2847
  • eISSN
    2288-2855
  • 수록기간
    2012~2025
  • 십진분류
    KDC 326 DDC 380

이 권호 내 다른 논문 / The International Journal of Advanced Smart Convergence Volume 14 Number 3

    피인용수 : 0(자료제공 : 네이버학술정보)

    함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

      페이지 저장