The 8th International Conference on Next Generation Computing 2022 (2022.10)바로가기
페이지
pp.153-156
저자
Min Je Kim, Tanveer Hussain, Waseem Ullah, Hikmat Yar, Mi Young Lee, Muhammad Sajjad, Sung Wook Baik
언어
영어(ENG)
URL
https://www.earticle.net/Article/A419763
원문정보
초록
영어
The analysis, recognition and perception of behavior has usually been a crucial task for researchers. The goal of this paper is to address the problem to recognize animal species, which has numerous applications in zoology, ecology, biology, and entertainment. Researchers used different machine learning approach for animal species recognition, however the researchers mostly used image data for this purpose and ignore the importance of audio data. In this work, our focus is to process multi modality (image and voice) dataset for animal species recognition. We proposed two different networks for animals’ audio and visual representation to recongize animals’ species. First network for animals’ audios classification that extract MFCC features, and these features is passed from four VGG style blocks while the second network extract visual features from images to classify according to their species. The experimental results demonstrated the effectiveness of the proposed model of achieved better performance in terms of classification accuracies.
목차
Abstract I. INTRODUCTION II. The proposed model A. Visual Classification of Animal Species. III. RESULTS AND EXPERIMENTS A. Training Detail B. Implementation Detail C. Dataset Explanation D. Experimental Results of the Proposed Model IV. CONCLUSION REFERENCES
키워드
Deep learninganimal classificationMFCCWildlife
저자
Min Je Kim [ Sejong University ]
Tanveer Hussain [ Sejong University ]
Waseem Ullah [ Sejong University ]
Hikmat Yar [ Sejong University ]
Mi Young Lee [ Sejong University ]
Muhammad Sajjad [ Islamia College Peshawar Peshawar, Pakistan ]
Sung Wook Baik [ Sejong University ]
Corresponding Author