Accurate audio segmentation has recently received increasing attention for its applications in automatic indexing, content analysis and information retrieval. Hence, this paper proposes a highly accurate audio segmentation methodology using a genetic algorithm-based approach to adapting and optimizing segmentation window lengths. Specifically, this paper analyzes the parameter sequence of the root-mean-square values of an input audio stream with optimal sliding window (or segmentation window) lengths found and adapted by a genetic algorithm. In addition, this paper determines whether an audio-cut occurs or not by utilizing the parameter sequences as inputs of a support vector machine. Experimental results indicate that the proposed approach achieves 100.00% and 98.69% in the average precision and recall rates of segmentation performance, respectively.
Myeongsu Kang [ Department of Electrical, Electronic and Computer Engineering, University of Ulsan, De 93 Daehak –ro, Nam-gu, Ulsan 680749, Korea ]
Jong-Myon Kim [ Department of Electrical, Electronic and Computer Engineering, University of Ulsan, De 93 Daehak –ro, Nam-gu, Ulsan 680749, Korea ]
Corresponding author.
보안공학연구지원센터(IJMUE) [Science & Engineering Research Support Center, Republic of Korea(IJMUE)]
설립연도
2006
분야
공학>컴퓨터학
소개
1. 보안공학에 대한 각종 조사 및 연구
2. 보안공학에 대한 응용기술 연구 및 발표
3. 보안공학에 관한 각종 학술 발표회 및 전시회 개최
4. 보안공학 기술의 상호 협조 및 정보교환
5. 보안공학에 관한 표준화 사업 및 규격의 제정
6. 보안공학에 관한 산학연 협동의 증진
7. 국제적 학술 교류 및 기술 협력
8. 보안공학에 관한 논문지 발간
9. 기타 본 회 목적 달성에 필요한 사업
간행물
간행물명
International Journal of Multimedia and Ubiquitous Engineering
간기
월간
pISSN
1975-0080
수록기간
2008~2016
등재여부
SCOPUS
십진분류
KDC 505DDC 605
이 권호 내 다른 논문 / International Journal of Multimedia and Ubiquitous Engineering Vol.10 No.1