An Analysis of the Errors in the Auto-Generated Captions of University  Commencement Speeches on YouTube

Jeong-Hwa Lee; Kyung-Whan Cha

216.73.216.182

개인회원 가입

개인회원
기관회원

개인회원 로그인

개인회원 가입으로 더욱 편리하게 이용하세요. 개인회원 가입

아이디/비밀번호를 잊으셨나요? 아이디/비밀번호 찾기

기관회원 로그인

소속기관에서 검색되지 않는 기관은 무료원문다운이 불가능합니다. 개인회원 가입 후 유료구매를 하시거나 소속기관 도서관에 이용문의해 주세요.

Home

An Analysis of the Errors in the Auto-Generated Captions of University Commencement Speeches on YouTube

발행기관

아시아영어교육학회 바로가기
간행물

The Journal of AsiaTEFL SCOPUS KCI 등재 바로가기
통권

Vol.17 No.1 (2020.03)바로가기
페이지

pp.143-159
저자

Jeong-Hwa Lee, Kyung-Whan Cha
언어

영어(ENG)
URL

https://www.earticle.net/Article/A372123

※ 기관로그인 시 무료 이용이 가능합니다.

5,100원

원문정보

초록

영어: Auto-generated captions on YouTube have proven useful in helping viewers better understand the words being spoken. However, at times they fail to contain accurate captions. In these cases, they lead to confusion. The aim of this paper is to identify and analyze errors in the auto-generated captions of 20 commencement speeches on YouTube. These speeches were presented over a period of 12 years by speakers from different walks of life. The researchers selected ten male and ten female icons. Only the first 10 minutes of the speeches were utilized for this investigation. All the captioned errors were collected and analyzed. Upon completion of the analysis, it was discovered that the frequency of errors in each speech ranged between 10 and 46 cases, with an average of one error occurring about every 26 seconds. Among the different error categories, nouns record the highest number with 144 cases (31.3%). The second is verbs with 93 cases (20.2%), then prepositions with 37 cases (8.1%). Among the four subcategories, namely omission, addition, substitution, and word order, substitution recorded the highest amount of errors with 357 cases (77.6%). Furthermore, the errors were classified into two major groups. The first, involving function words, appeared in 169 cases (36.7%). The second, involving content words, appeared in 291 cases (63.3%). The results of this research suggest that a continuous development of the voice recognition software that automatically generates captions is necessary for more efficient and accurate data that will help viewers and listeners better comprehend the video contents.

Abstract
Introduction
Literature Review
Auto-generated Caption Errors
Machine Translation Errors
Method
Data Collection
Data Analysis
Results
Auto–Generated Caption Errors Based on 10 Categories and Four Sub-Categories
Function Word and Content Word Errors
Frequency Rates of Auto-generated Caption Errors as Recorded from the 20 Commencement Speeches
Discussion and Implication
Relating to the 10 Categories and Four Sub-Categories
Relating to Function Words and Content Words
Relating to the Frequency Rates of Each of the 20 Commencement Speeches
Summary and Limitations
Acknowledgments
The Authors
References
Appendix A
Appendix B

키워드

auto-generated caption errors YouTube university commencement speeches function words content words omission addition substitution word order

저자

Jeong-Hwa Lee [ Hansung University, Korea ]
Kyung-Whan Cha [ Chung-Ang University, Korea ]

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

발행기관명

아시아영어교육학회 [Asia TEFL]
설립연도
2004
분야
사회과학>교육학
소개
The goals of Asia TEFL are to promote scholarship, disseminate information, and facilitate cross-cultural understanding among persons concerned with the teaching and learning of English in Asia. In order to accomplish this, Asia TEFL will pursue the following goals: 1. To link ELT professionals in joint research on issues and concerns regarding English teaching and learning in the Asian context. 2. To publish an academic journal, The Asia TEFL Journal, as an internationally recognized journal in the field of English language teaching. 3. To host conferences and seminars addressing important issues concerning ELT in Asia. 4. To develop proficiency guidelines and assessment methods designed for the needs of the Asian context. 5. To develop programs for Asian learners and teachers of English to build their English language proficiency and cultural understanding and provide them with the skills required to be efficient English teaching professionals.

간행물

간행물명

The Journal of AsiaTEFL
간기
계간
pISSN
1738-3102
eISSN
2466-1511
수록기간
2004~2026
등재여부
SCOPUS,KCI 등재
십진분류
KDC 740 DDC 420

이 권호 내 다른 논문 / The Journal of AsiaTEFL Vol.17 No.1

피인용수 : 0건 (자료제공 : 네이버학술정보)

함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

출처 : 네이버학술정보

0개의 논문이 장바구니에 담겼습니다.

페이지 저장

소속기관 조회

이용자님의 소속기관(단체)이 서비스에 가입되어 있는지 확인해 보십시오.
기관회원에 소속되어 있는 이용자는 원문을 무료로 이용할 수 있습니다.

상호: 주식회사 학술교육원 I 대표: 노방용 I 사업자등록번호: 122-81-88227 I 통신판매업신고번호: 제2008-인천부평-00176호 I 정보보호책임자: 이두영
주소: (21319)인천광역시 부평구 영성중로 50 미래타워 701호 I 전화: 0505-555-0740 I 팩스: 0505-555-0741 I 이메일: earticle@earticle.net

음성지원 및 돋보기 서비스

Earticle