TOEFL11 코퍼스에서 주제 프롬프터의 자동 분류

윤태진

216.73.216.176

개인회원 가입

개인회원
기관회원

개인회원 로그인

개인회원 가입으로 더욱 편리하게 이용하세요. 개인회원 가입

아이디/비밀번호를 잊으셨나요? 아이디/비밀번호 찾기

기관회원 로그인

소속기관에서 검색되지 않는 기관은 무료원문다운이 불가능합니다. 개인회원 가입 후 유료구매를 하시거나 소속기관 도서관에 이용문의해 주세요.

Home

Ⅲ. 언어와 인지

TOEFL11 코퍼스에서 주제 프롬프터의 자동 분류
Automatic classification of prompts in the TOEFL11 corpus

발행기관

국제언어인문학회 바로가기
간행물

인문언어 KCI 등재 바로가기
통권

제20권 1호 (2018.06)바로가기
페이지

pp.157-177
저자

윤태진
언어

한국어(KOR)
URL

https://www.earticle.net/Article/A334650

※ 기관로그인 시 무료 이용이 가능합니다.

5,700원

원문정보

초록

영어: The aim of the paper is to develop classification models of the prompts of the TOEFL essays in the TOEFL11 corpus. The corpus is a collection of TOEFL essays written in response to one of 8 prompts of various topics and by test-takers of different proficiency levels who are from 11 different countries. The number of essays is 11,000 for each language (that is, 121,000 in total). The paper aims at developing prompt classification models using an automatic method of Support Vector Machine (SVM), to which a number of different features are fed: The input features to the model include high frequency words which are observed in the raw essay texts, and high frequency nouns which are extracted from a POS-tagged essay texts. High frequency nouns among three different proficiency levels are also used as input features. The results indicated that even though high frequency words taken from raw textual materials performed quite well with an accuracy of 90.4%, the words tagged as nouns did even better with an accuracy of 97.3%. The inspecting of high frequency nouns revealed that the words were independently distributed among prompts with nearly no overlapping across different prompts. The classification test of essay samples of different proficiency levels confirmed that the accuracy rate of automatically classifying prompts by observing the frequency occurrence of nouns in texts increased in general as the proficiency levels of the essay samples increase. The paper serves as a foundation for further details studies on topic modeling used by learners of English.

키워드

TOEFL11 prompt classification SVM learner corpus writing samples

저자

윤태진 [ Tae-Jin Yoon | 성신여자대학교 ]

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

발행기관명

국제언어인문학회 [INTERNATIONAL ASSOCIATION FOR HUMANISTIC STUDIES IN LANGUAGE]
설립연도
2000
분야
인문학>언어학
소개
국제언어인문학회는 '언어를 통한 인문학 연구'의 필요성에 동감하는 여러 전공분야 학자들의 뜻을 담고 있습니다. 언어에 초점을 맞추는 것은, 다양한 전공분야의 참여에서 생겨날 수 있는 '이질적 집합'의 상황을 극복하기 위한 장치입니다. 현재로서는 작은 불씨를 지핀 것에 불과합니다. 그러나 이렇게 일구어진 불꽃이 새로운 학풍의 바람결에 커다란 섬광으로 빛나게 될 날이 올 것을 우리는 확신합니다. 우리의 학회와 학술지는 인문학 불변의 가치와 시대적 사명을 인식하는 국내외의 학자들을 향해 활짝 개방되어 있습니다. 특정 전공의 범위를 넘어서서 철학, 문학, 언어학, 종교, 역사, 문화, 예술 등의 시각에서 언어의 본질을 토론할 기회가 될 것입니다.

간행물

간행물명

인문언어 [LINGUA HUMANITATIS]
간기
반년간
pISSN
1598-2130
수록기간
2000~2025
등재여부
KCI 등재
십진분류
KDC 705 DDC 405

이 권호 내 다른 논문 / 인문언어 제20권 1호

피인용수 : 0건 (자료제공 : 네이버학술정보)

함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

출처 : 네이버학술정보

0개의 논문이 장바구니에 담겼습니다.

페이지 저장

소속기관 조회

이용자님의 소속기관(단체)이 서비스에 가입되어 있는지 확인해 보십시오.
기관회원에 소속되어 있는 이용자는 원문을 무료로 이용할 수 있습니다.

상호: 주식회사 학술교육원 I 대표: 노방용 I 사업자등록번호: 122-81-88227 I 통신판매업신고번호: 제2008-인천부평-00176호 I 정보보호책임자: 이두영
주소: (21319)인천광역시 부평구 영성중로 50 미래타워 701호 I 전화: 0505-555-0740 I 팩스: 0505-555-0741 I 이메일: earticle@earticle.net

음성지원 및 돋보기 서비스

Earticle

TOEFL11 코퍼스에서 주제 프롬프터의 자동 분류
Automatic classification of prompts in the TOEFL11 corpus

원문정보

초록

목차

키워드

저자

참고문헌

간행물 정보

발행기관

간행물

이 권호 내 다른 논문 / 인문언어 제20권 1호

피인용수 : 0건 (자료제공 : 네이버학술정보)

함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

Earticle

TOEFL11 코퍼스에서 주제 프롬프터의 자동 분류 Automatic classification of prompts in the TOEFL11 corpus

원문정보

초록

목차

키워드

저자

참고문헌

간행물 정보

발행기관

간행물

이 권호 내 다른 논문 / 인문언어 제20권 1호

피인용수 : 0건 (자료제공 : 네이버학술정보)

함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

TOEFL11 코퍼스에서 주제 프롬프터의 자동 분류
Automatic classification of prompts in the TOEFL11 corpus