Earticle

다운로드

버트(BERT)를 활용한 인간번역의 자동평가 : 여러 모델의 성능 비교 및 활용 가능성
Automatic Evaluation of Human Translation using BERT.

  • 간행물
    통번역학연구 KCI 등재 바로가기
  • 권호(발행년)
    제26권 4호 (2022.10) 바로가기
  • 페이지
    pp.117-137
  • 저자
    정혜연, 서수영
  • 언어
    한국어(KOR)
  • URL
    https://www.earticle.net/Article/A420664

원문정보

초록

영어
The most recent language model in NLP, known as BERT, has numerous advantages over other language models. It ‘understands’ human language in a subword unit and can recognize rare words and out-of-vocabulary words. Furthermore, it considers syntax and context during the text-understanding process. We applied the evaluation metric “BERTscore” using this BERT model to evaluate human translation (HT). 120 translated texts were evaluated with five evaluation metrics: BLEU, METEOR, emBLEU, emMETEOR, as well as BERTscore and the result was compared with professional translators’ evaluation. The comparison examines the validity and reliability of these metrics, particularly the BERTscore, for future application for HT evaluation. BERTscore demonstrated a stable performance, taking first place in scores, and third in ranks. The validity of metrics of word2vec models, especially that of emBLEU, was somewhat disappointing, probably owing to the domain difference between the training corpus and test corpus.

목차


1. 서론
2. BERTscore와 다른 자동평가 모델
2.1. 기본 모델 – BLEU, METEOR
2.2. 워드투벡터 활용 모델 – emBLEU, emMETEOR
2.3. 버트 활용 모델 - BERTscore
3. 인간평가와의 비교
4. 실험
4.1. 실험 코퍼스 구축
4.2. 평가와 분석
5. 결과 및 토론
5.1. 인간평가
5.2. 자동평가
5.3. 인간평가와 자동평가 비교
6. 요약 및 결론
참고문헌

저자

  • 정혜연 [ Chung, Hye-yeon | 한국외국어대학교 ] 주저자 및 교신저자
  • 서수영 [ Seo, Soo-young | 한림대학교 ] 공동저자

참고문헌

자료제공 : 네이버학술정보

    간행물 정보

    • 간행물
      통번역학연구 [Interpreting and Translation Studies]
    • 간기
      계간
    • pISSN
      1975-6321
    • eISSN
      2713-8372
    • 수록기간
      1997~2026
    • 등재여부
      KCI 등재
    • 십진분류
      KDC 717 DDC 400