Earticle

다운로드

Explaining the translation error factors of Korean-Chinese machine translation services using self-attention visualization

  • 간행물
    한국경영정보학회 정기 학술대회 바로가기
  • 권호(발행년)
    2021 한국경영정보학회 추계통합학술대회 (2021.11) 바로가기
  • 페이지
    pp.214-215
  • 저자
    Chenglong Zhang, Hyunchul Ahn
  • 언어
    영어(ENG)
  • URL
    https://www.earticle.net/Article/A402821

원문정보

초록

영어
This study analyzed the Korean-Chinese translation error factors of machine translation services such as Papago and Google translate through self-attention path visualization. Self-Attention is a key method of the Transformer and Bert NLP model and recently widely used in machine translation. The study analyzes the difference in the Attention path between ST (source text) and ST' which meaning does not change and the translation output appears more accurately. Understanding these attention path pattern differences and each translation result TT (target text), and TT', the study summarized three types of errors: symbols missing error, grammar error, and multiple meaning words error. The study analyzed using the xlm-ReBerta multilingual NLP model provided by exBERT, for self-attention visualization, and suggesting some suggestions for error resolution. Through the study, it was found that the cause of translation error can be well identified using the Self-Attention visualization method, and through the explanation, machine translation algorithm developers can use it to improve service quality, and researchers can more understand machine translation algorithms.

목차

Abstract
Introduction
exBERT
Methods
Conclusion
Implications
References

저자

  • Chenglong Zhang [ Graduate School of Business IT, Kookmin University ]
  • Hyunchul Ahn [ Graduate School of Business IT, Kookmin University ]

참고문헌

자료제공 : 네이버학술정보

    간행물 정보

    • 간행물
      한국경영정보학회 정기 학술대회 [KMIS Conference]
    • 간기
      반년간
    • 수록기간
      1990~2025
    • 십진분류
      KDC 325 DDC 658