Earticle

현재 위치 Home

IT Marketing and Policy

On the Analysis of Natural Language Processing Morphology for the Specialized Corpus in the Railway Domain

첫 페이지 보기
  • 발행기관
    국제인공지능학회(구 한국인터넷방송통신학회) 바로가기
  • 간행물
    International Journal of Internet, Broadcasting and Communication 바로가기
  • 통권
    Vol.14 No.4 (2022.11)바로가기
  • 페이지
    pp.189-197
  • 저자
    Jong Un Won, Hong Kyu Jeon, Min Joong Kim, Beak Hyun Kim, Young Min Kim
  • 언어
    영어(ENG)
  • URL
    https://www.earticle.net/Article/A421046

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

원문정보

초록

영어
Today, we are exposed to various text-based media such as newspapers, Internet articles, and SNS, and the amount of text data we encounter has increased exponentially due to the recent availability of Internet access using mobile devices such as smartphones. Collecting useful information from a lot of text information is called text analysis, and in order to extract information, it is performed using technologies such as Natural Language Processing (NLP) for processing natural language with the recent development of artificial intelligence. For this purpose, a morpheme analyzer based on everyday language has been disclosed and is being used. Pre-learning language models, which can acquire natural language knowledge through unsupervised learning based on large numbers of corpus, are a very common factor in natural language processing recently, but conventional morpheme analysts are limited in their use in specialized fields. In this paper, as a preliminary work to develop a natural language analysis language model specialized in the railway field, the procedure for construction a corpus specialized in the railway field is presented.

목차

Abstract
1. Introduction
1.1 Background
1.2 Problem Definition
1.3 Composition of the Paper
2. Feasibility Analysis for Construction a Specialized Field Corpus
2.1 Comparison of Types and Characteristics of Korean Morpheme Analyzers
2.2 Limitations of the Existing Morpheme Analyzer and the Need to Build a Specialized Field Corpus
2.3 The Procedure for Construction a Corpus Specialized in the Railway Domain
3. Construction of a Specialized Natural Language Corpus for the Railway Domain
3.1 Selection and Collection of Data to Build a Corpus of Railway Specialized Domain
3.2 Data Cleaning and Preprocessing for Morpheme Analysis
3.3 Procedure for Obtaining Specialty Corpus in Railway Domain
4. Results of Building a Railway Corpus
5. Conclusion
References

키워드

Natural Language Processing (NLP) Morphology Corpus Artificial Intelligence (AI) Intelligent Railway and Transportation Technologies

저자

  • Jong Un Won [ Principal Researcher, Artificial Intelligence Railroad Research Department, Korea Railroad Research Institute, Korea ]
  • Hong Kyu Jeon [ Senior Researcher, Artificial Intelligence Railroad Research Department, Korea Railroad Research Institute, Korea ]
  • Min Joong Kim [ Ph. D. Candidate, Department of Systems Engineering, Ajou University, Korea ]
  • Beak Hyun Kim [ Principal Researcher, Artificial Intelligence Railroad Research Department, Korea Railroad Research Institute, Korea ]
  • Young Min Kim [ Associate professor, Department of Systems Engineering, Ajou University, Korea ] Corresponding Author

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

  • 발행기관명
    국제인공지능학회(구 한국인터넷방송통신학회) [The International Association for Artificial Intelligence]
  • 설립연도
    2000
  • 분야
    공학>전자/정보통신공학
  • 소개
    인터넷방송, 인터넷 TV , 방송 통신 네트워크 및 관련 분야에 대한 국내는 물론 국제적인 학술, 기술의 진흥발전에 공헌하고 지식 정보화 사회에 기여하고자 한다.

간행물

  • 간행물명
    International Journal of Internet, Broadcasting and Communication
  • 간기
    계간
  • pISSN
    2288-4920
  • eISSN
    2288-4939
  • 수록기간
    2009~2025
  • 십진분류
    KDC 326 DDC 380

이 권호 내 다른 논문 / International Journal of Internet, Broadcasting and Communication Vol.14 No.4

    피인용수 : 0(자료제공 : 네이버학술정보)

    함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

      페이지 저장