Earticle

현재 위치 Home

Oral Session II - III : Emerging Topics in AI

Classification and Comparative Analysis of LIME-based Machine Learning Models

첫 페이지 보기
  • 발행기관
    한국차세대컴퓨팅학회 바로가기
  • 간행물
    한국차세대컴퓨팅학회 학술대회 바로가기
  • 통권
    The 10th International Conference on Next Generation Computing 2024 (2024.11)바로가기
  • 페이지
    pp.378-381
  • 저자
    Won-Young Jo, Chan-Uk Yeom, Keun-Chang Kwak
  • 언어
    영어(ENG)
  • URL
    https://www.earticle.net/Article/A468890

원문정보

초록

영어
This study compares and analyzes the performance of LIME-based machine learning methods (Gaussian Naive Bayes (GNB), Highly-Efficient Logistic Regression (LR), Linear Support Vector Machine (SVM), and Triple-layer Neural Network (TNN)) using three medical datasets. High-dimensional data increases the likelihood of overfitting in learning algorithms due to the curse of dimensionality. To address this, LIME is utilized to compute the importance of key features contributing to the model's predictions. Based on this, features are selected. The LIME technique generates multiple samples by perturbing the data in the local region. Subsequently, a simple linear model is used to evaluate the impact of each feature on the predictions. Features with high importance derived from this process are selected for model retraining. As a result, it was confirmed that learning time could be reduced while maintaining or even improving performance with a smaller number of features. Consequently, by selecting necessary features, the curse of dimensionality issue is alleviated, and accuracy can be maintained or improved using fewer features in the Hepatitis C Prediction Dataset, Breast Cancer Wisconsin (Prognostic) Dataset, and Glioma Grading Clinical and Mutation Features Dataset.

목차

Abstract
I. INTRODUCTION
II. LIME
III. MACHINE LEARNING MODELS AND LIME-BASED FEATURE SELECTION METHODS
A. Gaussian Naive Bayes (GNB)
B. Highly-Efficient Logistic Regression (LR)
C. Linear Support Vector Machine (SVM)
D. Triple-layer Neural Network (TNN)
E. LIME-based Machine Learning Method
IV. EXPERIMENTS AND RESULTS ANALYSIS
V. CONCLUSION
ACKNOWLEDGMENT
REFERENCE

키워드

LIME feature selection validation accuracy efficiency

저자

  • Won-Young Jo [ Department of Electronics Engineering, Chosun University Gwangju, South Korea ]
  • Chan-Uk Yeom [ Division of AI Convergence College Chosun University Gwangju, South Korea ]
  • Keun-Chang Kwak [ Department of Electronics Engineering, Chosun University Gwangju, South Korea ]

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

  • 발행기관명
    한국차세대컴퓨팅학회 [Korean Institute of Next Generation Computing]
  • 설립연도
    2005
  • 분야
    공학>컴퓨터학
  • 소개
    본 학회는 차세대 PC 및 그 관련분야의 학술활동을 통하여 차세대 PC의 학문 및 기술발전을 도모하고 산업발전 및 국제협력 증진을 목적으로 한다.

간행물

  • 간행물명
    한국차세대컴퓨팅학회 학술대회
  • 간기
    반년간
  • 수록기간
    2021~2025
  • 십진분류
    KDC 566 DDC 004

이 권호 내 다른 논문 / 한국차세대컴퓨팅학회 학술대회 The 10th International Conference on Next Generation Computing 2024

    피인용수 : 0(자료제공 : 네이버학술정보)

    함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

      페이지 저장