Improving Accuracy of Machine Learning-Based Prediction Model for Heart Disease Classification Using Information Gain and DBSCAN

Norma Latif Fitriyani; Muhammad Syafrudin; Ganjar Alfian

216.73.216.95

개인회원 가입

개인회원
기관회원

개인회원 로그인

개인회원 가입으로 더욱 편리하게 이용하세요. 개인회원 가입

아이디/비밀번호를 잊으셨나요? 아이디/비밀번호 찾기

기관회원 로그인

소속기관에서 검색되지 않는 기관은 무료원문다운이 불가능합니다. 개인회원 가입 후 유료구매를 하시거나 소속기관 도서관에 이용문의해 주세요.

Home

Improving Accuracy of Machine Learning-Based Prediction Model for Heart Disease Classification Using Information Gain and DBSCAN

발행기관

한국경영정보학회 바로가기
간행물

한국경영정보학회 정기 학술대회 바로가기
통권

2022 경영정보관련학회 춘계통합학술대회 (2022.06)바로가기
페이지

pp.506-509
저자

Norma Latif Fitriyani, Muhammad Syafrudin, Ganjar Alfian
언어

영어(ENG)
URL

https://www.earticle.net/Article/A416370

※ 기관로그인 시 무료 이용이 가능합니다.
※ 학술발표대회집, 워크숍 자료집 중 4페이지 이내 논문은 '요약'만 제공되는 경우가 있으니, 구매 전에 간행물명, 페이지 수 확인 부탁 드립니다.

4,000원

원문정보

초록

영어: Accuracy improvement of classification model becomes main research objective in various fields. Selecting important features and removing outliers of a dataset are two effective solutions for improving model accuracy. Information Gain is one of the feature selection methods that can be considered as a solution for selecting important features of a dataset. Information Gain selects the variable that maximizes the information gain, which in turn minimizes the entropy and best splits the dataset into groups for effective classification. Aside of selecting important feature, removing outlier is also necessary for improving accuracy of the classification model. Density-Based Spatial Clustering of Applications with Noise (DBSCAN) is one of the powerful outlier removal methods which can identify with significant accuracy the clusters of random shape and size in large databases corrupted with noise. Therefore, in this study, we propose the accuracy improvement of heart disease classification model using Information Gain and DBSCAN applied to various machine learning algorithms. One publicly available heart disease dataset (Cleveland) is utilized in this study to build the classification model. The results showed that after implementing Information Gain, the accuracy of the model applied to Gaussian Naïve Bayes, Logistic Regression, Multi-Layer Perceptron, Support Vector Machine, Decision Tree, Random Forest, and Extreme Gradient Boosting algorithms increases as much as 1.31% in average. The accuracy also increases when DBSCAN is applied to the model after utilizing Information Gain, with the number of improvements is around 0.62%.

키워드

Accuracy improvement Information Gain feature selection BDSCAN outlier removal machine learning algorithms

저자

Norma Latif Fitriyani [ Department of Data Science, Sejong University ]
Muhammad Syafrudin [ Department of Artificial Intelligence, Sejong University ]
Ganjar Alfian [ Department of Electrical Engineering and Informatics, Vocational College, Universitas Gadjah Mada ]

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

발행기관명

한국경영정보학회 [The Korea Society of Management information Systems]
설립연도
1989
분야
사회과학>경영학
소개
이 학회는 경영정보학의 연구 및 교류를 촉진하고 학문의 발전과 응용에 공헌함을 목적으로 합니다.

간행물

간행물명

한국경영정보학회 정기 학술대회 [KMIS Conference]
간기
반년간
수록기간
1990~2025
십진분류
KDC 325 DDC 658

이 권호 내 다른 논문 / 한국경영정보학회 정기 학술대회 2022 경영정보관련학회 춘계통합학술대회

피인용수 : 0건 (자료제공 : 네이버학술정보)

함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

출처 : 네이버학술정보

0개의 논문이 장바구니에 담겼습니다.

페이지 저장

소속기관 조회

이용자님의 소속기관(단체)이 서비스에 가입되어 있는지 확인해 보십시오.
기관회원에 소속되어 있는 이용자는 원문을 무료로 이용할 수 있습니다.

상호: 주식회사 학술교육원 I 대표: 노방용 I 사업자등록번호: 122-81-88227 I 통신판매업신고번호: 제2008-인천부평-00176호 I 정보보호책임자: 이두영
주소: (21319)인천광역시 부평구 영성중로 50 미래타워 701호 I 전화: 0505-555-0740 I 팩스: 0505-555-0741 I 이메일: earticle@earticle.net

음성지원 및 돋보기 서비스

Earticle

Improving Accuracy of Machine Learning-Based Prediction Model for Heart Disease Classification Using Information Gain and DBSCAN

원문정보

초록

목차

키워드

저자

참고문헌

간행물 정보

발행기관

간행물

이 권호 내 다른 논문 / 한국경영정보학회 정기 학술대회 2022 경영정보관련학회 춘계통합학술대회

피인용수 : 0건 (자료제공 : 네이버학술정보)

함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.