Earticle

현재 위치 Home

Applications of Machine Learning Models on Yelp Data

첫 페이지 보기
  • 발행기관
    한국경영정보학회 바로가기
  • 간행물
    Asia Pacific Journal of Information Systems KCI 등재 SCOPUS 바로가기
  • 통권
    제29권 제1호 (2019.03)바로가기
  • 페이지
    pp.35-49
  • 저자
    Ruchi Singh, Jongwook Woo
  • 언어
    영어(ENG)
  • URL
    https://www.earticle.net/Article/A350487

※ 기관로그인 시 무료 이용이 가능합니다.

4,800원

원문정보

초록

영어
The paper attempts to document the application of relevant Machine Learning (ML) models on Yelp (a crowd-sourced local business review and social networking site) dataset to analyze, predict and recommend business. Strategically using two cloud platforms to minimize the effort and time required for this project. Seven machine learning algorithms in Azure ML of which four algorithms are implemented in Databricks Spark ML. The analyzed Yelp business dataset contained 70 business attributes for more than 350,000 registered business. Additionally, review tips and likes from 500,000 users have been processed for the project. A Recommendation Model is built to provide Yelp users with recommendations for business categories based on their previous business ratings, as well as the business ratings of other users. Classification Model is implemented to predict the popularity of the business as defining the popular business to have stars greater than 3 and unpopular business to have stars less than 3. Text Analysis model is developed by comparing two algorithms, uni-gram feature extraction and n-feature extraction in Azure ML studio and logistic regression model in Spark. Comparative conclusions have been made related to efficiency of Spark ML and Azure ML for these models.

목차

ABSTRACT
Ⅰ. Introduction
1.1. Related Works
1.2. Motivation and Goals
Ⅱ. Background
2.1. Microsoft Azure Machine Learning Studio
2.2. Databricks
2.3. Preliminary Work on the Machine Learning Models
Ⅲ. Methodology
3.1. Data Description
3.2. Hardware Specification
3.3. Machine Learning Workflow
3.4. Process in Azure
3.5. Process in Databricks
Ⅳ. Results and Discussions
4.1. Matchbox Recommender
4.2. Collaborative Filtering Recommender
4.3. Classification Models
4.4. K-Means Clustering
4.5. Text Analysis
Ⅴ. Conclusions and Future Work

키워드

Machine learning Yelp Recommender Predictive analytics Text analysis

저자

  • Ruchi Singh [ Data Analyst, CISCO, USA ]
  • Jongwook Woo [ Professor, IS, College of Business and Economics, California State University Los Angeles, USA ] Corresponding Author

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

  • 발행기관명
    한국경영정보학회 [The Korea Society of Management information Systems]
  • 설립연도
    1989
  • 분야
    사회과학>경영학
  • 소개
    이 학회는 경영정보학의 연구 및 교류를 촉진하고 학문의 발전과 응용에 공헌함을 목적으로 합니다.

간행물

  • 간행물명
    Asia Pacific Journal of Information Systems
  • 간기
    계간
  • pISSN
    2288-5404
  • eISSN
    2288-6818
  • 수록기간
    1990~2026
  • 등재여부
    KCI 등재,SCOPUS
  • 십진분류
    KDC 325 DDC 658

이 권호 내 다른 논문 / Asia Pacific Journal of Information Systems 제29권 제1호

    피인용수 : 0(자료제공 : 네이버학술정보)

    함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

      페이지 저장