Sentiment dictionaries or lexicons are core elements for “bag-of-word” approaches of opinion mining or sentiment analysis. Rather than using general-purpose sentiment dictionaries, domain-specific sentiment lexicons can contribute to improve performance because they can reflect domain specific terms and meanings. This paper presents four domain-specific sentiment dictionary construction methods for opinion mining, and describes performance evaluation results using a practical data set. The comparison subjects of this research include SO-PMI (Semantic Orientation from Pointwise Mutual Information) and three term frequency-based methods with different term polarity measures. To evaluate the performance of four different methods, a movie review data set from a representative Internet movie community site, IMDb (Internet Movie Database) is collected using a web crawling program, and is analyzed using R programs. Based on training data set, domain specific sentiment dictionaries are constructed using four different methods, and are compared their performance of sentiment analysis. The experimental results show that domain-specific sentiment dictionaries are working better than general-purpose dictionaries except one genre, „animation‟. Also, term frequency-based approaches show better performance than SO-PMI.
목차
Abstract 1. Introduction 2. Related Works 2.1. Sentiment Analysis 2.2. Sentiment Lexicon Construction 2.3. PMI and SO-PMI 3. Domain Specific Lexicon Building Methods 4. Experimental Design 4.1. Experiment Data Set 4.2. Four Different Term Polarity Determination Measures 5. Experiment Results 6. Conclusion Remarks Acknowledgments References
보안공학연구지원센터(IJDTA) [Science & Engineering Research Support Center, Republic of Korea(IJDTA)]
설립연도
2006
분야
공학>컴퓨터학
소개
1. 보안공학에 대한 각종 조사 및 연구
2. 보안공학에 대한 응용기술 연구 및 발표
3. 보안공학에 관한 각종 학술 발표회 및 전시회 개최
4. 보안공학 기술의 상호 협조 및 정보교환
5. 보안공학에 관한 표준화 사업 및 규격의 제정
6. 보안공학에 관한 산학연 협동의 증진
7. 국제적 학술 교류 및 기술 협력
8. 보안공학에 관한 논문지 발간
9. 기타 본 회 목적 달성에 필요한 사업
간행물
간행물명
International Journal of Database Theory and Application
간기
격월간
pISSN
2005-4270
수록기간
2008~2016
십진분류
KDC 505DDC 605
이 권호 내 다른 논문 / International Journal of Database Theory and Application Vol.9 No.8