Earticle

현재 위치 Home

Oral Session B-3 : Biomedical Applications

Comparing Embedding-Based Approaches for Complex Emotion Detection in Online Comments:

첫 페이지 보기
  • 발행기관
    한국차세대컴퓨팅학회 바로가기
  • 간행물
    한국차세대컴퓨팅학회 학술대회 바로가기
  • 통권
    ICNGC 2025 The 11th International Conference on Next Generation Computing 2025 (2025.12)바로가기
  • 페이지
    pp.284-286
  • 저자
    Jiwon Kim, Eungkyo Suh
  • 언어
    영어(ENG)
  • URL
    https://www.earticle.net/Article/A478514

원문정보

초록

영어
This study compares two embedding-based natural language processing techniques—Sentence-BERT (SBERT) combined with HDBSCAN clustering and BERTopic modeling—for detecting complex emotions in short Korean online comments. Using 33,531 comments collected from a YouTube relationship counseling channel, we examined how each method captures nuanced and overlapping sentiments such as affection, avoidance, and conflict. Both models used identical SBERT embeddings and UMAP-based dimensionality reduction, and their clustering performance was quantitatively evaluated using Silhouette Score, Davies–Bouldin Index (DBI), and Calinski–Harabasz Index (CHI). The results show that BERTopic achieved higher coherence and clearer topic boundaries (Silhouette = 0.40, DBI = 0.85, CHI = 15,157) compared to SBERT–HDBSCAN (Silhouette = –0.23, DBI = 1.49, CHI = 1,230). Although both methods yielded high noise ratios due to the leaf-based density clustering, BERTopic effectively reclassified semantically relevant comments through its ClassTF-IDF weighting, improving topic stability and interpretability. These findings suggest that BERTopic provides superior performance for analyzing short, emotion-rich Korean text and offers methodological insight for future sentiment analysis research. This electronic document is a “live” template and already defines the components of your paper [title, text, heads, etc.] in its style sheet.

목차

Abstract
I. INTRODUCTION
II. METHODS
A. Data Composition and Preprocessing
B. Analytical Procedure
C. Evaluation
III. RESULTS
A. HDBSCAN Clustering Based on Sentence-BERT
B. BERTopic Modeling
IV. CONCLUSION
REFERENCES

키워드

Sentence-BERT (SBERT) BERTopic HDBSCAN UMAP Embedding-based analysis Sentiment analysis Online comments analysis.

저자

  • Jiwon Kim [ Data and Knowledge Service Engineering Dankook University Gyeonggi-do, South Korea ]
  • Eungkyo Suh [ Data and Knowledge Service Engineering Dankook University Gyeonggi-do, South Korea ]

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

  • 발행기관명
    한국차세대컴퓨팅학회 [Korean Institute of Next Generation Computing]
  • 설립연도
    2005
  • 분야
    공학>컴퓨터학
  • 소개
    본 학회는 차세대 PC 및 그 관련분야의 학술활동을 통하여 차세대 PC의 학문 및 기술발전을 도모하고 산업발전 및 국제협력 증진을 목적으로 한다.

간행물

  • 간행물명
    한국차세대컴퓨팅학회 학술대회
  • 간기
    반년간
  • 수록기간
    2021~2025
  • 십진분류
    KDC 566 DDC 004

이 권호 내 다른 논문 / 한국차세대컴퓨팅학회 학술대회 ICNGC 2025 The 11th International Conference on Next Generation Computing 2025

    피인용수 : 0(자료제공 : 네이버학술정보)

    함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

      페이지 저장