Earticle

현재 위치 Home

Convergence of Internet, Broadcasting and Communication

Optimizing Information Retrieval in Dark Web Academic Literature : A Study Using KeyBERT for Keyword Extraction and Clustering

첫 페이지 보기
  • 발행기관
    국제인공지능학회(구 한국인터넷방송통신학회) 바로가기
  • 간행물
    International Journal of Internet, Broadcasting and Communication 바로가기
  • 통권
    Vol.16 No.4 (2024.12)바로가기
  • 페이지
    pp.203-208
  • 저자
    Yosua Setyawan Soekamto, Leonard Christopher Limanjaya, Yoshua Kaleb Purwanto, Bongjun Choi, Seung-Keun Song, Dae-Ki Kang
  • 언어
    영어(ENG)
  • URL
    https://www.earticle.net/Article/A459073

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

원문정보

초록

영어
The exponential increase in publications and the interconnected nature of sub-domains make traditional methods of information extraction and organization inadequate. This inefficiency can impede scientific progress and innovation. To address these challenges, this research leverages the ability of Bidirectional Encoder Representations from Transformers for keyword extraction (KeyBERT) and integrates with K-Means clustering to organize topics from large datasets effectively. Analyzing a dataset of 47,627 articles from SCOPUS in the domains of Reinforcement Learning and Computer Vision. An ablation study demonstrates the generalizability of the approach across these fields, with the optimal number of clusters determined to be three using the Elbow Method. The results demonstrate that KeyBERT is effective in extracting and organizing topics within these domains, with a particular focus on applications such as medical imaging, autonomous driving, and real-time detection systems. This methodology offers a scalable solution for organizing vast academic datasets, enabling researchers to extract meaningful insights efficiently and apply this approach to other domains.

목차

Abstract
1. Introduction
2. Literature Review
2.1 Keyword Extraction with BERT
2.2 Topic Clustering
3. Methodology
3.1 Data Collection and Preprocessing
3.2 Keyword Extraction and Topic Clustering
4. Result and Discussion
5. Conclusion
Acknowledgement
References

키워드

K-Means KeyBERT Keyword Extraction Text Mining Topic Clustering.

저자

  • Yosua Setyawan Soekamto [ PhD Student at Department of Computer Engineering, Dongseo University, Busan, South Korea/Lecturer at Department of Information Systems, Universitas Ciputra Surabaya ]
  • Leonard Christopher Limanjaya [ Master Student at Department of Computer Engineering, Dongseo University, Busan, South Korea ]
  • Yoshua Kaleb Purwanto [ Master Student at Department of Computer Engineering, Dongseo University, Busan, South Korea ]
  • Bongjun Choi [ Professor at Department of Software, Dongseo University, Busan, South Korea ]
  • Seung-Keun Song [ Professor at Department of Visual Contents, Graduate School, Dongseo University, Busan, South Korea ]
  • Dae-Ki Kang [ Professor at Department of Computer Engineering, Dongseo University, Busan, South Korea ] Corresponding Author

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

  • 발행기관명
    국제인공지능학회(구 한국인터넷방송통신학회) [The International Association for Artificial Intelligence]
  • 설립연도
    2000
  • 분야
    공학>전자/정보통신공학
  • 소개
    인터넷방송, 인터넷 TV , 방송 통신 네트워크 및 관련 분야에 대한 국내는 물론 국제적인 학술, 기술의 진흥발전에 공헌하고 지식 정보화 사회에 기여하고자 한다.

간행물

  • 간행물명
    International Journal of Internet, Broadcasting and Communication
  • 간기
    계간
  • pISSN
    2288-4920
  • eISSN
    2288-4939
  • 수록기간
    2009~2025
  • 십진분류
    KDC 326 DDC 380

이 권호 내 다른 논문 / International Journal of Internet, Broadcasting and Communication Vol.16 No.4

    피인용수 : 0(자료제공 : 네이버학술정보)

    함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

      페이지 저장