Earticle

다운로드

Learning Disentangled Webpage Representation with Triplet-style Transformer for Phishing Attack Detection

  • 간행물
    한국차세대컴퓨팅학회 학술대회 바로가기
  • 권호(발행년)
    The 10th International Conference on Next Generation Computing 2024 (2024.11) 바로가기
  • 페이지
    pp.299-302
  • 저자
    Jun-Ho Yoon, Seok-Jun Buu
  • 언어
    영어(ENG)
  • URL
    https://www.earticle.net/Article/A468868

원문정보

초록

영어
Phishing attacks have surged with the expansion of the internet, becoming a major cause of personal information leaks. A significant challenge in phishing detection lies in the shared characteristics between phishing and benign webpages, as attackers intentionally design phishing pages to closely resemble benign ones. These shared features often result in ambiguous representations in the embedding space, which complicates accurate detection. To address this issue, we propose a method that introduces feature disentanglement into deep learning models for phishing webpage detection. By leveraging both URL and HTML data, our method employs a triplet loss function to better separate phishing and benign classes in the embedding space, thus reducing the overlap of shared features. This disentanglement effectively decreases the rate of false positives and false negatives. Experimental results show that our approach improves phishing detection accuracy by up to 9.85 percentage points and increases the F1 score by up to 11.92 percentage points compared to existing methods.

목차

Abstract
I. INTRODUCTION
II. RELATED WORKS
III. PROPOSED METHOD
A. Character-based URL Analysis using CNN
B. Word-based URL Analysis using Transformer
C. HTML DOM Structure Analysis using GCN
D. Triplet Network for Webpage Disentanglement
IV. EXPERIMENTAL RESULTS
A. Dataset and Preprocessing
B. Confusion Matrix Analysis
C. Performance Comparison
D. t-SNE Visualization of Embeddings
V. CONCLUSION
ACKNOWLEDGMENT
REFERENCES

저자

  • Jun-Ho Yoon [ Dept. of Computer Engineering Gyeongsang National University Republic of Korea ]
  • Seok-Jun Buu [ Dept. of Computer Engineering Gyeongsang National University Republic of Korea ] Corresponding Author

참고문헌

자료제공 : 네이버학술정보

    간행물 정보

    • 간행물
      한국차세대컴퓨팅학회 학술대회
    • 간기
      반년간
    • 수록기간
      2021~2025
    • 십진분류
      KDC 566 DDC 004