Abstract
I. INTRODUCTION
II. RELATED WORKS
III. PROPOSED METHOD
A. Character-based URL Analysis using CNN
B. Word-based URL Analysis using Transformer
C. HTML DOM Structure Analysis using GCN
D. Triplet Network for Webpage Disentanglement
IV. EXPERIMENTAL RESULTS
A. Dataset and Preprocessing
B. Confusion Matrix Analysis
C. Performance Comparison
D. t-SNE Visualization of Embeddings
V. CONCLUSION
ACKNOWLEDGMENT
REFERENCES