Abstract
I. INTRODUCTION
II. METHODS
A. Architrcture
B. Dataset
C. Video Augmentation
D. Landmark Estimation
E. Landmark Ensemble Process and Data Preprocessing
F. Transformer-Based Classification Model
III. RESULTS
IV. DISUSSION
V. CONCLUSION
ACKNOWLEDGMENT
REFERENCES