ICNGC 2025 The 11th International Conference on Next Generation Computing 2025 (2025.12)바로가기
페이지
pp.287-290
저자
Eun-jeong, Kim, Eung-Kyo Suh
언어
영어(ENG)
URL
https://www.earticle.net/Article/A478515
원문정보
초록
영어
Early diagnosis of dementia, such as Alzheimer's disease, is clinically crucial, and deep learning-based brain MRI analysis has gained significant attention. However, acquiring large-scale datasets remains challenging in medical imaging, necessitating learning from small datasets. This study quantitatively compares the dementia classification performance of CNN-based ResNet-18 and various Transformer architectures (Swin Transformer, Vision Transformer, UNETR) using the OASIS-3 brain MRI dataset (3,428 samples) and systematically evaluates their suitability for small-scale medical data. Methods: Data comprising 2,943 normal (85.8%) and 485 dementia (14.2%) cases were split 80:20, with 5-fold cross-validation. Focal Loss was applied to mitigate class imbalance, and metrics including Sensitivity, Specificity, Balanced Accuracy, and AUC-ROC were evaluated. Results: ResNet-18 achieved the most balanced performance with Sensitivity of 79.38%, Balanced Accuracy of 77.38%, and AUC-ROC of 85.40%. Transformer models showed distinctly different patterns: Swin Transformer (Sensitivity 42.27%, AUC 81.78%) exhibited normal-class bias, Vision Transformer (Sensitivity 22.68%, AUC 47.89%) nearly failed to learn with pure global attention, and UNETR (Sensitivity 88.66%, AUC 73.00%) achieved highest sensitivity but severely low specificity (41.77%). ResNet-18 also demonstrated superior learning efficiency (parameters: 33M, training time: 2.1h) compared to Transformers (Swin: 60-70M/3.1h, ViT: 18M/3.0h, UNETR: 10- 12M/4.0h). This study confirms that CNN's inductive bias and structural efficiency are more effective than Transformer's global attention or hybrid approaches in small-scale brain MRI datasets, with ResNet-18 proving most suitable for dementia screening due to balanced sensitivity-specificity trade-off.
목차
Abstract I. INTRODUCTION II. RELATED WORK III. A METHOD OF RESEARCH A. Datasets and preprocessing B. Model Architecture C. Learning Settings D. Evaluation Indicators IV. THE RESULTS OF AN EXPERIMENT A. Test Set Performance Comparison V. DISCUSSION A. Analysis of Performance Differences B. Clinical Implications and Limitations VI. CONCLUSION REFERENCES
키워드
Brain MRIDementia diagnosisResNet-18Vision TransformerSwin TransformerUNETRDeep learningMedical imagingSmall-scale datasetFocal Loss
저자
Eun-jeong, Kim [ Dept. of Department of Data and Knowledge Service Engineering Graduate school, Dankook University Yongin, South Korea ]
Eung-Kyo Suh [ Dept. of Department of Data and Knowledge Service Engineering Graduate school, Dankook University Yongin, South Korea ]
Corresponding Author