Earticle

현재 위치 Home

Human-Machine Interaction Technology (HIT)

A Study on LLM Fine-tuning Strategies for Improving the Performance of Generative AI

첫 페이지 보기
  • 발행기관
    국제인공지능학회(구 한국인터넷방송통신학회) 바로가기
  • 간행물
    The International Journal of Advanced Smart Convergence 바로가기
  • 통권
    Volume 14 Number 4 (2025.12)바로가기
  • 페이지
    pp.105-113
  • 저자
    Jee Young Lee
  • 언어
    영어(ENG)
  • URL
    https://www.earticle.net/Article/A481180

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

원문정보

초록

영어
Recently, Large Language Models (LLMs) have led to rapid performance improvements across natural language processing (NLP) and are expanding into various domains. However, pre-trained LLMs are based on general-purpose data, which can lead to domain bias and learning inefficiencies when applied to specific domains. Consequently, efficient fine-tuning techniques that can optimize specific domains while maintaining the model's generalizability are emerging as a key research topic. To overcome these limitations, this study proposed an Adaptive Layer-wise Fine-tuning (ALF) technique that performs selective parameter updates based on layer importance. ALF calculates the weight change rate of each layer during the learning process and dynamically updates only the layers with high importance, thereby improving efficiency without performance degradation while learning only about 20-30% of the total model parameters. The results of the attention weight analysis confirmed that ALF had improved focus on semantically central words, thereby improving contextual consistency and model interpretability (interpretable fine-tuning). These results suggest that ALF is an effective approach that simultaneously improves efficiency and generalization performance compared to existing fine-tuning techniques. Future research will develop a framework for optimizing generative AI models that is both sustainable and reliable, through expansion into multimodal data, reinforcement learning-based auto-tuning, and ethical fine-tuning.

목차

Abstract
1. Introduction
2. Related Work
2.1 Overview of Large Language Models (LLMs)
2.2 Basic concepts and classification of fine-tuning
2.3 Recent Research Trends in Adaptive Fine-tuning
3. Method
3.1 Overview of Research Framework
3.2 Experimental Models and Datasets
3.3 Proposed Fine-tuning Approach: Adaptive Layer-wise Fine-tuning (ALF)
3.4 Implementation Environment
4. Results and Discussion
4.1 Quantitative Performance Comparison
4.2 Attention Weight Distribution Analysis
4.3 Efficiency Evaluation
4.4 Discussion
5. Conclusion
References

키워드

LLM Fine-tuning Generative AI Parameter-efficient Tuning Adaptive Layer-wise Fine-tuning Domain Transfer

저자

  • Jee Young Lee [ Associate Professor, Department of Software, SeoKyeong University, Korea ] Corresponding Author

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

  • 발행기관명
    국제인공지능학회(구 한국인터넷방송통신학회) [The International Association for Artificial Intelligence]
  • 설립연도
    2000
  • 분야
    공학>전자/정보통신공학
  • 소개
    인터넷방송, 인터넷 TV , 방송 통신 네트워크 및 관련 분야에 대한 국내는 물론 국제적인 학술, 기술의 진흥발전에 공헌하고 지식 정보화 사회에 기여하고자 한다.

간행물

  • 간행물명
    The International Journal of Advanced Smart Convergence
  • 간기
    계간
  • pISSN
    2288-2847
  • eISSN
    2288-2855
  • 수록기간
    2012~2025
  • 십진분류
    KDC 326 DDC 380

이 권호 내 다른 논문 / The International Journal of Advanced Smart Convergence Volume 14 Number 4

    피인용수 : 0(자료제공 : 네이버학술정보)

    함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

      페이지 저장