Earticle

현재 위치 Home

Workshop Session_KETI

Efficient Dataflow for SwiGLU

첫 페이지 보기
  • 발행기관
    한국차세대컴퓨팅학회 바로가기
  • 간행물
    한국차세대컴퓨팅학회 학술대회 바로가기
  • 통권
    The 9th International Conference on Next Generation Computing 2023 (2023.12)바로가기
  • 페이지
    pp.38-39
  • 저자
    Yunpyo Hong, Seokhun Jeon, Young-Jong Jang, Byung-Soo Kim
  • 언어
    영어(ENG)
  • URL
    https://www.earticle.net/Article/A448112

원문정보

초록

영어
As many LLMs have been released, modified network layers based on transformer have been researched to improve performance. However, it is essential to design LLMs in a large size for performance, and as a result, current LLMs can only be executed on large servers, and various attempts have been made to reduce the amount of computation. In this paper, we present a method to reduce the amount of computation by using the data attribute of the SwiGLU layer used by meta and google. Since SwiGLU contains an activation function, it generates a large number of near-zero values, and we try to reduce the amount of computation by skipping unnecessary operations. Our experiments show that our algorithm can reduce the computation by 13.3% when there are 20% zeros from activation function.

목차

Abstract
I. INTRODUCTION
II. BACKGROUND
III. PROPOSED ARCHITECTURE
IV. RESULT & CONCLUSION
ACKNOWLEDGMENT
REFERENCES

키워드

GLU SwiGLU dataflow LLM FFN zero skip

저자

  • Yunpyo Hong [ Korea Electronics Technology Institute SoC Platform Research Center ]
  • Seokhun Jeon [ Korea Electronics Technology Institute SoC Platform Research Center ]
  • Young-Jong Jang [ Korea Electronics Technology Institute SoC Platform Research Center ]
  • Byung-Soo Kim [ Korea Electronics Technology Institute SoC Platform Research Center ] Corresponding Author

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

  • 발행기관명
    한국차세대컴퓨팅학회 [Korean Institute of Next Generation Computing]
  • 설립연도
    2005
  • 분야
    공학>컴퓨터학
  • 소개
    본 학회는 차세대 PC 및 그 관련분야의 학술활동을 통하여 차세대 PC의 학문 및 기술발전을 도모하고 산업발전 및 국제협력 증진을 목적으로 한다.

간행물

  • 간행물명
    한국차세대컴퓨팅학회 학술대회
  • 간기
    반년간
  • 수록기간
    2021~2025
  • 십진분류
    KDC 566 DDC 004

이 권호 내 다른 논문 / 한국차세대컴퓨팅학회 학술대회 The 9th International Conference on Next Generation Computing 2023

    피인용수 : 0(자료제공 : 네이버학술정보)

    함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

      페이지 저장