Earticle

다운로드

Swin Transformer-based multi-scale crowd localization method

  • 간행물
    한국차세대컴퓨팅학회 학술대회 바로가기
  • 권호(발행년)
    The 8th International Conference on Next Generation Computing 2022 (2022.10) 바로가기
  • 페이지
    pp.118-121
  • 저자
    Yi Ren, Xin He
  • 언어
    영어(ENG)
  • URL
    https://www.earticle.net/Article/A419754

원문정보

초록

영어
In this paper, we propose a new framework that enables an object detector trained with only point-level annotations to estimate the centroids and sizes of objects in dense scenes. Specifically, the framework is based on the Swin Transformer structure and introduces a self-designed resolution feature fusion module in the hierarchical structure, where the estimation of object centroids is done directly by point supervision, and the object pseudo-size is initialized based on the assumption of local uniform distribution, and the regression of object size is guided by an improved congestion-aware loss function. In the NWPU-Crowd dataset, our method outperformed the existing state-of-the-art detection counting methods in F1-measure, precision, MSE evaluation criteria.

목차

Abstract
I. INTRODUCTION
II. METHOD
A. Swin Transformer
B. Resolution feature fusion module
C. Congestion-aware loss function
III. EXPERIMENTS
A. Evaluation Criteria
B. Dataset
C. Parameter Setting
D. Ablation experiments
E. Experiment results
IV. CONCLUSION
REFERENCES

저자

  • Yi Ren [ Computer Science Chongqing University of Posts and Telecommunications Chongqing, China ] Corresponding Author
  • Xin He [ Computer Science Chongqing University of Posts and Telecommunication Chongqing, China ]

참고문헌

자료제공 : 네이버학술정보

    간행물 정보

    • 간행물
      한국차세대컴퓨팅학회 학술대회
    • 간기
      반년간
    • 수록기간
      2021~2025
    • 십진분류
      KDC 566 DDC 004