Earticle

현재 위치 Home

Oral Session B-3 : Biomedical Applications

Robust Out-of-Stock Detection by Localizing Products with Open-Vocabulary Models

첫 페이지 보기
  • 발행기관
    한국차세대컴퓨팅학회 바로가기
  • 간행물
    한국차세대컴퓨팅학회 학술대회 바로가기
  • 통권
    ICNGC 2025 The 11th International Conference on Next Generation Computing 2025 (2025.12)바로가기
  • 페이지
    pp.296-299
  • 저자
    Rakwon Choi, Junwon Son, Wonsuk Kim, Wonsuk Kim
  • 언어
    영어(ENG)
  • URL
    https://www.earticle.net/Article/A478518

원문정보

초록

영어
Out-of-Stock (OOS) detection is a critical task in retail management, yet traditional automated methods suffer from significant limitations. Existing approaches, whether based on classical image processing or closed-set deep learning models, lack scalability and robustness, often requiring extensive retraining for every new shelf layout or product type. This paper proposes a novel and flexible OOS detection pipeline that leverages the power of Open- Vocabulary Object Detection (OVD). Instead of attempting to directly detect "empty space," our method first identifies all present products using an OVD model guided by flexible text prompts. An inverse occupancy mask is then generated to identify potential OOS regions, which are subsequently refined through a robust multi-stage post-processing filter. Experiments on our custom dataset of 149 diverse retail shelf images demonstrate the superiority of this approach. Our method achieves an OOS detection Accuracy of 87.9%, vastly outperforming the baseline approach (70.1% Accuracy). Furthermore, by applying per-shelf optimized prompts and parameters, our model's Accuracy increases to 96.84%, highlighting its high adaptability and effectiveness for realworld retail environments.

목차

Abstract
I. INTRODUCTION
II. METHODOLOGY
A. Geometric Preprocessing
B. Open-Vocabulary Product Detection
C. Inverse Occupancy Mask Generation
D. Mask Postprocessing
III. EXPERIMENTS AND RESULTS
A. Experiment Settings
B. Experimental Results
IV. CONCLUSION AND FUTURE WORK
ACKNOWLEDGMENT
REFERENCES

키워드

Out-of-stock Vision Language Model Open Vocabulary Object Detection

저자

  • Rakwon Choi [ Safe AI Seoul, South Korea ]
  • Junwon Son [ Safe AI Seoul, South Korea ]
  • Wonsuk Kim [ Safe AI Seoul, South Korea ]
  • Wonsuk Kim [ Safe AI Seoul, South Korea ]

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

  • 발행기관명
    한국차세대컴퓨팅학회 [Korean Institute of Next Generation Computing]
  • 설립연도
    2005
  • 분야
    공학>컴퓨터학
  • 소개
    본 학회는 차세대 PC 및 그 관련분야의 학술활동을 통하여 차세대 PC의 학문 및 기술발전을 도모하고 산업발전 및 국제협력 증진을 목적으로 한다.

간행물

  • 간행물명
    한국차세대컴퓨팅학회 학술대회
  • 간기
    반년간
  • 수록기간
    2021~2025
  • 십진분류
    KDC 566 DDC 004

이 권호 내 다른 논문 / 한국차세대컴퓨팅학회 학술대회 ICNGC 2025 The 11th International Conference on Next Generation Computing 2025

    피인용수 : 0(자료제공 : 네이버학술정보)

    함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

      페이지 저장