Earticle

현재 위치 Home

Oral Session A-1: Computer Vision

Multi-Stage Cascade R-CNN with Feature Pyramid Network for Pothole Detection

첫 페이지 보기
  • 발행기관
    한국차세대컴퓨팅학회 바로가기
  • 간행물
    한국차세대컴퓨팅학회 학술대회 바로가기
  • 통권
    ICNGC 2025 The 11th International Conference on Next Generation Computing 2025 (2025.12)바로가기
  • 페이지
    pp.22-24
  • 저자
    Thinh Nguyen, Thuan Bui, An Vu, Hieu Tran, Minh Duc Le, Ngoc Dung Bui
  • 언어
    영어(ENG)
  • URL
    https://www.earticle.net/Article/A478451

원문정보

초록

영어
Pothole detection remains a critical challenge in road maintenance and safety management, as potholes deteriorate road surfaces, compromise vehicle safety, and increase maintenance costs. Traditional pothole detection methods relying on manual inspection or simple image processing are often labor-intensive, prone to human error, and lack adaptability to varying road conditions. Meanwhile, modern approaches utilizing single-stage object detectors such as YOLO variants have provided real-time detection capabilities but tend to suffer in accurately localizing potholes at higher Intersection over Union (IoU) thresholds, especially when faced with the irregular shapes and scale variability characteristic of real-world potholes. To overcome these limitations, a multi-stage detection framework based on Cascade Region-based Convolutional Neural Network (Cascade R-CNN) with a ResNet-50 backbone and a Feature Pyramid Network (FPN) was developed. This framework employs progressive bounding box refinement through multiple detection stages with increasingly strict IoU thresholds, resulting in improved localization precision. The model was trained and evaluated on a meticulously curated dataset of more than 30,000 images featuring diverse pothole instances. It achieves a mean Average Precision (mAP) of 0.653 across IoU thresholds from 0.5 to 0.95, surpassing the baseline Faster RCNN by 4.3 points and outperforming YOLOv8 by 5 points. On an NVIDIA RTX 4090 GPU, the proposed model runs at approximately 80–90 frames per second, which enables nearreal- time execution and renders it practical for integration into automated road inspection and maintenance systems. These results indicate that the proposed Cascade R-CNN framework offers a robust and effective solution for high-accuracy pothole detection, addressing the shortcomings of existing detection methods in complex road environments.

목차

Abstract
I. INTRODUCTION
II. PROPOSED METHOD
III. EXPERIMENTS AND RESULTS
A. Experimental Setup
B. Results
IV. CONCLUSION
REFERENCES

키워드

pothole detection cascade R-CNN ResNet-50 backbone feature pyramid network object localization

저자

  • Thinh Nguyen [ Swinburne Vietnam, FPT University Hanoi, Vietnam ]
  • Thuan Bui [ Swinburne Vietnam, FPT University Hanoi, Vietnam ]
  • An Vu [ Swinburne Vietnam, FPT University Hanoi, Vietnam ]
  • Hieu Tran [ Swinburne Vietnam, FPT University Hanoi, Vietnam ]
  • Minh Duc Le [ Swinburne Vietnam, FPT University Hanoi, Vietnam ]
  • Ngoc Dung Bui [ University of Transport and Communications Hanoi, Vietnam ] Corresponding Author

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

  • 발행기관명
    한국차세대컴퓨팅학회 [Korean Institute of Next Generation Computing]
  • 설립연도
    2005
  • 분야
    공학>컴퓨터학
  • 소개
    본 학회는 차세대 PC 및 그 관련분야의 학술활동을 통하여 차세대 PC의 학문 및 기술발전을 도모하고 산업발전 및 국제협력 증진을 목적으로 한다.

간행물

  • 간행물명
    한국차세대컴퓨팅학회 학술대회
  • 간기
    반년간
  • 수록기간
    2021~2025
  • 십진분류
    KDC 566 DDC 004

이 권호 내 다른 논문 / 한국차세대컴퓨팅학회 학술대회 ICNGC 2025 The 11th International Conference on Next Generation Computing 2025

    피인용수 : 0(자료제공 : 네이버학술정보)

    함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

      페이지 저장