Leveraging Visual Language Models for Information Extraction from Semi-Structured Business Documents

Bongjin Sohn; Gunwoong Lee

216.73.217.117

개인회원 가입

개인회원
기관회원

개인회원 로그인

개인회원 가입으로 더욱 편리하게 이용하세요. 개인회원 가입

아이디/비밀번호를 잊으셨나요? 아이디/비밀번호 찾기

기관회원 로그인

소속기관에서 검색되지 않는 기관은 무료원문다운이 불가능합니다. 개인회원 가입 후 유료구매를 하시거나 소속기관 도서관에 이용문의해 주세요.

Home

Leveraging Visual Language Models for Information Extraction from Semi-Structured Business Documents

발행기관

한국경영정보학회 바로가기
간행물

한국경영정보학회 정기 학술대회 바로가기
통권

2025 한국겨영정보학회 추계학슬대회 (2025.10)바로가기
페이지

pp.81-84
저자

Bongjin Sohn, Gunwoong Lee
언어

영어(ENG)
URL

https://www.earticle.net/Article/A476014

※ 기관로그인 시 무료 이용이 가능합니다.
※ 학술발표대회집, 워크숍 자료집 중 4페이지 이내 논문은 '요약'만 제공되는 경우가 있으니, 구매 전에 간행물명, 페이지 수 확인 부탁 드립니다.

4,000원

원문정보

초록

영어: Modern enterprises maintain extensive repositories of business documents within their intranet systems, creating a critical need for automated processing capabilities of image-based documents to enhance operational efficiency. Unlike standardized forms, most business documents are semi-structured, with layouts and field positions varying widely across organizations and document types. This complexity has generated substantial demand for advanced information extraction and organization technologies, capable of handling irregular structures and diverse schemas. However, conventional Optical Character Recognition (OCR) approaches, which prioritize textual recognition, encounter significant limitations when processing complex forms due to their reliance on location-based extraction. Similarly, Key Information Extraction (KIE) techniques often require domain-specific pre-training, resulting in considerable learning and adapting costs for novel document formats. To address these challenges, this study proposes an innovative process for effectively extracting and organizing key elements from semi-structured documents by employing Visual Language Models (VLMs) that process documents as image inputs and concurrently analyze visual and linguistic information. The proposed framework determines superior extraction accuracy, economic efficiency, and even user satisfaction by exploiting both semantic textual content and spatial positioning as visual cues. Experimental results demonstrate that the VLM-based framework outperforms existing OCR and KIE solutions across multiple evaluation dimensions, while the integration of human-in-the-loop verification processes establishes a practical framework for semi-structured document automation (e.g., commercial invoice) with immediate applicability in fast-changing enterprise environments.

키워드

Vision Language Model Semi-structured Document Optical Character Recognition Key Information Extraction Human-in-the-Loop

저자

Bongjin Sohn [ Korea University Business School, Information Systems ]
Gunwoong Lee [ Korea University Business School, Information Systems ]

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

발행기관명

한국경영정보학회 [The Korea Society of Management information Systems]
설립연도
1989
분야
사회과학>경영학
소개
이 학회는 경영정보학의 연구 및 교류를 촉진하고 학문의 발전과 응용에 공헌함을 목적으로 합니다.

간행물

간행물명

한국경영정보학회 정기 학술대회 [KMIS Conference]
간기
반년간
수록기간
1990~2025
십진분류
KDC 325 DDC 658

이 권호 내 다른 논문 / 한국경영정보학회 정기 학술대회 2025 한국겨영정보학회 추계학슬대회

피인용수 : 0건 (자료제공 : 네이버학술정보)

함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

출처 : 네이버학술정보

0개의 논문이 장바구니에 담겼습니다.

페이지 저장

소속기관 조회

이용자님의 소속기관(단체)이 서비스에 가입되어 있는지 확인해 보십시오.
기관회원에 소속되어 있는 이용자는 원문을 무료로 이용할 수 있습니다.

상호: 주식회사 학술교육원 I 대표: 노방용 I 사업자등록번호: 122-81-88227 I 통신판매업신고번호: 제2008-인천부평-00176호 I 정보보호책임자: 이두영
주소: (21319)인천광역시 부평구 영성중로 50 미래타워 701호 I 전화: 0505-555-0740 I 팩스: 0505-555-0741 I 이메일: earticle@earticle.net

음성지원 및 돋보기 서비스

Earticle

Leveraging Visual Language Models for Information Extraction from Semi-Structured Business Documents

원문정보

초록

목차

키워드

저자

참고문헌

간행물 정보

발행기관

간행물

이 권호 내 다른 논문 / 한국경영정보학회 정기 학술대회 2025 한국겨영정보학회 추계학슬대회

피인용수 : 0건 (자료제공 : 네이버학술정보)

함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.