The use of electronic documents has rapidly increased in recent decades and the PDF is one the most commonly used electronic document formats. A scanned PDF is an image and does not actually contain any text. For the vision–impaired user who is dependent upon a screen reader to access this information, this format is not useful. Thus addressing PDF accessibility through assistive technology has now become an important concern. PDF layout analysis provides precious formatting information that supports PDF component classification. This classification facilitates the tag generation. Accurate tagging produces a searchable and navigable scanned PDF document. This paper describes several practical segmentation methods which are easy to implement and efficient for PDF layout analysis so that the scanned PDF document can be navigated or searched using assistive technologies.
목차
Abstract 1. Introduction 3. Pre Processing 3.1. Format Conversion 3.2. Binarization 3.3. Scaling Image 3.4. Margin Removal 3.5. Skew Detection and Correction 4. Block Segmentation 5. Text-Image Segmentation 6. Line Segmentation 7. Word Segmentation 8. Vertical-Horizontal-Recursive-Segmentation 9. Conclusion References
키워드
PDF layout analysisOptical character recognition (OCR)Vision-impaired
저자
Azadeh Nazemi [ Electrical and Computer Engineering Spatial Sciences, Curtin University ,Perth ,WA,Australia ]
Iain Murray [ Electrical and Computer Engineering Spatial Sciences, Curtin University ,Perth ,WA,Australia ]
David A. Mc Meekin [ Electrical and Computer Engineering Spatial Sciences, Curtin University ,Perth ,WA,Australia ]
보안공학연구지원센터(IJSIP) [Science & Engineering Research Support Center, Republic of Korea(IJSIP)]
설립연도
2006
분야
공학>컴퓨터학
소개
1. 보안공학에 대한 각종 조사 및 연구
2. 보안공학에 대한 응용기술 연구 및 발표
3. 보안공학에 관한 각종 학술 발표회 및 전시회 개최
4. 보안공학 기술의 상호 협조 및 정보교환
5. 보안공학에 관한 표준화 사업 및 규격의 제정
6. 보안공학에 관한 산학연 협동의 증진
7. 국제적 학술 교류 및 기술 협력
8. 보안공학에 관한 논문지 발간
9. 기타 본 회 목적 달성에 필요한 사업
간행물
간행물명
International Journal of Signal Processing, Image Processing and Pattern Recognition
간기
격월간
pISSN
2005-4254
수록기간
2008~2016
십진분류
KDC 505DDC 605
이 권호 내 다른 논문 / International Journal of Signal Processing, Image Processing and Pattern Recognition Vol.7 No.4