In the modern Korean vocabulary system, compared to other idiomatic expressions, four-character idioms can be considered a remarkable variety equipped with several distinctive functions in various linguistic fields, although they account for a relatively small proportion. To expand the classification framework of four-character idioms in Korean with detailed morphological and syntactic criteria, this paper aims to conduct a corpus-based quantitative analysis on four-character idioms, not just collecting thousands of realistic samples, but also extracting adequate morphemic co-occurrence frequency information from both the Sejong colloquial and written POS Tagged Corpus. The key contributions of this paper are threefold: (1) a comprehensive review of existing definitions and classifications of four-character idioms, with particular attention to identifying distinctive morphological and semantic traits; (2) the application of corpus-linguistic methodologies to propose an improved framework supplemented with frequency-based idiom lists; and (3) a multi-stage frequency analysis to explore the internal linguistic and contextual factors affecting the deployment of four-character idioms, culminating in a set of syntactic and pragmatic usage constraints and corresponding classification strategies.
목차
ABSTRACT 1. 서론 2. 연구 배경 3. 사자성어의 범주 및 분류 3.1. 개념적 범주 비교 3.2. 형태·의미적 구분 특징 4. 말뭉치 기반 분석 결과 4.1. 사자성어 빈도 분포 4.2. 사자성어 사용 제약 5. 결론 참고문헌
키워드
Korean 4-character-idiomSejong POS Tagged CorpusLexical ClassificationCo-occurrence Frequency ListUsage Constraint