Leilei Kong, Zhimao Lu, Haoliang Qi, Zhongyuan Han
언어
영어(ENG)
URL
https://www.earticle.net/Article/A231858
※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.
원문정보
초록
영어
Providing effective methods of identification of high-obfuscation plagiarism seeds presents a significant research problem in the field of plagiarism detection. The conventional methods of plagiarism detection are based on single type of features to capture plagiarism seeds. But for high-obfuscation plagiarism detection, these single type features are not sufficient for identifying the plagiarism seeds effectively because of the varied plagiarism methods used in high-obfuscation plagiarism. This paper presents a multi-features fusion method for the high-obfuscation plagiarism seeds identification. This method exploits Logical Regression model to integrate lexicon features, syntax features, semantics features and structure features which extracted from suspicious document and source document. A multi-feature fusion classifier based on Logical Regression model is proposed to decide whether a text fragment pair can be regarded as plagiarism seeds or not. Experimental results on the PAN@CLEF2013 summary-obfuscation corpus show that the fusion of different types of features produces more accurate results.
목차
Abstract 1. Introduction 2. Related Work 3. Multi-features fusion via Logical Regression model 3.1. Problem 3.2. Multi-Features Fusion 3.3. Features set for high-obfuscation plagiarism detection 4. Result 4.1. Corpus 4.2. Measures 4.3. Baseline 4.4. Experimental results 5. Conclusion Acknowledgements References
키워드
plagiarism detectionhigh-obfuscation plagiarismplagiarism seedsmulti-feature fusionlogical regression model
저자
Leilei Kong [ Harbin Engineering University, Heilongjiang Institute of Technology ]
Zhimao Lu [ Harbin Engineering University ]
Haoliang Qi [ Heilongjiang Institute of Technology ]
Zhongyuan Han [ Heilongjiang Institute of Technology ]
보안공학연구지원센터(IJUNESST) [Science & Engineering Research Support Center, Republic of Korea(IJUNESST)]
설립연도
2006
분야
공학>컴퓨터학
소개
1. 보안공학에 대한 각종 조사 및 연구
2. 보안공학에 대한 응용기술 연구 및 발표
3. 보안공학에 관한 각종 학술 발표회 및 전시회 개최
4. 보안공학 기술의 상호 협조 및 정보교환
5. 보안공학에 관한 표준화 사업 및 규격의 제정
6. 보안공학에 관한 산학연 협동의 증진
7. 국제적 학술 교류 및 기술 협력
8. 보안공학에 관한 논문지 발간
9. 기타 본 회 목적 달성에 필요한 사업
간행물
간행물명
International Journal of u- and e- Service, Science and Technology
간기
격월간
pISSN
2005-4246
수록기간
2008~2016
십진분류
KDC 505DDC 605
이 권호 내 다른 논문 / International Journal of u- and e- Service, Science and Technology Vol.7 No.4