Earticle

현재 위치 Home

유사도 알고리즘을 활용한 시맨틱 프로세스 검색방안
Semantic Process Retrieval with Similarity Algorithms

첫 페이지 보기
  • 발행기관
    한국경영정보학회 바로가기
  • 간행물
    Asia Pacific Journal of Information Systems KCI 등재 바로가기
  • 통권
    제18권 제1호 (2008.03)바로가기
  • 페이지
    pp.79-96
  • 저자
    이홍주, Mark Klein
  • 언어
    한국어(KOR)
  • URL
    https://www.earticle.net/Article/A91758

※ 기관로그인 시 무료 이용이 가능합니다.

5,200원

원문정보

초록

영어
One of the roles of the Semantic Web services is to execute dynamic intra-organizational services including the integration and interoperation of business processes. Since different organizations design their processes differently, the retrieval of similar semantic business processes is necessary in order to support inter-organizational collaborations. Most approaches for finding services that have certain features and support certain business processes have relied on some type of logical reasoning and exact matching.
This paper presents our approach of using imprecise matching for expanding results from an exact matching engine to query the OWL(Web Ontology Language) MIT Process Handbook. MIT Process Handbook is an electronic repository of best-practice business processes. The Handbook is intended to help people: (1) redesigning organizational processes, (2) inventing new processes, and (3) sharing ideas about organizational practices.
In order to use the MIT Process Handbook for process retrieval experiments, we had to export it into an OWL-based format. We model the Process Handbook meta-model in OWL and export the processes in the Handbook as instances of the meta-model. Next, we need to find a sizable number of queries and their corresponding correct answers in the Process Handbook. Many previous studies devised artificial dataset composed of randomly generated numbers without real meaning and used subjective ratings for correct answers and similarity values between processes. To generate a semantic-preserving test data set, we create 20 variants for each target process that are syntactically different but semantically equivalent using mutation
operators. These variants represent the correct answers of the target process.
We devise diverse similarity algorithms based on values of process attributes and structures of business processes. We use simple similarity algorithms for text retrieval such as TF-IDF and Levenshtein edit distance to devise our approaches, and utilize tree edit distance measure because semantic processes are appeared to have a graph structure. Also, we design similarity algorithms considering similarity of process structure such as part process, goal, and exception. Since we can identify relationships between semantic process and its subcomponents, this information can be utilized for calculating similarities between processes. Dice’s coefficient and Jaccard similarity measures are utilized to calculate portion of overlaps between processes
in diverse ways.
We perform retrieval experiments to compare the performance of the devised similarity algorithms. We measure the retrieval performance in terms of precision, recall and F measure? the harmonic mean of precision and recall. The tree edit distance shows the poorest performance in terms of all measures. TF-IDF and the method incorporating TF-IDF measure and Levenshtein edit distance show better performances than other devised methods. These two measures are focused on similarity between name and descriptions of process.
In addition, we calculate rank correlation coefficient, Kendall’s tau b, between the number of process mutations and ranking of similarity values among the mutation sets. In this experiment, similarity measures based on process structure, such as Dice’s, Jaccard, and derivatives of these measures, show greater coefficient than measures based on values of process attributes. However, the Lev-TFIDF-JaccardAll measure considering process structure and attributes’ values together shows reasonably better performances in these two experiments. For retrieving semantic process, we can think that it’s better to consider diverse aspects of process similarity such as process structure and values of process attributes.
We generate semantic process data and its dataset for retrieval experiment from MIT Process Handbook repository. We suggest imprecise query algorithms that expand retrieval results from exact matching engine such as SPARQL, and compare the retrieval performances of the similarity algorithms. For the limitations and future work, we need to perform experiments with other dataset from other domain. And, since there are many similarity values from diverse measures, we may find better ways to identify relevant processes by applying these values simultaneously.

목차

Abstract
 Ⅰ. 서론
 Ⅱ. 관련 연구
 Ⅲ. 시맨틱 프로세스 표현
  3.1 MIT 프로세스 핸드북
  3.2 프로세스 핸드북 온톨로지와 OWL 파일
 Ⅳ. 시맨틱 비즈니스 프로세스 검색방안
  4.1 유사도 알고리즘
  4.2 실험
 Ⅴ. 토의 및 결론
  5.1 토의
  5.2 결론 및 향후 연구 방향
 참고문헌

키워드

Semantic Business Process Process Retrieval Similarity Semantic Web

저자

  • 이홍주 [ Hong Joo Lee | 가톨릭대학교 경영학부 ] 교신저자
  • Mark Klein [ Principal Research Scientist at the MIT Center for Collective Intelligence ]

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

  • 발행기관명
    한국경영정보학회 [The Korea Society of Management information Systems]
  • 설립연도
    1989
  • 분야
    사회과학>경영학
  • 소개
    이 학회는 경영정보학의 연구 및 교류를 촉진하고 학문의 발전과 응용에 공헌함을 목적으로 합니다.

간행물

  • 간행물명
    Asia Pacific Journal of Information Systems
  • 간기
    계간
  • pISSN
    2288-5404
  • eISSN
    2288-6818
  • 수록기간
    1990~2026
  • 등재여부
    KCI 등재,SCOPUS
  • 십진분류
    KDC 325 DDC 658

이 권호 내 다른 논문 / Asia Pacific Journal of Information Systems 제18권 제1호

    피인용수 : 0(자료제공 : 네이버학술정보)

    함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

      페이지 저장