Earticle

현재 위치 Home

International Journal of Database Theory and Application

간행물 정보
  • 자료유형
    학술지
  • 발행기관
    보안공학연구지원센터(IJDTA) [Science & Engineering Research Support Center, Republic of Korea(IJDTA)]
  • pISSN
    2005-4270
  • 간기
    격월간
  • 수록기간
    2008 ~ 2016
  • 주제분류
    공학 > 컴퓨터학
  • 십진분류
    KDC 505 DDC 605
Vol.9 No.10 (32건)
No
31

Query Evaluation on Probabilistic Databases Using Indexing and MapReduce

Kavita K. Beldar, M. D. Gayakwad, Debnath Bhattacharyya, Hye-jin Kim

보안공학연구지원센터(IJDTA) International Journal of Database Theory and Application Vol.9 No.10 2016.10 pp.363-378

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

Entity resolution technique is used for recognize the duplicate tuples which signify similar real world entities. Existing resolution technique is unable to solve the problems of higher level of heterogeneity and additional continual data alteration. Working on this type of database, there is necessitated to enumerate the integrity of data. The new approach is introduced here on probabilistic databases by unmerged duplicates for processing complex queries. This is achieved by using probabilistic databases. For competent access toward entity resolution data over a large collection of possible resolution worlds, new indexing technique is presented here. Also, a computation of query processing is reduced by using indexing structure. The focus is on set similarity relation on very big probabilistic database by using MapReduce technique. MapReduce is a popular paradigm that can process large volume data more efficiently. In this paper, different approaches proposed using MapReduce to deal with this task: 1. merge data set with MapReduce and merge data set without MapReduce, 2. Merge data set with MapReduce using Hadoop. This approaches implemented on windows and Hadoop framework and performed compressing experiments to their performances. Also the speedup ratio for both is tested.

32

Chinese Sentence Similarity Computational Model Based on Multi-Features Combination

Peiying Zhang, Qiuming Li, Huayu Li

보안공학연구지원센터(IJDTA) International Journal of Database Theory and Application Vol.9 No.10 2016.10 pp.379-386

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

Combined with the issue of single direction of the solution of the existing sentence similarity algorithms, a Chinese sentence similarity computational model based on multi-features combination was proposed. The approach combines word overlap similarity, word order similarity, dependency relationship similarity, semantic similarity, structure similarity, sentence similarity, and keyword distance similarity to calculate the similarity between sentences, using the weight to describe the contribution of each feature of the sentence, and then gets a better experiment result. Experimental results shows that this approach can fully describe the features of the sentence, therefore can improve the sentence similarity computation accuracy.

 
1 2
페이지 저장