2016 (314)
2015 (176)
2014 (107)
2013 (62)
2012 (33)
2011 (24)
2010 (19)
2009 (18)
2008 (7)
Query Evaluation on Probabilistic Databases Using Indexing and MapReduce
보안공학연구지원센터(IJDTA) International Journal of Database Theory and Application Vol.9 No.10 2016.10 pp.363-378
※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.
Entity resolution technique is used for recognize the duplicate tuples which signify similar real world entities. Existing resolution technique is unable to solve the problems of higher level of heterogeneity and additional continual data alteration. Working on this type of database, there is necessitated to enumerate the integrity of data. The new approach is introduced here on probabilistic databases by unmerged duplicates for processing complex queries. This is achieved by using probabilistic databases. For competent access toward entity resolution data over a large collection of possible resolution worlds, new indexing technique is presented here. Also, a computation of query processing is reduced by using indexing structure. The focus is on set similarity relation on very big probabilistic database by using MapReduce technique. MapReduce is a popular paradigm that can process large volume data more efficiently. In this paper, different approaches proposed using MapReduce to deal with this task: 1. merge data set with MapReduce and merge data set without MapReduce, 2. Merge data set with MapReduce using Hadoop. This approaches implemented on windows and Hadoop framework and performed compressing experiments to their performances. Also the speedup ratio for both is tested.
Chinese Sentence Similarity Computational Model Based on Multi-Features Combination
보안공학연구지원센터(IJDTA) International Journal of Database Theory and Application Vol.9 No.10 2016.10 pp.379-386
※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.
Combined with the issue of single direction of the solution of the existing sentence similarity algorithms, a Chinese sentence similarity computational model based on multi-features combination was proposed. The approach combines word overlap similarity, word order similarity, dependency relationship similarity, semantic similarity, structure similarity, sentence similarity, and keyword distance similarity to calculate the similarity between sentences, using the weight to describe the contribution of each feature of the sentence, and then gets a better experiment result. Experimental results shows that this approach can fully describe the features of the sentence, therefore can improve the sentence similarity computation accuracy.
0개의 논문이 장바구니에 담겼습니다.
선택하신 파일을 압축중입니다.
잠시만 기다려 주십시오.