Quora search engine redirects to different discussion pages based on the search terms searched by the user. So, when questions that are semantically similar are searched on Quora, it sometimes redirects a user to different discussion pages even if there exists a page to the dedicated search. In such a case, Semantic Similarity among the questions carries highest weightage. So, for text, using traditional methods for calculating similarity, usually the text is considered as sequence of words and they just count the number of words that occurred in a sentence, on which some distance measures are applied to find the similarity, while missing the semantic level knowledge of the text during calculation. Considering such traditional methods, it will also require a huge training set as well as time to produce an accurate model. But in this Research Paper, Siamese Based Network is used that can train itself on a single example of each text to provide an accurate similarity output.I have used different types of pre-trained word embedding models like word-2-vec and glove to understand the semantics of the question pairs present in Quora Question Pair dataset. This paper introduces a new approach to calculate sentence similarity and gives astronishing results outperforming the current state of art Siamese Based LSTM models. Along with this new approach of using Manhattan LSTM with attention mechanism for similarity calculation, a comparative analysis is performed on the embedded question pairs, among different Siamese based LSTM models like LSTM and Manhattan LSTM, to predict whether the questions are similar or not and get the best model combination for Quora Question Pair.
목차
ABSTRACT 1. Deep Learning and Text Similarity 1.1 Similariy between Questions Searched in Quora 1.2 Relevance and Importance of the Research 2. Related Research 3. Research Methodology 3.1. Siamese Neural Network Flow 3.2 Model Architecture 3.3 Dataset Description 3.4. Algorithm Design and Components 4. Analysis and Evaluation 5. Conclusion References
Yan LI [ School of Textile Garment and Design, Changshu Institute of Techonology, No.99 South Third Ring Road (Donghu Campus), Changshu 215500, Jiangsu, China ]
Corresponding author
1. 게임산업을 활성화 하고,
2. 게임기술과 기술 인력을 양산할 수 있도록 교육기관의 교과과정을 개발하고,
3. 관련기술에 대한 연구발표회, 강연회, 강습회 등을 개최하며,
4. 학회지, 논문지 및 관련 문헌을 발간하고,
5. 게임 기술 개발을 위한 국제화, 표준화 등을 지원하고,
6. 산.학.연.관이 협동할 수 있는 국제적 학술교류 및 협력을 지원하고,
7. 회원 상호간의 공동 이익과 친목을 증진시킨다.
간행물
간행물명
컴퓨터게임및콘텐츠논문지(구 한국컴퓨터게임학회논문지) [Journal of Computer Games and Contents]
간기
월간
pISSN
3091-7409
eISSN
3092-3638
수록기간
2002~2026
등재여부
KCI 등재
십진분류
KDC 691DDC 793
이 권호 내 다른 논문 / 컴퓨터게임및콘텐츠논문지(구 한국컴퓨터게임학회논문지) 제35권 제4호