On exploiting content and citations together to compute similarity of scientific papers

Masoud Reyhani Hamedani, Sang Wook Kim, Sang Chul Lee, Dong Jin Kim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Scopus citations

Abstract

In computing the similarity of scientific papers, previous text-based and link-based similarity measures look at only a single side of the content and citations. In this paper, we propose a novel approach called SimCC that effectively combines the content and citation information to accurately compute the similarity of scientific papers. Unlike previous approaches, SimCC effectively represents both authority and context of a scientific paper simultaneously in computing similarities. Also, we propose SimCC+A to consider recently-published papers. The effectiveness of our proposed method is demonstrated via extensive experiments on a real-world dataset of scientific papers, with more than 100% improvement in accuracy compared with previous methods.

Original languageEnglish
Title of host publicationCIKM 2013 - Proceedings of the 22nd ACM International Conference on Information and Knowledge Management
Pages1553-1556
Number of pages4
DOIs
StatePublished - 2013
Event22nd ACM International Conference on Information and Knowledge Management, CIKM 2013 - San Francisco, CA, United States
Duration: 27 Oct 20131 Nov 2013

Publication series

NameInternational Conference on Information and Knowledge Management, Proceedings

Conference

Conference22nd ACM International Conference on Information and Knowledge Management, CIKM 2013
Country/TerritoryUnited States
CitySan Francisco, CA
Period27/10/131/11/13

Keywords

  • Authority
  • Citation
  • Content
  • Scientific papers
  • Similarity

Fingerprint

Dive into the research topics of 'On exploiting content and citations together to compute similarity of scientific papers'. Together they form a unique fingerprint.

Cite this