C-Rank and its variants: A contribution-based ranking approach exploiting links and content

Dong Jin Kim, Sang Chul Lee, Ho Yong Son, Sang Wook Kim, Jae Bum Lee

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

This paper addresses the problem in Web page ranking of effectively combining link and content information with efficiency high enough to be applicable to real-world search engines. Unlike previous surfer models, our approach is based on the viewpoint of a Web page author. Based on this viewpoint, we formulate the concept of contribution score, which indicates the amount to which a term in each page is utilized by other pages. To improve efficiency without loss of effectiveness, we exploit the expectations of both a Web page author and a Web search engine user on retrieval results, and restrict candidate terms that can contribute to other pages to a set of keywords of each page. In this paper, we propose three contribution-based models: C-Rank, PC-Rank and HC-Rank. Experimental results show that C-Rank provides the best precision among the models and is very effective for topic distillation tasks on the .GOV collection in TREC. Most importantly, the proposed models are efficient enough to be applicable to real-world search engines.

Original languageEnglish
Pages (from-to)761-778
Number of pages18
JournalJournal of Information Science
Volume40
Issue number6
DOIs
StatePublished - 12 Dec 2014

Bibliographical note

Publisher Copyright:
© The Author(s) 2014.

Keywords

  • C-Rank
  • Content and link ranking
  • Contribution based ranking
  • Contribution constraints
  • Web information retrieval

Fingerprint

Dive into the research topics of 'C-Rank and its variants: A contribution-based ranking approach exploiting links and content'. Together they form a unique fingerprint.

Cite this