Capturing Page Freshness for Web Search

Na Dai and Brian D. Davison.

Poster (2 pages)
Official ACM published version: http://doi.acm.org/10.1145/1835449.1835658
Author's version: PDF (55KB)

Abstract
Freshness has been increasingly realized by commercial search engines as an important criteria for measuring the quality of search results. However, most information retrieval methods focus on the relevance of page content to given queries without considering the recency issue. In this work, we mine page freshness from web user maintenance activities and incorporate this feature into web search. We first quantify how fresh the web is over time from two distinct perspectives---the page itself and its in-linked pages---and then exploit a temporal correlation between two types of freshness measures to quantify the confidence of page freshness. Results demonstrate page freshness can be better quantified when combining with temporal freshness correlation. Experiments on a real-world archival web corpus show that incorporating the combined page freshness into the searching process can improve ranking performance significantly on both relevance and freshness.

In Proceedings of the 33rd Annual ACM SIGIR Conference on Research and Development in Information Retrieval, pages 871-872, Geneva, Switzerland, ACM Press, July 2010.

Back to Brian Davison's publications


Last modified: 21 July 2010
Brian D. Davison