Full Paper (8 pages)
Official ACM published version: http://dx.doi.org/10.1145/1148170.1148189
Author's version: PDF (273KB)
Traditional web link-based ranking schemes use a single score to measure a page's authority without concern of the community from which that authority is derived. As a result, a resource that is highly popular for one topic may dominate the results of another topic in which it is less authoritative. To address this problem, we suggest calculating a score vector for each page to distinguish the contribution from different topics, using a random walk model that probabilistically combines page topic distribution and link structure. We show how to incorporate the topical model within both PageRank and HITS without affecting the overall property and still render insight into topic-level transition. Experiments on multiple datasets indicate that our technique outperforms other ranking approaches that incorporate textual analysis.
In Proceedings of the 29th Annual International ACM SIGIR Conference on Research & Development on Information Retrieval, pages 91-98, Seattle, WA, August 6-11, 2006.
Back to Brian Davison's publications