Explorations in Tag Suggestion and Query Expansion

Jian Wang and Brian D. Davison

Full Paper (8 pages)
Official ACM published version: http://doi.acm.org/10.1145/1458583.1458592
Author's copy: PDF (571KB)

Abstract
The query used in a search system is only an approximation to the user's true information need, and as a result, many factors can reduce the quality of search results. One is query ambiguity, causing searchers with different needs to issue the same query. For example, for the query java, some users may want to find java tutorial while others may want to download java software. Other factors include a vocabulary mismatch and a lack of knowledge regarding the contents of the document collection. In any case, many users benefit from assistance in forming a good query. As a result, some commercial services provide query suggestions for many queries.

In this paper, we propose a Tag Suggestion System that takes advantage of tags associated with query results to expand a searcher's query. Since not every web page is associated with existing tags, we first build an auto-tagging system which can assign multiple tags to web pages, including news, blogs, etc. The current system contains the most popular 140 tags in del.icio.us, with high precision performance.

A small user study is performed to evaluate anecdotally the performance of our Tag Suggestion System, showing better quality than the query suggestion mechanisms provided by Yahoo! and Google. The result pages of expanded queries generated by the Tag Suggestion System are also significantly better than those of the Google original system.

Citation: In Proceedings of the CIKM 2008 Workshop on Search in Social Media (SSM 2008), pages 43-50, Napa Valley, CA, October 2008.

© ACM, 2008. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution.

Back to Brian Davison's publications


Last modified: 25 November 2008 Brian D. Davison