Publication Type
Report
Version
submittedVersion
Publication Date
2007
Abstract
This paper presents the duality hypothesis of search and tagging, two important behaviors of web users. The hypothesis states that if a user views a document D in the search results for query Q, the user would tend to assign document $D$ a tag identical to or similar to Q; similarly, if a user tags a document D with a tag T, the user would tend to view document D if it is in the search results obtained using T as a query. We formalize this hypothesis with a unified probabilistic model for search and tagging, and show that empirical results of several tasks on search log and tag data sets, including ad hoc search, query suggestion, and query trend analysis, all support this duality hypothesis. Since the availability of search log is limited due to the privacy concern, our study opens up a highly promising direction of using tag data to approximate or supplement search log data for studying user behavior and improving search engine accuracy.
Keywords
social bookmarking, query logs, tagging records, duality analysis
Discipline
Databases and Information Systems | Numerical Analysis and Scientific Computing
Research Areas
Data Science and Engineering
Issue
2919
First Page
1
Last Page
9
Publisher
University of Illinois at Urbana-Champaign
City or Country
Urbana-Champaign, IL
Citation
MEI, Qiaozhu; JIANG, Jing; SU, Hang; and ZHAI, ChengXiang.
Searching and Tagging: Two Sides of the Same Coin?. (2007). 1-9.
Available at: https://ink.library.smu.edu.sg/sis_research/1043
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://www.ideals.illinois.edu/handle/2142/10998
Included in
Databases and Information Systems Commons, Numerical Analysis and Scientific Computing Commons