Publication Type
Journal Article
Version
publishedVersion
Publication Date
5-2012
Abstract
Users are increasingly pursuing complex task-oriented goals on the web, such as making travel arrangements, managing finances, or planning purchases. To this end, they usually break down the tasks into a few codependent steps and issue multiple queries around these steps repeatedly over long periods of time. To better support users in their long-term information quests on the web, search engines keep track of their queries and clicks while searching online. In this paper, we study the problem of organizing a user's historical queries into groups in a dynamic and automated fashion. Automatically identifying query groups is helpful for a number of different search engine components and applications, such as query suggestions, result ranking, query alterations, sessionization, and collaborative search. In our approach, we go beyond approaches that rely on textual similarity or time thresholds, and we propose a more robust approach that leverages search query logs. We experimentally study the performance of different techniques, and showcase their potential, especially when combined together.
Keywords
User history, search history, query clustering, query reformulation, click graph, task identification.
Discipline
Databases and Information Systems | Numerical Analysis and Scientific Computing
Publication
IEEE Transactions on Knowledge and Data Engineering
Volume
24
Issue
5
First Page
912
Last Page
925
ISSN
1041-4347
Identifier
10.1109/TKDE.2010.251
Publisher
IEEE
Citation
HWANG, Heasoo; LAUW, Hady W.; GETOOR, Lise; and NTOULAS, Alexandros.
Organizing User Search Histories. (2012). IEEE Transactions on Knowledge and Data Engineering. 24, (5), 912-925.
Available at: https://ink.library.smu.edu.sg/sis_research/1548
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1109/TKDE.2010.251
Included in
Databases and Information Systems Commons, Numerical Analysis and Scientific Computing Commons