Users are increasingly pursuing complex task-oriented goals on the web, such as making travel arrangements, managing finances, or planning purchases. To this end, they usually break down the tasks into a few codependent steps and issue multiple queries around these steps repeatedly over long periods of time. To better support users in their long-term information quests on the web, search engines keep track of their queries and clicks while searching online. In this paper, we study the problem of organizing a user's historical queries into groups in a dynamic and automated fashion. Automatically identifying query groups is helpful for a number of different search engine components and applications, such as query suggestions, result ranking, query alterations, sessionization, and collaborative search. In our approach, we go beyond approaches that rely on textual similarity or time thresholds, and we propose a more robust approach that leverages search query logs. We experimentally study the performance of different techniques, and showcase their potential, especially when combined together.
User history, search history, query clustering, query reformulation, click graph, task identification.
Databases and Information Systems | Numerical Analysis and Scientific Computing
Data Management and Analytics
IEEE Transactions on Knowledge and Data Engineering
HWANG, Heasoo; LAUW, Hady Wirawan; Getoor, Lise; and Ntoulas, Alexandros.
Organizing User Search Histories. (2012). IEEE Transactions on Knowledge and Data Engineering. 24, (5), 912-925. Research Collection School Of Information Systems.
Available at: http://ink.library.smu.edu.sg/sis_research/1548