Publication Type

Journal Article

Version

publishedVersion

Publication Date

5-2012

Abstract

Users are increasingly pursuing complex task-oriented goals on the web, such as making travel arrangements, managing finances, or planning purchases. To this end, they usually break down the tasks into a few codependent steps and issue multiple queries around these steps repeatedly over long periods of time. To better support users in their long-term information quests on the web, search engines keep track of their queries and clicks while searching online. In this paper, we study the problem of organizing a user's historical queries into groups in a dynamic and automated fashion. Automatically identifying query groups is helpful for a number of different search engine components and applications, such as query suggestions, result ranking, query alterations, sessionization, and collaborative search. In our approach, we go beyond approaches that rely on textual similarity or time thresholds, and we propose a more robust approach that leverages search query logs. We experimentally study the performance of different techniques, and showcase their potential, especially when combined together.

Keywords

User history, search history, query clustering, query reformulation, click graph, task identification.

Discipline

Databases and Information Systems | Numerical Analysis and Scientific Computing

Publication

IEEE Transactions on Knowledge and Data Engineering

Volume

24

Issue

5

First Page

912

Last Page

925

ISSN

1041-4347

Identifier

10.1109/TKDE.2010.251

Publisher

IEEE

Copyright Owner and License

Authors

Additional URL

https://doi.org/10.1109/TKDE.2010.251

Share

COinS