Title

Fuzzy Matching of Web Queries to Structured Data

Publication Type

Conference Proceeding Article

Publication Date

3-2010

Abstract

Recognizing the alternative ways people use to reference an entity, is important for many Web applications that query structured data. In such applications, there is often a mismatch between how content creators describe entities and how different users try to retrieve them. In this paper, we consider the problem of determining whether a candidate query approximately matches with an entity. We propose an off-line, data-driven, bottom-up approach that mines query logs for instances where Web content creators and Web users apply a variety of strings to refer to the same Web pages. This way, given a set of strings that reference entities, we generate an expanded set of equivalent strings for each entity. The proposed method is verified with experiments on real-life data sets showing that we can dramatically increase the queries that can be matched.

Discipline

Databases and Information Systems | Numerical Analysis and Scientific Computing

Research Areas

Data Management and Analytics

Publication

IEEE International Conference on Data Engineering

Identifier

10.1109/ICDE.2010.5447817

Publisher

IEEE

Additional URL

http://doi.ieeecomputersociety.org/10.1109/ICDE.2010.5447817

This document is currently not available here.

Share

COinS