Generating synonyms based on query log data
An approach is described for generating synonyms to supplement at least one information item, such as, in one case, a set of related items. The approach can involve an expansion phase, a clean-up phase, and a reduction phase. In the expansion phase, the approach identifies, for each related item, a set of initial synonym candidates. In the clean-up phase, the approach removes noise from the set of initial synonym candidates (if such noise exists), to provide a set of filtered synonym candidate items. In the reduction phase, the approach ranks and applies a threshold (or thresholds) to the set of filtered synonym candidate items, to generate, for each information item, a set of selected synonyms. The approach uses query log data at various points in its operation. The selected synonyms can be used to improve the effectiveness of user searches.
Computer Sciences | Databases and Information Systems
Data Management and Analytics
PAPARIZOS, Stelios; CHENG, Tao; and LAUW, Hady Wirawan.
Generating synonyms based on query log data. (2010). Research Collection School Of Information Systems.
Available at: http://ink.library.smu.edu.sg/sis_research/3315