Application of automatic topic identification on Excite Web search engine data logs


Ozmutlu H. C. , Cavdur F.

INFORMATION PROCESSING & MANAGEMENT, vol.41, no.5, pp.1243-1262, 2005 (Journal Indexed in SCI) identifier identifier

  • Publication Type: Article / Article
  • Volume: 41 Issue: 5
  • Publication Date: 2005
  • Doi Number: 10.1016/j.ipm.2004.04.018
  • Title of Journal : INFORMATION PROCESSING & MANAGEMENT
  • Page Numbers: pp.1243-1262

Abstract

The analysis of contextual information in search engine query logs enhances the understanding of Web users' search patterns. Obtaining contextual information on Web search engine logs is a difficult task, since users submit few number of queries, and search multiple topics. Identification of topic changes within a search session is an important branch of search engine user behavior analysis. The purpose of this study is to investigate the properties of a specific topic identification methodology in detail, and to test its validity. The topic identification algorithm's performance becomes doubtful in various cases. These cases are explored and the reasons underlying the inconsistent performance of automatic topic identification are investigated with statistical analysis and experimental design techniques. (c) 2004 Elsevier Ltd. All rights reserved.