Application of automatic topic identification on Excite Web search engine data logs


Ozmutlu H. C., Cavdur F.

INFORMATION PROCESSING & MANAGEMENT, cilt.41, sa.5, ss.1243-1262, 2005 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 41 Sayı: 5
  • Basım Tarihi: 2005
  • Doi Numarası: 10.1016/j.ipm.2004.04.018
  • Dergi Adı: INFORMATION PROCESSING & MANAGEMENT
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Social Sciences Citation Index (SSCI), Scopus
  • Sayfa Sayıları: ss.1243-1262
  • Bursa Uludağ Üniversitesi Adresli: Evet

Özet

The analysis of contextual information in search engine query logs enhances the understanding of Web users' search patterns. Obtaining contextual information on Web search engine logs is a difficult task, since users submit few number of queries, and search multiple topics. Identification of topic changes within a search session is an important branch of search engine user behavior analysis. The purpose of this study is to investigate the properties of a specific topic identification methodology in detail, and to test its validity. The topic identification algorithm's performance becomes doubtful in various cases. These cases are explored and the reasons underlying the inconsistent performance of automatic topic identification are investigated with statistical analysis and experimental design techniques. (c) 2004 Elsevier Ltd. All rights reserved.