Neural network applications for automatic new topic identification on excite web search engine data logs


Ozmutlu H. C. , Cavdur F., Ozmutlu S. , Spink A.

67th Annual Meeting of the American-Society-for-Information-Science-and-Technology, Rhode Island, United States Of America, 12 - 17 November 2004, vol.41, pp.310-316 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Volume: 41
  • Doi Number: 10.1002/meet.1450410137
  • City: Rhode Island
  • Country: United States Of America
  • Page Numbers: pp.310-316

Abstract

The analysis of contextual information in search engine query logs is an important, yet difficult task. Users submit few queries, and search multiple topics sometimes with closely related context. Identification of topic changes within a search session is an important branch of contextual information analysis. The purpose of this study is to propose a topic identification algorithm using neural networks. A sample from the Excite data log is selected to train the neural network and then the neural network is used to identify topic changes in the data log. As a result, 76% of topic shifts and 92% of topic continuations are identified correctly.