A Machine Learning Architecture for Optimizing Web Search Engines

Justin Boyan, D. Freitag, and T. Joachims

Workshop Paper, AAAI '96 Workshop on Internet-Based Information Systems, August, 1996

View Publication

Abstract

Indexing systems for the World Wide Web, such as Lycos and Alta Vista, play an essential role in making the Web useful and usable. These systems are based on Information Retrieval methods for indexing plain text documents, but also include heuristics for adjusting their document rankings based on the special HTML structure of Web documents. In this paper, we describe a wide range of such heuristics--including a novel one inspired by reinforcement learning techniques for propagating rewards through a graph--which can be used to affect a search engine’s rankings. We then demonstrate a system which learns to combine these heuristics automatically, based on feedback collected unintrusively from users, resulting in much improved rankings.

Notes
AAAI Technical Report WS-96-06

BibTeX

@workshop{Boyan-1996-16324,
author = {Justin Boyan and D. Freitag and T. Joachims},
title = {A Machine Learning Architecture for Optimizing Web Search Engines},
booktitle = {Proceedings of AAAI '96 Workshop on Internet-Based Information Systems},
year = {1996},
month = {August},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.