Document Type
Conference Proceeding
Publisher
IEEE Press
Faculty
Faculty of Computing, Health and Science
School
School of Computer and Information Science
RAS ID
3069
Abstract
Web caching is intended to reduce network traffic, server load and user-perceived retrieval latency. Web pre-fetching, which can be considered as "active" caching, builds on regular Web caching, minimizing further a Web user's access delay. To be effective, however, the pre-fetching techniques must be able to predict subsequent Web access with minimum computational overheads. This paper presents a similarity-based mechanism to support similarity-aware Web document pre-fetching between proxy caches and browsing clients. We first define a set of measures to assess similarities between Web documents, and then propose a multi-cache architecture to cache Web documents based on those similarities. A predictor is developed to support the similarity-aware document pre-fetching algorithm. Preliminary experiments have shown that our predictor offers superior performance when compared with some existing prediction algorithms.
DOI
10.1109/ICMLC.2005.1527329
Access Rights
free_to_read
Comments
This is an Author's Accepted Manuscript of: Xiao, J. , & Collins, M. (2005). Similarity-aware Web Content Management and Document Pre-fetching. Proceedings of International Conference on Machine Learning and Cybernetics. (pp. 2307-2312). Guangzhou, China. IEEE Press. Available here
© 2005 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.