Document Type

Conference Proceeding

Publisher

IEEE Press

Faculty

Computing, Health and Science

School

School of Computer and Information Science

RAS ID

3069

Comments

This conference paper was originally published as: Xiao, J. , & Collins, M. (2005). Similarity-aware Web Content Management and Document Pre-fetching. Proceedings of International Conference on Machine Learning and Cybernetics. (pp. 2307-2312). Guangzhou, China. IEEE Press. Original article available here

Abstract

Web caching is intended to reduce network traffic, server load and user-perceived retrieval latency. Web pre-fetching, which can be considered as "active" caching, builds on regular Web caching, minimizing further a Web user's access delay. To be effective, however, the pre-fetching techniques must be able to predict subsequent Web access with minimum computational overheads. This paper presents a similarity-based mechanism to support similarity-aware Web document pre-fetching between proxy caches and browsing clients. We first define a set of measures to assess similarities between Web documents, and then propose a multi-cache architecture to cache Web documents based on those similarities. A predictor is developed to support the similarity-aware document pre-fetching algorithm. Preliminary experiments have shown that our predictor offers superior performance when compared with some existing prediction algorithms.

DOI

10.1109/ICMLC.2005.1527329

 
COinS
 

Link to publisher version (DOI)

10.1109/ICMLC.2005.1527329