Document Type

Conference Proceeding

Publisher

IEEE Press

Faculty

Computing, Health and Science

School

School of Computer and Information Science

RAS ID

3069

Comments

This conference paper was originally published as: Xiao, J. , & Collins, M. (2005). Similarity-aware Web Content Management and Document Pre-fetching. Proceedings of International Conference on Machine Learning and Cybernetics. (pp. 2307-2312). Guangzhou, China. IEEE Press. Original article available here

© 2005 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Abstract

Web caching is intended to reduce network traffic, server load and user-perceived retrieval latency. Web pre-fetching, which can be considered as "active" caching, builds on regular Web caching, minimizing further a Web user's access delay. To be effective, however, the pre-fetching techniques must be able to predict subsequent Web access with minimum computational overheads. This paper presents a similarity-based mechanism to support similarity-aware Web document pre-fetching between proxy caches and browsing clients. We first define a set of measures to assess similarities between Web documents, and then propose a multi-cache architecture to cache Web documents based on those similarities. A predictor is developed to support the similarity-aware document pre-fetching algorithm. Preliminary experiments have shown that our predictor offers superior performance when compared with some existing prediction algorithms.

DOI

10.1109/ICMLC.2005.1527329

Access Rights

free_to_read

 
COinS
 

Link to publisher version (DOI)

10.1109/ICMLC.2005.1527329