TR-03-11.pdf

``Efficient Distributed Disk Caching in Data Grid Management"

Song Jiang and Xiaodong Zhang

Proceedings of IEEE International COnference on Cluster Computing
(Cluster2003), Hong Kong, China, December 1-4, 2003.

Abstract

Effectively utilizing disk caches is critical for delivering and sharing
data in data-grids considering the large sizes of requested files and
excessively prolonged file transmission time. An essential component
in the disk cache management is its replacement policy that determines
which file(s) are least valuable and should be evicted to create space
for incoming files. Though a large number of replacement algorithms
for data objects of different sizes have been proposed recently in the
domain of Web-caching and disk caching in data grids, they inherit the
shortcomings of the LRU and LFU replacements in characterizing access
patterns.  In order to address this limit, we propose a technique to
measure relative file access locality strength -- how soon a file is to
be re-accessed before being evicted compared with other files.  When we
estimate the in-cache re-access probability, we take the disk space
consumed by accessed files as well as disk cache size into consideration.
Using a relative locality strength estimation, we are able to accurately
rank the value of each file for being cached, and select the file(s)
with least values for replacement.

Our simulation results show that our proposed policy is the most
effective one among existing policies in interpreting access patterns,
and consistently achieves performance improvement measured by hit ratios
and byte hit ratios.

Back to the Publication Page.

Back to the HPCS Main Page at the Ohio State University.