TR-05-9.pdf
``Fast and low-cost search schemes by exploiting localities in P2P networks"
Lei Guo, Song Jiang, Li Xiao, and Xiaodong Zhang
Journal of Parallel and Distributed Computing, Vol. 65, Issue 6, pp. 729-742.
Abstract
Existing peer-to-peer (P2P) search algorithms generally target either the
performance objective of improving search quality from a client's perspective,
or the objective of reducing search cost from an Internet management
perspective. Most existing work of designing and optimizing search algorithms
in unstructured P2P networks addresses the trade-off between the two
performance objectives. In contrast, our goal in this study is to attempt
to achieve both objectives. Motivated by our observations on the content
locality in the peer community and the localities of search interests of
individual peers, we propose content-abundant cluster-selectively
prefetching indices from responding peers (CAC-SPIRP), a fast and low-cost
P2P searching algorithm. Our algorithm consists of two components. The
first component aims to reduce the search cost by constructing a CAC, where
content-abundant peers self-identify, and self-organize themselves into an
inter-connected cluster providing a pool of popular objects to be frequently
accessed by the peer community. A query will be first routed to the CAC,
and most likely to be satisfied there, significantly reducing the amount of
network traffic and the search scope. The second component in our algorithm
is client oriented and aims to improve the quality of P2P search, called
SPIRP. A client individually identifies a small group of peers who have
the same interests as itself to prefetch their entire file indices of the
related interests, minimizing unnecessary outgoing queries and significantly
reducing query response time. Building SPIRP on the CAC Internet
infrastructure, our algorithm combines both merits of the two components
to achieve both performance objectives. Our trace-driven simulations show
that CAC-SPIRP significantly improves the overall performance from both
client's perspective and Internet management perspective.