Documented Scientific Discoveries and Technical Innovations

Welcome to the High Performance Computing and Software Laboratory Technical Report Browser

This document lists the titles of selected technical reports (published or to be published) of the High Performance Computing and Software Laboratory (since 1994) with links to corresponding ./publications/abstracts. Included in the heading of each ./publications/abstract is a link to download the actual technical report.

Papers sorted by Years


2008

``The stretched exponential distribution of Internet media access patterns" , Proceedings of 27th ACM Symposium on Principles of Distributed Computing (PODC 2008), Toronto, Canada, August 18-21, 2008.

``Caching for Bursts (C-Burst): let hard disks sleep well and work energetically", Proceedings of 13th ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED'08), Bangalore, India, August 11-13, 2008.

``LightFlood: minimizing redundant messages and maximizing scope of peer-to-peer search", IEEE Transactions on Parallel and Distributed Systems, Vol. 19, No. 5, 2008.

``Gaining insights into multicore cache partitioning: bridging the gap between simulation and real systems", Proceedings of the 14th International Symposium on High Performance Computer Architecture (HPCA'08), Salt Lake City, Utah, February 16-20, 2008.


2007

``PSM-Throttling: minimizing energy comsumption for bulk data communications in WLANs", Proceedings of the 15th International Conference on Network Protocols, (ICNP'07), Beijing, China, October 16-19, 2007.

``SProxy: a caching infrastructure to support Internet streaming", IEEE Transactions on Multimedia, Vol. 9, No. 5, 2007.

``Cost-aware caching algorithms for distributed storage servers", Proceedings of the 21st International Symposium on Distributed Computing (DISC'07), Lemesos, Cyprus, September 24-26, 2007.

``Maintaining strong cache consistency for the Domain Name System", IEEE Transactions on Knowledge and Data Engineering, Vol. 19, No. 8, 2007.

``SCAP: Smart Caching in wireless Access Points to improve P2P streaming", Proceedings of the 27 International Conference on Distributed Computing Systems (ICDCS'07), Toronto, Canada, June 25-29, 2007.

``STEP: Sequentiality and Thrashing Detection based Prefetching to improve performance of networked storage servers", Proceedings of the 27 International Conference on Distributed Computing Systems (ICDCS'07), Toronto, Canada, June 25-29, 2007.

``DiskSeen: exploiting disk layout and access history to enhance I/O prefetch", Proceedings of 2007 USENIX Annual Technical Conference (USENIX'07), Santa Clara, California, June 17-22, 2007.

``Does Internet media traffic really follow Zipf-like distribution?", (an extended abstract), Proceedings of 2007 ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS'07), San Diego, California, June 12-16, 2007.

``Design and Analysis of Sensing Scheduling Algorithms under Partial Coverage for Object Detection in Sensor Networks", IEEE Transactions on Parallel and Distributed Systems, Vol. 18, No. 3, 2007.

``Cooperative Relay Service in a Wireless LAN", IEEE Journal on Selected Areas in Communications, Vol. 25, No. 2, 2007.

``A Performance Study of BitTorrent-like Peer-to-Peer Systems", IEEE Journal on Selected Areas in Communications, Vol. 25, No. 1, 2007.

``Coordinated multilevel buffer cache management with consistent access locality quantification", IEEE Transactions on Computers, Vol. 56, No. 1, 2007.


2006

``Delving into Internet streaming media delivery: a quality and resource utilization perspective", Proceedings of ACM SIGCOMM Internet Measurement Conference (IMC'06), Rio de Janeiro, Brazil, October 25-27, 2006.

``SmartSaver: turning flash drive into a disk energy saver for mobile computers", Proceedings of 11th ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED'06), Tegernsee, Germany, October 4-6, 2006.

``DNScup: a strong cache consistency protocol for DNS", Proceedings of the 26th International Conference on Distributed Computing Systems (ICDCS'06), Lisbon, Portugal, July 4-7, 2006.

``ASAP: an AS-Aware Peer-relay protocol for high quality VoIP", Proceedings of the 26th International Conference on Distributed Computing Systems (ICDCS'06), Lisbon, Portugal, July 4-7, 2006.

``A locality-aware cooperative cache management protocol to improve network file system performance", Proceedings of the 26th International Conference on Distributed Computing Systems (ICDCS'06), Lisbon, Portugal, July 4-7, 2006.

``Segment-based streaming media proxy: modeling and optimization", IEEE Transactions on Multimedia, Vol. 8, No. 2, 2006.

``Design and evaluation of a scalable and reliable P2P assisted proxy for on-demand streaming media delivery", IEEE Transactions on Knowledge and Data Engineering, Vol. 18, No. 5, 2006.

``MESA: reducing cache conflicts by increasing static and run-time methods", Proceedings of International Symposium on Performance Analysis of Systems and Software (ISPASS-2006), Austin, Texas, March 19-21, 2006.

``Exploiting idle communication power to improve wireless network performance and energy efficiency", Proceedings of INFOCOM'06, Barcelona, Spain, April 23-29, 2006.


2005

``Fast proxy delivery of multiple streaming sessions in shared running buffers", IEEE Transactions on Multimedia, Vol. 7, No. 6, December 2005.

``DULO: an effective buffer cache management scheme to exploit both temporal and spatial localities", Proceedings of the 4th USENIX Conference on Files and Storage Technologies (FAST'05), San Francisco, CA, December 14-16, 2005.

``Coordinated data prefetching for Web contents", Computer Communications, Vol. 28, Issue 17, October 2005.

``Look-ahead architecture adaptation to reduce processor power consumption" IEEE Micro, Vol. 25, No. 4, July/August, 2005.

``Measurement, analysis, and modeling of BitTorrent-like systems" Proceedings of ACM SIGCOMM Internet Measurement Conference (IMC'05), New Orleans, LA, October 19-21, 2005.

``Making LRU friendly to weak locality workloads: a novel replacement algorithm to improve buffer cache performance", IEEE Transactions on Computers, Vol. 54, No. 8, 2005.

``Segment-based proxy caching for Internet streaming media delivery", IEEE Multimedia, Vol. 12, No. 3, July-September, 2005.

``Fast and low-cost search schemes by exploiting localities in P2P networks", Journal of Parallel and Distributed Computing, Vol. 65, Issue 6, 2005.

``Design and analysis of wave sensing scheduling protocols for object-tracking applications", Proceedings of the First International Conference on Distributed Computing in Sensor Systems (DCOSS '05), Marina del Rey, California, June 30 - July 1, 2005.

``Analyzing object detection quality under probabilistic coverage in sensor networks", Proceedings of the 13th International Workshop on Quality of Service, (IWQoS'05), Passau, Germany, June 21 - 23, 2005.

``Analysis of multimedia workloads with implications for internet streaming" , Proceedings of the 14th International World Wide Web Conference, (WWW'2005), Chiba, Japan, May 10-14, 2005.

``DISC: Dynamic Interleaved Segment Caching for interactive steaming accesses", Proceedings of the 25th International Conference on Distributed Computing Systems, (ICDCS'2005), Columbus, Ohio, June 6-9, 2005.

``Locality awareness in unstructured peer-to-peer systems", IEEE Transactions on Parallel and Distributed Systems, Vol. 16, No. 2, February 2005.

``CLOCK-Pro: an effective improvement of the CLOCK replacement", Proceedings of 2005 USENIX Annual Technical Conference (USENIX'05), Anaheim, CA, April 10-15, 2005.

Clock-Pro patches are available in Linux Kernel:

``SCOPE: scalable consistency maintenance in structured P2P systems", Proceedings of IEEE INFOCOM 2005 Conference, Miami, Florida, March 13-17, 2005.

``Token-ordered LRU: an effective page replacement policy and its implementation in Linux systems", Performance Evaluation, Vol. 60, Issue 1-4, 2005.

The token algorithm has been adopted in Linux Kernel


2004

``A study on object tracking quality under probabilistic coverage in sensor networks", a poster presentation in MobiCom'04, Philadelphia, Pennsylvania, September 26 to October 1, 2004; an extended abstract published in ACM Mobile Computing and Communication Review (MC2R), Vol. 9, No. 1, pp 73-76, January 2005.

``Enforcing direct communications between clients and Web servers to improve proxy performance and security", Software: Practice and Experience, Vol. 34, Issue 12, October 2004.

``Exploiting content localities for efficient search in P2P systems", Proceedings of the 18th International Symposium on Distributed Computing (DISC 2004), Amsterdam, Netherlands, October 4 - 8, 2004.

``Strong cache consistency support for domain name system", a poster presentation in SIGCOMM'04, Portland, Oregon, August 31 - September 3, 2004.

``Design and optimization of large size and low overhead off-chip caches", IEEE Transactions on Computers, Vol. 53, No. 7, 2004.

``Building a large and efficient hybrid peer-to-peer Internet caching system" , IEEE Transactions on Knowledge and Data Engineering, Vol. 16, No. 6, 2004.

``Adaptive memory allocations in clusters to handle unexpectedly large data-intensive jobs" , IEEE Transactions on Parallel and Distributed Systems, Vol. 15, No. 7, 2004.

``SAT-Match: a self-adaptive topology matching method to achieve low lookup latency in structured P2P overlay networks" , Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS'04), Santa Fe, New Mexico, April 26-30, 2004.

``PROP: a scalable and reliable P2P assisted streaming proxy system" , Proceedings of the 24th International Confernece on Distributed Computing Systems, (ICDCS'04), Tokyo, Japan, March 23-26, 2004.

``ULC: A file block placement and replacement protocol to effectively exploit hierarchical locality in multi-level buffer caches" , Proceedings of the 24th International Confernece on Distributed Computing Systems, (ICDCS'04), Tokyo, Japan, March 23-26, 2004.

``SRB: Shared Running Buffers in proxy to exploit memory locality of multiple streaming media sessions" , Proceedings of the 24th International Confernece on Distributed Computing Systems, (ICDCS'04), Tokyo, Japan, March 23-26, 2004.

``Locality-aware topology matching in P2P systems" , Proceedings of IEEE INFOCOM'04, Hong Kong, March 7-11, 2004.

``Designs of high quality streaming proxy systems" , Proceedings of IEEE INFOCOM'04, Hong Kong, March 7-11, 2004.

``Investigating performance insights of segment-based proxy caching of streaming media strategies",, Proceedings of ACM International Conference on Multimedia Computing and Networking (MMCN'04), January 21-22, 2004.


2003

``Auto-CFD: efficiently parallelizing CFD applications on clusters" Proceedings of IEEE International Confernece on Cluster Computing, (Cluster'03), December 1-4, 2003.

``Efficient Distributed Disk Caching in Data Grid Management" Proceedings of IEEE International Confernece on Cluster Computing, (Cluster'03), December 1-4, 2003.

``On scalable and locality aware Web file sharing", Journal of Parallel and Distributed Computing, Vol. 63, No. 10, 2003.

``Low cost and reliable mutual anonymity protocols in peer-to-peer networks", IEEE Transactions on Parallel and Distributed Systems, Vol. 14, No. 9, 2003.

``Accurately modeling workload interactions for deploying prefetching in Web servers", , Proceedings of 2003 International Conference on Parallel Processing, (ICPP'03), Kaohsiung, Taiwan, China, October 6-9, 2003.

``LighFlood: an efficient flooding scheme for file search in unstructured peer-to-peer systems", Proceedings of 2003 International Conference on Parallel Processing, (ICPP'03), Kaohsiung, Taiwan, China, October 6-9, 2003.

``Adaptive and lazy segmentation based proxy caching for streaming media delivery" , Proceedings of 13th ACM International Workshop on Network and Operating Systems Support for Design Audio and Video, (NOSSDAV'03), Monterey, California, USA, June 1-3, 2003.

``Mutual anonymity protocols for hybrid peer-to-peer systems" , Proceedings of 23rd International Conference on Distributed Computing Systems, (ICDCS'03), Providence, Rhode Island, May 19-22, 2003.

``A popularity-based prediction model for Web prefetching", IEEE Computer, Vol. 36, No. 3, March, 2003.


2002

``Detective borwsers: a software technique to improve Web access performance and security", Proceedings of the 7th International Workshop on Web Content Caching and Distribution, (WCW'02), Boulder, Colorado, August 14-16, 2002.

``Access-mode predictions for low-power cache design", IEEE Micro, Vol. 22, No. 2, March/April, 2002.

``LIRS: an efficient low inter-reference recency set replacement to improve buffer cache performance" , Proceedings of the 2002 ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems, (SIGMETRICS'02), Marina Del Rey, California, June 15-19, 2002.

``Adaptive and virtual reconfigurations for dynamic job scheduling in clusters" , Proceedings of 22nd International Conference on Distributed Computing Systems, (ICDCS'02), Vienna, Austria, July 2-5, 2002.

``On reliable and scalable peer-to-peer web document sharing", Proceedings of 2002 International Parallel and Distributed Processing Symposium, (IPDPS'02), Fort Lauderdale, Florida, April 15-19, 2002.

``TPF: a system thrashing protection facility", Software: Practice and Experience, Vol. 32, Issue 3, 2002.

``Dynamic cluster resource allocations for jobs with known and unknown memory demands", IEEE Transactions on Parallel and Distributed Systems, Vol. 13, No. 3, 2002.

``Fine-grain priority scheduling on multi-channel memory systems", Proceedings of the 8th International Symposium on High Performance Computer Architecture, (HPCA-8), Cambridge, Massachusetts, February 2-6, 2002.


2001

``Breaking address mapping symmetry at multi-level of memory hierarchy to reduce DRAM row-buffer conflicts", Journal of Instruction-Level Parallelism, Vol. 3, 2001.

``Adaptive page replacement to protect thrashing in Linux", Proceedings of the 5th USENIX Annual Linux Showcase and Conference, (ALS'01), Oakland, California, November 5-10, 2001.

``Cached DRAM for ILP processor memory access latency reduction", IEEE Micro, Vol. 21, No. 4, July/August, 2001.

``Coordinated data prefetching by utilizing reference information at both proxy and Web servers", Proceedings of the ACM Workshop on Performance and Architecture of Web Servers, (PAWS-2001), Boston, Massachusetts, June 16-17, 2001.

``Exploiting neglected data locality in browsers", Proceedings of the 10th International World Wide Web Conference, (WWW10), Hong Kong, May 1-5, 2001, (an extended abstract).

``Dynamic load sharing with unknown memory demands in clusters", Proceedings of the 21st International Conference on Distributed Computing Systems, (ICDCS'2001), Phoenix, Arizona, April 16-19, 2001.

``Fast bit-reversals on uniprocessors and shared-memory multiprocessors", SIAM Journal on Scientific Computing, Vol. 22, No. 6, 2001.

``Architectural effects of symmetric multiprocessors on TPC-C commercial workload", Journal on Parallel and Distributed Computing, Vol. 61, 2001.


2000

``A permutation-based page interleaving scheme to reduce row-buffer conflicts and exploit data locality", Proceedings of the 33rd Annual International Symposium on Microarchitecture, (Micro-33), Monterey, California, December 10-13, 2000.

The permutation technique has been adopted in the memory controller in the Sun MicroSystems' UltraSPARC IIIi processor.

``Improving memory performance of sorting algorithms", ACM Journal on Experimental Algorithmics, Vol. 5, 2000.

``Memory hierarchy considerations for cost-effective cluster computing", IEEE Transactions on Computers, Vol. 49, No. 9, 2000.

``Incorporating job migration and network RAM to share memory resources", Proceedings of the 9th IEEE International Symposium on High Performance Distributed Computing, (HPDC-9), Pittsburgh, Pennsylvania, August 1-4, 2000.

``Effective Load Sharing on Heterogeneous Networks of Workstations", Proceedings of the 2000 International Parallel and Distributed Processing Symposium, (IPDPS'2000), Cancun, Mexico, May 1-5, 2000.

``Cacheminer: a runtime approach to exploit cache locality on SMP", IEEE Transactions on Parallel and Distributed Systems, Vol. 11, No. 4, 2000.

``Improving distributed workload performance by sharing both CPU and memory resources", Proceedings of the 20th International Conference on Distributed Computing Systems, (ICDCS'2000), Taipei, Taiwan, April 10-13, 2000.


1999

``Cache-optimal methods for bit-reversals", Proceedings of Supercomputing'99, (SC'99), November, Portland, Oregon, 1999.

``Analysis of commercial workload on SMP multiprocessors", Proceedings of Performance'99 August, 1999.

``Profit-effective parallel computing", IEEE Concurrency, Vol. 7, No. 2, 1999.

``The impact of memory hierarchies on cluster computing", Proceedings of 13th International Parallel Processing Symposium & 10th Symposium on Parallel and Distributed Processing, (Second Merged Symposium IPPS/SPDP'99), April, 1999.

``Engineering workstations", Encyclopedia of Electrical and Electronics Engineering, John Wiley & Sons, Publishers, February, 1999.

``Performance models and simulation", Chapter 6, High Performance Cluster Computing, Volume 1, edited by R. Buyya, Prentice Hall, New Jersey, 1999.

``Comparative evaluation and case studies of shared-memory and data-parallel execution patterns", Scientific Programming, Vol. 7, No. 1, 1999.


1998

``Lock Bypassing: an efficient algorithm for concurrently accessing priority heaps", ACM Journal on Experimental Algorithmics, Vol. 3, No. 3, 1998.

``A memory-layout oriented run-time technique for locality optimization", Proceedings of 1998 International Conference on Parallel Processing, (ICPP'98), August 1998.

``Characterizing and scheduling communication tasks of parallel and sequential jobs on networks of workstations", Computer Communications, Vol. 21, Issue. 5, 1998.

``An Integrated Approach of Performance Prediction on Networks of Workstations" , Chapter 4, Advanced Computer System Design, K. Bagchi, J. Walrand and G.Zobrist, Eds, Gordon and Breach Publishers, 1998.

Exploiting Cache Locality on Symmetric Multiprocessors: A Run-Time Approach, Ph.D. Dissertation, College of William and Mary, May 1998.


1997

``Two fast and high-associativity cache schemes", IEEE Micro, October, 1997.

``A comparative evaluation of hierarchical network architecture of the HP-Convex Exemplar", Proceedings of ICCD'97.

``Coordinating parallel processes on networks of workstations", Journal of Parallel and Distributed Computing, Vol. 46, No. 2, 1997.

``Effectively scheduling parallel tasks and communications on networks of workstations", Proceedings of Euro-Par'97.

``Nova visualization for optimization of data-parallel programs", Proceedings of Euro-Par'97.

``Distributed edge detection: issues and implementations", IEEE Computational Science and Engineering, Spring Issue, 1997.

``Software support for multiprocessor latency measurement and evaluation", IEEE Transactions on Software Engineering , Vol. 23, No. 1, 1997.

``Adaptively scheduling parallel loops on distributed shared-memory systems", IEEE Transactions on Parallel and Distributed Systems, Vol. 8, No. 1, 1997.


1996

``Semi-empirical multiprocessor performance predictions" , Journal of Parallel and Distributed Computing, Vol. 39, No. 1, 1996.

``An effective and practical performance prediction model for parallel computing on non-dedicated heterogeneous NOW" , Journal of Parallel and Distributed Computing, Vol. 38, No. 1, 1996.

``An adaptive loop scheduling algorithm on shared-memory systems" , Proceedings of the 8th Symposium on Parallel and Distributed Processing, IEEE Computer Society Press, October, 1996.

``Evaluating and designing software mutual exclusion algorithms on shared-memory multiprocessors" , IEEE Parallel & Distributed Technology, Spring Issue, 1996.

``Simulation of heterogeneous networks of workstations" , Proceedings of MASCOTS'96, IEEE Computer Society Press, February, 1996.

``A fast token-chasing mutual exclusion algorithm in arbitrary network topologies" , Journal of Parallel and Distributed Computing, Vol. 35, No. 2, 1996.

``Parallelizing FDTD Methods for Solving Electromagnetic Scattering Problems" , Applications on Advanced Architecture Computers, G. Astfalk Eds., SIAM Press, 1996.


1995

``Comparative modeling and evaluation of CC-NUMA and COMA on hierarchical ring architectures", IEEE Transactions on Parallel and Distributed Systems, Vol. 6, No. 12, 1995.

``Modeling and characterizing parallel computing performance on heterogeneous networks of workstations" , Proceedings of the 7th IEEE Symposium on Parallel and Distributed Processing, IEEE Computer Society Press, October, 1995.

``*Graph: a tool for visualizing communication and optimizing layout in data-parallel programs" , Proceedings of the 1995 International Conference on Parallel Processing, CRC Press, Vol. 2, August, 1995.

``Software support for asynchronous computing across networks" , Proceedings of the 19th Annual International Computer Software and Application Conference , IEEE Computer Society Press, August, 1995.

``Multiprocessor scalability predictions through detailed program execution analysis" , Proceedings of the 9th ACM International Conference on Supercomputing, ACM Press, July, 1995. (Best Paper Award ).

``Comparative performance analysis and evaluation of hot spots on network-based shared-memory architectures", IEEE Transactions on Parallel and Distributed Systems, Vol. 6, No. 8, 1995.

``Parallelizing an oil refining simulation: numerical methods, implementations and experience", Parallel Computing , Vol. 21, No. 4, 1995.


1994

"Distributed image edge detection methods and performance", Proceedings of the Sixth IEEE Symposium on Parallel and Distributed Processing , IEEE Computer Society Press, October, 1994.

"Performance predictions on implicit communication systems", Proceedings of the Sixth IEEE Symposium on Parallel and Distributed Processing , IEEE Computer Society Press, October, 1994.

``Distributed computation of electromagnetic scattering problems using finite-difference time-domain decompositions", Proceedings of the Third IEEE International Symposium on High-Performance Distributed Computing , IEEE Computer Society Press, August, 1994.

``Latency metric: an experimental method for measuring and evaluating parallel program and architecture scalability", Journal of Parallel and Distributed Computing , Vol. 22, No. 3, 1994.

``Measuring and analyzing parallel computing scalability", Proceedings of the 1994 International Conference of Parallel Processing , CRC Press, Vol. II, August, 1994.

``Comparative performance evaluation of spin-lock synchronization on MIN-based and HR-based multiprocessors", IEEE Parallel and Distributed Technology, Spring Issue, 1994.

``Computation and communication patterns of large-scale image convolutions on parallel architectures", Proceedings of the 8th International Parallel Processing Symposium, IEEE Computer Society Press, April, 1994.

``Evaluation and measurement of multiprocessor latency patterns", Proceedings of the 8th International Parallel Processing Symposium , IEEE Computer Society Press, April, 1994.

Tutorial on Multiprocessor Performance Measurement and Evaluation , IEEE Computer Society Press, 1994.

``Triangular decomposition methods for solving reducible nonlinear systems of equations", SIAM Journal on Optimization , Vol. 5. No. 2, 1994.

``Spin-lock synchronization on the Butterfly and KSR1", IEEE Parallel & Distributed Technology, Vol. 2, Spring Issue, 1994.


Find the hidden treasure, eh?!