Publications of Xiaodong Zhang


Selected Publications

  • ``Understanding insights into the basic structure and essential issues of table placements methods in clusters", Proceedings of 40th International Conference on Very Large Data Bases (VLDB 2014), Hangzhou, China, September 1-5, 2014. (with Y. Huai, R. Lee, S. Ma, and O. O'Malley, the paper was accepted in PVLDB 2013)

  • ``Major technical advancements in Apache Hive", Proceedings of 2014 ACM SIGMOD Conference on Management of Data (SIGMOD 2014), Snowbird, Utah, June 22-27, 2014. (with A. Chauhan, A. Gates, G. Hagleitner, E. N. Hanson, Y. Huai, R. Lee, O. O'Malley, J. Pandey, and Y. Yuan)

  • ``UNIK: unsupervised social network spam detection", Proceedings of 22nd ACM International Conference on Information and Knowledge Management (CIKM 2013), San Francisco, October 27 - November 1, 2013. (with S. Chen , L. Guo, E. Tan and Yihong (Eric) Zhao)

  • ``S-CAVE: effective SSD caching to improve virtual machine storage performance", Proceedings of 22nd International Conference on Parallel Architectures and Compilation techniques (PACT 2013), Edinburgh, Scotland, September 7-11, 2013. (with R. Lee, D. Liu, T. Luo, Siyuan Ma, and Li Zhou)

  • ``Hadoop-GIS: a high performance spatial data warehousing system over MapReduce", Proceedings of 39th International Conference on Very Large Data Bases (VLDB 2013), Riva del Garda, Trento, Italy, August 26-30, 2013. (with A. Aji, R. Lee, Q. Liu, J. H. Saltz, H. Vo, and F. Wang)

  • ``The Yin and Yang of processing data warehousing queries on GPU devices", Proceedings of 39th International Conference on Very Large Data Bases (VLDB 2013), Riva del Garda, Trento, Italy, August 26-30, 2013. (with R. Lee and Y. Yuan)

  • ``LDPC-in-SSD: making advanced error correction codes work effectively in solid state drives", Proceedings of 11th USENIX Conference on File and Storage Technologies (FAST'13), San Jose, California, February 12-15, 2013. (with H. Sun, T. Zhang, K. Zhao, W. Zhao, and N. Zheng)

  • ``Accelerating pathology image data cross-comprison on CPU-GPU hybrid systems" , Proceedings of 38th International Conference on Very Large Databases (VLDB 2012), Istanbul, Turkey, August 27-31, 2012. (with Y. Huai, R. Lee, J. H. Saltz, F. Wang, and K. Wang).

  • ``hStorage-DB: heterogeneity-aware data management to exploit full capacity of hybrid storage systems", Proceedings of 38th International Conference on Very Large Databases (VLDB 2012), Istanbul, Turkey, August 27-31, 2012. (with F. Chen, R. Lee, Michael Mesnier, and T. Luo).

  • ``Spam behavior analysis and detection in user generated content on social networks", Proceedings of 32nd International Conference on Distributed Computing Systems (ICDCS 2012), Macau, China, June 18-21, 2012. (with S. Chen, L. Guo, E. Tan, and Yihong (Eric) Zhao).

  • ``BWS: Balanced Work Stealing for time-sharing multicores", Proceedings of ACM EuroSys Conference (EuroSys 2012), Bern, Switzerland, April 10-13, 2012. (with X. Ding, P. B. Gibbons and K. Wang)
    BWS is open source software.

  • ``DOT: a matrix model for analyzing, optimizing and deploying software for big data analytics in distributed systems", Proceedings of 2nd ACM Symposium on Cloud Computing (SOCC 2011), Cascais, Portugal, October 27-28, 2011. (with Y. Huai, R. Lee, C. H. Xia, and S. Zhang).

  • ``YSmart: Yet another SQL-to-MapReduce Translator", Proceedings of 31st International Conference on Distributed Computing Systems (ICDCS 2011), Minneapolis, Minnesota, June 20-24, 2011. Best Paper Award . (with Y. He, Y. Huai, R. Lee, and F. Wang).

    YSmart has been merged into big data warehousing production systems

  • ``Hystor: making the best use of solid state drives in high performance storage systems", Proceedings of 25 ACM International Conference on Supercomputing (ICS 2011), Tucson, Arizona, May 31 - June 4, 2011. Best Paper Award . (with F. Chen, and D. A. Koufaty).

    Hystor has made impact on commercial hybrid storage products, including Apple's Fusion Drive

  • ``SRM-Buffer: An OS Buffer Management Technique to Prevent Last Level Caches from Thrashing in multicores", Proceedings of ACM EuroSys Conference (EuroSys 2011), Salzburg, Austria, April 10-13, 2011. (with X. Ding, and K. Wang)

  • ``RCFile: a fast and space-efficient data placement structure in MapReduce-based warehouse systems", Proceedings of International Conference on Data Engineering (ICDE'11), Hannova, Germany, April 11-16, 2011. (with Y. He, Y. Huai, N. Jain, R. Lee, and Z. Shao, and Z. Xu).

  • RCFile has been adopted in big data warehouse production systems

  • ``CAFTL: a content-aware flash translationa layer enhancing the lifespan of flash memory basedsolid state drives", Proceedings of 9th USENIX Conference on File and Storage Technologies (FAST'11), San Jose, California, February 15-17, 2011. (with F. Chen, and T. Luo)

  • ``Essential roles of exploiting internal parallelism of flash memory based solid state drives in high-speed data processing", Proceedings of 17th International Symposium on High Performance Computer Architecture (HPCA-17), San Antonio, Texas, February 12-16, 2011. (with F. Chen, and R. Lee)

  • ``ULCC: a user-level facility for optimizing shared cache performance on multicores", Proceedings of 16th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming (PPoPP 2011), San Antonio, Texas, February 12-16, 2011. (with X. Ding, and K. Wang)
    ULCC is open source software.

  • ``TopBT: a topology-aware and infrastructure-independent BitTorrent client", Proceedings of INFOCOM'10, San Diego, California, March 15-19, 2010. (with S. Chen, L. Guo, T. Luo, S. Ren, and E. Tan).
    TopBT is open source software.

  • ``CUBS: coordinated upload bandwidth sharing in residential networks" , Proceedings of 17th International Conference on Network Protocols (ICNP 2009), Princeton, NJ, October 13-16, 2009. (with S. Chen, L. Guo, and E. Tan).

  • ``Enabling software management for multicore caches with a lightweight hardware support", Proceedings of 22nd ACM/IEEE Annual Conference on Supercomputing (SC09), Portland, Oregon, November 14-20, 2009. (with X. Ding, J. Lin, Q. Lu, P. (Saday) Sadayappan, and Z. Zhang).

  • ``Soft-OLP: improving hardware cache performance through software-controlled object-level partitioning", Proceedings of 18th International Conference on Parallel Architectures and Compilation techniques (PACT 2009), Raleigh, North Carolina, September 12-16, 2009. (with X. Ding, J. Lin, Q. Lu, P. (Saday) Sadayappan, and Z. Zhang).

  • ``MCC-DB: minimizing cache conflicts in multi-core processors for databases", Proceedings of 35th International Conference on Very large Data Bases (VLDB 2009), Lyon, France, August 24-28, 2009 (with F. Chen, X. Ding, R. Lee, and Q. Lu).

  • ``Analyzing patterns of user content generation in online social networks" , Proceedings of 15th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD-2009), Paris, France, June 28- July 1st, 2009.. (with S. Chen, L. Guo, E. Tan, and Y. E. Zhao).

  • ``Understanding intrinsic characteristics and system implications of flash memory based solid state drives", Proceedings of 2009 ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems (SIGMETRICS/Performance 2009), Seattle, WA, June 15-19, 2009 (with F. Chen, and D. A. Koufaty).

  • ``BP-Wrapper: a system framework making any replacement algorithms (almost) lock contention free", Proceedings of 25th International Conference on Data Engineering (ICDE'09), Shanghai, China, March 29- April 4, 2009. (with X. Ding, and S. Jiang).

  • ``The stretched exponential distribution of Internet media access patterns" , Proceedings of 27th ACM Symposium on Principles of Distributed Computing (PODC 2008), Toronto, Canada, August 18-21, 2008. (with S. Chen, L. Guo, E. Tan, and Z. Xiao).

  • `` Caching for Bursts (C-Burst): let hard disks sleep well and work energetically", Proceedings of the 13th ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED'08), Banglore, India, August 11-13, 2008. (with F. Chen).

  • ``Gaining insights into multicore cache partitioning: bridging the gap between simulation and real systems", Proceedings of the 14th International Symposium on High Performance Computer Architecture (HPCA-14), Salt Lake City, Utah, February 16-20, 2008. (with X. Ding, J. Lin, Q. Lu, P. (Saday) Sadayappan, and Z. Zhang).

    The OS-based cache partitioning method in this paper has been used in Linux kernel for production systems

  • ``PSM-Throttling: minimizing energy comsumption for bulk data communications in WLANs", Proceedings of the 15th International Conference on Network Protocols (ICNP'07), Beijing, China, October 16-19, 2007. (with S. Chen, L. Guo, and E. Tan).

  • ` `SProxy: a caching infrastructure for Internet streaming", IEEE Transactions on Multimedia, Vol. 9, Issue. 5, 2007, pp. 1062-1072. (with S. Chen, B. Shen, and S. Wee).

  • ``DiskSeen: exploiting disk layout and access history to enhance I/O prefetch", Proceedings of 2007 USENIX Annual Technical Conference (USENIX'07), Santa Clara, June 17-22, 2007. (with F. Chen, K. Davis, X. Ding, and S. Jiang).

  • ``Delving into Internet streaming media delivery: a quality and resource utilization perspective", Proceedings of ACM SIGCOMM Internet Measurement Conference (IMC'06), Rio de Janeiro, Brazil, October 25-27, 2006. (with S. Chen, L. Guo, O. Spatscheck, E. Tan, and Z. Xiao)

  • ``SmartSaver: turning flash drive into a disk energy saver for mobile computers" Proceedings of the 11th ACM International Symposium on Low Power Electronics and Design (ISLPED'06), Tegernsee, Germany, October 4-6, 2006. (with F. Chen and S. Jiang).

  • ``ASAP: an AS-Aware Peer-relay protocol for high quality VoIP", Processings of the 26th International Conference on Distributed Computing Systems (ICDCS'06), Lisbon, Portugal, July 4-7, 2006. (with L. Guo and S. Ren).

  • ``Exploiting idle commnication power to improive wireless network performance and energy efficiency", Proceedings of INFOCOM'06, Barcelona, Spain, April 23-29, 2006. (with S. Chen, X. Ding, L. Guo, Q. Li, and H. Wang).
    The extended version has been published in IEEE Journal on Selected Areas in Communications, Vol. 25, No. 2, 2007.

  • ``Fast proxy delivery of multiple streaming sessions in shared running buffers", IEEE Transactions on Multimedia, Vol. 7, No. 6, December, 2005, pp 1157-1169. (with S. Basu, S. Chen, B. Shen, and Y. Yan).

  • ``DULO: an effective buffer cache management scheme to exploit both temporal and spatial localities", Proceedings of the 4th USENIX Conference on Files and Storage Technologies (FAST'05), San Francisco, CA, December 14-16, 2005. (with F. Chen, X. Ding, S. Jiang, and E. Tan).

  • ``Measurement, analysis, and modeling of BitTorrent-like systems", Proceedings of ACM SIGCOMM Internet Measurement Conference (IMC'05), Berkeley, CA, October 19-21, 2005. (with S. Chen, X. Ding, L. Guo, E. Tan, and Z. Xiao).
    The extended version has been published in IEEE Journal on Selected Areas in Communications, Vol. 25, No. 1, 2007.

  • ``Coordinated data prefetching for Web contents", Computer Communications, Vol. 28, Issue 17, 2005, pp. 1947-1958. (with X. Chen).

  • ``Look-ahead architecture adaptation to reduce processor power consumption", IEEE Micro, Vol. 25, No. 4, 2005, pp. 10-19. (with Z. Zhu).

  • ``Segment-based proxy caching for Interent streaming media delivery", IEEE Multimedia, Vol. 12, No. 3, July/September, 2005. (with S. Chen, B. Shen, H. Wang, and S. Wee).

  • ``Analysis of multimedia workloads with implications for Internet streaming", Proceedings of the 14th International World Wide Web Conference, (WWW'05), Chiba, Japan, May 10-14, 2005. (with S. Chen, L. Guo, and Z. Xiao)

  • ``CLOCK-Pro: an effective improvement of the CLOCK replacement", Proceedings of USENIX Annual Technical Conference (USENIX'05), Anaheim, CA, April 10-15, 2005, pp. 323-336. (with F. Chen, and S. Jiang).

    Clock-Pro has been adopted in OS kernels and other data processing systems

  • ``Token-ordered LRU: an effective page replacement policy and its implementation in Linux systems", Performance Evaluation, Vol. 60, Issue 1-4, 2005, pp. 5-29. (with S. Jiang).

    The token algorithm is a part pf Linux Kernel

  • ``Design and optimization of large size and low overhead off-chip caches", IEEE Transactions on Computers, Vol. 53, No. 7, 2004, pp. 843-855. (with Z. Zhang and Z. Zhu)

  • ``ULC: A file block placement and replacement protocol to effectively exploit hierarchical locality in multi-level buffer caches" , Proceedings of the 24th International Confernece on Distributed Computing Systems, (ICDCS'04), Tokyo, Japan, March 23-26, 2004. (with S. Jiang).
    The extended version has been published in IEEE Transactions on Computers, Vol. 56, No. 1, 2007.

  • ``Designs of high quality streaming proxy systems" , Proceedings of IEEE INFOCOM'04, Hong Kong, March 7-11, 2004. (with S. Chen, B. Shen, and S. Wee).

  • ``Low cost and reliable mutual anonymity protocols in peer-to-peer networks", IEEE Transactions on Parallel and Distributed Systems, Vol. 14, No. 9, 2003, pp. 829-840. (with L. Xiao and Z. Xu).

  • ``A popularity-based prediction model for Web prefetching", IEEE Computer, Vol. 36, No. 3, March 2003, pp. 59-66. (with X. Chen).

  • ``Access-mode predictions for low-power cache design", IEEE Micro, Vol. 22, No. 2, March/April, 2002, pp. 58-71. (with Z. Zhu).

  • ``LIRS: an efficient low inter-reference recency set replacement to improve buffer cache performance", Proceedings of the 2002 ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems (SIGMETRICS'02), Marina Del Rey, California, June 15-19, 2002. (with S. Jiang).
    The extended version has been published in IEEE Transactions on Computers, Vol. 54, No. 8, 2005.

    The LIRS algorithm has been adopted in major database and software systems

  • ``Fine-grain priority scheduling on multi-channel memory systems", Proceedings of the 8th International Symposium on High Performance Computer Architecture, (HPCA-8), Cambridge, Massachusetts, February 2-6, 2002, pp. 107-116. (with Z. Zhang and Z. Zhu).

  • ``Fast bit-reversals on uniprocessors and shared-memory multiprocessors", SIAM Journal on Scientific Computing, Vol. 22, No. 6, 2001, pp. 2113-2134. (with Z. Zhang).

  • ``A permutation-based page interleaving scheme to reduce row-buffer conflicts and exploit data locality", Proceedings of the 33rd Annual International Symposium on Microarchitecture, (Micro-33), Monterey, California, December 10-13, 2000. pp. 32-41. (with Z. Zhang and Z. Zhu).

    The permutation technique has been widely adopted in different commercial processors.

  • ``Memory hierarchy considerations for cost-effective cluster computing", IEEE Transactions on Computers, Vol. 49, No. 9, 2000, pp. 915-933. (with X. Du and Z. Zhu).

  • ``Improving memory performance of sorting algorithms", ACM Journal on Experimental Algorithmics, Vol. 5, No. 3, 2000, pp. 1-22. (with S. Kubricht and L. Xiao).

  • ``Cacheminer: a runtime approach to exploit locality on SMP", IEEE Transactions on Parallel and Distributed Systems, Vol. 11, No. 4, 2000, pp. 357-374. (with Y. Yan and Z. Zhang).

  • ``Two fast and high-associativity cache schemes", IEEE Micro, Vol. 17, No. 5, 1997, pp. 40-49. (with Y. Yan and C. Zhang).

  • ``Coordinating parallel processes on networks of workstations", Journal of Parallel and Distributed Computing, Vol. 46, No. 2, 1997, pp. 125-135. (with X. Du).

  • ``Adaptively scheduling parallel loops in distributed shared memory systems" , IEEE Transactions on Parallel and Distributed Systems, Vol. 8, No. 1, 1997, pp. 70-81. (with C. Jin and Y. Yan).

  • ``Semi-empirical multiprocessor performance predictions", Journal of Parallel and Distributed Computing, Vol. 39, No. 1, 1996, pp. 14-28. (with L. Sun and Z. Xu).

  • ``Evaluating and designing software mutual exclusion algorithms on shared-memory multiprocessors", IEEE Parallel & Distributed Technology, Spring Issue, 1996, pp. 25-42. (with R. Castaneda and Y. Yan).

  • ``Multiprocessor scalability predictions through detailed program execution analysis", Proceedings of the 9th ACM International Conference on Supercomputing (ICS'95), Barcelona, Spain, July 1995. Best Paper Award . (with Z. Xu).

  • ``Triangular decoposition methods for solving reducible nonlinear systems of equations", SIAM Journal on Optimization, Vol. 4, No. 2, 1994. (with J. E. Dennis, Jr. and J. M. Martinez).

  • ``Latency metric: an experimental method for measuring and evaluating parallel program and archtecture scalability", Journal of Parallel and Distributed Computing, Vol. 22, No. 3, 1994. (with K. He and Y. Yan).

  • ``Parallel methods for solving nonliear block bordered systems of equations", SIAM Journal on Scientific and Statistical Computing, Vol. 13, No. 4, 1992. (with R. Byrd and R. B. Schnabel).

  • ``System effects of interprocessor communication latency in multicomputers", IEEE Micro, Vol. 11, No. 2, 1991.