Hello and welcome to my home-page!!

I am a fourth year Ph.D student at the Ohio State University. I am working as a Graduate Research Assistant with Dr. D.K.Panda . I am currently involved in the MVAPICH project - a high performance MPI implementation over InfiniBand and iWARP. I am specifically interested in designing efficient algorithms for the collective operations supported in the MPI standard. Most of my work involves leveraging InfiniBand network offload to design non-blocking collective algorithms and understanding their impact on the performance and scalability of parallel scientific applications. I also work on designing multi-core aware, network-toplogy aware and power-aware algorithms for MPI collective operations.

Previously, I pursued my undergraduate studies at the National Institute of Technology Karnataka, Surathkal , India. I graduated in April, 2008. During my undergrad studies, I have also interned at the Indian Institute of Science, Verari Systems, India and Raman Research Institute, India.





Research Publications

    Invited Papers (Book Chapters)

  • Sayantan Sur, Sreeram Potluri, Krishna Chaitanya Kandalla , Hari Subramoni, Karen Tomko, Dhabaleswar K. Panda, "Co-Designing MPI Library and Applications for InfiniBand Clusters," Computer, 06 Sept. 2011. IEEE computer Society Digital Library

  • D. K. Panda, S. Sur, H. Subramoni and K. Kandalla, "Network Support for Collective Communication, Encyclopedia of Parallel Computing, 2011.
  • Conferences

  • J. Jose, H. Subramoni, K. Kandalla, M. Wasi-ur-Rahman, H. Wang, and D. K. Panda, "Scalable Memcached design for InfiniBand Clusters using Hybrid Transports" - Accepted for publication at IEEE CCGrid 2012, USA
  • K. Kandalla, U. Yang, J. Keasler, T. Kolev, A. Moody, H. Subramoni, K. Tomko, J. Vienne and D. K. Panda, "Designing Non-blocking Allreduce with Collective Offload on InfiniBand Clusters: A Case Study with Conjugate Gradient Solvers", Accepted for publication at IPDPS 2012.
  • H. Subramoni, K. Kandalla, J. Vienne, S. Sur, B. Barth, K. Tomko, R. McLay, K. Schulz and D.K. Panda, "Design and Evaluation of Network Topology-/Speed- Aware Broadcast Algorithms for InfiniBand Cluster" - Accepted for publication at IEEE Cluster 2011, USA
  • Krishna Kandalla, Hari Subramoni, Jerome Vienne, S. Pai Raikar, Karen Tomko, Sayantan Sur and Dhabaleswar Panda, "Designing Non-blocking Broadcast with Collective Offload on InfiniBand Clusters: A Case Study with HPL" The 19th IEEE Annual Symposium
    on High Performance Interconnects, HotI 2011, USA Conference Slides
  • Krishna Kandalla, Hari Subramoni, Karen Tomko, Dmitry Pekurovsky, Sayantan Sur and Dhabaleswar K. Panda, "High-Performance and Scalable Non-Blocking All-to-All with Collective Offload on InfiniBand Clusters: A Study with Parallel 3D FFT" - The International Supercomputing Conference (ISC 2011), Hamburg, Germany Conference Slides Bibtex
  • Hari Subramoni, Krishna Kandalla, Sayantan Sur and Dhabaleswar K. Panda, "Design and Evaluation of Generalized Collective Communication Primitives with Overlap using ConnectX-2 Offload Engine" The 18th Annual Symposium on High Performance Interconnects, HotI 2010.
  • Krishna Kandalla, Emilio P. Mancini, Sayantan Sur and Dhabaleswar K. Panda, Designing Power-Aware Collective Communication Algorithms for InfiniBand Clusters," The 39th International Conference on Parallel Processing, ICPP 2010. Conference Slides. Bibtex

  • Workshops

  • A. Singh, S. Potluri, H. Wang, K. Kandalla, S. Sur and D. K. Panda, "MPI Alltoall Personalized Exchange on GPGPU Clusters: Design Alternatives and Benefits", Workshop on Parallel Programming on Accelerator Clusters (PPAC '11), held in conjunction with Cluster '11, Sept. 2011.
  • N. Dandapanthula, H. Subramoni, J. Vienne, K. Kandalla , S. Sur, D. K. Panda, and R. Brightwell, "INAM - A Scalable InfiniBand
    Network Analysis and Monitoring Tool", 4th Int'l Workshop on Productivity and Performance (PROPER 2011), in conjunction with EuroPar, Aug. 2011.
  • M. Luo, S. Potluri, P. Lai, E. P. Mancini, H. Subramoni, K. Kandalla , S. Sur and D. K. Panda, "High Performance Design and Implementation of Nemesis Communication Layer for Two-sided and One-Sided MPI" The Workshop on Parallel Programming Models and System Software (P2S2), 2010, held in Conjunction with the 39th International Conference on Parallel Processing, ICPP 2010.
  • Krishna Kandalla, Hari Subramoni, Abhinav Vishnu and Dhabaleswar K. Panda, "Designing Topology-Aware Collective Communication Algorithms for Large Scale Infiniband Clusters : Case Studies with Scatter and Gather" , The 10th Workshop on Communication Architecture for Clusters (CAC 10), held in conjunction with Int'l Parallel and Distributed Processing Symposium (IPDPS 2010). Conference Slides. Bibtex
  • K. Kandalla, H. Subramoni, G. Santhanaraman, M. Koop and D. K. Panda, "Designing Multi-Leader-Based Allgather Algorithms for Multi-Core Clusters", The 9th Workshop on Communication Architecture for Clusters (CAC 09), held in conjunction with Int'l Parallel and Distributed Processing Symposium (IPDPS 2009), Rome, Italy, May 2009 Conference Slides. Bibtex



  • Technical Reports

  • Hao Wang, Miao Luo, Krishna Kandalla, Sayantan Sur, and Dhabaleswar K. Panda, "Can Streaming SIMD Non-Temporal Instructions
    Benefit Intra-node MPI Communication on Modern Multi-core Platforms?" 14 pp. OSU-CISRC-12/10-TR29. Electronic report under
    TR29.pdf OSU-CISRC-12/10--TR29.
  • Krishna Kandalla, Sreeram Potluri, Miao Luo and Dhabaleswar K. Panda, "Designing Multi-Core Aware Inter-Communicator Operations for MPI-2 Dynamic Process Management," 10 pp. OSU-CISRC-11/09-TR54



kandalla@cse.ohio-state.edu