Hello and welcome to my home-page!!
I am a fourth year Ph.D student at the Ohio State University. I am working as a Graduate Research Assistant with Dr. D.K.Panda .
I am currently involved in the
MVAPICH project - a high performance MPI implementation over InfiniBand
and iWARP. I am specifically interested in designing efficient
algorithms for the collective operations supported in the MPI standard.
Most of my work involves leveraging InfiniBand network offload to design
non-blocking collective algorithms and understanding their impact on the performance and scalability of parallel scientific applications. I also work on designing multi-core aware, network-toplogy aware and power-aware algorithms for MPI collective operations.
Previously, I pursued my undergraduate studies at the
National Institute of Technology Karnataka, Surathkal , India.
I graduated in April, 2008. During my undergrad studies, I have also interned at the
Indian Institute of Science,
Verari Systems, India and Raman Research Institute, India.
Invited Papers (Book Chapters)
-
Sayantan Sur, Sreeram Potluri, Krishna Chaitanya Kandalla , Hari Subramoni,
Karen Tomko, Dhabaleswar K. Panda, "Co-Designing MPI Library and Applications
for InfiniBand Clusters," Computer, 06 Sept. 2011. IEEE computer Society
Digital Library
-
D. K. Panda, S. Sur, H. Subramoni and K. Kandalla, "Network Support for Collective Communication,
Encyclopedia of Parallel Computing, 2011.
Conferences
-
J. Jose, H. Subramoni, K. Kandalla, M. Wasi-ur-Rahman,
H. Wang, and D. K. Panda, "Scalable Memcached
design for InfiniBand Clusters using Hybrid Transports" - Accepted for
publication at IEEE CCGrid 2012, USA
-
K. Kandalla, U. Yang, J. Keasler, T. Kolev, A. Moody,
H. Subramoni, K. Tomko, J. Vienne and D. K. Panda,
"Designing
Non-blocking Allreduce with Collective Offload on
InfiniBand
Clusters: A Case Study with Conjugate Gradient Solvers",
Accepted for publication at IPDPS 2012.
-
H. Subramoni, K. Kandalla, J. Vienne, S. Sur, B. Barth, K. Tomko,
R. McLay, K. Schulz and D.K. Panda, "Design and Evaluation of Network Topology-/Speed- Aware Broadcast Algorithms for InfiniBand Cluster" - Accepted for
publication at IEEE Cluster 2011, USA
-
Krishna Kandalla, Hari Subramoni, Jerome Vienne, S. Pai Raikar, Karen Tomko, Sayantan Sur and Dhabaleswar Panda, "Designing Non-blocking Broadcast with Collective Offload on InfiniBand Clusters: A Case Study with HPL" The 19th IEEE Annual Symposium
on High Performance Interconnects, HotI 2011, USA
Conference Slides
-
Krishna Kandalla, Hari Subramoni, Karen Tomko, Dmitry Pekurovsky, Sayantan Sur and Dhabaleswar K. Panda, "High-Performance and Scalable Non-Blocking All-to-All with Collective Offload on InfiniBand Clusters: A Study with Parallel 3D FFT" - The International Supercomputing Conference (ISC 2011), Hamburg, Germany
Conference Slides
Bibtex
-
Hari Subramoni, Krishna Kandalla, Sayantan Sur and Dhabaleswar K. Panda, "Design and Evaluation of Generalized Collective Communication Primitives
with Overlap using ConnectX-2 Offload Engine" The 18th Annual Symposium on High Performance Interconnects, HotI 2010.
-
Krishna Kandalla, Emilio P. Mancini, Sayantan Sur and Dhabaleswar K. Panda,
Designing
Power-Aware Collective Communication Algorithms for InfiniBand Clusters,"
The 39th International Conference on Parallel Processing, ICPP 2010.
Conference Slides. Bibtex
Workshops
-
A. Singh, S. Potluri, H. Wang, K. Kandalla, S. Sur and D. K. Panda, "MPI
Alltoall Personalized Exchange on GPGPU Clusters: Design Alternatives and
Benefits", Workshop on Parallel Programming on Accelerator Clusters (PPAC
'11), held in conjunction with Cluster '11, Sept. 2011.
-
N. Dandapanthula, H. Subramoni, J. Vienne, K. Kandalla , S. Sur, D. K.
Panda, and R. Brightwell, "INAM - A Scalable InfiniBand
Network Analysis
and Monitoring Tool", 4th Int'l Workshop on Productivity and Performance
(PROPER 2011), in conjunction with EuroPar, Aug. 2011.
-
M. Luo, S. Potluri, P. Lai, E. P. Mancini, H. Subramoni, K.
Kandalla , S. Sur and D. K. Panda, "High Performance Design and Implementation
of Nemesis Communication Layer for Two-sided and One-Sided MPI" The Workshop on Parallel Programming Models and System Software (P2S2), 2010, held in Conjunction
with the 39th International Conference on Parallel Processing, ICPP 2010.
-
Krishna Kandalla, Hari Subramoni, Abhinav Vishnu and Dhabaleswar K. Panda,
"Designing Topology-Aware Collective Communication Algorithms for Large
Scale Infiniband Clusters : Case Studies with Scatter and Gather" ,
The 10th Workshop on Communication Architecture for Clusters (CAC
10), held in conjunction with Int'l Parallel and Distributed Processing
Symposium (IPDPS 2010). Conference Slides. Bibtex
-
K. Kandalla, H. Subramoni, G. Santhanaraman, M. Koop and D. K. Panda, "Designing Multi-Leader-Based Allgather Algorithms for Multi-Core Clusters", The 9th Workshop on Communication Architecture for Clusters (CAC 09), held in conjunction with Int'l Parallel and Distributed Processing Symposium (IPDPS 2009), Rome, Italy, May 2009 Conference Slides. Bibtex
Technical Reports
-
Hao Wang, Miao Luo, Krishna Kandalla, Sayantan Sur, and Dhabaleswar K. Panda,
"Can Streaming SIMD Non-Temporal Instructions
Benefit Intra-node MPI
Communication on Modern Multi-core Platforms?" 14 pp. OSU-CISRC-12/10-TR29.
Electronic report under
TR29.pdf OSU-CISRC-12/10--TR29.
-
Krishna Kandalla, Sreeram Potluri, Miao Luo and Dhabaleswar K.
Panda, "Designing Multi-Core Aware Inter-Communicator Operations for MPI-2 Dynamic Process Management," 10 pp. OSU-CISRC-11/09-TR54
|
kandalla@cse.ohio-state.edu
|