Research Interests
My research interests cover different aspects of
PC cluster technology - architecture, system software, applications -
and the intersection of cluster architecture and Grid computing.
One important focus of my investigation is how to integrate
Terabyte-sized disk storage built using off-the-shelf components into a
parallel machine with supercomputer performance and enhanced I/O
capabilities. The node parallelism of cluster architectures present an
unique opportunity to increase the performance of file access if
augmented with a well designed striped file system that can leverage
the parallelism of disk access. Another focus of my research is the
investigation of mechanisms to increase the efficiency of data
transport between remote clusters over wide area networks. The purpose
of this project is to overcome current performance limitations seen by
data intensive Grid applications in accessing remote data repositories.
On the application side I am studying computationally intensive
Computational Biology problems involving large amounts of data such as
genome assemblies, whole genome comparisons, and tools for high
sensitivity queries to biological databases.
Many of our experiments start on a small
cluster (called Datacluster) that dr. Srini Parthasarathy and I use for
our research. For larger runs we use the Ohio Supercomputer Center
machines, particularly the larger Itanium2 cluster. Our Myrinet
interconnected cluster consists of
eight nodes plus a plus one filewerver. Each node is a Dell
PowerEdge1400
equipped with two 1GHz Pentium III, 1 GB of RAM, 18 GB SCSI disk, two
60
GB IBM Deskstar 75GXP IE disks connected to a 3Ware controller in Raid
0 configuration (hardware striping). The the file server has the same
features
except 2GB of RAM, three 40GB SCSI disks and no IDE disks. We are
currently
running the Red Hat distribution of Linux on the cluster. A dedicated 1
Gb/s link to the Ohio Supercomputer Center is used for high throughput
data transfer and Grid
computing experiments.
Current Projects
High Performance I/O
Today several TBs of disk storage can be easily added to a cluster for
less than $1,000/TB by installing a few IDE disks on each node. Besides
the low cost, such a configuration has other benefits such as a large
aggregate disk access bandwidth and the availability of resources for
distributed preprocessing and caching of data. Currently however there
are no robust system tools capable of reaping all the potential
benefits of this type of distributed storage. We are interested
in developing tools and system software to enable parallel applications
to efficiently and transparently access storage on multiple cluster
nodes. We are looking at existing parallel file systems (PVFS) and
parallel I/O libraries (MPI-IO) and studying how they can be adapted
and optimized for our purposes. Some of this work is done in
collaboration with the Ohio Supercomputer Center in the context of the
OSC Mass Storage project.
The Organic Grid
Desktop grids have recently been used to perform some of the largest
computations in the world and have the potential to grow by several
more orders of magnitude. However, current approaches to utilizing
desktop resources require either centralized servers or extensive
knowledge of the underlying system, limiting their scalability.
We propose a biologically inspired and fully-decentralized approach to
the organization of computation that is based on the autonomous
scheduling of strongly mobile agents on a peer-to-peer network. Our
approach achieves the following design objectives: near-zero knowledge
of network topology, zero knowledge of system status, autonomous
scheduling, distributed computation, lack of specialized nodes. Every
node is equally responsible for scheduling and computation, both of
which are performed with practically no prior knowledge about the
system.
We have implemented an extension of Java with strong mobility that
allows multi-threaded agents to migrate with all of their execution
state. We built a grid infrastructure, the Organic Grid, in which an
application is scheduled by encapsulating it in an agent together with
a scheduler specific to the application characteristics. We are
currently working on a screen saver for deploying the Organic Grid to
desktop PCs. This work is a collaboration with Dr. Gerald Baumgartner.
Cellular Computation
We have recently started exploring new concepts at the intersection of
computing and biology. Using quantitative models of protein-DNA
binding found
in the literature, we are studying how to build sequential circuits
using the transcriptional machinery of the cell. The basic idea is that
the recruitment of RNA polymerase to a gene promoter region (an event
that starts the expression of the gene and then the production of the
protein it encodes) is modulated by the action of multiple
transcription factors (a class of DNA-binding proteins). Using this
principle, properly designed configurations of transcription factors
implement elementary
gate functionality (such as AND, OR, XOR) in terms of protein
concentrations. Starting with the definition of a finite state machine,
we are investigating how to define a configuration of transcription
factors that implements the given machine. Other researchers have
studied the case of combinatorial logics, and we build upon their
results in tackling the challenge of implementing arbitrary sequential
circuits.
Data transport over the Grid
The increasing need to access distant large data sets is the motivation
behind the study of novel techniques to increase the throughput of data
transfer between remote sites. One of the tools we are developing
is an enhanced version of a remote storage access tool called Storage
Resource Broker (SRB); SRB is a production quality tool developed at
the San Diego Supercomputer Center. The performance enhanced version of
SRB employs several strategies to increase the amount of data moved per
unit of time between two remote machines. One such strategy is to
introduce a notion of pipelining in handling the data and try to
overlap different stages of the transfer. Another strategy is to stripe
data across several parallel connections between the two remote sites.
In a collaboration with researchers at the San Diego Supercomputer
Center I am developing a MPI-IO interface to SRB that would enable
parallel applications to take full advantage of the aggregate bandwidth
of the network striping.
Data intensive Computational Biology
I keep experimenting with our cluster to learn new ways of building and
programming clusters specifically to solve demanding computational
biology problems. For example in a recent project on the assembly and
annotation of a complete mammalian genome directed by Dr. Bo Yuan at
the OSU Medical College, the development of a parallel version of a
popular bioinformatic tool (BLAST) and an enhanced 1 TB storage system
resulted in an order of magnitude improvement in the speed of the large
scale computation required for the assembly.
In a separate project in collaboration with Dr. Ralf Bundschuh in the
Physics Department, we are studying how to enhance the sensitivity of
PSI-BLAST using a novel statistical theory of sequence alignments.
PSI-BLAST is the program of choice for the search of large protein
databases. This new theory should enable the creation of a version of
PSI-BLAST in which the use of position-specific gap costs should enable
the detection of very weak and thus previously undetectable alignments.
Projects I have been
involved with in the past
- High-performance messaging layers for a
cluster of
workstations interconnected with a Myrinet
network (FM
project).
- High performance implementation of MPI
for
Myrinet
(MPI-FM
project)
- Algorithms for collective communications
on
clusters.
- Scalability of large scale HPVM
clusters including the NT
supercluster at the NCSA site in Urbana.
- Design of a terabyte storage server based
on
HPVM
cluster technology.
Donwloads
- Supplementary information for the Biochemical CPU paper can be
found here
- The SEMPLAR library for high performance remote access to SRB
data repositories can be donwloaded here. SEMPLAR is an
implementation of MPI-IO over SRB that extends the parallel I/O
interface to enable direct access of remote data servers.
Publications
Copyright disclaimer: This material is
presented
to ensure timely dissemination of scholarly and technical work.
Copyright
and all rights therein are retained by authors or by other copyright
holders.
All persons copying this information are expected to adhere to the
terms
and constraints invoked by each author's copyright. In most cases,
these
works may not be reposted without the explicit permission of the
copyright
holder.
Journals
-
M. Lauria, F. Iorio, D. di Bernardo, "NIRest: a tool for gene network and
mode of action inference", Annals of the New York Academy of Sciences, in press, 2008.
- A.J.
Chakravarti, G. Baumgartner, M. Lauria., "Self-Organizing
Scheduling on the Organic
Grid", Int.
Journal. of High Performance Computing Applications, Vol. 20, No. 1,
Spring 2006, pp. 115-130.
- A.J.
Chakravarti, G. Baumgartner, M. Lauria.
“The Organic
Grid: Self-Organizing
Computation on a Peer-to-Peer Network”, IEEE Transactions on
Systems, Man, and
Cybernetics, Part A, Vol. 35, No. 3, May 2005, pp. 373-384
- M. Pillai, M. Lauria, "A High
Performance Redundancy Scheme for Cluster File Systems", Int.
Journal. of High Performance Computing and Networking, Vol. 2, Nos.
2/3/4, 2004
- Yuheng Li, M. Lauria, R. Bundschuh, "Using
Hybrid Alignment for Iterative Sequence Database Searches", Concurrency
and Computation: Practice and Experience, Volume 16, Issue 9, August
2004, pp. 841-853
- M. Bernaschi, G. Iannello, M. Lauria, "Efficient
Implementation of Reduce-Scatter in MPI", Journal of System
Architecture
(2003), doi:10.1016/S1383-7621(03)00059-6
, Vol. 49, No. 3, August 2003, pp.
89-108
- M. Lauria, S. Pakin, A. Chien "Efficient
Layering for High Speed Communication: the MPI over Fast Messages (FM)
Experience", Cluster Computing, 2 (1999), pp. 107-116.
- A. Chien, M. Lauria, R. Pennington, M.
Showerman,
G. Iannello, M. Buchanan, K. Connelly, L. Giannini, G. Koenig, S.
Krishnamurthy,
Q. Liu, S. Pakin, G. Sampemane "Design
and Evaluation of an HPVM-based Windows NT Supercomputer",
The
International Journal of High-Performance Computing Applications, Vol.
13, No. 3, Fall 1999, pp. 201-219
- M. Lauria, A. Chien. "MPI-FM:
High Performance MPI on Workstation Clusters" , Journal of
Parallel
and Distributed Computing, Vol 40, No. 1, January
1997, pp. 4-18.
Conferences
-
Y. Dimitrov, C. Giovine, G. Mango, M. Lauria, "A combinatorial model for self-organizing networks", Workshop on Large-Scale and Volatile Desktop Grids (PCGrid 2007), held in conjunction with IPDPS 2007, Long Beach, CA, March 2007.
- N. Ali, M. Lauria. "Improving the
Performance of Remote I/O
using Asynchronous Primitives", HPDC-15, June 2006, Paris.
- Yuheng Li, M. Lauria, R. Bundschuh. "Suboptimal Alignments
Improve the Detection of Weak Homologs in Sequence Database Searches",
IEEE 5th Symposium on Bioinformatic and Bioengineering (BIBE 2005),
Minneapolis, MN, October 19-21, 2005.
- D. Cotroneo, G. Paolillo, S. Russo, M. Lauria. "CSAR-2: a
Case Study of Parallel File System Dependability Analysis",
2005 International Conference on High Performance Computing and
Communications (HPCC-05), Sorrento, Italy, 21-24 September, 2005,
Lecture Notes in Computer Science
series,
Springer-Verlag, Vol. 3726, pp. 190-199.
- N. Ali, M. Lauria. “SEMPLAR: High-Performance Remote Parallel
I/O
over SRB”, IEEE International Symposium on Cluster Computing
and the Grid (CCGrid2005), Cardiff, UK, May 2005
- A.J. Chakravarti, G. Baumgartner, M. Lauria. "Application-Specific
Scheduling for the
Organic Grid", 5th IEEE/ACM
International Workshop on Grid Computing (GRID 2004),
Pittsburgh, November 2004. Available as Technical Report
OSU-CISRC-4/04-TR23, Dept. of Computer and Information Science, The
Ohio State University, April 2004.
- M. Pillai, M. Lauria. "RAAC:
An Architecture for Scalable, Reliable Storage in Clusters",
IEEE Int.
Conference on Cluster Computing (Cluster '04),
20-23 September 2004, San Diego, CA.
- Mario Lauria, Kaustubh Bhalerao, Muthu M Pugalanthiran, Bo
Yuan, "Building
blocks of a biochemical CPU based on DNA transcription logic",
3rd Workshop on Non-Silicon Computation (NSC-3), Munich, June 2004
(held in conjunction with ISCA 2004). [Supplementary
information]
- A.J. Chakravarti, G. Baumgartner, M. Lauria. "The Organic
Grid: Self-Organizing
Computation on a Peer-to-Peer Network", International
Conference on Autonomic Computing (ICAC '04), New York, NY, 17-18 May
2004.
An extended version of this paper is available as Technical Report
OSU-CISRC-10/03-TR55, Dept. of Computer and Information Science,
The Ohio State University, October 2003.
- Daolong Wang, Mario Lauria, Bo Yuan, Fred A. Wright, "Mega
Weaver: A Simple Iterative Approach for BAC Consensus Assembly",
The Second Asia-Pacific Bioinformatics Conference (APBC2004),
Dunedin, New Zealand, CRPIT, Vol 29, pp. 145-153, January 2004.
- M. Pillai, M. Lauria, "A High
Performance
Redundancy Scheme for Cluster File Systems", IEEE Int.
Conference on Cluster Computing (Cluster 2003), p. 216, December 2003,
Hong
Kong.
- M. Pillai, M. Lauria, "CSAR: Cluster
Storage with Adaptive Redundancy", ICPP-03, pp. 223-230,
October
2003, Kaohsiung, Taiwan, ROC
- R. Sundaresan, M. Lauria, T. Kurc, S.
Parthasarathy,
and J. Saltz, "Adaptive polling of Grid
resource monitors using a
slacker
coherence model", HPDC-12, pp. 260-269, Seattle, June 2003
- R. Sundaresan, T. Kurc, M. Lauria, S.
Parthasarathy,
and J. Saltz, "A Slacker
Coherence
Protocol for Pull-based Monitoring of On-line Data Sources",
CCGrid
2003, pp. 250-257, May 2003, Tokyo, Japan.
- Yuheng Li, M. Lauria, R. Bundschuh, "Using
Hybrid Alignment for Iterative Sequence Database Searches", 2nd
IEEE International Workshop on High Performance Computational Biology
(HiCOMB
2003), April 22, 2003, Nice, France (held in conjunction with IPDPS
2003), p. 152.
- E. Nallipogu, F. Ozguner, M.
Lauria, "Improving
the Throughput of Remote Storage Acccess through pipelining",
3rd
International Workshop on Grid Computing (GRID 2002), Baltimore,
November,
2002 (held in conjunction with Supercomputing 2002), Lecture
Notes
in Computer Science series, Springer-Verlag, Vol.
2536, pp. 305-316.
- K. Bell, A. Chien, M. Lauria, "A
High-Performance Cluster Storage Server", HPDC-11, Edinburgh,
July
2002, pp. 311-320.
- M. Bernaschi, G. Iannello, M. Lauria, "Efficient
Implementation of Reduce-Scatter in MPI", 10th EUROMICRO
Workshop
on Parallel and Distributed Processing (PDP2002), January 2002.
- M. Lauria, S. Pakin, A. Chien "Efficient
Layering for High Speed Communication: Fast Messages 2.x" ,
HPDC7,
Chicago, July 98, pp. 10-20.
- G. Iannello, M. Lauria, S. Mercolino, "Cross-platform
Analysis of Fast Messages for Myrinet" , CANPC '98 Workshop
(held in conjunction with HPCA4),
Las Vegas, February 1998, Lecture Notes in Computer Science
series,
Springer-Verlag, Vol.
1362, pp. 217-231.
- G. Iannello, M. Lauria, S. Mercolino, "LogP
Performance Characterization of Fast Messages atop Myrinet",
6th
EUROMICRO Workshop on Parallel and Distributed Processing (PDP98),
January,
1998.
- Chien A.A., Pakin S., Lauria M., M.
Buchanan,
K.
Hane, L. Giannini, and J. Prusakova, "High
Performance Virtual Machines (HPVM): Clusters with Supercomputing APIs
and Performance" , Eighth SIAM Conference on Parallel
Processing
for Scientific Computing (PP97), March, 1997.
- S. Pakin, M. Lauria, A. Chien "High
Performance Messaging on Workstations: Illinois Fast Messages (FM) for
Myrinet" Supercomputing 95, San Diego, November, 1995.
Other
- B. Falsafi, M. Lauria (Eds.), Proceedings
of CANPC'00, 4th International workshop on Network-Based
Parallel
Computing - Communication, Architecture, and Applications, Toulouse,
France,
Jan 8, 2000, Lecture Notes in Computer Science series, Springer-Verlag,
Vol 1797, ISBN 3-540-67879-4
- A. Sivasubramaniam, M. Lauria (Eds.), Proceedings
of CANPC'99, 3rd International workshop on Network-Based Parallel
Computing
- Communication, Architecture, and Applications, Orlando, Florida, USA,
Jan 9, 1999, Lecture Notes in Computer Science series, Springer-Verlag,
Vol 1602 (1999), ISBN 3-540-65915-3