CSE, OSU

I am a PhD student working in HPCS Lab, advised by Xiaodong Zhang.

You can contact me at huai@cse.ohio-state.edu

Projects


YSmart (homepage): YSmart is a correlation aware SQL-to-MapReduce translator. It can automatically generate MapReduce programs for a SQL query. Its optimizer tries to generate less number of MapReduce jobs by eliminating unnecessary data shuffling. The core optimizer of YSmart has been integrated into Apache Hive, a data warehouse system for Hadoop. This Hive optimizer is called Correlation Optimizer.

Publications


Talks


DOT
  • SoCC 2011 Conference Talk, October 2011.
YSmart

Code


I am a committer of Apache Hive. (My Hive Jiras)