Hakan Ferhatosmanoglu

 

            Associate Professor                                                     Department of Computer Science and Engineering

            The Ohio State University                                          Phone:  (614) 292-6377         

            2015 Neil Ave. Room 689                                          Email: hakan@cse.ohio-state.edu

            Columbus, Ohio 43210-1277                                      http://www.cse.ohio-state.edu/~hakan

 

http://www.cse.ohio-state.edu/../Owner/Desktop/cv-bio/

RESEARCH INTERESTS:

            High performance databases for multi-dimensional and scientific applications

            Storage, retrieval, and efficient I/O in large-scale systems

            Spatial, multimedia, and high dimensional data

            Bioinformatics and biomedical databases

 

WORK EXPERIENCE:

            Associate Professor, Computer Science and Engineering, Ohio State University (OSU), 2007-

            Assistant Professor, Computer Science and Engineering, OSU, 2001-2007.

Courtesy Faculty, Biomedical Informatics, OSU, 2001-

Participating Faculty, Biophysics, Mathematical Biosciences Institute, OSU, 2003-

Graduate Research Associate, Computer Science, University of California, Santa Barbara, 1997-2001.

Research Scientist at AT&T Labs- Research, Florham Park, New Jersey. Design and Implementation of a Framework for Decision Support and Data Mining for Large Data Warehouses, 1999.

Graduate Teaching Associate of Operating Systems, Database Management Systems, Foundations of Computer Science, Discrete Mathematics, Automata Theory and Formal Languages courses. 1997-1999.

Software Engineer at Bilkent University Computer Center, Ankara, Turkey. Design and Implementation of Database Management Systems for Bilkent University Dormitories. 1997.


EDUCATION:

Ph.D. Computer Science, University of California, Santa Barbara. 2001.

Dissertation: Efficient Retrieval and Scalable Storage of Multi-dimensional Data.

Co-advisors: Divyakant Agrawal and Amr El Abbadi.

B.S. Computer Science, Bilkent University, Ankara, Turkey. 1997.

Full scholarship awarded by the university for all undergraduate education.

 

HONORS AND AWARDS:

·         Lumley Research Award, OSU College of Engineering, 2007.

·         NSF CAREER Award, 2006.

·         Early Career Principal Investigator Award, US Department of Energy, 2003.

·         Large Interdisciplinary Grant Award, Office of Research, OSU, 2004.

·         Ranked 1st, Mathematics Competition organized by Turkish National Science and Research Foundation, 1990.

·         Finalist, Nationwide Science and Knowledge Contest organized by Milliyet Education Foundation, 1990.

·         Ranked 45th among around 1.4 million examinees in the centralized National University Entrance Exam in Turkey, 1993.

 

RESEARCH FUNDING:

 

·         NSF Collaborative Systems, NSF Advances in Bio-Informatics, NSF Cyberinfrastructure, US Department of Energy, NASA/ Ohio Aerospace Institute, NSF Research Infrastructure, Ohio Board of Regents, Pfizer, Inc.

 

PUBLICATIONS:

1.      Secondary Bitmap Indexes with Vertical and Horizontal Partitioning . G. Canahuate, T. Apaydin, A. Sacan, H. Ferhatosmanoglu , EDBT, March 2009, pp 600-611.

2.      Mutual Information Based Extrinsic Similarity for Microarray Analysis. D. Ucar, F. Altiparmak, H. Ferhatosmanoglu, S. Parthasarathy: BICoB, April 2009, pp. 424-436.

3.      Integrated Search and Alignment of Protein Structures. A. Sacan, I. Toroslu, H. Ferhatosmanoglu. Bioinformatics, vol. 24, no. 14, December 2008, pp. 2872-9.

4.      CellTrack: An Open-Source Software for Cell Tracking and Motility Analysis. A. Sacan, H. Ferhatosmanoglu, H. Coskun, Bioinformatics, vol. 24, no. 14, July 2008, pp. 1647-1649.

5.      Distance-based indexing of residue contacts for protein structure retrieval and alignment. Sacan, A.; Toroslu, I.H.; Ferhatosmanoglu, H.; BioInformatics and BioEngineering, BIBE, 8th IEEE International Conference, Oct. 2008, pp. 1 – 7.

6.      An Enhanced Partial Order Curve Comparison Algorithm and its Application to Analyzing Protein Folding Trajectories. H. Sun, H. Ferhatosmanoglu, M. Ota, Y. Wang. BMC BioInformatics 2008, Vol. 9 : 344, August 2008 doi:10.1186/1471-2105-9-344.

7.      Dynamic Data Organization for Bitmap Indices. T. Apaydin, G. Canahuate, H. Ferhatosmanoglu, A. Tosun, 3rd International ICST Conference on Scalable Information Systems, InfoScale, June 2008.

8.      Analysis of Basic Data Reordering Techniques. T. Apaydin, A. Tosun, H. Ferhatosmanoglu, 20th Conference on Scientific and Statistical Data Management, SSDBM, July 2008, pp. 517-524.

9.      Incremental Maintenance of Online Summaries Over Multiple Streams. F. Altiparmak, E. Tuncel, H. Ferhatosmanoglu. IEEE Transactions on Knowledge and Data Engineering (TKDE), Volume 20, Issue 2, Feb. 2008, pp. 216-229.

10.      Online Index Recommendations for High-Dimensional Databases using Query Workloads. M. Gibas, G. Canahuate, H. Ferhatosmanoglu. IEEE Transactions on Knowledge and Data Engineering (TKDE), Volume 20, Issue 2, Feb. 2008, pp. 246-260.

11.  Automated Data Discovery in Similarity Score Queries. Altiparmak F., Tosun A. S., Ferhatosmanoglu H., Sacan A., 13th International Conference on Database Systems for Advance Applications (DASFAA), New Delhi, India, March 2008, pp. 440-451.

12.  Michael Gibas, Hakan Ferhatosmanoglu: Indexing, High Dimensional. Encyclopedia of GIS, pp. 502-507, February 2008.

13.  A General Framework for Modeling and Processing Optimization Queries. M. Gibas, N. Zheng, H. Ferhatosmanoglu, VLDB ’07, Vienna, Austria, September, 2007, pp. 1069-1080. (Accept-rate: 17%)

14.  Enhanced Partial Order Curve Comparison over Multiple Protein Folding Trajectories. H. Sun, H. Ferhatosmanoglu, M. Ota, Y. Wang, Life Sciences Society Computational Systems Biology (CSB ’07), San Diego, CA, August 2007,  pp. 299-310.

15.  Update Conscious Bitmap Indices. G. Canahuate, M. Gibas, H. Ferhatosmanoglu. International Conference on Scientific and Statistical Database Management (SSDBM ‘07), Banff, Canada, July 2007, 12 pages.

16.  LFM-Pro: A Tool for Detecting Significant Local Structural Sites in Proteins. A. Sacan, O. Ozturk, H. Ferhatosmanoglu, Y. Wang. Bioinformatics, Volume 23, Number 6, pp. 709-716, March 2007.

17.  A Multi-Metric Similarity Based Analysis of Microarray Data. F. Altiparmak, S. Erdal, O.  Ozturk, H. Ferhatosmanoglu. IEEE International Conference on Bioinformatics and Biomedicine (BIBM),  2007, pp. 317-324.

18.  Data Space Mapping for Efficient I/O in Large Multi-dimensional Databases. H. Ferhatosmanoglu, A. Ramachandran, D. Agrawal, A. El Abbadi. Information Systems Journal, vol. 32, no. 1, pp. 83-103, March 2007.

19.  Access Structures for Angular Similarity Queries. T. Apaydin and H. Ferhatosmanoglu. IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 16, no. 6, pp. 1512-1525, November 2006.

20.  Approximate Encoding for Direct Access and Query Processing over Compressed Bitmaps. T. Apaydin, G. Canahuate, H. Ferhatosmanoglu, A. Tosun. 32nd International Conference on Very Large Data Bases (VLDB ’06), Seoul, Korea, September 2006, pp. 846-857. (Accept-rate: 13.2%)

21.  High Dimensional Nearest Neighbor Searching. H. Ferhatosmanoglu, E. Tuncel, D. Agrawal, A. El Abbadi. Information Systems Journal, vol. 31, no. 6, pp. 512-540, September 2006.

22.  Information Mining over Heterogeneous and High Dimensional Time Series Data in Clinical Trials Databases. F. Altiparmak, H. Ferhatosmanoglu, S. Erdal, C. Trost. IEEE Transactions on Information Technology in Biomedicine, 10(2): 254- 263 (April 2006). 

23.  Predicting the Binding Affinity of MHC class II Peptides. F. Altiparmak, A. Akalin, H. Ferhatosmanoglu. LSS Computational Systems Bioinformatics Conference (CSB ’06). Stanford, CA, pp. 331-334. (Acceptance rate: 19%).

24.  Indexing Incomplete Databases. G. Canahuate, M. Gibas, H. Ferhatosmanoglu. 10th International Conference on Extending Database Technology (EDBT ’06). Munich, Germany, March 2006, pp. 884-901. (Acceptance rate: 16%).

25.  Online Summarization of Dynamic Time Series Data. U. Ogras and H. Ferhatosmanoglu. VLDB Journal, 15(1): 84-98 (January 2006).

26.  Compressing Bitmap Indices by Data Reorganization. A. Pinar, T. Tao, H. Ferhatosmanoglu. 21st IEEE International Conference on Data Engineering (ICDE ’05). Tokyo, Japan, April 2005, pp. 310-321. (Acceptance rate: 13%).

27.  Vector Space Indexing for Biosequence Similarity Searches. O. Ozturk and H. Ferhatosmanoglu. International Journal on Artificial Intelligence Tools, 14(5):  811-826 (2005).

28.  Optimal Data-Space Partitioning of Spatial Data for Parallel I/O. H. Ferhatosmanoglu, D. Agrawal, O. Egecioglu, and A. El Abbadi. Distributed and Parallel Databases (DAPD), 17(1): 75-101 (2005).

29.  Replicated Declustering of Spatial Data. H. Ferhatosmanoglu, A. Tosun, A. Ramachandran. 23rd ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS ‘04), June 2004, Paris, France, pp. 125-135. (Acceptance rate: 16%)

30.  Dimensionality Reduction and Similarity Computation using Inner Product Approximations. O. Egecioglu, H. Ferhatosmanoglu, U. Ogras. IEEE Transactions on Knowledge and Data Engineering (TKDE). June 2004 (vol. 16, no.6), pp. 714-726.

31.  A Time-Series Analysis of Gene Expression Data. S. Erdal, O. Ozturk, D. Armbruster, H. Ferhatosmanoglu, W. Ray. IEEE International Symp. On Bioinformatics and Bioengineering (BIBE ’04), Taiwan, March 2004, pp. 366-378.

32.  High Dimensional Reverse Nearest Neighbor Queries. A. Singh, H. Ferhatosmanoglu, A. Tosun. 12th ACM International Conference on Information and Knowledge Management (CIKM'03), pp. 91-98. New Orleans, LA, November 2003. (Acceptance rate: 15%)

33.  Dimensionality Reduction using Magnitude and Shape Approximations. Umit Ogras, Hakan Ferhatosmanoglu. 12th ACM International Conference on Information and Knowledge Management (CIKM'03), pp. 99-107. New Orleans, LA, November 2003. (Acceptance rate: 15%)

34.  Peer-to-Peer Spatial Queries in Sensor Networks. M. Demirbas and H. Ferhatosmanoglu. 3rd IEEE International Conference on Peer-to-Peer Computing (P2P '03), Linkφping, Sweden. September 2003, pp. 32-39.

35.  Efficient k-NN Search on Streaming Data Series. X. Liu and H. Ferhatosmanoglu. 8th International Symposium on Spatial and Temporal Databases (SSTD '03), Santorini, Greece. July 2003, pp. 83-101.

36.  CoMRI: A Compressed Multi-Resolution Index Structure for Sequence Similarity Queries. H. Sun, O. Ozturk and H. Ferhatosmanoglu. IEEE Computer Society Bioinformatics Conference (CSB '03). Stanford, CA. August 2003, pp. 553-558.

37.  Effective Indexing and Filtering for Similarity Search in Large Biosequence Databases. O. Ozturk and H. Ferhatosmanoglu. IEEE International Symp. on Bioinformatics and Bioengineering (BIBE '03), pp. 359-366. Washington, DC. March 2003.

38.  VQ-Index: An Index Structure for Similarity Searching in Multimedia Databases. E. Tuncel, H. Ferhatosmanoglu, K. Rose. ACM Multimedia, Juan Les Pins, France, pp. 543-552. December 2002. (Acceptance rate (A.r.): 14%)

39.  Vulnerabilities in Similarity Search based Systems. A. Tosun and H. Ferhatosmanoglu. ACM International Conference on Information and Knowledge Management (CIKM ‘02), Mc Lean, VA, pp. 110-117. November 2002.

40.  Optimal Parallel I/O Using Replication. A. Tosun, H. Ferhatosmanoglu. In proceedings of International Workshops on Parallel Processing (ICPPW ‘02), Vancouver, Canada, pp. 506-513. August 2002.

41.  Approximate Nearest Neighbor Searching in Multimedia Databases. H. Ferhatosmanoglu, E. Tuncel, D. Agrawal, A. El Abbadi.  In proceedings of the 17th IEEE International Conference on Data Engineering (ICDE ‘01), Heidelberg, Germany. April 2001. (A.r.: 15%)

42.  Efficient Processing of Conical Queries. H. Ferhatosmanoglu, D. Agrawal and A. El Abbadi. In proceedings of the 10th ACM International Conference on Information and Knowledge Management (CIKM ‘01), Atlanta, pp. 1-8. November 2001.

43.  Optimal Partitioning for Efficient I/O in Spatial Databases. H. Ferhatosmanoglu, D. Agrawal, A. El Abbadi. European Conference on Parallel Computing (Euro-Par ‘01), Parallel I/O and Storage Technology, Manchester, United Kingdom, pp. 889-900. August 2001.

44.  Constrained Nearest Neighbor Queries. H. Ferhatosmanoglu, I. Stanoi, D. Agrawal, A. El Abbadi. 7th International Symposium on Spatial and Temporal Database (SSTD ‘01), Los Angeles, CA, pp. 257-276. July 2001.

45.  Vector Approximation based Indexing for Non-uniform High Dimensional Data Sets. H. Ferhatosmanoglu, E. Tuncel, D. Agrawal, A. El Abbadi. In proceedings of the 9th ACM International Conference on Information and Knowledge Management (CIKM ‘00), Washington, DC, pp. 202-209. November 2000.

46.  Dimensionality Reduction and Similarity Computation by Inner Product Approximations. O. Egecioglu and H. Ferhatosmanoglu.  In proceedings of the 9th ACM International Conference on Information and Knowledge Management (CIKM ‘00), Washington, DC, pp. 219-226.  November 2000.

47.  Circular Data-space Partitioning for Similarity Queries and Parallel Disk Allocation. O. Egecioglu and H. Ferhatosmanoglu. In proceedings of the 11th IASTED International Conference on Parallel and Distributed Computing and Systems (PDCS ‘99), Boston, MA, pp. 194-200. November 1999.

48.  Clustering Declustered Data for Efficient Retrieval. H. Ferhatosmanoglu, D. Agrawal, A. El Abbadi. In proceedings of the 8th ACM International Conference on Information and Knowledge Management (CIKM ‘99), Kansas City, Missouri, pp. 343-350. November 1999.

49.  Concentric Hyperspaces and Disk Allocation for Fast Parallel Range Searching. H. Ferhatosmanoglu, D. Agrawal, A. El Abbadi. In proceedings of the 15th IEEE International Conference on Data Engineering (ICDE ‘99), Sydney, Australia, pp. 608-615. March 1999. (Acceptance rate: 15%)

Recent Workshops

50.  Investigating the use of Extrinsic Similarity Measures for Microarray Analysis. D. Ucar, F. Altiparmak, H. Ferhatosmanoglu, S. Parthasarathy. Proceedings of the BIOKDD workshop at the ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2007.

51.  Combining Mining Results from Multiple Sources in Clinical Trials and Microarray Applications. F. Altiparmak, O. Ozturk, S. Erdal, H. Ferhatosmanoglu, D. C. Trost. MMIS workshop at the ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2007.

52.  Incremental Quantization for Aging Data Streams. F. Altiparmak, D. Chiu, H. Ferhatosmanoglu. DSMM workshop at IEEE International Conference on Data Mining (ICDM), 2007.

STUDENT SUPERVISION:

 

·         PhD Advisees (6 completed, 3 current)

·         MS Advisees (6 completed, 3 current)

·         PhD Dissertation/Candidacy Committee (40 completed)

·         M.S. Committee (2 completed, 3 current)

·         Undergraduate Students (6 completed)

·         Information Systems advisor in BS CSE program.

 

PROFESSIONAL SERVICES:

 

Program Committee Member

  • SIGMOD 2010, 21 ACM SIGMOD International Conference on Management of Data, Indianapolis, June 2010.
  • ISCIS 2009, Track Chair, Bioinformatics and Bioengineering, 24th International Symp. On Computer and Information Sciences, 2009.
  • SSDBM 2009, 21st International Conference on Scientific and Statistical Database Management, New Orleans, USA, June 2009.
  • ICDE 2009, 25th International Conference on Data Engineering, Shanghai, China, March 2009.
  • SSTD 2009, 21st International Symp. on Spatial and Temporal Databases, Aalborg, Denmark, July 2009.
  • ICDM 2008, IEEE International Conference on Data Mining, Pisa, Italy, December 2008.
  • ACM SAC 2008, 23rd Annual ACM Symposium on Applied Computing, Mobile Computing and Applications, Brazil, March 2008.
  • IEEE CSE 2008, 9th IEEE International Conference on Computational Science and Engineering, Brazil, June 2008
  • DASFAA 2008. 13th International Conference on Database Systems for Advanced Applications, Delhi, India, March, 2008.
  • KDD 2007. 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Jose, August, 2007.
  • ICDE 2007. 23rd IEEE International Conference on Data Engineering, Istanbul, Turkey, April 2007.
  • DASFAA 2007. 12th International Conference on Database Systems for Advanced Applications, Bangkok , Thailand, April 2007.
  • KDD 2006. 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Philadelphia, August 2006.
  • IEEE WirelessCom 2006, Mobile Computing Symposium, Toronto, Canada, June 2006.
  • MDM 2006. Workshop on Mobile Location-Aware Sensor Networks, Nara, Japan, May 2006.
  • DASFAA 2006. 11th International Conference on Database Systems for Advanced Applications, Singapore, April 2006.
  • IEEE BIBE 2005. 5th IEEE Symposium on Bioinformatics and Bioengineering, Minneapolis, Minnesota, October 2005.
  • AINA 2006. 20th IEEE Advanced Information Networking and Applications, Vienna, Austria, April 2006.
  • ACM SAC 2006. 20th ACM Symposium on Applied Computing, Mobile Computing and Applications, Dijon, France, March 2006.
  • ICESS 2005  2nd International Conference on Embedded Software and Systems, Xi'an, P. R. China, December 2005.
  • CISSE 2005. International Joint Conferences on Computer, Information, and Systems Sciences, and Engineering, December 2005.
  • IEEE BIBE 2005. 5th IEEE Symposium on Bioinformatics and Bioengineering, Minneapolis, Minnesota, October 2005.
  • BIOKDD 2005. 5th International Workshop on Data Mining in Bioinformatics (BIOKDD), Chicago, IL, August 2005.
  • ACM SAC 2005. 20th ACM Symposium on Applied Computing, Mobile Computing and Applications, Santa Fe, New Mexico, March 2005.
  • IEEE WirelessCom 2005. Symposium on Mobile Computing. Maui, Hawaii, June 2005.
  • IEEE ICDE 2004. 20th IEEE International Conference on Data Engineering, Boston, Mass., USA, March 2004.
  • IEEE BIBE 2004. IEEE International Symposium on Bioinformatics and Bioengineering, Taiwan, Taichung, Taiwan, ROC, May 2004.
  • ACM SAC 2004. ACM Symposium on Applied Computing, special track on Mobile Computing and Applications (SAC-MCA), Nicosia, Cyprus, March 2004.
  • IEEE ICDE 2003. 19th IEEE International Conference on Data Engineering, Bangalore, India, March 2003.
  • ACM SAC 2003. ACM Symposium on Applied Computing, special track on Mobile Computing and Applications (SAC-MCA), Melbourne, Florida, March 2003.
  • IEEE BIBE 2003. IEEE International Symposium on Bioinformatics and Bioengineering, Washington, DC, March 2003.
  • ACM CIKM 2002. Conference on Information and Knowledge Management, McLean, VA, November 2002.
  • Organization Committee, ACM CIKM 2002.

Panelist and Reviewer for Proposals

National Science Foundation Information & Intelligent Systems (IIS), Information Integration and Informatics, Division of Biological Infrastructure.

Canada National Research Council, Social Sciences and Humanities Research.

Ohio Supercomputer Center.

Reviewer for Journals and Conferences

·         ACM Transactions on Database Systems (TODS)

·         IEEE Transactions on Knowledge and Data Engineering (TKDE)

·         Very Large Databases Journal (VLDB Journal)

·         Oxford Journal on Bioinformatics

·         IEEE Transactions on Information Technology in Biomedicine

·         Parallel and Distributed System

·         International Journal on Information Processing Letters (IPL)

·         Information Sciences

·         Data and Knowledge Engineering

·         The Computer Journal

·         And conferences: VLDB, PODS, ICDE, SIGMOD, CIKM, ICDCS, COMAD, DaWaK, ICPP.

 

Reviewer for Books

 

·         Fundamentals of Database Systems, by Elmasri & Navathe, Addison Wesley.

·         An Information Systems Project Guide (working title), Prentice Hall.

 

      Departmental Service

·      Member, Advisory Committee

·      Member, Faculty Search, Graduate Studies, and Computer Systems Committees

·      External Service, Grad Admission, Undergraduate Studies Committees

 

PROJECTS:

 

    . High-performance multi-dimensional databases: Systems and tools for indexing, querying, and mining of multi-dimensional data.

    . Data Stream Management: Online and real-time compression, indexing, and analysis of streaming data.

    . Data Management and Mining for Coastal Monitoring: A large-scale data management system for monitoring physical and ecological coastal changes.

    . Protein Structure Modeling: Computational geometry and data mining tools for modeling protein functional sites.

    . Microarray and Proteomics Data Management: Management of protein sequences and structures, and gene expression data sets; and methods for integration of these databases.

    . Exploration of Dynamic Sequences in Scientific Databases: Online structures and algorithms to dynamically maintain and analyze data sequences for scientific discovery and monitoring purposes.

    . Real-time dissemination of weather data: Formatting, management, and online compression techniques for effective and efficient dissemination of real-time meteorological data.

    . Data Management in Multi-server architectures: Partitioning and data placement for fast searching of multi-dimensional data in multi-server architectures.

    . Data warehouse: A client-server data warehouse and efficient view maintenance system in a multi-user concurrent environment.

    . Trademark search engine: A web-based search engine for similarity searching of trademark images.

    . Web-based DB integration: A query engine that gathers information from multiple web pages and builds its own up-to-date relational tables.

    . Web transaction monitor: A TP-Monitor (for transactions initiated from web browsers) that provides deadlock-free transactional access to multiple data repositories employing synchronization, concurrency  control and recovery.

    . Cyberinfrastructure System for Coastal Analysis: A data-intensive cyberinfrastructure component for coastal forecasting and change analysis.

    . Clinical Data Mining: Outlier detection and modeling tools in pharmaceutical clinical trials databases.

    . Similarity-based Indexing and Integration of Protein Sequences and Structure.