Hakan
Ferhatosmanoglu
Associate
Professor Department
of Computer Science and Engineering
The
2015
Neil Ave. Room 689 Email: hakan@cse.ohio-state.edu
RESEARCH
INTERESTS:
High performance databases for multi-dimensional and
scientific applications
Storage, retrieval, and efficient I/O in large-scale
systems
Spatial, multimedia, and high
dimensional data
Bioinformatics and biomedical
databases
WORK EXPERIENCE:
Associate Professor,
Computer Science and Engineering,
Assistant Professor, Computer Science and
Engineering, OSU, 2001-2007.
Courtesy Faculty, Biomedical Informatics, OSU,
2001-
Participating Faculty, Biophysics, Mathematical
Biosciences Institute, OSU, 2003-
Graduate Research Associate, Computer Science, University
of California, Santa Barbara, 1997-2001.
Research Scientist at AT&T Labs- Research, Florham Park, New Jersey.
Design and Implementation of a Framework for Decision Support and Data Mining
for Large Data Warehouses, 1999.
Graduate Teaching Associate of Operating Systems, Database Management Systems,
Foundations of Computer Science, Discrete Mathematics, Automata Theory and
Formal Languages courses. 1997-1999.
Software Engineer at Bilkent
EDUCATION:
Ph.D. Computer Science,
Dissertation: Efficient Retrieval and
Scalable Storage of Multi-dimensional Data.
Co-advisors: Divyakant Agrawal and Amr
El Abbadi.
B.S. Computer Science, Bilkent
University,
Full
scholarship awarded by the university for all undergraduate education.
HONORS AND AWARDS:
·
Lumley Research Award,
·
NSF CAREER Award, 2006.
·
Early Career
Principal Investigator Award, US Department of Energy, 2003.
·
Large Interdisciplinary Grant Award, Office of Research, OSU, 2004.
·
Ranked 1st,
Mathematics Competition organized by Turkish National Science and Research
Foundation, 1990.
·
Finalist,
Nationwide Science and Knowledge Contest organized by Milliyet Education Foundation,
1990.
·
Ranked 45th among
around 1.4 million examinees in the centralized National University Entrance
Exam in
RESEARCH FUNDING:
·
NSF Collaborative Systems, NSF Advances in Bio-Informatics, NSF
Cyberinfrastructure, US Department of Energy, NASA/ Ohio Aerospace Institute,
NSF Research Infrastructure, Ohio Board of Regents, Pfizer, Inc.
PUBLICATIONS:
2. Mutual Information Based
Extrinsic Similarity for Microarray Analysis. D. Ucar, F. Altiparmak, H. Ferhatosmanoglu, S.
Parthasarathy: BICoB, April
2009, pp. 424-436.
3. Integrated Search and Alignment of Protein Structures.
A. Sacan, I. Toroslu, H. Ferhatosmanoglu. Bioinformatics, vol. 24, no. 14,
December 2008, pp. 2872-9.
4. CellTrack: An Open-Source Software for Cell Tracking
and Motility Analysis. A. Sacan, H. Ferhatosmanoglu, H. Coskun, Bioinformatics,
vol. 24, no. 14, July 2008, pp. 1647-1649.
5. Distance-based
indexing of residue contacts for protein structure retrieval and alignment. Sacan, A.; Toroslu, I.H.; Ferhatosmanoglu, H.; BioInformatics and BioEngineering, BIBE, 8th IEEE International
Conference, Oct. 2008, pp. 1 7.
6. An Enhanced Partial Order Curve Comparison Algorithm
and its Application to Analyzing Protein Folding Trajectories. H. Sun, H. Ferhatosmanoglu, M. Ota, Y. Wang. BMC
BioInformatics 2008, Vol.
9 : 344, August 2008 doi:10.1186/1471-2105-9-344.
7. Dynamic Data Organization
for Bitmap Indices. T. Apaydin, G. Canahuate, H. Ferhatosmanoglu, A. Tosun, 3rd International
ICST Conference on Scalable Information Systems, InfoScale, June 2008.
8. Analysis of Basic Data Reordering Techniques. T. Apaydin, A. Tosun, H.
Ferhatosmanoglu, 20th Conference on Scientific and Statistical Data Management,
SSDBM, July 2008, pp. 517-524.
9.
Incremental Maintenance
of Online Summaries Over Multiple Streams. F. Altiparmak, E. Tuncel, H.
Ferhatosmanoglu. IEEE Transactions on
Knowledge and Data Engineering (TKDE),
Volume 20, Issue 2, Feb. 2008, pp. 216-229.
10.
Online Index
Recommendations for High-Dimensional Databases using Query Workloads. M. Gibas,
G. Canahuate, H. Ferhatosmanoglu. IEEE
Transactions on Knowledge and Data Engineering (TKDE), Volume 20, Issue 2, Feb. 2008, pp. 246-260.
11.
Automated Data Discovery in Similarity Score Queries. Altiparmak F.,
Tosun A. S., Ferhatosmanoglu H., Sacan A., 13th International Conference on Database Systems for Advance
Applications (DASFAA), New Delhi, India, March 2008, pp. 440-451.
12. Michael Gibas, Hakan Ferhatosmanoglu: Indexing, High Dimensional. Encyclopedia of GIS, pp. 502-507, February 2008.
13. A General Framework for Modeling and
Processing Optimization Queries. M. Gibas, N.
Zheng, H. Ferhatosmanoglu, VLDB 07, Vienna, Austria, September, 2007, pp. 1069-1080. (Accept-rate: 17%)
14. Enhanced Partial
Order Curve Comparison over Multiple Protein Folding Trajectories. H. Sun,
H. Ferhatosmanoglu, M. Ota, Y. Wang, Life
Sciences Society Computational
Systems Biology (CSB 07), San
Diego, CA, August 2007, pp. 299-310.
15. Update Conscious
Bitmap Indices. G. Canahuate, M. Gibas, H. Ferhatosmanoglu. International
Conference on Scientific and Statistical Database Management (SSDBM 07), Banff, Canada, July
2007, 12 pages.
16. LFM-Pro: A Tool for Detecting Significant Local Structural Sites in
Proteins. A. Sacan, O. Ozturk, H. Ferhatosmanoglu, Y. Wang. Bioinformatics,
Volume 23, Number 6, pp. 709-716, March 2007.
17. A Multi-Metric Similarity
Based Analysis of Microarray Data. F. Altiparmak, S. Erdal, O.
Ozturk, H. Ferhatosmanoglu. IEEE
International Conference on Bioinformatics and Biomedicine (BIBM), 2007,
pp. 317-324.
18. Data Space Mapping for Efficient I/O in Large
Multi-dimensional Databases. H. Ferhatosmanoglu, A. Ramachandran, D. Agrawal,
A. El Abbadi. Information Systems
Journal, vol. 32, no. 1, pp. 83-103, March 2007.
19. Access
Structures for Angular Similarity Queries. T. Apaydin and H. Ferhatosmanoglu. IEEE Transactions on Knowledge and Data
Engineering (TKDE), vol. 16, no. 6, pp. 1512-1525, November 2006.
20. Approximate Encoding for Direct Access and Query
Processing over Compressed Bitmaps. T. Apaydin, G. Canahuate, H.
Ferhatosmanoglu, A. Tosun. 32nd
International Conference on Very Large Data Bases (VLDB 06),
21. High
Dimensional Nearest Neighbor Searching. H. Ferhatosmanoglu, E. Tuncel, D.
Agrawal, A. El Abbadi. Information Systems Journal, vol. 31, no. 6, pp. 512-540,
September 2006.
22. Information Mining over Heterogeneous
and High Dimensional Time Series Data in Clinical Trials Databases. F.
Altiparmak, H. Ferhatosmanoglu, S. Erdal, C. Trost. IEEE Transactions on Information Technology in Biomedicine, 10(2):
254- 263 (April 2006).
23. Predicting the Binding Affinity of MHC class II
Peptides. F. Altiparmak, A. Akalin, H. Ferhatosmanoglu. LSS Computational Systems Bioinformatics Conference (CSB 06).
24. Indexing
Incomplete Databases. G. Canahuate, M. Gibas, H. Ferhatosmanoglu. 10th International Conference on
Extending Database Technology (EDBT 06).
25. Online Summarization of
Dynamic Time Series Data. U. Ogras and H. Ferhatosmanoglu. VLDB Journal, 15(1): 84-98 (January 2006).
26. Compressing Bitmap Indices
by Data Reorganization. A. Pinar, T. Tao, H. Ferhatosmanoglu. 21st IEEE International
Conference on Data Engineering (ICDE
05).
27. Vector Space Indexing for
Biosequence Similarity Searches. O. Ozturk and H. Ferhatosmanoglu. International
Journal on Artificial Intelligence Tools, 14(5): 811-826 (2005).
28. Optimal Data-Space
Partitioning of Spatial Data for Parallel I/O. H. Ferhatosmanoglu, D. Agrawal,
O. Egecioglu, and A. El Abbadi. Distributed and Parallel Databases (DAPD),
17(1): 75-101 (2005).
29. Replicated Declustering of
Spatial Data. H. Ferhatosmanoglu, A. Tosun, A. Ramachandran. 23rd
ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS
04), June 2004,
30. Dimensionality Reduction and
Similarity Computation using Inner Product Approximations. O. Egecioglu, H.
Ferhatosmanoglu, U. Ogras. IEEE
Transactions on Knowledge and Data Engineering (TKDE). June 2004 (vol. 16, no.6), pp. 714-726.
31. A Time-Series Analysis of
Gene Expression Data. S. Erdal, O. Ozturk, D. Armbruster, H. Ferhatosmanoglu,
W. Ray. IEEE International Symp. On Bioinformatics and Bioengineering (BIBE
04),
32. High Dimensional Reverse
Nearest Neighbor Queries. A. Singh, H. Ferhatosmanoglu, A. Tosun. 12th ACM International Conference on
Information and Knowledge Management (CIKM'03),
pp. 91-98.
33. Dimensionality Reduction
using Magnitude and Shape Approximations. Umit Ogras, Hakan Ferhatosmanoglu. 12th ACM International Conference on
Information and Knowledge Management (CIKM'03),
pp. 99-107.
34. Peer-to-Peer Spatial Queries
in Sensor Networks. M. Demirbas and H. Ferhatosmanoglu. 3rd IEEE
International Conference on Peer-to-Peer Computing (P2P '03), Linkφping,
Sweden. September 2003, pp. 32-39.
35. Efficient k-NN Search on
Streaming Data Series. X. Liu and H. Ferhatosmanoglu. 8th International
Symposium on Spatial and Temporal Databases (SSTD '03),
36. CoMRI: A Compressed
Multi-Resolution Index Structure for Sequence Similarity Queries. H. Sun, O.
Ozturk and H. Ferhatosmanoglu. IEEE Computer Society Bioinformatics
Conference (CSB '03). Stanford, CA. August 2003, pp. 553-558.
37. Effective Indexing and
Filtering for Similarity Search in Large Biosequence Databases. O. Ozturk and
H. Ferhatosmanoglu. IEEE International Symp. on Bioinformatics and
Bioengineering (BIBE '03), pp. 359-366.
38. VQ-Index: An Index Structure
for Similarity Searching in Multimedia Databases. E. Tuncel, H. Ferhatosmanoglu, K. Rose. ACM Multimedia,
Juan Les Pins,
39. Vulnerabilities in Similarity
Search based Systems. A. Tosun and H.
Ferhatosmanoglu. ACM International Conference on Information and
Knowledge Management (CIKM 02),
40. Optimal Parallel I/O Using
Replication. A. Tosun, H.
Ferhatosmanoglu. In proceedings of International Workshops on
Parallel Processing (ICPPW 02),
41. Approximate Nearest Neighbor
Searching in Multimedia Databases. H.
Ferhatosmanoglu, E. Tuncel, D. Agrawal, A. El Abbadi. In
proceedings of the 17th IEEE International Conference on Data
Engineering (ICDE 01),
42. Efficient Processing of
Conical Queries. H. Ferhatosmanoglu,
D. Agrawal and A. El Abbadi. In proceedings of the 10th ACM
International Conference on Information and Knowledge Management (CIKM 01),
43. Optimal Partitioning for
Efficient I/O in Spatial Databases. H.
Ferhatosmanoglu, D. Agrawal, A. El Abbadi. European Conference on
Parallel Computing (Euro-Par 01), Parallel I/O and Storage Technology,
44. Constrained Nearest Neighbor
Queries. H. Ferhatosmanoglu,
45. Vector Approximation based
Indexing for Non-uniform High Dimensional Data Sets. H. Ferhatosmanoglu, E. Tuncel, D. Agrawal, A. El Abbadi. In
proceedings of the 9th ACM International Conference on Information
and Knowledge Management (CIKM 00),
46. Dimensionality Reduction and
Similarity Computation by Inner Product Approximations. O. Egecioglu and H. Ferhatosmanoglu. In
proceedings of the 9th ACM International Conference on
Information and Knowledge Management (CIKM 00),
47. Circular Data-space
Partitioning for Similarity Queries and Parallel Disk Allocation. O. Egecioglu
and H. Ferhatosmanoglu. In
proceedings of the 11th IASTED International Conference on
Parallel and Distributed Computing and Systems (PDCS 99),
Boston, MA, pp. 194-200. November 1999.
48. Clustering Declustered Data
for Efficient Retrieval. H.
Ferhatosmanoglu, D. Agrawal, A. El Abbadi. In proceedings of the 8th
ACM International Conference on Information and Knowledge Management (CIKM
99),
49. Concentric Hyperspaces and
Disk Allocation for
Recent Workshops
50. Investigating the use of Extrinsic Similarity Measures for Microarray
Analysis. D. Ucar, F.
Altiparmak, H. Ferhatosmanoglu, S. Parthasarathy. Proceedings of the BIOKDD
workshop at the ACM International Conference on Knowledge Discovery and Data
Mining (SIGKDD), 2007.
51. Combining
Mining Results from Multiple Sources in Clinical Trials and Microarray
Applications. F. Altiparmak,
O. Ozturk, S. Erdal, H. Ferhatosmanoglu, D. C. Trost. MMIS workshop at
the ACM International Conference on Knowledge Discovery and
Data Mining (SIGKDD),
2007.
52. Incremental
Quantization for Aging Data Streams. F. Altiparmak, D. Chiu, H.
Ferhatosmanoglu. DSMM workshop at IEEE International Conference on Data
Mining (ICDM), 2007.
STUDENT
SUPERVISION:
·
PhD Advisees (6
completed, 3 current)
·
MS Advisees (6
completed, 3 current)
·
PhD Dissertation/Candidacy
Committee (40 completed)
·
M.S. Committee (2
completed, 3 current)
·
Undergraduate Students (6 completed)
·
Information Systems advisor in BS CSE program.
PROFESSIONAL SERVICES:
Program
Committee Member
National Science Foundation
Information & Intelligent Systems (IIS), Information Integration and Informatics,
Division of Biological Infrastructure.
Canada National
Research Council, Social Sciences and Humanities Research.
·
An Information
Systems Project Guide (working title), Prentice Hall.
Departmental
Service
· Member,
Advisory Committee
· Member,
Faculty Search, Graduate Studies, and Computer Systems Committees
· External
Service, Grad Admission, Undergraduate Studies Committees
PROJECTS:
. High-performance multi-dimensional
databases: Systems and tools for
indexing, querying, and mining of multi-dimensional data.
. Data Stream Management: Online and real-time compression, indexing, and
analysis of streaming data.
. Data Management and Mining for Coastal
Monitoring: A large-scale data
management system for monitoring physical and ecological coastal changes.
. Protein Structure Modeling: Computational geometry and data mining tools for
modeling protein functional sites.
. Microarray and Proteomics Data
Management: Management of protein
sequences and structures, and gene expression data sets; and methods for
integration of these databases.
. Exploration of Dynamic Sequences in
Scientific Databases: Online
structures and algorithms to dynamically maintain and analyze data sequences
for scientific discovery and monitoring purposes.
. Real-time dissemination of weather data: Formatting, management, and online compression
techniques for effective and efficient dissemination of real-time
meteorological data.
. Data Management in Multi-server
architectures: Partitioning and data
placement for fast searching of multi-dimensional data in multi-server
architectures.
. Data warehouse: A client-server data warehouse and efficient view
maintenance system in a multi-user concurrent environment.
. Trademark search engine: A web-based search engine for similarity searching
of trademark images.
. Web-based DB integration: A query engine that gathers information from
multiple web pages and builds its own up-to-date relational tables.
. Web transaction monitor: A
TP-Monitor (for transactions initiated from web browsers) that provides
deadlock-free transactional access to multiple data repositories employing
synchronization, concurrency control and
recovery.
. Cyberinfrastructure System for Coastal
Analysis: A data-intensive
cyberinfrastructure component for coastal forecasting and change analysis.
. Clinical Data Mining: Outlier detection and modeling tools in pharmaceutical
clinical trials databases.
. Similarity-based Indexing and Integration
of Protein Sequences and Structure.