The Ohio State University Computer Science and Engineering
Search CSE
    Home  |  Events |  People
Around Campus
  Department
  Calendar
    Calendar Index
    Upcoming Speakers
   
  News
  Research
  Undergrad Programs
  Graduate Programs
  People
  Courses
  Computing Services
 

Guest Speaker

Name That Tune: Finding a Song from a Sung Query

Bryan Pardo
University of Michigan

Thurs., Mar. 4th
3:30pm; 480 Dreese Labs
All interested parties are invited.
Refreshments will be served immediately preceding the talk.

Abstract:
Music Information Retrieval has become an active area of research motivated by the increasing importance of internet-based music distribution. Online catalogs are already approaching one million songs, so it is important to study new techniques for searching these vast stores of audio. One approach to finding music that has received much attention is “Query by Humming” (QBH). This approach enables users to retrieve songs and information about them by singing, humming, or whistling a melodic fragment. In QBH systems, the query is a digital audio recording of a melodic fragment, and the ultimate target is a complete digital audio recording of a piece.

We have created a QBH system for music search and retrieval. A user sings a theme from the desired piece of music. The sung theme (query) is converted into a sequence of pitch-intervals and rhythms. This sequence is compared to musical themes (targets) stored in a database. The top pieces are returned to the user in order of similarity to the sung theme. We describe two approaches to measuring similarity between database themes and the sung query. In the first, queries are compared to database themes using probabilistic string-alignment algorithms. Here, similarity between target and query is determined by edit cost. In the second approach, pieces in the database are represented as hidden Markov models (HMMs). In this approach, the query is treated as an observation sequence and a target is judged similar to the query if its HMM has a high likelihood of generating the query. Experiments show that while no approach is clearly superior in retrieval ability, string matching often has a significant speed advantage. Moreover, neither approach surpasses human performance.

Host: DeLiang (Leon) Wang

 

 

Home  |  Department  |  Calendar  |  News  |  Research  |  Undergrad Programs  |  Graduate Programs
People  |  Courses  |  Computing Services  |  Ohio State Home  |  Diversity Program  |  College of Engineering  |  E-mail Us

The Ohio State University, Copyright 2004.