Speech Discrimination Based on Multiscale Spectro-Temporal Modulations (ISR IP)

For more information, contact ISR External Relations Director
Jeff Coriale at coriale@umd.edu or 301.405.6604.

ISR intellectual property available to license

Inventors
Nima Mesgarani, Shihab Shamma

U.S. Patent: 7,505,902

Description
Researchers at the University of Maryland have developed a content-based audio classification algorithm based on novel multiscale spectro-temporal modulation features inspired by cortical processing.

A potential use for the classification system is to discriminate speech from non-speech. Non-speech, for example, could consist of animal vocalizations, music, or environmental sounds. In head-to-head comparisons with two other state-of the-art approaches to discriminating between speech and non-speech, the multiscale spectro-temporal system performed significantly better.

These algorithms also have applications in audio and data retrieval, archival management, modern human-computer interfaces, and in the entertainment and security industries. The researchers are also working on developing algorithms to enhance speech in noisy environments using an auditory model.

For more information
If you would like to license this intellectual property, have questions, would like to contact the inventors, or need more information, contact ISR External Relations Director Jeff Coriale at coriale@umd.edu or 301.405.6604.

Find more ISR IP
You can go to our main IP search page to search by research category or faculty name. Or view the entire list of available IP on our complete IP listing page.

ISR-IP-Shamma ISR-IP-biological ISR-IP-audio

Published June 21, 2007