Clark School Home UMD

ISR News Story

Espy-Wilson awarded NSF grant; subcontractor for Navy STTR grant

Professor Carol Espy-Wilson (ECE/ISR) has been awarded a National Science Foundation (NSF) grant and is a sub-contractor on a Phase II Navy STTR award.

NSF: Multipitch Tracking
The one-year, $125,000 NSF award is for Robust Intelligence: Extension of the APP Detector for Multipitch Tracking and Speaker Separation.

Because of the loss of fine temporal structure, users of cochlear implants encounter problems in separating speakers in multi-speaker environments. Speech recognition and speaker identification systems must deal with simultaneous speech from several talkers in conversations. A crucial preprocessing step for such systems is segregating speech according to its constituent sources.

Because of the differences in people's vocal chords, different speakers have characteristic pitch ranges. Pitch tracks can be used to help separate the combined signal into different speech streams. The research will help systems recognize the number of speakers and the separation of their pitch tracks based on the periodic portions of the speech signal (i.e., voiced regions).

Current popular multi-pitch tracking approaches are susceptible to artifacts caused by the interaction between the periodic regions of the different speech signals. Consequently, the periodicity of the combined signal can be different from that of the individual components. The major new idea is the extension of an existent periodicity and pitch estimation process to higher dimensions, arriving at a multi-dimensional periodicity function which is not susceptible to the harmonic interaction artifacts.

The multiple pitch tracks obtained are more accurate even when one speaker is considerably more dominant than the other. The approach is easily generalized to non-speech audio and should be robust in noisy channels. This project is a step towards separating actual speech streams from each other based on multi-pitch information.

Navy STTR: Speech Separation
Espy-Wilson is also a subcontractor for a Navy Phase II Small Business Technology Transfer (STTR) award to Signal Processing, Inc. of Rockville, Md., A Novel Speech Separation Approach for Enhanced Speaker Identification and Speech Recognition. This is an 18-month award for $103,066.

In this grant, Signal Processing, Inc. will use a speech enhancement algorithm developed by Espy-Wilson in a noise suppression system it is developing. Espy-Wilson’s algorithm is based on modified phase opponency (MPO) and does not require noise estimation.

Once developed, the company’s system could suppress high background noise and achieve high performance speech separation, speaker identification, and speech recognition. It could also be used for bird monitoring to avoid birdstrikes to aircraft, and speech enhancement in communication centers, conference rooms, aircraft cockpits, cars and buses. Further uses could be in security monitoring in airport terminals, bus and train stations, where the system could pick up multiple conversations from different people and different angles and as a front-end processor to an automatic speech recognition system.

September 12, 2008


Prev   Next

 

 

Current Headlines

Stephen Trimberger in residence as ISR visiting research engineer

Brin Family Prize Celebrates Student Innovation

Alumnus Ravi Tandon Receives 2018 Keysight Early Career Professor Award 

Alum Mingyan Liu is PI for Multiscale Network Games of Collusion and Competition MURI

Haptic Safety for Unmanned Vehicles

Three ISR faculty receive UMD Brain and Behavior Initiative seed grants

Aneesh Raghavan wins Ann G. Wylie Dissertation Fellowship

Book edited by Ghodssi, Lin in top 25 percent of most downloaded Springer eBooks

Why a robot can't yet outjump a flea

Bill Regli wins Washington Academy of Sciences Excellence in Research in Computer Science Award

News Resources

Return to Newsroom

Search News

Archived News

Events Resources

Events Calendar