Coupling Pattern Recognition and Signal Processing

From HLT@INESC-ID

Revision as of 17:02, 9 July 2013 by Rdmr (talk | contribs) (Created page with "__NOTOC__ {{speakerLargeBio| |name=Ahmed Hussen Abdelaziz |image=hussein_k.jpg |email=Ahmed.HussenAbdelAziz@rub.de |www=http://www.ruhr-uni-bochum.de/ika/mitarbeiter/hussen.htm |...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.
Ahmed Hussen Abdelaziz
Ahmed Hussen Abdelaziz
Ahmed Hussen Abdelaziz
Addresses: www mail

Date

  • 15:00, Friday, July 19th, 2013
  • Room 020, INESC-ID

Speaker

  • Ahmed Hussen Abdelaziz, Institut für Kommunikationsakustik, Ruhr-Universität

Abstract

Signal processing and pattern recognition are often treated as separate problems. However, tight coupling between them can yield a significantly improved performance in both of these tasks. In this talk, we will introduce two new approaches for such a stronger coupling, providing more precise input from signal processing to pattern recognition and vice versa. We start with coupling pattern recognition models with signal processing algorithms using a new statistical model, called the twin hidden Markov model (THMM), for speech enhancement. By using the THMM, hidden Markov models HMMs can be exploited to enhance speech signals in a recognize-and-synthesize scheme by using the most appropriate features in both recognition and synthesis. After that, we introduce a new approach for coupling signal processing with pattern recognition, called significance decoding (SD). The SD approach is a new uncertainty-of-observation technique that deploys the features uncertainties estimated by the signal processing algorithm to improve the recognition accuracy of automatic speech recognition under tough environmental conditions. Finally, we combine these two schemes in the context of audio-visual speech recognition in order to enhance its performance in very noisy environments.