AUDIMUS

From HLT@INESC-ID

Revision as of 16:28, 6 June 2007 by Jpn (talk | contribs)

AUDIMUS is a platform for research of new techniques and development of new applications for Automatic Speech Recognition in the Portuguese language.

Summary

AUDIMUS is the name of a generic platform for an Automatic Speech Recognition System specifically tailored to the European Portuguese language. This platform is used as a research base for new techniques of the different components of a speech recognition system. The improvements that result from the research work are fully integrated in AUDIMUS in order to give rise to new and better applications.

Description

AUDIMUS is a hybrid speech recognizer that combines the temporal modeling capabilities of Hidden Markov Models (HMMs) with the pattern discriminative classification capabilities of multilayer perceptrons (MLPs). This same recognizer is being used for different complexity tasks based on a common structure but with different components. MLPs are used to estimate the context-independent posterior phone probabilities given the acoustic data at each input frame. The phone probabilities generated at the output of the MLP classifiers are combined using an appropriate algorithm. The acoustic models are dependent on the input facilities since we are using separated models for telephone speech, for microphone speech or broadcast news. The same occurs with the lexical and the language models which are dependent on the specific application domain. The AUDIMUS decoder is based on a weighted finite-state-transducer (WFST) approach to large vocabulary speech recognition.

Downloads