m |
|||
Line 1: | Line 1: | ||
The most challenging aspects of speech recognition are the ones related to processing speech in widely different domains, spoken in a variety of dialects, and potentially adverse environments, and dealing with the characteristics of spontaneous speech: no punctuation, disfluencies, emotions, and overlapping turns. In this context, L2F’s activities have been recently concentrated in several research strands: | The most challenging aspects of speech recognition are the ones related to processing speech in widely different domains, spoken in a variety of dialects, and potentially adverse environments, and dealing with the characteristics of spontaneous speech: no punctuation, disfluencies, emotions, and overlapping turns. In this context, L2F’s activities have been recently concentrated in several research strands: | ||
*Broadcast News (BN) recognition | *Broadcast News (BN) recognition | ||
− | :Our work in this area started in the scope of the European project ALERT. There are currently two PhD Theses on this topic. One covering [[ | + | :Our work in this area started in the scope of the European project ALERT. There are currently two PhD Theses on this topic. One covering [[Audio indexation|BN Audio Indexing]] and [[BN Speech Recognition]] and another covering [[BN Language Models]]. In order to show the developments several prototypes and demos are made. This is the case of a prototype resulting from the ALERT project: [[SSNT - Summarization of Broadcast News Services]]. |
* Recognition in adverse environments | * Recognition in adverse environments | ||
:The field of robust speech recognition is relatively new at L2F. We are currently working on speech enhancement techniques using beam forming for a multi-user speaker environment. Our approach has a single array of 64 linearly spaced microphones. | :The field of robust speech recognition is relatively new at L2F. We are currently working on speech enhancement techniques using beam forming for a multi-user speaker environment. Our approach has a single array of 64 linearly spaced microphones. |
The most challenging aspects of speech recognition are the ones related to processing speech in widely different domains, spoken in a variety of dialects, and potentially adverse environments, and dealing with the characteristics of spontaneous speech: no punctuation, disfluencies, emotions, and overlapping turns. In this context, L2F’s activities have been recently concentrated in several research strands: