SLT 2012 presentations

From HLT@INESC-ID

Revision as of 16:15, 26 November 2012 by Rdmr (talk | contribs)


Date

  • 15:00, Friday, November 16th, 2012
  • Room 336

Intent Transfer in Speech-to-Speech Machine Translation

Speaker

Abstract

In this talk I will present the English-Portuguese bilingual speech database based on the TAP-UP corpus. I will then describe an approach for analysis and transfer of speaker intent for the purpose of speech translation.

Recovery of Acronyms, Out-Of-Lattice Words and Pronunciations from Parallel Multilingual Speech

Speaker

Abstract

In this work we present a set of techniques which explore information from multiple, different language versions of the same speech, to improve Automatic Speech Recognition (ASR) performance. Using this redundant information we are able to recover acronyms, words that cannot be found in the multiple hypotheses produced by the ASR systems, and pronunciations absent from their pronunciation dictionaries. When used together, the three techniques yield a relative improvement of 5.0% over the WER of our baseline system, and 24.8% relative when compared with standard speech recognition, in an Europarl Committee dataset with three different languages (Portuguese, Spanish and English).