Speaker and content identification: Difference between revisions
From HLT@INESC-ID
(Created page with "__NOTOC__ {{speakerLargeBio| |name=Xavier Anguera |image=xavier.jpg |email=xanguera@tid.es |www=http://www.xavieranguera.com/ |bio=Xavier Anguera Miro: Ing. [MS] 2001 by UPC (Bar...") |
No edit summary |
||
Line 2: | Line 2: | ||
{{speakerLargeBio| | {{speakerLargeBio| | ||
|name=Xavier Anguera | |name=Xavier Anguera | ||
|image=xavier. | |image=xavier.png | ||
|email=xanguera@tid.es | |email=xanguera@tid.es | ||
|www=http://www.xavieranguera.com/ | |www=http://www.xavieranguera.com/ |
Latest revision as of 09:33, 11 April 2012
Xavier Anguera |
![]() |
Addresses: www mail |
Date
- 15:00, Friday, April 13th, 2012
- Room 336
Speaker
- Xavier Anguera, Telefonica Research
Abstract
In this talk I will cover two of the topics I have been recently working on. On the one hand, with regard to speaker identification, I will introduce the use of binary fingerprints to model the voice of a speaker. Based on the projection of standard acoustic vectors into a special GMM model (representing the speaker acoustic space), high-dimensional binary vectors, which have been proven successful in identifying speakers for speaker verification and diarization tasks, are obtained. On the other hand, I will talk about current developments in pattern matching approaches that allow for the development of content-centric applications when little or no training data is available for a particular language. In particular, I will describe a query-by-example system I presented to Mediaeval 2011 evaluation, which uses a novel feature extraction front-end I further described in a paper at ICASSP 2012.
Note: This seminar will be held in English.