Projects

From HLT@INESC-ID

Revision as of 12:48, 9 February 2014 by Imt (Talk | contribs)

Ongoing

International

DIRHA (2012-2014)
The DIRHA project addresses the development of voice-enabled automated home environments based on distant-speech interaction in different languages. A distributed microphone network is installed in the rooms of a house in order to monitor selectively acoustic and speech activities observable inside any space, and to eventually run a spoken dialogue session with a given user in order to implement a service or to have access to appliances and other devices. [1]
COST Action IC1206 (2013-2017)
De-identification in multimedia content can be defined as the process of concealing the identities of individuals captured in a given set of data (images, video, audio, text), for the purpose of protecting their privacy. This Action aims to facilitate coordinated interdisciplinary efforts (related to scientific, legal, ethical and societal aspects) in the introduction of person de-identification and reversible de-identification in multimedia content. [2]

National

PoSTPort (2008-2010)
PoSTPort's goal is to port spoken language technologies originally developed for European Portuguese to the South-American and African varieties of Portuguese. The two main technologies to be investigated are speech synthesis and recognition. The project also involves corpora collection and characterization of the main differences between varieties, as well as their automatic identification, for switching among specific recognition systems.
FalaComigo (2010-2013)
FalaComigo aims to develop a solution to "Enhance the Cultural Tourism through the Interaction with Virtual Characters", by providing a set of applications, that will be deployed in various places of touristic interest. Through these solutions, FalaComigo team will provide new and compelling ways of interacting with visitors, supplying a remarkable sensory experience. On the basis of development of these solutions we find a spoken dialogue system with speech recognition and synthesis, 3D facial animation, spoken dialogue management systems and question/answer technologies.
OOBIAN (2010-2012)
Construction of lexical and grammatical resources for local integration in the mechanism that recognizes syntactic-semantic relationships on text. Integration with an indexing system.
AVOZ (2012-2013)
The AVoz project proposes to conduct an in-depth study of automatic speech recognition for this type of speech, in order to improve ASR performance. Project Website: https://avoz.l2f.inesc-id.pt
SUSPECT (2012-2015)
The goal of this project is to develop privacy-preserving frameworks for processing voice data. Processing will be performed without having access to the voice, i.e., access to any form of the speech that can be analyzed to obtain information about the talker or what they spoke. Using a combination of tools from cryptography and secure-multiparty computation we will render voice processing algorithms secure, so that the privacy of all parties is preserved. We specifically propose to develop solutions for secure speaker verification and keyword spotting problems.

Finished

International

  • euTV - Adaptive Channels in Europe (2010-2012)
  • METANET4U - Network of Excellence forging the multilingual Europe Technology Alliance (2010-2012)
  • LIREC - LIving with Robots and InteractivE Companions (2008-2012)
  • I-DASH - The Investigator’s Dashboard (2008-2010)
  • Vidivideo - Interactive semantic video search with a large thesaurus of machine learned audio-visual concepts(2007-2010)
  • COST-2102 - Cross-Modal Analysis of Verbal and Non-verbal Communication (2006-2010)
  • COST-2103 - Advanced Voice Function Assessment (2006-2010)
  • E-Circus - Education through Characters with Emotional Intelligence and Role-playing Capabilities that Understand Social Interaction (2006-2008)
  • COST 277 - Nonlinear Speech Processing (2001-2005)
  • COST 278 - Spoken Language Interaction in Telecommunication (2001-2005)
  • ALERT - Alert System for Selective Dissemination of Multimedia Information (2000-2002)
  • AUDIOLING-LP - Multimedia course for foreign students of the Portuguese language (Socrates Program)
  • SPEECHDAT - Speech Databases for for Creation of Voice Driven Teleservices (1994-1999)
  • VODIS - Advanced Speech Technologies for Voice Operated Driver Information Systems (1995-1999)
  • SPRACH - Speech Recognition ALgorithms for Connectionist Hybrids (1995-1998)
  • ELSNET - European Network in Language and Speech
  • ECESS - European Center of Excellence on Speech Synthesis

National

  • VITHEA - Virtual Therapist for Aphasia treatment (2010-2012)
  • REAP.PT - Computer Assisted Language Learning: Reading Practice (2009-2012)
  • PT-STAR - Speech Translation Advanced Research to and from Portuguese (2009-2012)
  • ARIA - Ambient-assisted Reading Interfaces for the Ageing-society (2010-2012)
  • FleetMod - FleetMod (simulate and predict the behavior of the skippers of fishing vessels to provide a framework to test the effectiveness of different management policies) (2008-2011)
  • StopFire - StopFire (a distributed intelligent system for forest fire combat aid) (2007-2011)
  • Tecnovoz - Tecnologia de Reconhecimento e Síntese de Voz (2006-2008)
  • RiCoBa - Rich Content Books for All (2005-2007)
  • LECTRA - Rich Transcription of Lectures for E-Learning Applications (2005-2007)
  • NLE GRID - Natural Language Engineering on a Computational Grid (2005-2007)
  • WFST - Weighted Finite State Transducers Applied to Spoken Language Processing (2004-2007)
  • DIGA - Dialog Interface for Global Access (2004-2007)
  • PAPOUS - The Story Teller (2003-2005)
  • IPSOM - Indexing, Integration and Sound Retrieval in Multimedia Documents (2000-2004)
  • ATA - Automatic Terms Acquisition (2001-2003)
  • FALA2 (2000-2003)
  • CITE-IV - Augmentative Communication Tools in Portuguese (1999-2000)
  • DIXI+ - A Text-to-Speech Synthesizer in Portuguese for Alternative and Augmentative Communication (1999-2001)
  • REC - Speech Recognition Applied to Telecommunications (1997-2000)
  • PRAXIS_FALA - Reconhecimento de fala de Alto Desempenho em Português (1997-1999)
  • CORAL - Labelled Spoken Dialogue Corpus (1997-1999)
  • BDFALA (in Portuguese) - Spoken Database for European Portuguese (1994-1998)
  • EDIFALA (in Portuguese) - Vocal Support System for Oral and Motor Handicapped (1993-1997)

Bilateral contracts

  • LIFAPOR - Spoken Books in European and Brazilian Portuguese (2005-2007)
  • ARARA - Automatic directory assistant service for Portuguese Telecom, together with Philips Speech Techonology (1999-2001)
  • SVIT (in Portuguese) - Partial Automation of Directory Services Based on Synthesis of Telephone Numbers (1995-1999)

Finished before 1995

  • WERNICKE - A Neural Network Based, Speaker Independent, Large Vocabulary, Continuous Speech Recognition System (1992-1995)
  • RELATOR - A European Network of Repositories for Linguistic Resources (1993-1994)
  • SAM_A (ESPRIT III) - Multi-Lingual Speech Input/Output Assessment, Methodology and Standardization (1992-1993)
  • ONOMASTICA (Language Research Engineering) - Multi-Language Pronounciation Dictionary of Proper Names and Place Names (1993-1995)
  • SUNSTAR (ESPRIT II) - Integration and Design of Speech Understanding Interfaces (1989-1992)
  • HCM-ELSNET (Human Capital Mobility) - Phrase Level Phonology and Dialogue & Discourse (1994-1996)
  • EUREKA 151 - High quality speech coding at medium-to-low bit rates (1987-1990)
  • COST 229 - Applications od Digital Signal Processing to Telecommunications (1990-1993)
  • COMETT - A Trans-European Platform for Transferable Continuing Education in Digital Signal Processing (1990-1993)