Projects

From HLT@INESC-ID

Ongoing

International

VidiVideo (2007-2010)
Summary: VIDI-Video project takes on the challenge of creating a substantially enhanced semantic access to video, implemented in a search engine. The engine will boost the performance of video search by forming a 1000 element thesaurus detecting instances of audio, visual or mixed-media content.
External site: http://www.vidivideo.eu
E-Circus (2006-2008)
Summary: E-Circus, will develop a new approach in the use of ICT to support social and emotional learning within Personal and Social Education (PSE). This will be achieved through virtual role-play with synthetic characters that establish credible and empathic relations with the learners.
External site: http://www.e-circus.org/
COST-2102 (2006-2010)
Summary: The main objective of this COST Action is to develop an advanced acoustical, perceptual, and psychological analysis of verbal and nonverbal communication signals originating in spontaneous face-to-face interaction, in order to identify algorithms and automatic procedures capable of identifying the human emotional states.
External site: http://www.cost2102.eu
COST-2103 (2006-2010)
Summary: The main objective of the Action is to combine previously unexploited techniques with new theoretical developments to improve the assessment of voice for as many European languages as possible, while acquiring in parallel data with a view to elaborating better voice production models.
External site: http://www.cost.esf.org/index.php?id=110&action_number=2103
ECESS ()
Summary: ECESS is a European initiative to foster the European research area in the field of Language Technology. As partner in this network where major industry partners such as IBM, Siemens and Nokia as well as European key research institutes are connected, L2F plays an active role in the R&D activities. We are responsible for the intelligibility and naturalness of the synthesized utterances while developing the acoustic module of the speech synthesis software.
External site: http://www.ecess.eu/

National

Tecnovoz (2006-2008)
Summary: The Tecnovoz consortium was built with the goal of creating a national technological centre capable of industrialising innovative systems based on speech technologies, at the same rhythm, and at equal technological level, that occurs in others countries. Simultaneously, it is intended that Portugal affirms itself as a real actor in the development of voice and speech technologies and a defender of its application to the Portuguese Language.
External site: http://www.tecnovoz.pt/web/home.asp
PoSTPort (2008-2010)
Summary: PoSTPort's goal is to port spoken language technologies originally developed for European Portuguese to the South-American and African varieties of Portuguese. The two main technologies to be investigated are speech synthesis and recognition. Instead of porting complete systems, we concentrate on linguistically relevant modules. Prior to this main work, the project will involve two tasks: corpora collection and characterization of the main differences between the studied varieties. The last task concerns the automatic identification of spoken varieties of Portuguese, which will be used as a pre-processing stage for switching among recognition systems developed for specific varieties.

Bilateral

Finished

International

  • COST 277 - Nonlinear Speech Processing
  • COST 278 - Spoken Language Interaction in Telecommunication
  • ALERT - Alert System for Selective Dissemination of Multimedia Information (2005-)
  • AUDIOLING-LP - Multimedia course for foreign students of the Portuguese language (Socrates Program)
  • SPEECHDAT - Speech Databases for for Creation of Voice Driven Teleservices (1994-1999)
  • VODIS - Advanced Speech Technologies for Voice Operated Driver Information Systems (1995-1999)
  • SPRACH - Speech Recognition ALgorithms for Connectionist Hybrids (1995-1998)
  • ELSNET - European Network in Language and Speech

National

  • RiCoBa - Rich Content Books for All (2005-2007)
  • LECTRA - Rich Transcription of Lectures for E-Learning Applications (2005-2007)
  • NLE GRID - Natural Language Engineering on a Computational Grid (2005-2007)
  • WFST - Weighted Finite State Transducers Applied to Spoken Language Processing (2004-2007)
  • DIGA - Dialog Interface for Global Access (2004-2007)
  • PAPOUS - The Story Teller (2003-2005)
  • IPSOM - Indexing, Integration and Sound Retrieval in Multimedia Documents (2000-2004)
  • ATA - Automatic Terms Acquisition (2001-2003)
  • FALA2 (2000-2003)
  • CITE-IV - Augmentative Communication Tools in Portuguese (1999-2000)
  • DIXI+ - A Text-to-Speech Synthesizer in Portuguese for Alternative and Augmentative Communication (1999-2001)
  • REC - Speech Recognition Applied to Telecommunications (1997-2000)
  • PRAXIS_FALA - Reconhecimento de fala de Alto Desempenho em Português (1997-1999)
  • CORAL - Labelled Spoken Dialogue Corpus (1997-1999)
  • BDFALA (in Portuguese) - Spoken Database for European Portuguese (1994-1998)
  • EDIFALA (in Portuguese) - Vocal Support System for Oral and Motor Handicapped (1993-1997)

Bilateral contracts

  • LIFAPOR - Spoken Books in European and Brazilian Portuguese (2005-2007)
  • ARARA - Automatic directory assistant service for Portuguese Telecom, together with Philips Speech Techonology (1999-2001)
  • SVIT (in Portuguese) - Partial Automation of Directory Services Based on Synthesis of Telephone Numbers (1995-1999)

Finished before 1995

  • WERNICKE - A Neural Network Based, Speaker Independent, Large Vocabulary, Continuous Speech Recognition System (1992-1995)
  • RELATOR - A European Network of Repositories for Linguistic Resources (1993-1994)
  • SAM_A (ESPRIT III) - Multi-Lingual Speech Input/Output Assessment, Methodology and Standardization (1992-1993)
  • ONOMASTICA (Language Research Engineering) - Multi-Language Pronounciation Dictionary of Proper Names and Place Names (1993-1995)
  • SUNSTAR (ESPRIT II) - Integration and Design of Speech Understanding Interfaces (1989-1992)
  • HCM-ELSNET (Human Capital Mobility) - Phrase Level Phonology and Dialogue & Discourse (1994-1996)
  • EUREKA 151 - High quality speech coding at medium-to-low bit rates (1987-1990)
  • COST 229 - Applications od Digital Signal Processing to Telecommunications (1990-1993)
  • COMETT - A Trans-European Platform for Transferable Continuing Education in Digital Signal Processing (1990-1993)