Difference between revisions of "Projects"

From HLT@INESC-ID

(National)
 
(174 intermediate revisions by 8 users not shown)
Line 1: Line 1:
 
__NOTOC__
 
__NOTOC__
== Ongoing==
+
== International Projects ==
 +
Please see https://www.inesc-id.pt.
  
=== International ===
+
== National Projects ==
 +
Please see https://www.inesc-id.pt.
  
{{Project|logo=logo-vidivideo.png
+
== Internal Projects ==
| shorttitle=VidiVideo
+
* [[COVID19]] - Detecção de COVID-19 a partir de tosse e fala
| title=Vidi-Video (Interactive semantic video search with a large thesaurus of machine learned audio-visual concepts)
+
| date=2007-2010
+
| information='''Summary:''' VIDI-Video project takes on the challenge of creating a substantially enhanced semantic access to video, implemented in a search engine. The engine will boost the performance of video search by forming a 1000 element thesaurus detecting instances of audio, visual or mixed-media content.<br/> '''External site:''' http://www.vidivideo.eu}}
+
{{Project|logo=logo-e-circus.png
+
| shorttitle=E-Circus
+
| title=E-Circus (Education through Characters with Emotional Intelligence and Role-playing Capabilities that Understand Social Interaction)
+
| date=2006-2008
+
| information='''Summary:''' E-Circus, will develop a new approach in the use of ICT to support social and emotional learning within Personal and Social Education (PSE). This will be achieved through virtual role-play with synthetic characters that establish credible and empathic relations with the learners.<br/> '''External site:''' http://www.e-circus.org/}}
+
{{Project|logo=logo-cost-2102.png
+
| shorttitle=COST-2102
+
| title=COST-2102 (Cross-Modal Analysis of Verbal and Non-verbal Communication)
+
| date=2006-2010
+
| information='''Summary:''' The main objective of this COST Action is to develop an advanced acoustical, perceptual, and psychological analysis of verbal and nonverbal communication signals originating in spontaneous face-to-face interaction, in order to identify algorithms and automatic procedures capable of identifying the human emotional states. <br/> '''External site:''' http://www.cost2102.eu}}
+
{{Project|logo=logo-cost-2103.png
+
| shorttitle=COST-2103
+
| title=COST-2103 (Advanced Voice Function Assessment)
+
| date=2006-2010
+
| information='''Summary:''' The main objective of the Action is to combine previously unexploited techniques with new theoretical developments to improve the assessment of voice for as many European languages as possible, while acquiring in parallel data with a view to elaborating better voice production models. <br/> '''External site:''' http://www.cost.esf.org/index.php?id=110&action_number=2103}}
+
{{Project|logo=logo-ecess.png
+
| shorttitle=ECESS
+
| title=ECESS (European Center of Excellence on Speech Synthesis)
+
|date=
+
| information='''Summary:''' ECESS is a European initiative to foster the European research area in the field of Language Technology. As partner in this network where major industry partners such as IBM, Siemens and Nokia as well as European key research institutes are connected, L2F plays an active role in the R&D activities. We are responsible for the intelligibility and naturalness of the synthesized utterances while developing the acoustic module of the speech synthesis software. <br/> '''External site:''' http://www.ecess.eu/}}
+
  
=== National ===
+
== Past International Projects ==
  
{{Project|logo=logo-tecnovoz.png
+
* [[The European Network on Integrating Vision and Language (iV&L Net)|COST Action IC1307]] - The European Network on Integrating Vision and Language (iV&L Net) (2013-2017)
| shorttitle=Tecnovoz
+
* [[De-identification for privacy protection in multimedia content|COST Action IC1206]] - De-identification for privacy protection in multimedia content (2013-2017)
| title=Tecnovoz (Tecnologia de Reconhecimento e Síntese de Voz)  
+
* [[TextLink: Structuring Discourse in Multilingual Europe|COST Action IS1312]] - TextLink: Structuring Discourse in Multilingual Europe (2013-2017)
| date=2006-2008
+
* [[Realising an Applied Gaming Ecosystem|RAGE]] - Realising an Applied Gaming Ecosystem (2015-2018)
| information='''Summary:''' The Tecnovoz consortium was built with the goal of creating a national technological centre capable of industrialising innovative systems based on speech technologies, at the same rhythm, and at equal technological level, that occurs in others countries. Simultaneously, it is intended that Portugal affirms itself as a real actor in the development of voice and speech technologies and a defender of its application to the Portuguese Language.<br/> '''External site:''' http://www.tecnovoz.pt/web/home.asp}}
+
* [[SPEDIAL (Spoken Dialogue Analytics)|SPEDIAL]] - Spoken Dialogue Analytics [http://www.spedial.eu] (2013-2015)
{{Project|logo=logo-postport.png
+
* [[DIRHA (Distant-speech Interaction for Robust Home Applications)|DIRHA]] - Voice-enabled automated home environments based on distant-speech interaction in different languages (2012-2015)
| shorttitle=PoSTPort
+
* [http://www.eutvweb.eu/ euTV] - Adaptive Channels in Europe (2010-2012)
| title=PoSTPort (POrting Speech Technologies to other varieties of Portuguese)
+
* [http://www.meta-net.eu/ METANET4U] - Network of Excellence forging the multilingual Europe Technology Alliance (2010-2012)
| date=2008-2010
+
* [http://www.lirec.eu/ LIREC] - LIving with Robots and InteractivE Companions (2008-2012)
| information='''Summary:''' PoSTPort's goal is to port spoken language technologies originally developed for European Portuguese to the South-American and African varieties of Portuguese. The two main technologies to be investigated are speech synthesis and recognition. Instead of porting complete systems, we concentrate on linguistically relevant modules. Prior to this main work, the project will involve two tasks: corpora collection and characterization of the main differences between the studied varieties. The last task concerns the automatic identification of spoken varieties of Portuguese, which will be used as a pre-processing stage for switching among recognition systems developed for specific varieties.}}
+
* [https://www.l2f.inesc-id.pt/wiki/index.php/I-Dash_%28The_Investigator%27s_Dashboard%29 I-DASH] - The Investigator’s Dashboard (2008-2010)
{{Project|logo=logo-ricoba.png
+
* [https://www.l2f.inesc-id.pt/wiki/index.php/Vidi-Video_%28Interactive_semantic_video_search_with_a_large_thesaurus_of_machine_learned_audio-visual_concepts%29 VIDIVIDEO] - Interactive semantic video search with a large thesaurus of machine learned audio-visual concepts(2007-2010)
| shorttitle=RiCoBa
+
* [http://www.cost.eu/domains_actions/ict/Actions/2102 COST-2102] - Cross-Modal Analysis of Verbal and Non-verbal Communication (2006-2010)
| title=RiCoBa (Rich Content Books for All)
+
* [http://www.cost.eu/domains_actions/ict/Actions/2103 COST-2103] - Advanced Voice Function Assessment (2006-2010)
| date=2005-2007
+
* [http://www.macs.hw.ac.uk/EcircusWeb/ E-Circus] - Education through Characters with Emotional Intelligence and Role-playing Capabilities that Understand Social Interaction (2006-2008)
| information='''Summary:''' The project aims at making books more accessible and appealing to different audiences. The goals are supporting rich Digital Talking Book (DTB) generation, through the development of a production framework to assist in the building and enriching of the books, and rich DTB playback, by developing tools for non-visual platforms, and tools that adapt the book presentation and interaction, reacting to changes in the user, playback devices, and environment.<br/> '''External site:''' http://hcim.di.fc.ul.pt/ricoba/index.html}}
+
* [http://www.see.ed.ac.uk/~cost277/ COST 277] - Nonlinear Speech Processing (2001-2005)
{{Project|logo=logo-lectra.png
+
* [http://www.cost.eu/domains_actions/ict/Actions/278 COST 278] - Spoken Language Interaction in Telecommunication (2001-2005)
| shorttitle=LECTRA
+
* [[ALERT]] - Alert System for Selective Dissemination of Multimedia Information (2000-2002)
| title=LECTRA (Rich Transcription of Lectures for E-Learning Applications)
+
| date=2005-2007
+
| information='''Summary:''' The goal of this project is the production of multimedia lecture contents for e-learning applications. We shall take as a pilot study a course for which the didactic material is already electronically available and in Portuguese. Our contribution to these contents will be to add, for each lecture in the course, the recorded video signal and the synchronized lecture transcription. We believe that this synchronized transcription may be specially important for hearing-impaired students.}}
+
{{Project|logo=logo-nlegrid.png
+
| shorttitle=NLE GRID
+
| title=NLE GRID (Natural Language Engineering on a Computational Grid)
+
| date=2005-2007
+
| information='''Summary:''' Computational grids enable the sharing and aggregation of geographically distributed resources for solving large-scale and data intensive problems. In this project we propose a software architecture for building component-based applications, targeted for the computational processing of the Portuguese language, deployed as a computational grid.}}
+
{{Project|logo=logo-wfst.png
+
| shorttitle=WFST
+
| title=WFST (Weighted Finite State Transducers Applied to Spoken Language Processing)  
+
| date=2004-2007
+
| information='''Summary:''' An interesting feature of FSMs is that they can be automatically built or "learned" from training data using corpus­based techniques. Compared to more traditional knowledge­based approaches, these techniques are attractive for their potential of much lower development costs. Another interesting property of FSMs is their feasibility for implementing or approximating knowledge-based techniques. The main goal of this project is the application of this framework to speech recognition and synthesis.}}
+
{{Project|logo=logo-diga.png
+
| shorttitle=DIGA
+
| title=DIGA (Dialog Interface for Global Access)  
+
| date=2004-2007
+
| information='''Summary:''' This project will integrate teams with very different expertise - speech processing, neural networks and natural language processing - who will join efforts to develop a conversational interface for accessing and retrieving online information. Building a prototype of a spoken dialog interface using state of the art core language technologies is therefore the first step towards being able to address in the future innovative research areas such as multilingual information access or animated multimodal conversational agents.}}
+
 
+
=== Bilateral ===
+
 
+
{{Project|logo=logo-lifapor.png
+
| shorttitle=LIFAPOR
+
| title=LIFAPOR (Spoken Books in European and Brazilian Portuguese)
+
| date=2005-2007
+
| information='''Summary:''' O projecto LiFaPor tem como principal objectivo potenciar o desenvolvimento de Livros Falados Digitais para o Português Europeu e para o Português Brasileiro. Trata-se, portanto, de realizar um conjunto de actividades que culminem num objectivo comum: a produção de Livros Falados Digitais nas duas variantes da Língua Portuguesa.<br/>}}
+
 
+
== Finished ==
+
 
+
=== International ===
+
 
+
* [[COST 277]] - Nonlinear Speech Processing
+
* [[COST 278]] - Spoken Language Interaction in Telecommunication
+
* [[ALERT]] - Alert System for Selective Dissemination of Multimedia Information (2005-)
+
 
* [[AUDIOLING-LP]] - Multimedia course for foreign students of the Portuguese language (Socrates Program)
 
* [[AUDIOLING-LP]] - Multimedia course for foreign students of the Portuguese language (Socrates Program)
 
* [[SPEECHDAT - Speech Databases for for Creation of Voice Driven Teleservices|SPEECHDAT]] - Speech Databases for for Creation of Voice Driven Teleservices (1994-1999)
 
* [[SPEECHDAT - Speech Databases for for Creation of Voice Driven Teleservices|SPEECHDAT]] - Speech Databases for for Creation of Voice Driven Teleservices (1994-1999)
Line 88: Line 33:
 
* [http://hebb.inesc.pt/NN/RFC/projectos.htm#SPRACH SPRACH] - Speech Recognition ALgorithms for Connectionist Hybrids (1995-1998)
 
* [http://hebb.inesc.pt/NN/RFC/projectos.htm#SPRACH SPRACH] - Speech Recognition ALgorithms for Connectionist Hybrids (1995-1998)
 
* [http://www.elsnet.org/ ELSNET] - European Network in Language and Speech
 
* [http://www.elsnet.org/ ELSNET] - European Network in Language and Speech
 +
* [http://www.ecess.eu/ ECESS] - European Center of Excellence on Speech Synthesis
  
=== National ===
+
== Past National Projects ==
  
 +
* [[TRAnslation and Transcription Assisted by Humans on the Internet|TRATAHI]] - TRAnslation and Transcription Assisted by Humans on the Internet (2015)
 +
* [[Intelligent Networked Robot Systems for Symbiotic Interaction with Children with Impaired Development|INSIDE]] - Intelligent Networked Robot Systems for Symbiotic Interaction with Children with Impaired Development (2014-2018)
 +
* [[Decision support systems for preventing ICU readmissions|IC4U]] - Decision support systems for preventing ICU readmissions (2013-2015)
 +
* [[MISNIS (Intelligent Mining of Public Social Networks’ Influence in Society)|MISNIS]] - Intelligent Mining of Public Social Networks’ Influence in Society (2013-2015)
 +
* [[Machine Translation for Microblogs|MT4M]] - Machine Translation for Microblogs (2015)
 +
* [[Voice coaching for reduced stress|VOCE]] - Voice coaching for reduced stress (2012-2015)
 +
* [[SUSPECT (SecUre SPEeCh Technologies)|SUSPECT]] - SecUre SPEeCh Technologies (2012-2015)
 +
* [http://www.vithea.org VITHEA] - Virtual Therapist for Aphasia treatment (2010-2012)
 +
* [https://avoz.l2f.inesc-id.pt AVOZ] - Models for automatic speech recognition for the Elderly (2012-2013)
 +
* [https://www.l2f.inesc-id.pt/w/PoSTPort_(POrting_Speech_Technologies_to_other_varieties_of_Portuguese) PoSTPort] - POrting Speech Technologies to other varieties of Portuguese (2008-2010)
 +
* [https://www.l2f.inesc-id.pt/wiki/index.php/OOBIAN_%28%29 OBIAN]  Lexical and grammatical resources for recognizing syntactic-semantic relationships on text (2010-2012)
 +
* [[Enhance the Cultural Tourism through the Interaction with Virtual Characters|FalaComigo]] - Enhance the Cultural Tourism through the Interaction with Virtual Characters (2010-2013)
 +
* [https://www.l2f.inesc-id.pt/wiki/index.php/REAP.PT_%28Computer_Aided_Language_Learning_-_Reading_Practice%29 REAP.PT] - Computer Assisted Language Learning: Reading Practice (2009-2012)
 +
* [https://www.l2f.inesc-id.pt/wiki/index.php/PT-STAR_%28Speech_Translation_Advanced_Research_to_and_from_Portuguese%29 PT-STAR] - Speech Translation Advanced Research to and from Portuguese (2009-2012)
 +
* [https://www.l2f.inesc-id.pt/wiki/index.php/ARIA_-_Ambient-assisted_Reading_Interfaces_for_the_Ageing-society ARIA] - Ambient-assisted Reading Interfaces for the Ageing-society (2010-2012)
 +
* [[FleetMod (simulate and predict the behavior of the skippers of fishing vessels to provide a framework to test the effectiveness of different management policies)|FleetMod]] - FleetMod (simulate and predict the behavior of the skippers of fishing vessels to provide a framework to test the effectiveness of different management policies) (2008-2011)
 +
* [[StopFire (a distributed intelligent system for forest fire combat aid)|StopFire]] - StopFire (a distributed intelligent system for forest fire combat aid) (2007-2011)
 +
* [[Tecnovoz (Tecnologia de Reconhecimento e Síntese de Voz)|Tecnovoz]] - Tecnologia de Reconhecimento e Síntese de Voz (2006-2008)
 +
* [[RiCoBa (Rich Content Books for All)|RiCoBa]] - Rich Content Books for All (2005-2007)
 +
* [[LECTRA (Rich Transcription of Lectures for E-Learning Applications)|LECTRA]] - Rich Transcription of Lectures for E-Learning Applications (2005-2007)
 +
* [[NLE GRID (Natural Language Engineering on a Computational Grid)|NLE GRID]] - Natural Language Engineering on a Computational Grid (2005-2007)
 +
* [[WFST (Weighted Finite State Transducers Applied to Spoken Language Processing)|WFST]] - Weighted Finite State Transducers Applied to Spoken Language Processing (2004-2007)
 +
* [[DIGA (Dialog Interface for Global Access) |DIGA]] - Dialog Interface for Global Access (2004-2007)
 
* [[PAPOUS - The Story Teller|PAPOUS]] - The Story Teller (2003-2005)
 
* [[PAPOUS - The Story Teller|PAPOUS]] - The Story Teller (2003-2005)
 
* [[IPSOM]] - Indexing, Integration and Sound Retrieval in Multimedia Documents (2000-2004)
 
* [[IPSOM]] - Indexing, Integration and Sound Retrieval in Multimedia Documents (2000-2004)
Line 103: Line 72:
 
* [[EDIFALA]] (in Portuguese) - Vocal Support System for Oral and Motor Handicapped (1993-1997)
 
* [[EDIFALA]] (in Portuguese) - Vocal Support System for Oral and Motor Handicapped (1993-1997)
  
=== Bilateral contracts ===
+
== Past Bilateral contracts ==
  
 +
* [[LIFAPOR (Spoken Books in European and Brazilian Portuguese)|LIFAPOR]]  - Spoken Books in European and Brazilian Portuguese (2005-2007)
 
* [[ARARA]] - Automatic directory assistant service for Portuguese Telecom, together with Philips Speech Techonology (1999-2001)
 
* [[ARARA]] - Automatic directory assistant service for Portuguese Telecom, together with Philips Speech Techonology (1999-2001)
 
* [[SVIT]] (in Portuguese) - Partial Automation of Directory Services Based on Synthesis of Telephone Numbers (1995-1999)
 
* [[SVIT]] (in Portuguese) - Partial Automation of Directory Services Based on Synthesis of Telephone Numbers (1995-1999)
  
== Finished before 1995 ==
+
== Projects finished before 1995 ==
  
 
* [[WERNICKE]] - A Neural Network Based, Speaker Independent, Large Vocabulary, Continuous Speech Recognition System (1992-1995)
 
* [[WERNICKE]] - A Neural Network Based, Speaker Independent, Large Vocabulary, Continuous Speech Recognition System (1992-1995)

Latest revision as of 17:06, 4 February 2021

International Projects

Please see https://www.inesc-id.pt.

National Projects

Please see https://www.inesc-id.pt.

Internal Projects

  • COVID19 - Detecção de COVID-19 a partir de tosse e fala

Past International Projects

  • COST Action IC1307 - The European Network on Integrating Vision and Language (iV&L Net) (2013-2017)
  • COST Action IC1206 - De-identification for privacy protection in multimedia content (2013-2017)
  • COST Action IS1312 - TextLink: Structuring Discourse in Multilingual Europe (2013-2017)
  • RAGE - Realising an Applied Gaming Ecosystem (2015-2018)
  • SPEDIAL - Spoken Dialogue Analytics [1] (2013-2015)
  • DIRHA - Voice-enabled automated home environments based on distant-speech interaction in different languages (2012-2015)
  • euTV - Adaptive Channels in Europe (2010-2012)
  • METANET4U - Network of Excellence forging the multilingual Europe Technology Alliance (2010-2012)
  • LIREC - LIving with Robots and InteractivE Companions (2008-2012)
  • I-DASH - The Investigator’s Dashboard (2008-2010)
  • VIDIVIDEO - Interactive semantic video search with a large thesaurus of machine learned audio-visual concepts(2007-2010)
  • COST-2102 - Cross-Modal Analysis of Verbal and Non-verbal Communication (2006-2010)
  • COST-2103 - Advanced Voice Function Assessment (2006-2010)
  • E-Circus - Education through Characters with Emotional Intelligence and Role-playing Capabilities that Understand Social Interaction (2006-2008)
  • COST 277 - Nonlinear Speech Processing (2001-2005)
  • COST 278 - Spoken Language Interaction in Telecommunication (2001-2005)
  • ALERT - Alert System for Selective Dissemination of Multimedia Information (2000-2002)
  • AUDIOLING-LP - Multimedia course for foreign students of the Portuguese language (Socrates Program)
  • SPEECHDAT - Speech Databases for for Creation of Voice Driven Teleservices (1994-1999)
  • VODIS - Advanced Speech Technologies for Voice Operated Driver Information Systems (1995-1999)
  • SPRACH - Speech Recognition ALgorithms for Connectionist Hybrids (1995-1998)
  • ELSNET - European Network in Language and Speech
  • ECESS - European Center of Excellence on Speech Synthesis

Past National Projects

  • TRATAHI - TRAnslation and Transcription Assisted by Humans on the Internet (2015)
  • INSIDE - Intelligent Networked Robot Systems for Symbiotic Interaction with Children with Impaired Development (2014-2018)
  • IC4U - Decision support systems for preventing ICU readmissions (2013-2015)
  • MISNIS - Intelligent Mining of Public Social Networks’ Influence in Society (2013-2015)
  • MT4M - Machine Translation for Microblogs (2015)
  • VOCE - Voice coaching for reduced stress (2012-2015)
  • SUSPECT - SecUre SPEeCh Technologies (2012-2015)
  • VITHEA - Virtual Therapist for Aphasia treatment (2010-2012)
  • AVOZ - Models for automatic speech recognition for the Elderly (2012-2013)
  • PoSTPort - POrting Speech Technologies to other varieties of Portuguese (2008-2010)
  • OBIAN Lexical and grammatical resources for recognizing syntactic-semantic relationships on text (2010-2012)
  • FalaComigo - Enhance the Cultural Tourism through the Interaction with Virtual Characters (2010-2013)
  • REAP.PT - Computer Assisted Language Learning: Reading Practice (2009-2012)
  • PT-STAR - Speech Translation Advanced Research to and from Portuguese (2009-2012)
  • ARIA - Ambient-assisted Reading Interfaces for the Ageing-society (2010-2012)
  • FleetMod - FleetMod (simulate and predict the behavior of the skippers of fishing vessels to provide a framework to test the effectiveness of different management policies) (2008-2011)
  • StopFire - StopFire (a distributed intelligent system for forest fire combat aid) (2007-2011)
  • Tecnovoz - Tecnologia de Reconhecimento e Síntese de Voz (2006-2008)
  • RiCoBa - Rich Content Books for All (2005-2007)
  • LECTRA - Rich Transcription of Lectures for E-Learning Applications (2005-2007)
  • NLE GRID - Natural Language Engineering on a Computational Grid (2005-2007)
  • WFST - Weighted Finite State Transducers Applied to Spoken Language Processing (2004-2007)
  • DIGA - Dialog Interface for Global Access (2004-2007)
  • PAPOUS - The Story Teller (2003-2005)
  • IPSOM - Indexing, Integration and Sound Retrieval in Multimedia Documents (2000-2004)
  • ATA - Automatic Terms Acquisition (2001-2003)
  • FALA2 (2000-2003)
  • CITE-IV - Augmentative Communication Tools in Portuguese (1999-2000)
  • DIXI+ - A Text-to-Speech Synthesizer in Portuguese for Alternative and Augmentative Communication (1999-2001)
  • REC - Speech Recognition Applied to Telecommunications (1997-2000)
  • PRAXIS_FALA - Reconhecimento de fala de Alto Desempenho em Português (1997-1999)
  • CORAL - Labelled Spoken Dialogue Corpus (1997-1999)
  • BDFALA (in Portuguese) - Spoken Database for European Portuguese (1994-1998)
  • EDIFALA (in Portuguese) - Vocal Support System for Oral and Motor Handicapped (1993-1997)

Past Bilateral contracts

  • LIFAPOR - Spoken Books in European and Brazilian Portuguese (2005-2007)
  • ARARA - Automatic directory assistant service for Portuguese Telecom, together with Philips Speech Techonology (1999-2001)
  • SVIT (in Portuguese) - Partial Automation of Directory Services Based on Synthesis of Telephone Numbers (1995-1999)

Projects finished before 1995

  • WERNICKE - A Neural Network Based, Speaker Independent, Large Vocabulary, Continuous Speech Recognition System (1992-1995)
  • RELATOR - A European Network of Repositories for Linguistic Resources (1993-1994)
  • SAM_A (ESPRIT III) - Multi-Lingual Speech Input/Output Assessment, Methodology and Standardization (1992-1993)
  • ONOMASTICA (Language Research Engineering) - Multi-Language Pronounciation Dictionary of Proper Names and Place Names (1993-1995)
  • SUNSTAR (ESPRIT II) - Integration and Design of Speech Understanding Interfaces (1989-1992)
  • HCM-ELSNET (Human Capital Mobility) - Phrase Level Phonology and Dialogue & Discourse (1994-1996)
  • EUREKA 151 - High quality speech coding at medium-to-low bit rates (1987-1990)
  • COST 229 - Applications od Digital Signal Processing to Telecommunications (1990-1993)
  • COMETT - A Trans-European Platform for Transferable Continuing Education in Digital Signal Processing (1990-1993)