HLT@INESC-ID - User contributions [en]

VoxCeleb-PT

2022-06-28T18:39:28Z

Jmen:

VoxCeleb-PT is a small dataset of voices of Portuguese celebrities that can be used as a language-specific extension of the widely used VoxCeleb corpus.

== Statistics ==

The dataset contains 51 celebrities, of which 23 are female. Altogether, Voxceleb-PT contains 26,663 automatically transcribed utterances (.wav 16kHz, pcm_s16le). The total duration is 17:55:14, with an average of 20 min/spk, and utterance duration 2-5s.

The dataset can be obtained in its original form and with both development (train/val) and test sets, all containing the full speaker cohort. All sets contain speaker id, age, gender and manually corrected transcriptions. As such, VoxCeleb-PT should prove useful for a variety of tasks, namely ASR, Speaker Verification and Age/Gender Recognition.

== Download ==

Please fill out [https://forms.gle/EuLPwgVLWQdBBPqYA this form ]to access the data.

* Original. The dataset follows the kaldi file system: each folder contains speech files from a given speaker and the following: ''text'', ''utt2spk'' and ''wav.scp''. The ''spk_info.csv'' file maps the speaker id to the celebrities' age and gender.

* With splits. Contains Train,dev and hidden test splits.

* Raw. Raw .mp4 and .wav files together with subtitle files.

== License ==

This dataset is available to download for research purposes under a [https://creativecommons.org/licenses/by/4.0/ Creative Commons Attribution 4.0 International License]. The copyright remains with the original owners of the video.

The views and opinions expressed by speakers in the dataset are those of the individual speakers and do not necessarily reflect the positions of the University of Lisbon, INESC-ID, or the authors.

VoxCeleb-PT

2022-06-28T18:36:04Z

Jmen:

VoxCeleb-PT is a small dataset of voices of Portuguese celebrities that can be used as a language-specific extension of the widely used VoxCeleb corpus.

== Statistics ==

The dataset contains 51 celebrities, of which 23 are female. Altogether, Voxceleb-PT contains 26,663 automatically transcribed utterances (.wav 16kHz, pcm_s16le). The total duration is 17:55:14, with an average of 20 min/spk, and utterance duration 2-5s.

The dataset can be obtained in its original form and with both development (train/val) and test sets, all containing the full speaker cohort. All sets contain speaker id, age, gender and manually corrected transcriptions. As such, VoxCeleb-PT should prove useful for a variety of tasks, namely ASR, Speaker Verification and Age/Gender Recognition.

== Download ==

Please fill out [https://forms.gle/EuLPwgVLWQdBBPqYA this form ]to access the data.

* Original. The dataset follows the kaldi file system: each folder contains speech files from a given speaker and the following: ''text'', ''utt2spk'' and ''wav.scp''. The ''spk_info.csv'' file maps the speaker id to the celebrities' age and gender.

* With splits. Contains Train,dev and hidden test splits.

* Raw. Raw .mp4 and .wav files together with subtitle files.

== License ==

This dataset is available to download for research purposes under a Creative Commons Attribution 4.0 International License[https://creativecommons.org/licenses/by/4.0/]. The copyright remains with the original owners of the video.

The views and opinions expressed by speakers in the dataset are those of the individual speakers and do not necessarily reflect the positions of the University of Lisbon, INESC-ID, or the authors.

VoxCeleb-PT

2022-06-21T16:07:42Z

Jmen: /* License */

VoxCeleb-PT is a small dataset of voices of Portuguese celebrities that can be used as a language-specific extension of the widely used VoxCeleb corpus.

== Statistics ==

The dataset contains 51 celebrities, of which 23 are female. Altogether, Voxceleb-PT contains 26,663 automatically transcribed utterances (.wav 16kHz, pcm_s16le). The total duration is 17:55:14, with an average of 20 min/spk, and utterance duration 2-5s.

The dataset can be obtained in its original form and with both development (train/val) and test sets, all containing the full speaker cohort. All sets contain speaker id, age, gender and manually corrected transcriptions. As such, VoxCeleb-PT should prove useful for a variety of tasks, namely ASR, Speaker Verification and Age/Gender Recognition.

== Download ==

* Original. The dataset follows the kaldi file system: each folder contains speech files from a given speaker and the following: ''text'', ''utt2spk'' and ''wav.scp''. The ''spk_info.csv'' file maps the speaker id to the celebrities' age and gender.

* With splits

== License ==

This dataset is available to download for research purposes under a Creative Commons Attribution 4.0 International License[https://creativecommons.org/licenses/by/4.0/]. The copyright remains with the original owners of the video.

The views and opinions expressed by speakers in the dataset are those of the individual speakers and do not necessarily reflect the positions of the University of Lisbon, INESC-ID, or the authors.

VoxCeleb-PT

2022-06-21T16:06:43Z

Jmen:

VoxCeleb-PT is a small dataset of voices of Portuguese celebrities that can be used as a language-specific extension of the widely used VoxCeleb corpus.

== Statistics ==

The dataset contains 51 celebrities, of which 23 are female. Altogether, Voxceleb-PT contains 26,663 automatically transcribed utterances (.wav 16kHz, pcm_s16le). The total duration is 17:55:14, with an average of 20 min/spk, and utterance duration 2-5s.

The dataset can be obtained in its original form and with both development (train/val) and test sets, all containing the full speaker cohort. All sets contain speaker id, age, gender and manually corrected transcriptions. As such, VoxCeleb-PT should prove useful for a variety of tasks, namely ASR, Speaker Verification and Age/Gender Recognition.

== Download ==

* Original. The dataset follows the kaldi file system: each folder contains speech files from a given speaker and the following: ''text'', ''utt2spk'' and ''wav.scp''. The ''spk_info.csv'' file maps the speaker id to the celebrities' age and gender.

* With splits

== License ==

This dataset is available to download for research purposes under a Creative Commons Attribution 4.0 International License. The copyright remains with the original owners of the video.

The views and opinions expressed by speakers in the dataset are those of the individual speakers and do not necessarily reflect the positions of the University of Lisbon, INESC-ID, or the authors.

VoxCeleb-PT

2022-06-21T15:37:03Z

Jmen: /* Dowload */

John Mendonca

2022-06-15T15:06:15Z

Jmen:

{{infobox|name=John Mendonça
|username=jmen
|contact=John.mendonca
}}
[[File:Portrait.jpg|200px|thumb|right]]

John Mendonça is a Junior Researcher of the HLT (Human Language Technologies Laboratory) and a PhD Student of the Doctoral Programme in Electrical and Computer Engineering of [https://tecnico.ulisboa.pt/pt/ IST]. He obtained his MSc degree in 2020 with the thesis "Automatic Detection of Profile Features". In it, he applied speech processing techniques, more specifically in speaker recognition and paralinguistics, for the collection of crowdsourced and health related datasets. More recently he joined a PhD Programme under the MAIA project, a joint CMU/INESC-ID/IT/Unbabel collaboration for Intelligent AI Agents in chatbots. His PhD work is closely related to conversational quality estimation.

== Contacts ==

* '''John Mendonça'''
:- '''Email''': John.mendonca(at)inesc-id.pt
:- '''Role''': Junior Researcher | PhD Student
:- Human Language Technologies Lab, INESC-ID | Instituto Superior Técnico
:- Rua Alves Redol, 9, 1000-029 Lisboa, Portugal
:- inesc-id.pt | tecnico.ulisboa.pt

== Research Interests ==

In 2020, he joined the HLT team for his Master Thesis Research. During this time, he led the HLT Team's contribution to the ComParE 2020 Breathing sub-challenge, pertaining the estimation of breathing signals. In 2021, his research focus shifted to Natural Language Processing (NLP), with special emphasis in conversational agent evaluation metrics.

== Academic Activities ==

* PhD Student: PDEEC

* Teaching Assistant: [https://fenix.tecnico.ulisboa.pt/disciplinas/PF364511132646/2020-2021/2-semestre Spoken Language Processing]

<inesc-id what='person' id='20720'></inesc-id>

[[category:People]]
[[category:Researchers]]

User:Jmen

2022-06-07T15:40:29Z

Jmen:

[[File:Portrait.jpg|200px|thumb|right]]

John Mendonça is a Junior Researcher of the HLT (Human Language Technologies Laboratory) and a PhD Student of the Doctoral Programme in Electrical and Computer Engineering of [https://tecnico.ulisboa.pt/pt/ IST]. He obtained his MSc degree in 2020 with the thesis "Automatic Detection of Profile Features". In it, he applied speech processing techniques, more specifically in speaker recognition and paralinguistics, for the collection of crowdsourced and health related datasets. More recently he joined a PhD Programme under the MAIA project, a joint CMU/INESC-ID/IT/Unbabel collaboration for Intelligent AI Agents in chatbots. His PhD work is closely related to conversational quality estimation.

== Contacts ==

* '''John Mendonça'''
:- '''Email''': John.mendonca[at]inesc-id.pt
:- '''Role''': Junior Researcher | PhD Student
:- Human Language Technologies Lab, INESC-ID | Instituto Superior Técnico
:- Rua Alves Redol, 9, 1000-029 Lisboa, Portugal
:- inesc-id.pt | tecnico.ulisboa.pt

== Research Interests ==

In 2020, he joined the HLT team for his Master Thesis Research. During this time, he led the HLT Team's contribution to the ComParE 2020 Breathing sub-challenge, pertaining the estimation of breathing signals. In 2021, his research focus shifted to Natural Language Processing (NLP), with special emphasis in conversational agent evaluation metrics.

== Academic Activities ==

* PhD Student: PDEEC

* Affiliate Student PhD Researcher: Language Technologies Institute, Carnegie Mellon University.

* Teaching Assistant: [https://fenix.tecnico.ulisboa.pt/disciplinas/PF364511132646/2020-2021/2-semestre Spoken Language Processing]

== Publications ==

* John Mendonca, Rui Correia, Mariana Lourenço, João Freitas and Isabel Trancoso, '''Towards Speaker Verification for Crowdsourced Speech Collections''', ''In'' LREC 2022, June 2022

* John Mendonça, Francisco Teixeira, Isabel Trancoso, Alberto Abad, '''Analyzing Breath Signals for the Interspeech 2020 ComParE Challenge''', ''In'' Interspeech 2020, October 2020

User:Jmen

2022-06-07T15:40:09Z

Jmen: /* Academic Activities */

[[File:Portrait.jpg|200px|thumb|right]]

John Mendonça is a Junior Researcher of the HLT (Human Language Technologies Laboratory) and a PhD Student of the Doctoral Programme in Electrical and Computer Engineering of [https://tecnico.ulisboa.pt/pt/ IST]. He obtained his MSc degree in 2020 with the thesis "Automatic Detection of Profile Features". In it, he applied speech processing techniques, more specifically in speaker recognition and paralinguistics, for the collection of crowdsourced and health related datasets. More recently he joined a PhD Programme under the MAIA project, a joint CMU/INESC-ID/IT/Unbabel collaboration for Intelligent AI Agents in chatbots. His PhD work is closely related to conversational quality estimation.

== Contacts ==

* '''John Mendonça'''
:- '''Email''': John.mendonca@inesc-id.pt
:- '''Role''': Junior Researcher | PhD Student
:- Human Language Technologies Lab, INESC-ID | Instituto Superior Técnico
:- Rua Alves Redol, 9, 1000-029 Lisboa, Portugal
:- inesc-id.pt | tecnico.ulisboa.pt

== Research Interests ==

In 2020, he joined the HLT team for his Master Thesis Research. During this time, he led the HLT Team's contribution to the ComParE 2020 Breathing sub-challenge, pertaining the estimation of breathing signals. In 2021, his research focus shifted to Natural Language Processing (NLP), with special emphasis in conversational agent evaluation metrics.

== Academic Activities ==

* PhD Student: PDEEC

* Affiliate Student PhD Researcher: Language Technologies Institute, Carnegie Mellon University.

* Teaching Assistant: [https://fenix.tecnico.ulisboa.pt/disciplinas/PF364511132646/2020-2021/2-semestre Spoken Language Processing]

== Publications ==

* John Mendonca, Rui Correia, Mariana Lourenço, João Freitas and Isabel Trancoso, '''Towards Speaker Verification for Crowdsourced Speech Collections''', ''In'' LREC 2022, June 2022

* John Mendonça, Francisco Teixeira, Isabel Trancoso, Alberto Abad, '''Analyzing Breath Signals for the Interspeech 2020 ComParE Challenge''', ''In'' Interspeech 2020, October 2020

User:Jmen

2022-06-07T15:34:52Z

Jmen:

[[File:Portrait.jpg|200px|thumb|right]]

John Mendonça is a Junior Researcher of the HLT (Human Language Technologies Laboratory) and a PhD Student of the Doctoral Programme in Electrical and Computer Engineering of [https://tecnico.ulisboa.pt/pt/ IST]. He obtained his MSc degree in 2020 with the thesis "Automatic Detection of Profile Features". In it, he applied speech processing techniques, more specifically in speaker recognition and paralinguistics, for the collection of crowdsourced and health related datasets. More recently he joined a PhD Programme under the MAIA project, a joint CMU/INESC-ID/IT/Unbabel collaboration for Intelligent AI Agents in chatbots. His PhD work is closely related to conversational quality estimation.

== Contacts ==

* '''John Mendonça'''
:- '''Email''': John.mendonca@inesc-id.pt
:- '''Role''': Junior Researcher | PhD Student
:- Human Language Technologies Lab, INESC-ID | Instituto Superior Técnico
:- Rua Alves Redol, 9, 1000-029 Lisboa, Portugal
:- inesc-id.pt | tecnico.ulisboa.pt

== Research Interests ==

In 2020, he joined the HLT team for his Master Thesis Research. During this time, he led the HLT Team's contribution to the ComParE 2020 Breathing sub-challenge, pertaining the estimation of breathing signals. In 2021, his research focus shifted to Natural Language Processing (NLP), with special emphasis in conversational agent evaluation metrics.

== Academic Activities ==

* PhD Student: PDEEC

* Teaching Assistant: [https://fenix.tecnico.ulisboa.pt/disciplinas/PF364511132646/2020-2021/2-semestre Spoken Language Processing]

== Publications ==

* John Mendonca, Rui Correia, Mariana Lourenço, João Freitas and Isabel Trancoso, '''Towards Speaker Verification for Crowdsourced Speech Collections''', ''In'' LREC 2022, June 2022

* John Mendonça, Francisco Teixeira, Isabel Trancoso, Alberto Abad, '''Analyzing Breath Signals for the Interspeech 2020 ComParE Challenge''', ''In'' Interspeech 2020, October 2020

Resources

2022-06-07T15:32:32Z

Jmen:

{{TOCright}}
L²F has been particularly active in the creation of linguistic resources for European Portuguese. The cooperation with CLUL has been of paramount importance in this activity. The resources listed are in inverse chronological order. The corresponding webpages are in Portuguese.

== Corpora ==

=== Speech ===

* POSTPORT - European, Brazilian and African varieties of Portuguese
* [[LECTRA Corpus|LECTRA]] - Classroom lectures
* [[IPSOM Pilot Corpus|IPSOM]] - Aligned spoken books
* [[ALERT Corpus|ALERT]] - Broadcast news
* [[CORAL Corpus|CORAL]] - Spoken dialogues (map task)
* [[BD-PÚBLICO Corpus|BD-PÚBLICO]]- Large vocabulary, speaker-independent, continuous speech
* [[SPEECHDAT Corpus|SPEECHDAT]] - Multi-purpose telephone speech database
* [[BDFALA Corpus|BDFALA]] - Speech analysis / synthesis
* [[EUROM.1 Corpus|EUROM.1]] - Multi-Lingual speech corpus for phonetic comparison
* '''[[VoxCeleb-PT]]''' - annotated corpus of European Portuguese celebrities.

=== Bilingual Corpus ===

* [[Word_Alignments|Golden collection of parallel multi-language word alignments]] - Manually annotated word alignments between six european languages taken from the Europarl common test set (more information on the [[Speech-to-speech Translation]] information page)

== Lexica ==
Pronunciation lexica (besides the ones included in the above corpora documentation):
* '''ONOMASTICA''' (Proper names of 11 European languages, in cooperation with TLP - Telefones de Lisboa e Porto): ~ 100.000 names of people, streets, towns and companies
* '''PF''' (Português Fundamental): ~ 26.000 citation forms

The pronunciation lexica developed by L²F use the SAMPA phonetic alphabet. See the [[SAMPA Table for European Portuguese|SAMPA table for European Portuguese]] and some comments about its design.

== See Also ==

* [[Resource Links]]

=== Newspapers ===
* [http://www.ims.uni-stuttgart.de/info/Newspapers.html List of Newspapers on the Internet] produced by [[Isabel Trancoso]] and maintained jointly with IMS Stuttgart.

=== Language Resource Centers ===
* [http://www.linguateca.pt Linguateca] (Distributed language resource center for Portuguese)
* [http://www.elra.info ELRA] (European Language Resources Association)
* [http://morph.ldc.upenn.edu/ LDC] (Linguistic Data Consortium)

=== Dictionaries ===
* [http://crnvmc.cern.ch/FIND/DICTIONARY? English/Technical Dictionary]
* [gopher://uts.mcc.ac.uk/77/gopherservices/enquire.english American English Dictionary]
* [gopher://gopher.princeton.edu:5003/7 Webster's Dictionary]
* [gopher://info.mcc.ac.uk/77/miscellany/acronyms/.index/index Acronyms Dictionary]
* [http://www.fmi.uni-passau.de/htbin/lt/lte English-German Dictionary]
* [http://www.fmi.uni-passau.de/htbin/lt/ltd German-English Dictionary]
* [http://nova.sti.nasa.gov/nasa-thesaurus.html NASA Thesaurus]

[[category:Resources]]

Downloads

2022-06-07T15:32:20Z

Jmen:

{{TOCright}}
These are tools and resources made available by the L²F.

== Tools ==

* [http://qa.l2f.inesc-id.pt/wiki/index.php/Systems#Just.Ask Just.Ask] is a Question-Answering system for English

* [http://www.l2f.inesc-id.pt/~lco/eugenio/index.html Eugenio] is a word predictor for European Portuguese

== Corpora Resources ==

=== Automatic Key Phrase Extraction ===
* '''[[110-PT-BN-KP]]''' - Manually annotated corpus of Portuguese Broadcast News with Key Phrases.

=== Translation ===

* '''[[Word_Alignments|Golden collection of parallel multi-language word alignments]]''' - Manually annotated word alignments between six european languages taken from the Europarl common test set (more information on the [[Speech-to-speech Translation]] information page)

=== Recommendation ===

* '''[[Fairy tale corpus]]''' - Corpus of fairy tales: the corpus is divided in semantically related clusters.

== Lexical Resources ==

=== Other ===

* [http://www.l2f.inesc-id.pt/resources/Portug.Dict.sit Portuguese Dictionary] for [http://www.eg.bucknell.edu/~excalibr/excalibur.html Excalibur], a spell checker for the Macintosh. Assembled by [[Nuno Mamede]] in association with [http://label2.ist.utl.pt/label/ LabEL]

VoxCeleb-PT

2022-06-07T14:49:07Z

Jmen:

VoxCeleb-PT

2022-06-07T14:48:31Z

Jmen:

Downloads

2022-06-07T14:34:36Z

Jmen:

{{TOCright}}
These are tools and resources made available by the L²F.

== Tools ==

* [http://qa.l2f.inesc-id.pt/wiki/index.php/Systems#Just.Ask Just.Ask] is a Question-Answering system for English

* [http://www.l2f.inesc-id.pt/~lco/eugenio/index.html Eugenio] is a word predictor for European Portuguese

== Corpora Resources ==

=== Speech ===
* '''[[VoxCeleb-PT]]''' - annotated corpus of European Portuguese celebrities.

=== Automatic Key Phrase Extraction ===
* '''[[110-PT-BN-KP]]''' - Manually annotated corpus of Portuguese Broadcast News with Key Phrases.

=== Translation ===

* '''[[Word_Alignments|Golden collection of parallel multi-language word alignments]]''' - Manually annotated word alignments between six european languages taken from the Europarl common test set (more information on the [[Speech-to-speech Translation]] information page)

=== Recommendation ===

* '''[[Fairy tale corpus]]''' - Corpus of fairy tales: the corpus is divided in semantically related clusters.

== Lexical Resources ==

=== Other ===

* [http://www.l2f.inesc-id.pt/resources/Portug.Dict.sit Portuguese Dictionary] for [http://www.eg.bucknell.edu/~excalibr/excalibur.html Excalibur], a spell checker for the Macintosh. Assembled by [[Nuno Mamede]] in association with [http://label2.ist.utl.pt/label/ LabEL]

VoxCeleb-PT

2022-06-07T14:34:31Z

Jmen: Created page with "VoxCeleb-PT is a small dataset of voices of Portuguese celebrities that can be used as a language-specific extension of the widely used VoxCeleb corpus. The dataset contain..."

VoxCeleb-PT is a small dataset of voices of Portuguese celebrities that can be used as a language-specific extension of the widely used VoxCeleb corpus.

The dataset contains 51 celebrities, of which 23 are female. Altogether, Voxceleb-PT contains 26,663 automatically transcribed utterances (.wav 16kHz, pcm_s16le). The total duration is 17:55:14, with an average of 20 min/spk, and utterance duration 2-5s.

The dataset can be obtained in its original form and with both development (train/val) and test sets, all containing the full speaker cohort. All sets contain speaker id, age, gender and manually corrected transcriptions. As such, VoxCeleb-PT should prove useful for a variety of tasks, namely ASR, Speaker Verification and Age/Gender Recognition.

John Mendonca

2021-12-10T15:27:15Z

Jmen:

[[File:Portrait.jpg|200px|thumb|right]]

John Mendonça is a Junior Researcher of the HLT (Human Language Technologies Laboratory) and a PhD Student of the Doctoral Programme in Electrical and Computer Engineering of [https://tecnico.ulisboa.pt/pt/ IST]. He obtained his MSc degree in 2020 with the thesis "Automatic Detection of Profile Features". In it, he applied speech processing techniques, more specifically in speaker recognition and paralinguistics, for the collection of crowdsourced and health related datasets. More recently he joined a PhD Programme under the MAIA project, a joint CMU/INESC-ID/IT/Unbabel collaboration for Intelligent AI Agents in chatbots. His PhD work is closely related to conversational quality estimation.

== Contacts ==

* '''John Mendonça'''
:- '''Email''': John.mendonca(at)inesc-id.pt
:- '''Role''': Junior Researcher | PhD Student
:- Human Language Technologies Lab, INESC-ID | Instituto Superior Técnico
:- Rua Alves Redol, 9, 1000-029 Lisboa, Portugal
:- inesc-id.pt | tecnico.ulisboa.pt

== Research Interests ==

In 2020, he joined the HLT team for his Master Thesis Research. During this time, he led the HLT Team's contribution to the ComParE 2020 Breathing sub-challenge, pertaining the estimation of breathing signals. In 2021, his research focus shifted to Natural Language Processing (NLP), with special emphasis in conversational agent evaluation metrics.

== Academic Activities ==

* PhD Student: PDEEC

* Teaching Assistant: [https://fenix.tecnico.ulisboa.pt/disciplinas/PF364511132646/2020-2021/2-semestre Spoken Language Processing]

== Publications ==

* John Daniel Fidalgo Mendonça, Francisco Teixeira, Isabel Trancoso, Alberto Abad, '''Analyzing Breath Signals for the Interspeech 2020 ComParE Challenge''', ''In'' Interspeech 2020, October 2020

User:Jmen

2021-12-10T15:26:49Z

Jmen:

[[File:Portrait.jpg|200px|thumb|right]]

John Mendonça is a Junior Researcher of the HLT (Human Language Technologies Laboratory) and a PhD Student of the Doctoral Programme in Electrical and Computer Engineering of [https://tecnico.ulisboa.pt/pt/ IST]. He obtained his MSc degree in 2020 with the thesis "Automatic Detection of Profile Features". In it, he applied speech processing techniques, more specifically in speaker recognition and paralinguistics, for the collection of crowdsourced and health related datasets. More recently he joined a PhD Programme under the MAIA project, a joint CMU/INESC-ID/IT/Unbabel collaboration for Intelligent AI Agents in chatbots. His PhD work is closely related to conversational quality estimation.

== Contacts ==

* '''John Mendonça'''
:- '''Email''': John.mendonca@inesc-id.pt
:- '''Role''': Junior Researcher | PhD Student
:- Human Language Technologies Lab, INESC-ID | Instituto Superior Técnico
:- Rua Alves Redol, 9, 1000-029 Lisboa, Portugal
:- inesc-id.pt | tecnico.ulisboa.pt

== Research Interests ==

In 2020, he joined the HLT team for his Master Thesis Research. During this time, he led the HLT Team's contribution to the ComParE 2020 Breathing sub-challenge, pertaining the estimation of breathing signals. In 2021, his research focus shifted to Natural Language Processing (NLP), with special emphasis in conversational agent evaluation metrics.

== Academic Activities ==

* PhD Student: PDEEC

* Teaching Assistant: [https://fenix.tecnico.ulisboa.pt/disciplinas/PF364511132646/2020-2021/2-semestre Spoken Language Processing]

== Publications ==

* John Daniel Fidalgo Mendonça, Francisco Teixeira, Isabel Trancoso, Alberto Abad, '''Analyzing Breath Signals for the Interspeech 2020 ComParE Challenge''', ''In'' Interspeech 2020, October 2020

File:Portrait.jpg

2021-12-10T15:22:57Z

Jmen:

John Mendonca

2021-12-10T15:20:08Z

Jmen: /* Contacts */

John Mendonça is a Junior Researcher of the HLT (Human Language Technologies Laboratory) and a PhD Student of the Doctoral Programme in Electrical and Computer Engineering of [https://tecnico.ulisboa.pt/pt/ IST]. He obtained his MSc degree in 2020 with the thesis "Automatic Detection of Profile Features". In it, he applied speech processing techniques, more specifically in speaker recognition and paralinguistics, for the collection of crowdsourced and health related datasets. More recently he joined a PhD Programme under the MAIA project, a joint CMU/INESC-ID/IT/Unbabel collaboration for Intelligent AI Agents in chatbots. His PhD work is closely related to conversational quality estimation.

== Contacts ==

* '''John Mendonça'''
:- '''Email''': John.mendonca(at)inesc-id.pt
:- '''Role''': Junior Researcher | PhD Student
:- Human Language Technologies Lab, INESC-ID | Instituto Superior Técnico
:- Rua Alves Redol, 9, 1000-029 Lisboa, Portugal
:- inesc-id.pt | tecnico.ulisboa.pt

== Research Interests ==

In 2020, he joined the HLT team for his Master Thesis Research. During this time, he led the HLT Team's contribution to the ComParE 2020 Breathing sub-challenge, pertaining the estimation of breathing signals. In 2021, his research focus shifted to Natural Language Processing (NLP), with special emphasis in conversational agent evaluation metrics.

== Academic Activities ==

* PhD Student: PDEEC

* Teaching Assistant: [https://fenix.tecnico.ulisboa.pt/disciplinas/PF364511132646/2020-2021/2-semestre Spoken Language Processing]

== Publications ==

* John Daniel Fidalgo Mendonça, Francisco Teixeira, Isabel Trancoso, Alberto Abad, '''Analyzing Breath Signals for the Interspeech 2020 ComParE Challenge''', ''In'' Interspeech 2020, October 2020

John Mendonca

2021-12-10T15:19:57Z

Jmen: Created page with "John Mendonça is a Junior Researcher of the HLT (Human Language Technologies Laboratory) and a PhD Student of the Doctoral Programme in Electrical and Computer Engineering of..."

John Mendonça is a Junior Researcher of the HLT (Human Language Technologies Laboratory) and a PhD Student of the Doctoral Programme in Electrical and Computer Engineering of [https://tecnico.ulisboa.pt/pt/ IST]. He obtained his MSc degree in 2020 with the thesis "Automatic Detection of Profile Features". In it, he applied speech processing techniques, more specifically in speaker recognition and paralinguistics, for the collection of crowdsourced and health related datasets. More recently he joined a PhD Programme under the MAIA project, a joint CMU/INESC-ID/IT/Unbabel collaboration for Intelligent AI Agents in chatbots. His PhD work is closely related to conversational quality estimation.

== Contacts ==

* '''John Mendonça'''
:- '''Email''': John.mendonca@inesc-id.pt
:- '''Role''': Junior Researcher | PhD Student
:- Human Language Technologies Lab, INESC-ID | Instituto Superior Técnico
:- Rua Alves Redol, 9, 1000-029 Lisboa, Portugal
:- inesc-id.pt | tecnico.ulisboa.pt

== Research Interests ==

In 2020, he joined the HLT team for his Master Thesis Research. During this time, he led the HLT Team's contribution to the ComParE 2020 Breathing sub-challenge, pertaining the estimation of breathing signals. In 2021, his research focus shifted to Natural Language Processing (NLP), with special emphasis in conversational agent evaluation metrics.

== Academic Activities ==

* PhD Student: PDEEC

* Teaching Assistant: [https://fenix.tecnico.ulisboa.pt/disciplinas/PF364511132646/2020-2021/2-semestre Spoken Language Processing]

== Publications ==

* John Daniel Fidalgo Mendonça, Francisco Teixeira, Isabel Trancoso, Alberto Abad, '''Analyzing Breath Signals for the Interspeech 2020 ComParE Challenge''', ''In'' Interspeech 2020, October 2020

John Daniel Fidalgo Mendonça

2021-02-19T16:50:41Z

John Mendonça is a Junior Researcher of the HLT (Human Language Technologies Laboratory) and a PhD Student of the Doctoral Programme in Electrical and Computer Engineering of [https://tecnico.ulisboa.pt/pt/ IST]. He obtained his MSc degree in 2020 with the thesis "Automatic Detection of Profile Features". In it, he applied speech processing techniques, more specifically in speaker recognition and paralinguistics, for the collection of crowdsourced and health related datasets. More recently he joined a PhD Programme under the MAIA project, a joint CMU/INESC-ID/IT/Unbabel collaboration for Intelligent AI Agents in chatbots. His PhD work is closely related to conversational quality estimation.

== Contacts ==

* '''John Mendonça'''
:- '''Email''': John.mendonca@inesc-id.pt
:- '''Role''': Junior Researcher | PhD Student
:- Human Language Technologies Lab, INESC-ID | Instituto Superior Técnico
:- Rua Alves Redol, 9, 1000-029 Lisboa, Portugal
:- inesc-id.pt | tecnico.ulisboa.pt

== Research Interests ==

In 2020, he joined the HLT team for his Master Thesis Research. During this time, he led the HLT Team's contribution to the ComParE 2020 Breathing sub-challenge, pertaining the estimation of breathing signals. In 2021, his research focus shifted to Natural Language Processing (NLP), with special emphasis in conversational agent evaluation metrics.

== Academic Activities ==

* PhD Student: PDEEC

* Teaching Assistant: [https://fenix.tecnico.ulisboa.pt/disciplinas/PF364511132646/2020-2021/2-semestre Spoken Language Processing]

== Publications ==

* John Daniel Fidalgo Mendonça, Francisco Teixeira, Isabel Trancoso, Alberto Abad, '''Analyzing Breath Signals for the Interspeech 2020 ComParE Challenge''', ''In'' Interspeech 2020, October 2020

User:Jmen

2021-02-19T16:48:57Z

John Mendonça is a Junior Researcher of the HLT (Human Language Technologies Laboratory) and a PhD Student of the Doctoral Programme in Electrical and Computer Engineering of [https://tecnico.ulisboa.pt/pt/ IST]. He obtained his MSc degree in 2020 with the thesis "Automatic Detection of Profile Features". In it, he applied speech processing techniques, more specifically in speaker recognition and paralinguistics, for the collection of crowdsourced and health related datasets. More recently he joined a PhD Programme under the MAIA project, a joint CMU/INESC-ID/IT/Unbabel collaboration for Intelligent AI Agents in chatbots. His PhD work is closely related to conversational quality estimation.

== Contacts ==

* '''John Mendonça'''
:- '''Email''': John.mendonca@inesc-id.pt
:- '''Role''': Junior Researcher | PhD Student
:- Human Language Technologies Lab, INESC-ID | Instituto Superior Técnico
:- Rua Alves Redol, 9, 1000-029 Lisboa, Portugal
:- inesc-id.pt | tecnico.ulisboa.pt

== Research Interests ==

In 2020, he joined the HLT team for his Master Thesis Research. During this time, he led the HLT Team's contribution to the ComParE 2020 Breathing sub-challenge, pertaining the estimation of breathing signals. In 2021, his research focus shifted to Natural Language Processing (NLP), with special emphasis in conversational agent evaluation metrics.

== Academic Activities ==

* PhD Student: PDEEC

* Teaching Assistant: [https://fenix.tecnico.ulisboa.pt/disciplinas/PF364511132646/2020-2021/2-semestre Spoken Language Processing]

== Publications ==

* John Daniel Fidalgo Mendonça, Francisco Teixeira, Isabel Trancoso, Alberto Abad, '''Analyzing Breath Signals for the Interspeech 2020 ComParE Challenge''', ''In'' Interspeech 2020, October 2020