Fernando Batista

From HLT@INESC-ID

Fernando Batista received his PhD (2011) in Computer Science and Engineering from Instituto Superior Técnico (IST). He is currently Associate Professor at Iscte - University Institute of Lisbon, and a researcher at INESC-ID, Lisbon. He was President of the Pedagogical Council of ISCTE-IUL (2017-2019), member of the Permanent Committee of the Pedagogical Council of ISCTE-IUL (2015-2017), and member of the Pedagogical Committee of the ISTA School (2015-2017). He participated in several European and National projects, and coordinated the INESC-ID team in the SpeDial project (2014-2015). His current research focuses on spoken and written Natural Language Processing, Machine Learning, and Text Mining for social media. Member of the organisation team of the Lisbon Machine Learning Summer School (LXMLS) from 2016 to 2021; and he was also member of the LxMLS technical staff between 2011 and 2015. He was the editorial co-chair of PROPOR 2020, editorial chair of EAMT 2020, web chair of the IPMU 2020, co-chair of the Human-Human languages track of SLATE 2019, co-chair of the demos session in PROPOR 2018, publication chair of IberSPEECH 2016, co-chair of the PROPOR 2016 Student Workshop, publication chair of IberSPEECH 2016, co-chair of the PROPOR 2016 Student Workshop, and handbook chair of EMNLP 2015. He was member of the Program Committee of several national and international conferences. He is Senior Member of the IEEE, and member of the ISCA Speech.

Research Interests

  • Machine learning
  • Natural Language Processing
  • Text and Speech processing
  • Shallow Parsing
  • Also: Operating Systems and Computer Architectures

Total Publications: 163

Article (Author)

In Proceedings (Author)

In Book (Author)

Proceedings (Editor)

Book (Editor)

Book (Author)

Technical Report (Author)

Miscellaneous (Author)

Thesis (Author)

 

Total Supervisions: 46

Doctoral Theses

Master's Theses

Internships

  • Domain Specific Conversational Agents using Machine Leaning and Natural Language Generation
    Stefania Budulan
    Helena Moniz (advisor), Fernando Batista (coadvisor)
    Internship, POLITEHNICA University of Bucharest - Romania, 2024-05-01 - 2024-08-31

 

Total Projects: 18

International Projects

  • LAW TRAIN - Mixed-reality environment for training cross-national teams in joint investigative interrogation-Intelligent interrogation training simulator
    INESC-ID Lisboa, Horizon 2020, , 2015-05-01 - 2018-04-29
  • SpeDial - Spoken Dialogue Analytics
    Athena Research and Innovation Center in Information Communication and Knowledge Technologies, EC FP7-ICT-2013-SME-DCA, , 2013-12-01 - 2015-11-30
  • DIRHA - Distant-speech Interaction for Robust Home Applications
    Fondazione Bruno KESSLER, European Commission - 7th Framework Programme, , 2012-01-01 - 2014-12-01

National Projects

  • CRAI - Center for Responsible AI
    Unbabel, IAPMEI, Societal Digital Transformation, 2023-01-01 - 2025-12-31
  • MAICT - A Multimodal Approach for Identifying Conspiracy Theories in Social Media
    INESC-ID Lisboa, FCT, Societal Digital Transformation, 2022-01-01 - 2023-06-30
  • HATE COVID - HATE COVID-19.PT - Detecting Overt and Covert Hate Speech in Social Media
    INESC-ID Lisboa, FCT, Societal Digital Transformation, 2021-05-01 - 2022-07-30
  • U4V - Unbabel for Video
    INESC-ID Lisboa, Unbabel, Societal Digital Transformation, 2019-02-01 - 2022-12-31
  • eSPERTo - Sistema de Parafraseamento para Edição e Revisão de Texto
    INESC-ID Lisboa, FCT, , 2014-10-01 - 2015-09-30
  • MISNIS - Intelligent Mining of Public Social Networks’ Influence in Society
    INESC-ID Lisboa, FCT, , 2013-04-01 - 2015-03-31
  • PT-STAR - Speech Translation Advanced Research to and from Portuguese
    INESC-ID Lisboa, FCT Carnegie Mellon, , 2009-05-01 - 2012-07-31
  • POSTPORT - POrting Speech Technologies to other varieties of PORTuguese
    INESC-ID Lisboa, FCT, , 2008-01-01 - 2010-12-31
  • NLE-GRID - Natural Language Engineering on a Computational GRID
    INESC-ID Lisboa, FCT, Life and Health Technology, 2005-06-01 - 2007-01-31
  • LECTRA - Rich Transcription of Lectures for E-Learning Applications
    INESC-ID Lisboa, FCT, , 2005-03-01 - 2007-09-30
  • ATA - Automatic Term Acquisition
    INESC-ID Lisboa, FCT, Life and Health Technology, 2001-01-01 - 2003-03-31

Contracts

  • Porto Editora 2 - Porto Editora 2
    INESC-ID Lisboa, Porto Editora, , 2006-03-17 - 2007-12-31
  • Porto Editora - Porto Editora
    INESC-ID Lisboa, Porto Editora, Societal Digital Transformation, 2004-06-01 - 2004-11-06

Conferences

 

Projects

Past Projects

  • SpeDial - Spoken Dialogue Analytics
  • COPAS - Contrast and Parallelism in Speech (03-2012 a 07-2015)
  • MISNIS - Intelligent Mining of Public Social Networks’ Influence in Society (04-2013 a 07-2015)
  • DIRHA - Distant-speech Interaction for Robust Home Applications (01-2012 a 12-2014)
  • METANET4U - European project aiming at supporting language technology for European languages and multilingualism (02-2011 a 02-2013)
  • PT-STAR - Speech Translation Advanced Research to And From Portuguese (05-2009 a 07-2012)
  • POSTPORT - Porting Speech Technologies to other Varieties of Portuguese (01-2008 a 06-2011)
  • Automatic Punctuation and Capitalization for automatic speech transcripts
  • Implementation of a shallow parser

Other Activities