Fernando Batista

From HLT@INESC-ID


Fernando Batista

Fernando Batista received his PhD (2011) in Computer Science and Engineering from Instituto Superior Técnico (IST). He is currently Associate Professor at Iscte - University Institute of Lisbon, and a researcher at INESC-ID, Lisbon. He was President of the Pedagogical Council of ISCTE-IUL (2017-2019), member of the Permanent Committee of the Pedagogical Council of ISCTE-IUL (2015-2017), and member of the Pedagogical Committee of the ISTA School (2015-2017). He participated in several European and National projects, and coordinated the INESC-ID team in the SpeDial project (2014-2015). His current research focuses on spoken and written Natural Language Processing, Machine Learning, and Text Mining for social media. Member of the organisation team of the Lisbon Machine Learning Summer School (LXMLS) from 2016 to 2021; and he was also member of the LxMLS technical staff between 2011 and 2015. He was the editorial co-chair of PROPOR 2020, editorial chair of EAMT 2020, web chair of the IPMU 2020, co-chair of the Human-Human languages track of SLATE 2019, co-chair of the demos session in PROPOR 2018, publication chair of IberSPEECH 2016, co-chair of the PROPOR 2016 Student Workshop, publication chair of IberSPEECH 2016, co-chair of the PROPOR 2016 Student Workshop, and handbook chair of EMNLP 2015. He was member of the Program Committee of several national and international conferences. He is Senior Member of the IEEE, and member of the ISCA Speech.

Publications

Books

2020

Edited Books

2016

  • Alberto Abad, Alfonso Ortega, Antonio Teixeira, Carmen Mateo, Carlos Hinarejos, Fernando Perdigão, Fernando Batista, Nuno J. Mamede, Advances in Speech and Language Technologies for Iberian Languages: Third International Conference, IberSPEECH 2016, Springer International Publish, Lecture Notes in Computer Science, 10077, Lisboa, Portugal, November 2016

Book Chapters

2022

  • Tanara Zingano Kuhn, Ida Rebelo-Arnold, Anabela Barreiro, Isabel Garcez, Fernando Batista, História, política e cultura no mundo lusófono, chapter Análise Comparativa das Edições Portuguesa e Brasileira da Obra Os Livros Que Devoraram o Meu Pai, de Afonso Cruz, Editora LiberArs Ltda, December 2022

2019

  • Marco Vicente, Fernando Batista, Joao P. Carvalho, Gender Detection of Twitter Users Based on Multiple Information Sources, chapter Gender Detection of Twitter Users Based on Multiple Information Sources, Springer International Publishing, May 2019

2018

  • Marco Paulo Fernandes Vicente, Fernando Batista, Joao P. Carvalho, Interactions Between Computational Intelligence and Mathematics Part 2, chapter Gender Detection of Twitter Users Based on Multiple Information Sources, Springer International Publish, November 2018
  • Hugo Rosa, Fernando Batista, Joao P. Carvalho, Interactions Between Computational Intelligence and Mathematics (Special issue of ESCIM2015), chapter Page Rank vs. Katz: Is the centrality algorithm choice relevant to measure user influence in Twitter?, Springer, October 2018

2016

  • Marco Paulo Fernandes Vicente, Fernando Batista, Joao P. Carvalho, Information Processing and Management of Uncertainty in Knowledge-Based Systems - Volume 611 of the series Communications in Computer and Information Science, chapter Creating Extended Gender Labelled Datasets of Twitter Users, pp 690-702, Springer, June 2016

2015

  • Marco Paulo Fernandes Vicente, Joao P. Carvalho, Fernando Batista, Communications in Computer and Information Science Vol 563 - International Languages, Applications and Technologies, chapter Using Unstructured Profile Information for Gender Classification of Portuguese and English Twitter Users, pp 57-64, Springer, December 2015
  • Mariana Juliao, Jorge Silva, Ana Aguiar, Helena Moniz, Fernando Batista, Languages, Applications and Technologies, chapter Speech Features for Discriminating Stress Using Branch and Bound Wrapper Search, Springer International Publish, , December 2015
  • Jose Moura, Fernando Batista, Elsa Alexandra Cabral da Rocha Cardoso, Luis Nunes, Chapter 6. Intelligent Management and Efficient Operation of Big Data, chapter Handbook of Research on Trends and Future Directions in Big Data and Web Intelligence, IGI Global, September 2015

2004

International Journals

2023

  • Ana Rita Peixoto, Ana de Almeida, Nuno António, Fernando Batista, Ricardo Ribeiro, Diachronic profile of startup companies through social media, Social Network Analysis and Mining, Springer Nature, vol. 13, n. 1, pages 52, doi: https://doi.org/10.1007/s13278-023-01055-2, https://doi.org/10.1007/s13278-023-01055-2, March 2023

2022

  • Mariana Cavique, Ricardo Ribeiro, Fernando Batista, Antónia Correia, Examining Airbnb guest satisfaction tendencies: a text mining approach, Current Issues in Tourism, Routledge, vol. 0, n. 0, pages 1-16, doi: 10.1080/13683500.2022.2115877, September 2022
  • Mariana Cavique, Antónia Correia, Ricardo Ribeiro, Fernando Batista, What are Airbnb hosts advertising? A longitudinal essay in Lisbon, Consumer Behavior in Tourism and Hospitality, Emerald, vol. 17, n. 3, pages 312--325, doi: 10.1108/CBTH-10-2021-0253, July 2022

2021

  • Nuno M Guerreiro, Ricardo Rei, Fernando Batista, Towards better subtitles: A multilingual approach for punctuation restoration of speech transcripts, Expert Systems with Applications, vol. 186, pages 115740, doi: https://doi.org/10.1016/j.eswa.2021.115740, December 2021
  • Joana Azinhaes, Fernando Batista, J. C. Ferreira, eWOM for public institutions: application to the case of the Portuguese Army. Social Network Analysis and Mining, Social Network Analysis and Mining, vol. 11, n. 1, pages 118, doi: 10.1007/s13278-021-00837-w, November 2021
  • Elizabeth Fernandes, Sérgio Moro, Paulo Cortez, Fernando Batista, Ricardo Ribeiro, A Data-driven Approach to Measure Restaurant Performance Combining Online Reviews and Historical Sales Data, International Journal of Hospitality Management, Elsevier, vol. 94, doi: 10.1016/j.ijhm.2020.102830, April 2021

2019

  • Sérgio Moro, Fernando Batista, Paulo Rita, Cristina Oliveira, Ricardo Ribeiro, Are the States United? An analysis of US hotels’ offers through TripAdvisor’s eyes, Journal of Hospitality and Tourism Research, SAGE, doi: 10.1177/1096348019854793, June 2019

2018

  • Sérgio Moro, Paulo Rita, Cristina Oliveira, Fernando Batista, Ricardo Ribeiro, Leveraging national tourist offices through data analytics, International Journal of Culture, Tourism and Hospitality Research, vol. 14, n. 4, doi: 10.1108/IJCTHR-04-2018-005, July 2018
  • Nuno António, Ana de Almeida, Luís Nunes, Fernando Batista, Ricardo Ribeiro, Hotel online reviews: different languages, different opinions, Information Technology & Tourism, Springer Berlin Heidelberg, vol. 18, n. 1, pages 157--185, doi: 10.1007/s40558-018-0107-x, April 2018
  • Nuno António, Ana Almeida, Luis Nunes, Fernando Batista, Ricardo Ribeiro, Hotel online reviews: Creating a multi-source aggregated index, International Journal of Contemporary Hospitality Management, vol. 30, n. 10, doi: 10.1108/IJCHM-05-2017-0302, March 2018

2017

  • Joao P. Carvalho, Hugo Rosa, Gaspar Manuel Rocha Brogueira, Fernando Batista, MISNIS: An Intelligent Platform for Twitter Topic Mining, Expert Systems With Applications, Elsevier, vol. 89, pages 374-388, doi: https://doi.org/10.1016/j.eswa.2017.08.001, December 2017
  • Catarina Rebelo, Inês Pereira, Hugo Rosa, Fernando Batista, Joao P. Carvalho, Twitter: from platform for mobilization to platform for commentary. The 2014 meet in Lisbon, (OBS*)ERVATORIO, Obercom, vol. 11, n. 4, pages 19-41, Portugal, December 2017

2016

  • Gaspar Manuel Rocha Brogueira, Fernando Batista, Joao P. Carvalho, A Smart System for Twitter Corpus Collection, Management and Visualization, International Journal of Technology and Human Interaction (IJTHI), IGI Global, vol. 13, n. 3, pages 13-32, December 2016
  • Gaspar Manuel Rocha Brogueira, Fernando Batista, Joao P. Carvalho, Using geolocated tweets for characterization of Twitter in Portugal and the Portuguese administrative regions, Social Network Analysis and Mining, Springer, vol. 6, n. 1, pages 1-20, doi: DOI: 10.1007/s13278-016-0347-8, June 2016

2014

  • Helena Moniz, Fernando Batista, Ana Isabel Mata da Silva, Isabel Trancoso, Speaking style effects in the production of disfluencies, Speech Communication, vol. 65, pages 20-35, doi: 10.1016/j.specom.2014.05.004, November 2014
  • Ana Isabel Mata da Silva, Helena Moniz, Fernando Batista, Comparing phrase-final patterns across speech styles and groups in European Portuguese, Noveaux cahiers de linguistique francaise, n. 31, pages 171-176, Genève, Swiss, September 2014

2013

  • Fernando Batista, Ricardo Ribeiro, Sentiment Analysis and Topic Classification based on Binary Maximum Entropy Classifiers, Procesamiento de Lenguaje Natural, Sociedad Española para el Procesamiento de Lenguaje Natural, vol. 50, n. 1, pages 77–84, March 2013

2012

  • Fernando Batista, Helena Moniz, Isabel Trancoso, Nuno J. Mamede, Bilingual Experiments on Automatic Recovery of Capitalization and Punctuation of Automatic Speech Transcripts, IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, n. 2, pages 474 -- 485, doi: 10.1109/TASL.2011.2159594, February 2012

2008

  • Fernando Batista, Diamantino António Caseiro, Nuno J. Mamede, Isabel Trancoso, Recovering Capitalization and Punctuation Marks for Automatic Speech Recognition: Case Study for the Portuguese Broadcast News, Speech Communication, vol. 50, n. 10, pages 847-862, doi: 10.1016/j.specom.2008.05.008, October 2008

Edited Proceedings

2022

  • Vládia Pinheiro, Pablo Gamallo, Raquel Amaro, Carolina Scarton, Fernando Batista, Diego Silva, Catarina Magro, Hugo Pinto, Computational Processing of the Portuguese Language: 15th International Conference, PROPOR 2022, Fortaleza, Brazil, March 2123, 2022, Proceedings, Springer International Publishing, Lecture Notes in Artificial Intelligence, 13208, December 2022

2020

  • Paulo Quaresma, Renata Vieira, Sandra Aluísio, Helena Moniz, Fernando Batista, Teresa Gonçalves, Computational Processing of the Portuguese Language, 14th International Conference, PROPOR 2020, Springer, Lecture Notes in Artificial Intelligence, March 2020

2019

  • Ricardo Rodrigues, Jan Janousek, Luís Ferreira, Luísa Coheur, Fernando Batista, Hugo Gonçalo Oliveira, 8th Symposium on Languages, Applications and Technologies (SLATE 2019), Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik, OpenAccess Series in Informatics (OASIcs), 74, http://www.dagstuhl.de/dagpub/978-3-95977-114-6, July 2019

International Conferences

2022

  • João Guarda, Marco Vicente, Fernando Batista, Gutenbrain: An Architecture for Equipment Technical Attributes Extraction from Piping and Instrumentation Diagrams, In KDIR 2022, vol. 1, pages 204-211, doi: 10.5220/0000165700003335, October 2022
  • Maria Inês Bico, Jorge Baptista, Fernando Batista, Esperança Cardeira, Early Experiments on Automatic Annotation of Portuguese Medieval Texts, In TPDL 2022, Springer International Publishing, vol. 13541, series Lecture Notes in Computer Science, pages 442--449, doi: https://doi.org/10.1007/978-3-031-16802-4_44, Italy, September 2022
  • Bernardo Cunha Matos, Raquel Bento Santos, Paula Carvalho, Ricardo Ribeiro, Fernando Batista, Comparing Different Approaches for Detecting Hate Speech in Online Portuguese Comments, In 11th Symposium on Languages, Applications and Technologies (SLATE 2022), Schloss Dagstuhl -- Leibniz-Zentrum für Informatik, vol. 104, series Open Access Series in Informatics (OASIcs), pages 10:1--10:12, doi: https://doi.org/10.4230/OASIcs.SLATE.2022.10, https://drops.dagstuhl.de/opus/volltexte/2022/16756, July 2022
  • Raquel Bento Santos, Bernardo Cunha Matos, Paula Carvalho, Fernando Batista, Ricardo Ribeiro, Semi-Supervised Annotation of Portuguese Hate Speech Across Social Media Domains, In 11th Symposium on Languages, Applications and Technologies (SLATE 2022), Schloss Dagstuhl -- Leibniz-Zentrum für Informatik, vol. 104, series Open Access Series in Informatics (OASIcs), pages 11:1--11:14, doi: https://doi.org/10.4230/OASIcs.SLATE.2022.11, https://drops.dagstuhl.de/opus/volltexte/2022/16757, July 2022
  • Paula Carvalho, Bernardo Cunha Matos, Raquel Bento Santos, Fernando Batista, Ricardo Ribeiro, Hate Speech Dynamics Against African descent, Roma and LGBTQI Communities in Portugal, In LREC 2022 - 13th Conference on Language Resources and Evaluation, European Language Resources Association (ELRA), June 2022
  • Beatriz Paula, João Coelho, Diogo Mano, Carlos Coutinho, João P. Oliveira, Ricardo Ribeiro, Fernando Batista, Collaborative Filtering for Mobile Application Recommendation with Implicit Feedback, In IEEE ICE - IAMOT Conference 2022, pages 1065 -- 1073, December 2022

2021

  • Rosária Bunga, Fernando Batista, Ricardo Ribeiro, From Implicit Preferences to Ratings: Video Games Recommendation based on Collaborative Filtering, In 13th International Joint Conference on Knowledge Discovery and Information Retrieval, SciTePress, October 2021
  • João Coelho, António Neto, Miguel Tavares, Carlos Coutinho, João Oliveira, Ricardo Ribeiro, Fernando Batista, Transformer-based language models for semantic search and mobile applications retrieval, In 13th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, SciTePress, pages 225-232, doi: 10.5220/0010657300003064, October 2021
  • João Coelho, António Neto, Miguel Tavares, Carlos Coutinho, Ricardo Ribeiro, Fernando Batista, Semantic Search of Mobile Applications Using Word Embeddings, In 10th Symposium on Languages, Applications and Technologies (SLATE 2021), Schloss Dagstuhl -- Leibniz-Zentrum für Informatik, vol. 94, series Open Access Series in Informatics (OASIcs), pages 12:1--12:12, doi: 10.4230/OASIcs.SLATE.2021.12, https://drops.dagstuhl.de/opus/volltexte/2021/14429, July 2021
  • Cátia Tavares, Ricardo Ribeiro, Fernando Batista, Sentiment Analysis of Portuguese Economic News, In 10th Symposium on Languages, Applications and Technologies (SLATE 2021), Schloss Dagstuhl -- Leibniz-Zentrum für Informatik, vol. 94, series Open Access Series in Informatics (OASIcs), pages 17:1--17:13, doi: 10.4230/OASIcs.SLATE.2021.17, July 2021
  • Ricardo Rei, Fernando Batista, Nuno Miguel Guerreiro, Luísa Coheur, Multilingual Simultaneous Sentence End and Punctuation Prediction, In Proceedings of the 1st Shared Task on Sentence End and Punctuation Prediction in NLG Text (SEPP-NLG 2021) held at SwissText 2021, June 2021

2020

  • Afonso Pinto, Helena Moniz, Fernando Batista, Detection of Emerging Words in Portuguese Tweets, In 9th Symposium on Languages, Applications and Technologies (SLATE 2020), Schloss Dagstuhl--Leibniz-Zentrum für Informatik, vol. 83, series OpenAccess Series in Informatics (OASIcs), pages 3:1--3:10, doi: https://doi.org/10.4230/OASIcs.SLATE.2020.3, September 2020
  • Soraia Filipe, Fernando Batista, Ricardo Ribeiro, Different Lexicon-Based Approaches to Emotion Identification in Portuguese Tweets (Short Paper), In 9th Symposium on Languages, Applications and Technologies (SLATE 2020), Schloss Dagstuhl--Leibniz-Zentrum für Informatik, vol. 83, series OpenAccess Series in Informatics (OASIcs), pages 12:1--12:8, doi: 10.4230/OASIcs.SLATE.2020.12, September 2020
  • João Rodrigues, Ricardo Ribeiro, Fernando Batista, Towards the Identification of Fake News in Portuguese, In 9th Symposium on Languages, Applications and Technologies (SLATE 2020), Schloss Dagstuhl--Leibniz-Zentrum für Informatik, vol. 83, series OpenAccess Series in Informatics (OASIcs), pages 7:1--7:14, doi: 10.4230/OASIcs.SLATE.2020.7, September 2020
  • Ricardo Rei, Nuno Miguel Guerreiro, Fernando Batista, Automatic Truecasing of Video Subtitles Using BERT: A Multilingual Adaptable Approach, In IPMU2020 - International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems, Springer International Publishing, vol. 1237, series Communications in Computer and Information Science book series (CCIS), pages 708--721, doi: https://doi.org/10.1007/978-3-030-50146-4_52, June 2020
  • Ricardo Rei, Nuno M Guerreiro, Fernando Batista, Automatic truecasing of video subtitles using BERT: a multilingual adaptable approach, In International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems, Springer, Cham, vol. 1237, pages 708-721, Lisbon, June 2020
  • Marco Felgueiras, Fernando Batista, Joao P. Carvalho, Creating Classification Models from Textual Descriptions of Companies Using Crunchbase, In IPMU2020 - International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems, Springer, vol. 1237, series Communications in Computer and Information Science, pages 695-707, Lisbon, Portugal, June 2020
  • Eugénio Ribeiro, Ricardo Ribeiro, Fernando Batista, João Oliveira, Using Topic Information to Improve Non-exact Keyword-Based Search for Mobile Applications, In Information Processing and Management of Uncertainty in Knowledge-Based Systems, 18th International Conference, IPMU 2020, Springer, vol. 1237, series Communications in Computer and Information Science, pages 373-386, June 2020
  • Anabela Barreiro, Ida Rebelo-Arnold, Fernando Batista, Isabel Garcez, Tanara Zingano Kuhn, One Book, Two Language Varieties, In Computational Processing of the Portuguese Language (PROPOR), Springer International Publishing, pages 379-389, doi: https://doi.org/10.1007/978-3-030-41505-1_36, Évora, Portugal, April 2020

2019

  • Vera Cabarrão, Mariana Julião, Rubén Solera Ureña, Helena Moniz, Fernando Batista, Isabel Trancoso, Ana Isabel Mata da Silva, Affective analysis of customer service calls, In 10th International Conference of Experimental Linguistics (ExLing 2019), pages 37-40, Lisbon, Portugal, September 2019
  • Helena Moniz, Rubén Solera Ureña, Vera Cabarrão, Mariana Julião, Fernando Batista, Isabel Trancoso, Affective computing based on acoustic-prosodic cues, In 14th Annual INGRoup Conference (Interdisciplinary Network for Group Research), , Lisbon, Portugal, July 2019

2018

  • Diana Lopes-Teixeira, Fernando Batista, Ricardo Ribeiro, Discovering Trends in Brand Interest through Topic Models, In Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, SciTePress, vol. 1, pages 245-252, doi: https://doi.org/10.5220/0006936202450252, October 2018
  • Anabela Barreiro, Fernando Batista, Contractions: To Align or Not to Align, That Is the Question, In The First Workshop on Linguistic Resources for NLP (LR4NLP) co-located with COLING, pages 122-130, Santa Fe, New Mexico, USA, August 2018
  • Diana Lopes-Teixeira, Fernando Batista, Ricardo Ribeiro, Spatio-Temporal Analysis of Brand Interest using Social Networks, In CISTI'2018 - 13th Iberian Conference on Information Systems and Technologies, IEEE, doi: 10.23919/CISTI.2018.8399241, Cáceres, Spain, June 2018

2017

  • Joao P. Carvalho, Hugo Rosa, Fernando Batista, Detecting relevant tweets in very large tweet collections: the London Riots case study, In FUZZ-IEEE, 2017 IEEE International Conference on Fuzzy Systems, IEEE Xplorer, Naples, Italy, July 2017

2016

  • Eugénio Ribeiro, Fernando Batista, Isabel Trancoso, José David Lopes, Ricardo Ribeiro, David Martins de Matos, Assessing User Expertise in Spoken Dialog System Interactions, In IberSPEECH 2016, Springer International Publish, vol. 10077, series Lecture Notes in Computer Science, pages 245--254, doi: 10.1007/978-3-319-49169-1_24, , Lisbon, November 2016
  • Eugénio Ribeiro, Fernando Batista, Isabel Trancoso, Ricardo Ribeiro, David Martins de Matos, Automatic Detection of Hyperarticulated Speech, In IberSPEECH 2016, Springer International Publish, vol. 10077, series Lecture Notes in Computer Science, pages 182--191, doi: http://dx.doi.org/10.1007/978-3-319-49169-1_18, , Lisbon, November 2016
  • Vera Cabarrão, Isabel Trancoso, Ana Isabel Mata da Silva, Helena Moniz, Fernando Batista, Global analysis of entrainment in dialogues, In IberSPEECH 2016, Springer, vol. 10077, series Lecture Notes in Computer Science, pages 215--223, doi: 10.1007/978-3-319-49169-1_21, , Lisbon, November 2016
  • Marco Paulo Fernandes Vicente, Fernando Batista, Joao P. Carvalho, Improving Twitter gender classification using multiple classifiers, In 8th European Symposium on Computational Intelligence and Mathematics (ESCIM 2016), pages 121 - 127, Sofia, Bulgaria, October 2016
  • Marco Paulo Fernandes Vicente, Fernando Batista, Joao P. Carvalho, Creating Extended Gender Labelled Datasets of Twitter Users, In IPMU2016 - 16th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based systems, Springer, vol. 611, series Communications in Computer and Information Science, pages 690-702, TU Eindhoven, The Netherlands, June 2016
  • Fernando Batista, Pedro dos Santos Lopes Curto, Isabel Trancoso, Alberto Abad, Jaime Rodrigues Ferreira, Eugénio Ribeiro, Helena Moniz, David Martins de Matos, Ricardo Ribeiro, SPA: Web-based Platform for easy Access to Speech Processing Modules, In LREC, European Language Resources Association (ELRA), pages 3886--3892, doi: ISBN: 978-2-9517408-9-1, Portorož, Slovenia, May 2016

2015

  • Hugo Rosa, Joao P. Carvalho, Fernando Batista, Detecting User Influence in Twitter: PageRank vs Katz, a case study, In ESCIM - 7th European Symposium on Computational Intelligence and Mathematics, pages 212-217, Cádiz, Spain, October 2015
  • Vera Cabarrão, Helena Moniz, Jaime Rodrigues Ferreira, Fernando Batista, Isabel Trancoso, Ana Isabel Mata da Silva, Sérgio dos Santos Lopes Curto, Prosodic Classification of Discourse Markers, In International Congress of Phonetic Sciences (ICPhS 2015), Glasgow, Scotland, UK, August 2015
  • Fernando Batista, Joao P. Carvalho, Text based classification of companies in CrunchBase, In FUZZ-IEEE2015 IEEE International Conference on Fuzzy Systems, IEEE, pages , Istambul, Turkey, August 2015
  • Marco Paulo Fernandes Vicente, Fernando Batista, Joao P. Carvalho, Twitter gender classification using user unstructured information, In FUZZ-IEEE, 2015 IEEE International Conference on Fuzzy Systems, IEEE, Istambul, Turkey, August 2015
  • Gaspar Manuel Rocha Brogueira, Fernando Batista, Joao P. Carvalho, Using Geolocated Tweets for Characterization of Portuguese Administrative Regions, In 18th AGILE International Conference on Geographic Information Science, Lisboa, Portugal, June 2015
  • Marco Paulo Fernandes Vicente, Joao P. Carvalho, Fernando Batista, Using Unstructured Profile Information for Gender Classification of Portuguese and English Twitter Users, In SLATE'15, IV Symposium on Languages, Applications and Technologies, Springer, pages 143-148, Madrid, Spain, June 2015

2014

  • Hugo Rosa, Fernando Batista, Joao P. Carvalho, Twitter Topic Fuzzy Fingerprints, In WCCI2014, FUZZ-IEEE, 2014 IEEE World Congress on Computational Intelligence, International Conference on Fuzzy Systems, IEEE, series IEEE Xplorer, pages 776-783, Beijing, China, July 2014
  • Hugo Rosa, Joao P. Carvalho, Fernando Batista, Detecting a Tweet’s Topic within a Large Number of Portuguese Twitter Trends, In SLATE'14 - 3rd Symposium on Languages, Applications and Technologies, Schloss Dagstuhl, vol. 4659, series OpenAccess Series in Informatics (OASIcs), pages 185-199, doi: http://dx.doi.org/10.4230/OASIcs.SLATE.2014.185, June 2014
  • Gaspar Manuel Rocha Brogueira, Fernando Batista, Joao P. Carvalho, Helena Moniz, Expanding a Database of Portuguese Tweets, In SLATE'14 3rd Symposium on Languages, Applications and Technologies, Schloss Dagstuhl, vol. 4569, series OpenAccess Series in Informatics (OASIcs), pages 275-282, Bragança, Portugal, June 2014
  • Gaspar Manuel Rocha Brogueira, Fernando Batista, Joao P. Carvalho, Helena Moniz, Portuguese geolocated tweets: an overview, In ISDOC2014 - Proceedings of the International Conference on Information Systems and Design of Communication, ACM, pages 178-179, Lisbon, Portugal, May 2014
  • Vera Cabarrão, Helena Moniz, Fernando Batista, Ricardo Ribeiro, Nuno J. Mamede, Hugo Meinedo, Isabel Trancoso, Ana Isabel Mata, David Martins de Matos, Revising the Annotation of a Broadcast News Corpus: a Linguistic Approach, In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), European Language Resources Association (ELRA), pages 3908-3913, Reykjavik, Iceland, May 2014

2013

  • Anabela Barreiro, Johanna Monti, Brigitte Orliac, Fernando Batista, When Multiwords Go Bad in Machine Translation, In Workshop on Multi-word Units in Machine Translation and Translation Technology, http://www.mt-archive.info/10/MTS-2013-W4-Barreiro, September 2013
  • Joao P. Carvalho, Vasco Pedro, Fernando Batista, Towards Intelligent Mining of Public Social Networks’ Influence in Society, In IFSA-NAFIPS2013 - 2013 IFSA World Congress and NAFIPS Annual Meeting, IEEE Xplore, pages 478-483, Edmonton, Canada, June 2013

2012

2011

  • Helena Moniz, Fernando Batista, Isabel Trancoso, Ana Isabel Mata da Silva, Analysis of interrogatives in different domains, In Towards Autonomous, Adaptive, and Context-Aware Multimodal Interfaces: Theoretical and Practical Issues. Third COST 2102 International Training School, Springer Berlin / Heidelberg, series Book series: Lecture Notes in Computer Science, pages 136-148, Caserta, Italy, January 2011

2010

2009

2008

  • Ana Cristina Mendes, Luísa Coheur, Nuno J. Mamede, Ricardo Ribeiro, David Martins de Matos, Fernando Batista, QA@L2F, first steps at QA@CLEF, Springer-Verlag, vol. 5152, series Lecture Notes in Computer Science, September 2008

2007

  • Ana Cristina Mendes, Luísa Coheur, Nuno J. Mamede, Luís Carlos da Silva Romão, João Miguel Sanches Loureiro, Ricardo Ribeiro, Fernando Batista, David Martins de Matos, QA@L2F@QA@CLEF, In Working Notes for the CLEF 2007 Workshop, September 2007
  • Fernando Batista, Diamantino António Caseiro, Nuno J. Mamede, Isabel Trancoso, Recovering Punctuation Marks for Automatic Speech Recognition, In Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), ISCA, vol. 1, series Interspeech, pages 2153-2156, Antwerp, Belgium, September 2007

2006

  • Ricardo Ribeiro, Fernando Batista, Joana Paulo Pardal, Nuno J. Mamede, H. Sofia Pinto, Cooking an Ontology, In The Twelfth International Conference on Artificial Intelligence: Methodology, Systems, Applications, Springer Berlin / Heidelberg, vol. 4183, series Lecture Notes in Computer Science, pages 213-221, Varna, Bulgaria, September 2006
  • Jorge Baptista, Fernando Batista, Nuno J. Mamede, Building a Dictionary of Anthroponyms, In PROPOR'2006 - Computational Processing of the Portuguese Language, Springer Verlag, Berlin / Heidelberg, vol. 3960, series Lecture Notes in Computer Science, pages 21-30, Itatiaia, Brazil, May 2006

2004

  • Luísa Coheur, Fernando Batista, Nuno J. Mamede, Towards a flexible syntax/semantics interface, In Proceedings of the Herramientas y Recursos Linguísticos para el Español y el Portugués workshop, a satelite of the Ninth, pages 265-272, Puebla, Mexico, November 2004

2003

  • Fernando Batista, Nuno J. Mamede, Flexible Module for Shallow Parsing, Using Preferences, In TASHA'2003 - Workshop on Tagging and Shallow Processing of Portuguese, Faculdade de Ciências da Universidade de Lisboa, series Technical Reports, pages 5-6, Lisboa, Portugal, October 2003
  • Luísa Coheur, Fernando Batista, Joana Paulo Pardal, JaVaLI! undestanding real questions, In Proc. EUROLAN'2003 - Student Workshop on Applied Natural Processing, Hamburg, Germany, July 2003

2002

2000

  • Luzia Helena Wittmann, Ricardo Ribeiro, Tânia Pego, Fernando Batista, Some Language Resources and Tools for Computational Processing of Portuguese at INESC, In LREC2000 – Second International Conference on Language Resources and Evaluation, vol. 1, pages 347, Athens, Greece, June 2000

National Journals

2019

2018

  • Vera Cabarrão, Helena Moniz, Fernando Batista, Isabel Trancoso, Ana Isabel Mata da Silva, Adaptação acústico-prosódica entre falantes, Revista da Associação Portuguesa de Linguística, vol. 4, July 2018

2016

  • Vera Cabarrão, Helena Moniz, Jaime Rodrigues Ferreira, Fernando Batista, Isabel Trancoso, Ana Isabel Mata da Silva, Sérgio dos Santos Lopes Curto, Classificação prosódica de marcadores discursivos, Revista da Associação Portuguesa de Linguística, n. 2, pages 69 -- 95, July 2016

National Conferences

2018

2017

2016

  • Angela Jusupova, Fernando Batista, Ricardo Ribeiro, Characterizing the Personality of Twitter Users based on their Timeline Information, In 16 Conferência da Associacao Portuguesa de Sistemas de Informação, pages 292 - 299, Porto, Portugal, October 2016
  • Fernando Rebelo, Fernando Batista, Ricardo Ribeiro, Cascatas de Classificação de Sentimento em Microblogues, In INFORUM 2016 - Atas do 8.o Simpósio de Informática, pages 203 -- 214, Lisboa, Portugal, September 2016
  • Luís Dias, Tomás Brandão, Fernando Batista, Detecting violence on movie excerpts: A machine-learning approach based on audio and video features, In INForum 2016, Gestão de Dados e Conhecimento, Lisboa, Portugal, September 2016

2015

  • Gaspar Manuel Rocha Brogueira, Fernando Batista, Joao P. Carvalho, Sistema Inteligente de Recolha, Armazenamento e Visualização de Informação proveniente do Twitter, In CAPSI2015 - 15ª Conferência da Associação Portuguesa de Sistemas de Informação, Lisboa, Portugal, October 2015
  • Mariana Juliao, Jorge Silva, Ana Aguiar, Helena Moniz, Jaime Rodrigues Ferreira, Fernando Batista, Speech Features for Discriminating Stress, In 10th Conference on Telecommunications, Conftele 2015, IT, https://www.it.pt/Publications/PaperConference/224, September 2015
  • Gaspar Manuel Rocha Brogueira, Fernando Batista, Joao P. Carvalho, Arquitetura e Desenvolvimento de um Repositório de Tweets em Português Europeu, In 5as Jornadas de Informática da Universidade de Évora - JIUE 2015, Springer, Évora, Portugal, February 2015

2014

2013

2012

2005

  • Jorge Baptista, Fernando Batista, Nuno J. Mamede, Cristina Mota, Npro: um novo recurso para o processamento computacional do Português, In XXI Encontro APL, Porto, Portugal, December 2005

Technical Reports

2014

  • Anabela Barreiro, Luísa Coheur, Tiago Luís, Angela Costa, Fernando Batista, João Graça, Isabel Trancoso, Multiword and Semantico-Syntactic Unit Alignments, Tech. Rep. 23 / 2014 INESC-ID Lisboa, December 2014

2008

  • David Martins de Matos, Ricardo Ribeiro, Sérgio Paulo, Fernando Batista, Luísa Coheur, Joana Paulo Pardal, Natural Language Engineering on a Computational Grid (NLE-GRID) T2 - Encapsulation of Reusable Components, Tech. Rep. 31 / 2008 INESC-ID Lisboa, January 2008

2006

Doctoral Theses

2011

Masters Theses

2003

Other Publications

2017

  • Eugénio Ribeiro, Fernando Batista, Isabel Trancoso, José Lopes, Ricardo Ribeiro, David Martins de Matos, Assessing User Expertise in Spoken Dialog System Interactions, https://arxiv.org/abs/1701.05011, January 2017

2014

  • Vera Cabarrão, Helena Moniz, Fernando Batista, Isabel Trancoso, Ana Isabel Mata, Sérgio dos Santos Lopes Curto, Discourse markers in spontaneous speech in European Portuguese: a first approach, Università dell'Insubria, October 2014

2012

PhD theses

Ongoing

  • Complexity of Language Variation and Sentence Structures, Eduardo M. Freitas. ISCTE (2020-). Fernando Batista, advisor. Ricardo Ribeiro, co-advisor.
  • Analysis of service quality in peer-to-peer accommodation by modeling the hidden aspects and sentiments embedded in online texts, Mariana Sofia Barreira Cavique Santos. Iscte (2019-). Ricardo Ribeiro, advisor. Fernando Batista, co-advisor.

MSc theses

Ongoing

  • Anonimização Inteligente de Dados, Miguel Pereira. ISCTE (2022-). Fernando Batista, advisor. Ricardo Ribeiro, co-advisor.
  • Sistema de recolha e anotação semiautomática de discurso de ódio nas redes sociais, . Instituto Superior Técnico, Universidade de Lisboa (2021-). Paula Carvalho, advisor. Fernando Batista, co-advisor.

Finished

  • Analysis of the Tourists behavior in Lisbon using Data from a Mobile Operator, Bruno Alexandre Mateus Francisco. Iscte - Instituto Universitário de Lisboa (2021-2022). Ricardo Ribeiro, advisor. Fernando Batista, co-advisor.
  • Humor and Offense Speech Classification and scoring using Natural Language Processing, Marcelo Custódio Mathias. Iscte - Instituto Universitário de Lisboa (2021-2022). Fernando Batista, advisor. Ricardo Ribeiro, co-advisor.
  • Identificação de Fatores de Risco na Gravidade dos Acidentes Rodoviários, Inês Mendes dos Santos Rodrigues de Carvalho. Iscte - Instituto Universitário de Lisboa (2021-2022). Fernando Batista, advisor. Luís Nunes, co-advisor.
  • Twitter's content trends depending on users profile and time of the year, Érica Rosa. Iscte - Instituto Universitário de Lisboa (2021-2022). Fernando Batista, advisor. Ricardo Ribeiro, co-advisor.
  • Influencers, are they responsible for Bitcoin's volatility? Transfer Entropy and Granger causality in prol of an answer, Jana Luisa Teixeira Lage. Iscte - Instituto Universitário de Lisboa (2021-2022). Diana Mendes, advisor. Fernando Batista, co-advisor.
  • Semi-Automatic Selection and Annotation of Hate Speech from Social Media, Raquel Bento Santos. Instituto Superior Técnico (2021-2022). Fernando Batista, advisor. Paula Carvalho, co-advisor.
  • Deteção Automática de Sinais de Depressão em Redes Sociais, Daniela Cristina da Silva Carlos. Iscte - Instituto Universitário de Lisboa (2020-2021). Ricardo Ribeiro, advisor. Fernando Batista, co-advisor.
  • Previsão automática de fraude em transações financeiras, Appio Indiano do Brazil Americano Neto. ISCTE-IUL (2020-2021). Fernando Batista, advisor. Sérgio Miguel Carneiro Moro, co-advisor.
  • Sentiment Analysis to Predict the Portuguese Economic Sentiment Based on Economic News, Cátia Daniela Lopes Tavares. ISCTE-IUL (2020-2021). Ricardo Ribeiro, advisor. Fernando Batista, co-advisor.
  • Student Data Prediction, Nuno Miguel Soares Fialho de Carvalho. ISCTE-IUL (2020-2021). Elsa Alexandra Cabral da Rocha Cardoso, advisor. Fernando Batista, co-advisor.
  • Automatic Aggression Detection in Social Media, Tiago Filipe Pardal de Almeida. Iscte - Instituto Universitário de Lisboa (2020-2021). Ricardo Ribeiro, advisor. Fernando Batista, co-advisor.
  • Classificação de Emoções em Redes Sociais, Soraia Alexandra Cardoso Filipe. Iscte - Instituto Universitário de Lisboa (2018-2021). Fernando Batista, advisor. Ricardo Ribeiro, co-advisor.
  • A Book-oriented Chatbot, Nuno Alexandre Mestre Barradas. Iscte - Instituto Universitário de Lisboa (2020). Ricardo Ribeiro, advisor. Fernando Batista, co-advisor.
  • Sistema de Recomendação de Videojogos, Rosária Patrícia Firmino Bunga. Iscte - Instituto Universitário de Lisboa (2020). Ricardo Ribeiro, advisor. Fernando Batista, co-advisor.
  • Análise de criptomoeda baseada em conteúdo gerado por utilizadores de redes sociais, Miguel de Guerra Narciso. Iscte - Instituto Universitário de Lisboa (2019-2020). Luís Nunes, advisor. Fernando Batista, co-advisor.
  • Avaliação do impacto emocional do vídeo usando técnicas de aprendizagem automática, André Filipe Lopes Maia. Iscte - Instituto Universitário de Lisboa (2019-2020). Tomás Gomes Silva Serpa Brandão, advisor. Fernando Batista, co-advisor.
  • Classificação de Fake News na Língua Portuguesa Europeia, João Filipe Carriço Rodrigues. Iscte - Instituto Universitário de Lisboa (2019-2020). Ricardo Ribeiro, advisor. Fernando Batista, co-advisor.
  • Detecção de Sentimentos Psíquicos em Utilizadores de Redes Sociais, Patrícia de Sousa dos Santos. Iscte - Instituto Universitário de Lisboa (2019-2020). Ricardo Ribeiro, advisor. Fernando Batista, co-advisor.
  • Multilabel classification of unstructured data using Crunchbase, Marco Filipe Madeira Felgueiras. Iscte - Instituto Universitário de Lisboa (2018-2020). Fernando Batista, advisor.
  • Deteção de palavras emergentes em tweets portugueses e análise do seu percurso na redes sociais, Afonso do Carmo Marques Mendes Pinto. Iscte - Instituto Universitário de Lisboa (2019-2020). Fernando Batista, advisor.
  • eWOM para instituições públicas: aplicação ao caso do Exército Português, Joana de Azinhaes Horta. Iscte - Instituto Universitário de Lisboa (2019-2020). Fernando Batista, advisor. João Ferreira, co-advisor.
  • Characterizing the Personality of Twitter Users Based on their Timeline Information, Anzhela Zhusupova. ISCTE-IUL (2015-2016). Fernando Batista, advisor.
  • Análise de Sentimento em Microblogues com base em Cascatas de Classificação, Fernando Manuel Dias Rebelo. ISCTE-IUL (2015-2016). Fernando Batista, advisor. Ricardo Ribeiro, co-advisor.
  • Detecting Violent Excerpts in Movies using Audio and Video Features, Luís Jorge Gregório Dias. ISCTE-IUL (2015-2016). Tomás Brandão, advisor. Fernando Batista, co-advisor.
  • Sistema Inteligente de Recolha e Armazenamento de Informação provenienter do Twitter, Gaspar Manuel Rocha Brogueira. ISCTE-IUL (2014-2015). Fernando Batista, advisor. Joao P. Carvalho, co-advisor.
  • Topic Detection within Public Social Networks, Hugo Rosa. Instituto Superior Técnico, Universidade de Lisboa (2013-2014). Joao P. Carvalho, advisor. Fernando Batista, co-advisor.

Research Interests

  • Machine learning
  • Natural Language Processing
  • Text and Speech processing
  • Shallow Parsing
  • Also: Operating Systems and Computer Architectures

Projects

Past Projects

  • SpeDial - Spoken Dialogue Analytics
  • COPAS - Contrast and Parallelism in Speech (03-2012 a 07-2015)
  • MISNIS - Intelligent Mining of Public Social Networks’ Influence in Society (04-2013 a 07-2015)
  • DIRHA - Distant-speech Interaction for Robust Home Applications (01-2012 a 12-2014)
  • METANET4U - European project aiming at supporting language technology for European languages and multilingualism (02-2011 a 02-2013)
  • PT-STAR - Speech Translation Advanced Research to And From Portuguese (05-2009 a 07-2012)
  • POSTPORT - Porting Speech Technologies to other Varieties of Portuguese (01-2008 a 06-2011)
  • Automatic Punctuation and Capitalization for automatic speech transcripts
  • Implementation of a shallow parser

Other Activities