Jorge Baptista

From HLT@INESC-ID

Jorge Baptista

Jorge Baptista got his "Licenciatura" and MA in Linguistics from the Faculty of Letters of the University of Lisbon, in 1990 and 1995, respectively. He has a PhD in Linguistics (syntax) from University of Algarve (2001). He is an Associate Professor at University of Algarve and an invited researcher at L2F, INESC ID Lisbon.

Jorge Baptista’s main research interests are in computational and theoretical linguistics (syntax, grammar, large coverage lexica, corpus linguistics, machine translation).

Publications

Books

2020

  • Jorge Baptista, Nuno J. Mamede, Dicionário Gramatical de Verbos do Português, Universidade do Algarve Editora, October 2020

Book Chapters

2021

2018

  • Cristina Mota, Jorge Baptista, Anabela Barreiro, The Lexicon-Grammar of Predicate Nouns with 'ser de' in Port4NooJ., chapter Formalizing NaturalLanguages with NooJ 2018 and Its Natural Language Processing Applications, Springer, , December 2018

2016

  • Jorge Baptista, Ilia Markov, Perspectives Harrissiennes, chapter Morphosyntactic processes involving body-part nouns in Portuguese, in Claire Martinot, Christiane Marque-Pucheu, and Sonia Gerolimich (Eds.), pp. 255-, CRL, November 2016

2010

  • Jorge Baptista, Les Tables. La grammaire du français par le menu (Mélanges en hommage à Christian Leclère), chapter Verba dicendi: a structure looking for verbs. In: Nakamura, Takuya; Laporte, Éric; Dister, Anne; Fairon, Cédrick (eds.). pp.11-20, CENTAL (Cahiers du CENTAL 6) /, June 2010

2008

International Journals

2022

  • Jorge Baptista, Sónia Reis, Nuno J. Mamede, Nomes predicativos e construções neutras, Revista da Associação Portuguesa de Linguística, APL, vol. 9, n. 10, pages 13-30, doi: https://doi.org/10.26334/2183-9077/rapln9ano2022a2, Lisboa, October 2022
  • Amanda Maraschin Bruscato, Jorge Baptista, The synchronous and asynchronous learning of anaphora: A corpus-based analysis with learners of English and Spanish, vol. 11, n. 1, pages 1-28, January 2022

2021

2020

  • Matilde do Carmo Lages Gonçalves, Luísa Coheur, Jorge Baptista, Ana Mineiro, Avaliação de recursos computacionais para o português, Linguamática, Universidade do Minho e Universidade de Vigo, vol. 12, n. 2, pages 51-68, doi: https://doi.org/10.21814/lm.12.2.331, December 2020
  • Sónia Reis, Jorge Baptista, Determinação de um mínimo paremiológico do português europeu (Establishing the paremiological minimum of European Portuguese), Acta Scientiarum: Language and Culture, vol. 42, pages e52114, doi: 10.4025/actascilangcult.v42i2.52114, July 2020

2019

2018

  • Anabela Barreiro, Rebelo-Arnold, Ida, Jorge Baptista, Cristina Mota, Parafraseamento Automático de Registo Informal em Registo Formal na Língua Portuguesa, Linguamática, Universidade do Minho and Universidade de Vigo, vol. 10, n. 2, pages 53-61, December 2018
  • Anabela Barreiro, Jorge Baptista, Renata Vieira, Paulo Quaresma, Prefácio - POP - Por Outras Palavras, Linguamática, Universidade do Minho and Universidade de Vigo, vol. 10, n. 2, September 2018

2017

  • Ilia Markov, Jorge Baptista, Obdulia Pichardo-Lagunas, Authorship Attribution in Portuguese Using Character N-grams, Acta Polytechnica Hungarica, vol. 14, n. 3, doi: DOI10.12700/APH.14.3.2017.3.4, November 2017

2015

  • Amanda Pontes Rassi, Jorge Baptista, Oto Araújo Vale, Oto Vale. Um corpus anotado de construções com verbo-suporte em Português, Gragoatá , Instituto de Letras - Universidade Federal Fluminense, vol. 38, n. 1, pages 207-230, Rio de Janeiro, June 2015

2013

  • Thomas Pellegrini, Rui Correia, Isabel Trancoso, Jorge Baptista, Nuno J. Mamede, Maxine Eskenazi, ASR-based exercises for listening comprehension practice in European Portuguese, Computer Speech and Language, doi: http://dx.doi.org/10.1016/j.csl.2013.02.004, February 2013

2012

2010

  • Caroline Hagège, Jorge Baptista, Nuno J. Mamede, Caracterização e Processamento de Expressões Temporais em Português, Linguamática, vol. 2, n. 1, pages 63-76, April 2010

2005

  • Jorge Baptista, Graça Fernandes, Anabela Correia, Léxico-gramática das frases fixas do Portugués europeu. Breve presentatión, Cadernos de Fraseoloxía Galega, Xunta de Galicia, vol. 7, pages 41-53, Galicia, Spain, December 2005

Edited Proceedings

2018

  • Anabela Barreiro, Jorge Baptista, Renata Vieira, Paulo Quaresma, Linguistic Tools and Resources for Paraphrasing in Portuguese, PROPOR 2018, Canela, Brazil, September 2018

2014

  • Jorge Baptista, Nuno J. Mamede, Sara Candeias, Ivandré Paraboni, Thiago Pardo, Maria das Graça Nunes, Computational Processing of the Portuguese Language. 11th International Conference PROPOR’2014, Springer-Verlag, Lecture Notes in Computer Science / Lecture Notes in Artific, 8775, São Carlos – SP, Brazil, July 2014

International Conferences

2022

  • Izabela Müller, Nuno J. Mamede, Jorge Baptista, Bootstrapping a Lexicon of Multiword Adverbs for Brazilian Portuguese, In Conference: EUROPHRAS International Conference on Computational and Corpus-Based Phraseology (EUROPHRAS 2022), Springer, series LNCS-LNAI, pages 160-174, doi: https://doi.org/10.1007/978-3-031-15925-1_12, September 2022
  • Maria Inês Bico, Jorge Baptista, Fernando Batista, Esperança Cardeira, Early Experiments on Automatic Annotation of Portuguese Medieval Texts, In TPDL 2022, Springer International Publishing, vol. 13541, series Lecture Notes in Computer Science, pages 442--449, doi: https://doi.org/10.1007/978-3-031-16802-4_44, Italy, September 2022
  • Sónia Reis, Jorge Baptista, Automatic Classification of Portuguese Proverbs, In Conference: 11th Symposium on Languages, Applications and Technologies (SLATE 2022), Schloss Dagstuhl -- Leibniz-Zentrum fur Informatik, pages 2:1--2:8, doi: 10.4230/OASIcs.SLATE.2022.2, July 2022
  • Jorge Baptista, Nuno J. Mamede, Sónia Reis, Support Verb Constructions across the Ocean Sea, In 18th Workshop on Multiword Expressions (MWE 2022) n@LREC2022, European Language Resources Association (ELRA),, pages 26-36, Marseille, June 2022

2021

  • Amanda Maraschin Bruscato, Jorge Baptista, BRANEN and BRANES Corpora, In The European Conference on Language Learning 2021, doi: 10.22492/issn.2188-112X.2021.3, September 2021

2020

  • Paula Carvalho, Bruno Martins, Hugo Rosa, Sílvio Moreira, Jorge Baptista, Mário J. Silva, Situational Irony in Farcical News Headlines, In Computational Processing of the Portuguese Language. PROPOR 2020., Springer, Cham, series LNCS, pages 65-75, doi: https://doi.org/10.1007/978-3-030-41505-1_7, March 2020

2019

2018

  • Cristina Mota, Jorge Baptista, Anabela Barreiro, The Lexicon-Grammar of Predicate Nouns with 'ser de' in Port4NooJ, In The 12th International Conference on Automatic Processing of Natural-Language Electronic Texts with NooJ, Springer, Cham, series Formalizing Natural Languages with NooJ 2018 and Its Natural Language Processing Applications, pages 124-137, doi: https://doi.org/10.1007/978-3-030-10868-7, Palermo, Italy, June 2018

2017

  • Helena Gómez-Adorno, Ilia Markov, Jorge Baptista, Grigori Sidorov, David Pinto, Discriminating between Similar Languages Using a Combination of Typed and Untyped Character N-grams and Words, In 4th Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2017), ACL, , April 2017

2016

  • Francisco Dias, Jorge Baptista, Nuno J. Mamede, Automated Anonymization of Text Documents, In 2016 IEEE Congress on Evolutionary Computation (CEC), IEEE, pages 1287-1294, doi: 10.1109/CEC.2016.7743936, Vancouver, BC, Canada, July 2016
  • Jorge Baptista, Sandra Maria de Babo Lourenço, Nuno J. Mamede, Automatic Generation of Exercises on Passive Transformation in Portuguese, In 2016 IEEE Congress on Evolutionary Computation (CEC), IEEE, pages 4962-4972, doi: 10.1109/CEC.2016.7744427, Vancouver, BC, Canada, July 2016
  • José Miguel Pinheiro Correia, Jorge Baptista, Nuno J. Mamede, Syntax Deep Explorer, In 12th International Conference PROPOR’2016, Springer, vol. 9727, series Lecture Notes in Computer Science / Lecture Notes in Artific, pages 189-201, doi: 10.1007/978-3-319-41552-9_19, Tomar, Lisboa, July 2016
  • José Miguel Pinheiro Correia, Jorge Baptista, Nuno J. Mamede, Syntax Deep Explorer (demo), In 12th International Conference PROPOR’2016, Proceedings of the Demo Session, pages 27-29, Tomar, Portugal, July 2016
  • Rui Correia, Nuno J. Mamede, Maxine Eskenazi, Jorge Baptista, metaTED: a Corpus of Metadiscourse for Spoken Language, In LREC 2016 - Proceedings of the 10th International Conference on Language Resources and Evaluation, European Language Resources Association (ELRA), pages 23-28, Portorož, Slovenia, May 2016
  • Pedro Curto, Nuno J. Mamede, Jorge Baptista, Assisting European Portuguese Teaching: Linguistic features extraction and automatic readability classifier, Springer, vol. 583, series Communications in Computer and Information Science, pages 81-96, doi: 10.1007/978-3-319-29585-5_5, February 2016

2015

  • Amanda Pontes Rassi, Nuno J. Mamede, Jorge Baptista, Oto Araújo Vale, Integrating support verb constructions into a parser, In STIL 2015, X Brazilian Symposium in Information and Human Language Technology, Sociedade Brasileira de Computação, pages 57--62, , Natal, Rio Grande do Norte, Brasil, November 2015
  • Tatyana Fukova, Svitlana Chornobay, Jorge Baptista, Lexicon-Grammar of Russian verbal Idioms, In Computerised and Corpus-based Approaches to Phraseology: Monolingual and Multilingual Perspectives, Gloria Corpas Pastor. Proceedings of Conference of the European Society of Phraseology (EuroPhras’2015). Málaga (Spain), June 28-July 2, 2015, European Society of Phraseology (EuroPhras’2015)/Editions Tr, pages 139-153, , June 2015
  • Sónia Reis, Jorge Baptista, Portuguese Proverbs: Types and Variants, In Computerised and Corpus-based Approaches to Phraseology: Monolingual and Multilingual Perspectives. Gloria Corpas Pastor (ed.). Proceedings of Conference, Málaga (Spain), June 28-July 2, 2015, European Society of Phraseology (EuroPhras’2015)/Editions Tr, pages 208--217, Geneva, June 2015

2014

  • Amanda P. Rassi, Jorge Baptista, Oto Vale, Automatic Detection of Proverbs and their Variants, In 11th International Conference PROPOR’2014, Springer International Publishing, vol. 8775, series Lecture Notes in Computer Science / Lecture Notes in Artific, pages 141-152, doi: 10.1007/978-3-319-09761-9_14, São Carlos – SP, Brazil, October 2014
  • Ilia Markov, Nuno J. Mamede, Jorge Baptista, Body-Part Nouns and Whole-Part Relations in Portuguese, In Computational Processing of the Portuguese Language. 11th International Conference PROPOR’2014, Springer International Publishing, vol. 8775, series Lecture Notes in Computer Science / Lecture Notes in Artific, pages 125-136, doi: 10.1007/978-3-319-09761-9_13, São Carlos SP, Brazil, October 2014
  • Jorge Baptista, Nuno J. Mamede, Ilia Markov, Integrating Verbal Idioms into an NLP System, In Computational Processing of the Portuguese Language. 11th International Conference PROPOR’2014, Springer International Publishing, vol. 8775, series Lecture Notes in Computer Science / Lecture Notes in Artific, pages 250-255, doi: 10.1007/978-3-319-09761-9_28, São Carlos SP, Brazil, October 2014
  • Rui Correia, Nuno J. Mamede, Jorge Baptista, Maxine Eskenazi, Toward Automatic Classification of Metadiscourse, In 9th International Conference on NLP, PolTAL 2014, Springer International Publishing, vol. 8686, series Lecture Notes in Computer Science / Lecture Notes in Artific, pages 262-269, doi: 10.1007/978-3-319-10888-9_27, Warsaw, Poland, September 2014
  • Amanda Rassi, Jorge Baptista, Oto Vale, Proverb Variation: Experiments on Automatic Detection in Brazilian Portuguese Texts, In Computational Processing of the Portuguese Language. 11th International Conference PROPOR’2014, Springer, vol. 8775, series Lecture Notes in Computer Science / Lecture Notes in Artific, pages 141-152, Carlos – SP, Brazil, July 2014
  • Amanda Rassi, Oto Vale, Jorge Baptista, Automatic Detection of Proverbs and their Variants, In Symposium on Languages, Applications and Technologies (SLATE‘14), Dagstuhl Publishing, series Schloss Dagstuhl - Leibniz-Zentrum fur Informatik, pages 235-250, Bragança, Portugal, June 2014
  • Rui Correia, Nuno J. Mamede, Jorge Baptista, Maxine Eskenazi, Using the Crowd to Annotate Metadiscursive Acts, In 10th Joint ACL – ISO Workshop on Interoperable Semantic Annotation (LREC Workshop), pages 102-108, Reykjavik, Iceland, May 2014

2013

  • Rui Pedro Talhadas dos Santos, Nuno J. Mamede, Jorge Baptista, Semantic Roles for Portuguese Verbs, In 32nd International Conference on Lexis and Grammar, pages 127-132, UALG/FCHS: Faro, Portugal, September 2013

2012

  • Lucas Nunes Vieira, Cláudio Diniz, Nuno J. Mamede, Jorge Baptista, A Lexicon of Verb and -mente Adverb Collocations in Portuguese, In Proceedings of the 31st International Conference on Lexis and Grammar, Università degli Studi di Salerno /University of South Bohem, pages 155-161, Czech Republic, September 2012
  • Rui Correia, Jorge Baptista, Maxine Eskenazi, Nuno J. Mamede, Maxine Eskenazi, Automatic Generation of Cloze Question Stems, In International Conference on Computational Processing of Portuguese (Propor 2012), Springer-Verlag, vol. 7243, series Lecture Notes in Artificial Intelligence, pages 168–178, Coimbra, Portugal, April 2012
  • André Freire Silva, Cristiano José Lopes Marques, Jorge Baptista, Alfredo Ferreira, Nuno J. Mamede, REAP.PT SYSTEM: Serious Games for Learning Portuguese, In International Conference on Computational Processing of Portuguese (Propor 2012), Springer-Verlag, vol. 7243, series Lecture Notes in Artificial Intelligence, pages 248–259, Coimbra, Portugal, April 2012

2011

  • André Freire Silva, Nuno J. Mamede, Alfredo Ferreira, Jorge Baptista, João Fernandes, Towards a Serious Game for Portuguese Learning, In 2nd International Conference on Serious Games Development and Applications (SGDA 2011), Springer-Verlag, vol. 6944, series Lecture Notes in Computer Science, pages 83 - 94, Lisbon, Portugall, October 2011

2010

  • Rui Correia, Jorge Baptista, Nuno J. Mamede, Isabel Trancoso, Maxine Eskenazi, Automatic Generation of Cloze Question Distractors, In Second Language Studies: Acquisition, Learning, Education and Technology, SLaTE: the ISCA SIG on Speech and Language Technology in Edu, Waseda University, Tokyo, Japan, September 2010
  • Jorge Baptista, Nuno J. Mamede, Fernando Gomes, Auxiliary verbs and verbal chains in European Portuguese, In 9th International Conference on Computational Processing of the Portuguese Language (Propor 2010), Springer Berlin / Heidelberg, vol. 6001, series LNAI, pages 110-119, doi: 10.1007/978-3-642-12320-7_14, Porto Alegre, Brazil, April 2010
  • Jorge Baptista, Neuza Costa, Joaquim Guerra, Marcos Zampieri, Maria de Lurdes Cabral, Nuno J. Mamede, P-AWL: Academic Word List for Portuguese, In 9th International Conference on Computational Processing of the Portuguese Language (Propor 2010), Springer Berlin / Heidelberg, vol. 6001, series LNAI, pages 120-123, doi: 10.1007/978-3-642-12320-7_15, Porto Alegre, Brazil, April 2010

2009

  • Luís Marujo, José Lopes, Nuno J. Mamede, Isabel Trancoso, Juan Pino, Maxine Eskenazi, Jorge Baptista, Céu Viana, Porting REAP to European Portuguese, In ISCA International Workshop on Speech and Language Technology in Education (SLaTE 2009), ISCA, , Wroxall Abbey Estate, Warwickshire, England, September 2009

2006

  • Jorge Baptista, Fernando Batista, Nuno J. Mamede, Building a Dictionary of Anthroponyms, In PROPOR'2006 - Computational Processing of the Portuguese Language, Springer Verlag, Berlin / Heidelberg, vol. 3960, series Lecture Notes in Computer Science, pages 21-30, Itatiaia, Brazil, May 2006

2004

  • Jorge Baptista, Anabela Correia, Graça Fernandes, Frozen Sentences of Portuguese: Formal Descriptions for NLP, In Workshop on Multiword Expressions: Integrating Processing, International Conference of the European Chapter of the sociation for Computational Linguistics, pages 72-79, Barcelona, Spain, July 2004

National Conferences

2017

  • Sónia Reis, Anna Maria Pompili, Alberto Abad, Jorge Baptista, O provérbio como estímulo num terapeuta virtual, In VI Simpósio Mundial de Estudos sobre o Português (SIMELP), Santarém, Portugal, December 2017

2016

2014

2013

2011

  • Ricardo Jorge Rosa Portela, Nuno J. Mamede, Jorge Baptista, Multiword Identification, In Terceiro Simpósio de Informáctica (INFORUM 2011), Dep. de Eng. Informática da Universidade de Coimbra, pages 663-674, Coimbra, Portugal, October 2011

2010

2005

  • Jorge Baptista, Fernando Batista, Nuno J. Mamede, Cristina Mota, Npro: um novo recurso para o processamento computacional do Português, In XXI Encontro APL, Porto, Portugal, December 2005

Technical Reports

2022

  • Pedro Santos, João Miguel de Sousa de Assis Dias, Jorge Baptista, Ana Teresa Jerónimo Antunes, Nuno Cordeiro, Bruno Martins, State of the Art in Artificial Intelligence applied to the Legal Domain, / 2022 INESC-ID Lisboa, December 2022

PhD theses

Ongoing

  • Compound Adverbs in Brazilian Portuguese., Izabela Müller. Universidade do Algarde (2022-). Jorge Baptista, advisor.
  • Lexical and Syntactical Patterns of Community College Native English Speakers in Developmental Education,, Miguel Da Corte. Universidade do Algarde (2022-). Jorge Baptista, advisor.

MSc theses

Ongoing

  • Repositório de Entidades Morfológicas, Eduardo António Gonçalves Castanho. (2013-). Nuno J. Mamede, advisor. Jorge Baptista, co-advisor.

Finished

  • Atualização Semi-Automática dos Recursos da STRING, Catarina Vieira de Mendonça. (2017-2019). Nuno J. Mamede, advisor. Jorge Baptista, co-advisor.
  • Event Identification in STRING, José Paulo de Oliveira Rodrigues Marques Dias. (2017-2019). Nuno J. Mamede, advisor. Jorge Baptista, co-advisor.
  • Adivinhador de palavras desconhecidas, João Humberto Moncóvio Rebelo. (2017-2018). Nuno J. Mamede, advisor. Jorge Baptista, co-advisor.
  • Legibilidade de livros de Ensino de Português, Gonçalo André Ramos Carvalho Pinto. (2016-2018). Nuno J. Mamede, advisor. Jorge Baptista, co-advisor.
  • Portuguese Verb Sense Disambiguation Using Parallel Corpus, Valentyn Hulevych. (2017-2018). Nuno J. Mamede, advisor. Jorge Baptista, co-advisor.
  • Desambiguação semântica de nomes, José Pedro de Almeida Arvela. (2016-2017). Nuno J. Mamede, advisor. Jorge Baptista, co-advisor.
  • Desambiguação da classe de construção dos verbos, Ricardo Filipe Mendes Pires. (2016-2017). Nuno J. Mamede, advisor. Jorge Baptista, co-advisor.
  • Syntax Deep Explorer, José Miguel Pinheiro Correia. Instituto Superior Técnico, Universidade de Lisboa (2014-2015). Nuno J. Mamede, advisor. Jorge Baptista, co-advisor.
  • Desambiguação semântica de nomes, Rita Alexandra Correia Soares Amaro Policarpo. Instituto Superior Técnico, Universidade de Lisboa (2013-2015). Nuno J. Mamede, advisor. Jorge Baptista, co-advisor.
  • Verb Sense Classification, Gonçalo André Rodrigues Suissas. Instituto Superior Técnico, Universidade de Lisboa (2013-2014). Nuno J. Mamede, advisor. Jorge Baptista, co-advisor.
  • Linguistic Expression of Irony in Social Media, . (2012-2014). Paula Carvalho, advisor. Jorge Baptista, co-advisor.
  • Verb Sense Disambiguation, Tiago Manuel Paulo Travanca. Instituto Superior Técnico, Universidade Técnica de Lisboa (2011-2013). Nuno J. Mamede, advisor. Jorge Baptista, co-advisor.
  • REAP.PT Sintáctico, Cristiano José Lopes Marques. Instituto Superior Técnico, Universidade Técnica de Lisboa (2010-2011). Nuno J. Mamede, advisor. Jorge Baptista, co-advisor.
  • Resolução de Expressões Anafóricas, Nuno Ricardo Pedruco Nobre. Instituto Superior Técnico, Universidade Técnica de Lisboa (2009-2011). Nuno J. Mamede, advisor. Jorge Baptista, co-advisor.

Graduation theses

Finished

Internships

Finished

  • Estágio de Portofólio, Catarina Vieira de Mendonça. (2017-2019). Nuno J. Mamede, advisor. Jorge Baptista, co-advisor.
  • Estágio de Portofólio, Ana Isabel Silva Galvão. (2017). Nuno J. Mamede, advisor. Jorge Baptista, co-advisor.