Fernando Batista

From L²F


Fernando Batista

Fernando Batista received his PhD in Computer Science and Engineering in 2011 from Instituto Superior Técnico (IST). He previously received the Mestre degree in Electrical and Computer Engineering in 2003 also from IST, and the Licenciado degree in in 1997 in Mathematics and Computer Sciences from Universidade da Beira Interior (UBI). He is currently a lecturer at Lisbon University Institute (ISCTE-IUL), being Assistant Professor from 2011 onwards and team coordinator of several courses, namely: Operating Systems (2011-2015), Probabilistic Learning for Natural Language Processing (2015/2016); and Computational Language Processing (2014/2015). Since early 2015 he has been elected member of the Permanent Committee of the ISCTE-IUL Pedagogical Council and member of Pedagogical Committee of the ISTA School. He is also researcher at the Spoken Language Systems Lab. of INESC-ID, Lisbon, where he has participated in 3 European projects: METANET4U (2011-2013), DIRHA (2012-2014), and SpeDial (2014-2015); and 4 national projects: POSTPORT (2008-2012), PT-STAR (2009-2012), COPAS (2012-2015), and MISNIS (2013-2015). He coordinated the INESC-ID team in the SpeDial project during 2014 and 2015. Recently, he was the handbook chair of the EMNLP 2015 that was held in Lisbon in September 17-21. He has been a member of the Lisbon Machine Learning Summer School (LxMLS) technical staff from 2011 to 2015, and he is now part of the organisation team for LXMLS 2016. He was member of the Program Committee of several conferences, including: Interspeech (2014, and 2015), SLATE (2013, 2014, and 2015), KDIR 2013, ICT 2014. He was also a reviewer for the Neurocomputing Journal, EPIA 2013, ACL 2013, Interspeech 2013, ACL 2014, Speech Prosody 2014, and for several other national conferences. He is member of ISCA, and in 2016 was elevated to IEEE Senior Member.


INESC ID Information

Publications

Edited Books

2016

  • Alberto Abad, Alfonso Ortega, Antonio Teixeira, Carmen Mateo, Carlos Hinarejos, Fernando Perdigão, Fernando Batista, Nuno J. Mamede, Advances in Speech and Language Technologies for Iberian Languages: Third International Conference, IberSPEECH 2016, Springer International Publish, Lecture Notes in Computer Science, 10077, Lisboa, Portugal, November 2016

Book Chapters

2016

  • Marco Paulo Fernandes Vicente, Fernando Batista, Joao P. Carvalho, Information Processing and Management of Uncertainty in Knowledge-Based Systems - Volume 611 of the series Communications in Computer and Information Science, chapter Creating Extended Gender Labelled Datasets of Twitter Users, pp 690-702, Springer, June 2016

2015

  • Marco Paulo Fernandes Vicente, Joao P. Carvalho, Fernando Batista, Communications in Computer and Information Science Vol 563 - International Languages, Applications and Technologies, chapter Using Unstructured Profile Information for Gender Classification of Portuguese and English Twitter Users, pp 57-64, Springer, December 2015
  • Mariana Juliao, Jorge Silva, Ana Aguiar, Helena Moniz, Fernando Batista, Languages, Applications and Technologies, chapter Speech Features for Discriminating Stress Using Branch and Bound Wrapper Search, Springer International Publish, 4th International Symposium, SLATE 2015, Madrid, Spain, June 18-19, 2015, Revised Selected Papers, December 2015
  • Jose Moura, Fernando Batista, Elsa Alexandra Cabral da Rocha Cardoso, Luis Nunes, Chapter 6. Intelligent Management and Efficient Operation of Big Data, chapter Handbook of Research on Trends and Future Directions in Big Data and Web Intelligence, IGI Global, September 2015

2004

International Journals

2017

  • Joao P. Carvalho, Hugo Rosa, Gaspar Manuel Rocha Brogueira, Fernando Batista, MISNIS: An Intelligent Platform for Twitter Topic Mining, Expert Systems With Applications, Elsevier, doi: https://doi.org/10.1016/j.eswa.2017.08.001, December 2017

2016

  • Gaspar Manuel Rocha Brogueira, Fernando Batista, Joao P. Carvalho, A Smart System for Twitter Corpus Collection, Management and Visualization, International Journal of Technology and Human Interaction (IJTHI), IGI Global, vol. 13, n. 3, pages 13-32, December 2016
  • Gaspar Manuel Rocha Brogueira, Fernando Batista, Joao P. Carvalho, Using geolocated tweets for characterization of Twitter in Portugal and the Portuguese administrative regions, Social Network Analysis and Mining, Springer, vol. 6, n. 1, pages 1-20, doi: DOI: 10.1007/s13278-016-0347-8, June 2016

2014

  • Helena Moniz, Fernando Batista, Ana Isabel Mata da Silva, Isabel Trancoso, Speaking style effects in the production of disfluencies, Speech Communication, vol. 65, pages 20-35, doi: 10.1016/j.specom.2014.05.004, November 2014
  • Ana Isabel Mata da Silva, Helena Moniz, Fernando Batista, Comparing phrase-final patterns across speech styles and groups in European Portuguese, Noveaux cahiers de linguistique francaise, n. 31, pages 171-176, Genève, Swiss, September 2014

2013

  • Fernando Batista, Ricardo Ribeiro, Sentiment Analysis and Topic Classification based on Binary Maximum Entropy Classifiers, Procesamiento de Lenguaje Natural, Sociedad Española para el Procesamiento de Lenguaje Natural, vol. 50, n. 1, pages 77–84, March 2013

2012

  • Fernando Batista, Helena Moniz, Isabel Trancoso, Nuno J. Mamede, Bilingual Experiments on Automatic Recovery of Capitalization and Punctuation of Automatic Speech Transcripts, IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, n. 2, pages 474 -- 485, doi: 10.1109/TASL.2011.2159594, February 2012

2008

  • Fernando Batista, Diamantino António Caseiro, Nuno J. Mamede, Isabel Trancoso, Recovering Capitalization and Punctuation Marks for Automatic Speech Recognition: Case Study for the Portuguese Broadcast News, Speech Communication, vol. 50, n. 10, pages 847-862, doi: 10.1016/j.specom.2008.05.008, October 2008

International Conferences

2017

  • Joao P. Carvalho, Hugo Rosa, Fernando Batista, Detecting relevant tweets in very large tweet collections: the London Riots case study, In FUZZ-IEEE, 2017 IEEE International Conference on Fuzzy Systems, IEEE Xplorer, Naples, Italy, July 2017

2016

  • Rubén Solera Ureña, Helena Moniz, Fernando Batista, Ramon Fernandez Astudillo, Joana Carvalho Filipe de Campos, Ana Paiva, Isabel Trancoso, Acoustic-prosodic automatic personality trait assessment for adults and children, In IberSPEECH 2016, Springer International Publishing, vol. 10077, series Lecture Notes in Computer Science, pages 192--201, doi: 10.1007/978-3-319-49169-1_19, Advances in Speech and Language Technologies for Iberian Languages: Third International Conference, IberSPEECH 2016, Lisbon, Portugal, November 23-25, 2016, Proceedings, Lisbon, November 2016
  • Eugénio Alves Ribeiro, Fernando Batista, Isabel Trancoso, Ricardo Ribeiro, David Martins de Matos, Automatic Detection of Hyperarticulated Speech, In IberSPEECH 2016, Springer International Publish, vol. 10077, series Lecture Notes in Computer Science, pages 182--191, doi: http://dx.doi.org/10.1007/978-3-319-49169-1_18, Advances in Speech and Language Technologies for Iberian Languages: Third International Conference, IberSPEECH 2016, Lisbon, Portugal, November 23-25, 2016, Proceedings, Lisbon, November 2016
  • Vera Cabarrão, Isabel Trancoso, Ana Isabel Mata da Silva, Helena Moniz, Fernando Batista, Global analysis of entrainment in dialogues, In IberSPEECH 2016, Springer, vol. 10077, series Lecture Notes in Computer Science, pages 215--223, doi: 10.1007/978-3-319-49169-1_21, Advances in Speech and Language Technologies for Iberian Languages: Third International Conference, IberSPEECH 2016, Lisbon, Portugal, November 23-25, 2016, Proceedings, Lisbon, November 2016
  • Marco Paulo Fernandes Vicente, Fernando Batista, Joao P. Carvalho, Improving Twitter gender classification using multiple classifiers, In 8th European Symposium on Computational Intelligence and Mathematics (ESCIM 2016), pages 121 - 127, Sofia, Bulgaria, October 2016
  • Marco Paulo Fernandes Vicente, Fernando Batista, Joao P. Carvalho, Creating Extended Gender Labelled Datasets of Twitter Users, In IPMU2016 - 16th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based systems, Springer, vol. 611, series Communications in Computer and Information Science, pages 690-702, TU Eindhoven, The Netherlands, June 2016
  • Fernando Batista, Pedro dos Santos Lopes Curto, Isabel Trancoso, Alberto Abad, Jaime Rodrigues Ferreira, Eugénio Alves Ribeiro, Helena Moniz, David Martins de Matos, Ricardo Ribeiro, SPA: Web-based Platform for easy Access to Speech Processing Modules, In LREC, European Language Resources Association (ELRA), pages 3886--3892, doi: ISBN: 978-2-9517408-9-1, Portorož, Slovenia, May 2016

2015

  • Hugo Rosa, Joao P. Carvalho, Fernando Batista, Detecting User Influence in Twitter: PageRank vs Katz, a case study, In ESCIM - 7th European Symposium on Computational Intelligence and Mathematics, pages 212-217, Cádiz, Spain, October 2015
  • Vera Cabarrão, Helena Moniz, Jaime Rodrigues Ferreira, Fernando Batista, Isabel Trancoso, Ana Isabel Mata da Silva, Sérgio dos Santos Lopes Curto, Prosodic Classification of Discourse Markers, In International Congress of Phonetic Sciences (ICPhS 2015), Glasgow, Scotland, UK, August 2015
  • Fernando Batista, Joao P. Carvalho, Text based classification of companies in CrunchBase, In FUZZ-IEEE2015 IEEE International Conference on Fuzzy Systems, IEEE, pages , Istambul, Turkey, August 2015
  • Marco Paulo Fernandes Vicente, Fernando Batista, Joao P. Carvalho, Twitter gender classification using user unstructured information, In FUZZ-IEEE, 2015 IEEE International Conference on Fuzzy Systems, IEEE, Istambul, Turkey, August 2015
  • Gaspar Manuel Rocha Brogueira, Fernando Batista, Joao P. Carvalho, Using Geolocated Tweets for Characterization of Portuguese Administrative Regions, In 18th AGILE International Conference on Geographic Information Science, Lisboa, Portugal, June 2015
  • Marco Paulo Fernandes Vicente, Joao P. Carvalho, Fernando Batista, Using Unstructured Profile Information for Gender Classification of Portuguese and English Twitter Users, In SLATE'15, IV Symposium on Languages, Applications and Technologies, Springer, pages 143-148, Madrid, Spain, June 2015

2014

  • Hugo Rosa, Fernando Batista, Joao P. Carvalho, Twitter Topic Fuzzy Fingerprints, In WCCI2014, FUZZ-IEEE, 2014 IEEE World Congress on Computational Intelligence, International Conference on Fuzzy Systems, IEEE, series IEEE Xplorer, pages 776-783, Beijing, China, July 2014
  • Hugo Rosa, Joao P. Carvalho, Fernando Batista, Detecting a Tweet’s Topic within a Large Number of Portuguese Twitter Trends, In SLATE'14 - 3rd Symposium on Languages, Applications and Technologies, Schloss Dagstuhl, vol. 4659, series OpenAccess Series in Informatics (OASIcs), pages 185-199, doi: http://dx.doi.org/10.4230/OASIcs.SLATE.2014.185, June 2014
  • Gaspar Manuel Rocha Brogueira, Fernando Batista, Joao P. Carvalho, Helena Moniz, Expanding a Database of Portuguese Tweets, In SLATE'14 3rd Symposium on Languages, Applications and Technologies, Schloss Dagstuhl, vol. 4569, series OpenAccess Series in Informatics (OASIcs), pages 275-282, Bragança, Portugal, June 2014
  • Gaspar Manuel Rocha Brogueira, Fernando Batista, Joao P. Carvalho, Helena Moniz, Portuguese geolocated tweets: an overview, In ISDOC2014 - Proceedings of the International Conference on Information Systems and Design of Communication, ACM, pages 178-179, Lisbon, Portugal, May 2014
  • Vera Cabarrão, Helena Moniz, Fernando Batista, Ricardo Ribeiro, Nuno J. Mamede, Hugo Meinedo, Isabel Trancoso, Ana Isabel Mata, David Martins de Matos, Revising the Annotation of a Broadcast News Corpus: a Linguistic Approach, In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), European Language Resources Association (ELRA), pages 3908-3913, Reykjavik, Iceland, May 2014

2013

  • Anabela Barreiro, Johanna Monti, Brigitte Orliac, Fernando Batista, When Multiwords Go Bad in Machine Translation, In Workshop on Multi-word Units in Machine Translation and Translation Technology, http://www.mt-archive.info/10/MTS-2013-W4-Barreiro, September 2013
  • Joao P. Carvalho, Vasco Pedro, Fernando Batista, Towards Intelligent Mining of Public Social Networks’ Influence in Society, In IFSA-NAFIPS2013 - 2013 IFSA World Congress and NAFIPS Annual Meeting, IEEE Xplore, pages 478-483, Edmonton, Canada, June 2013

2012

2011

  • Helena Moniz, Fernando Batista, Isabel Trancoso, Ana Isabel Mata da Silva, Analysis of interrogatives in different domains, In Towards Autonomous, Adaptive, and Context-Aware Multimodal Interfaces: Theoretical and Practical Issues. Third COST 2102 International Training School, Springer Berlin / Heidelberg, series Book series: Lecture Notes in Computer Science, pages 136-148, Caserta, Italy, January 2011

2010

2009

2008

  • Ana Cristina Mendes, Luísa Coheur, Nuno J. Mamede, Ricardo Ribeiro, David Martins de Matos, Fernando Batista, QA@L2F, first steps at QA@CLEF, Springer-Verlag, vol. 5152, series Lecture Notes in Computer Science, September 2008

2007

  • Ana Cristina Mendes, Luísa Coheur, Nuno J. Mamede, Luís Carlos da Silva Romão, João Miguel Sanches Loureiro, Ricardo Ribeiro, Fernando Batista, David Martins de Matos, QA@L2F@QA@CLEF, In Working Notes for the CLEF 2007 Workshop, September 2007
  • Fernando Batista, Diamantino António Caseiro, Nuno J. Mamede, Isabel Trancoso, Recovering Punctuation Marks for Automatic Speech Recognition, In Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), ISCA, vol. 1, series Interspeech, pages 2153-2156, Antwerp, Belgium, September 2007

2006

  • Ricardo Ribeiro, Fernando Batista, Joana Paulo Pardal, Nuno J. Mamede, H. Sofia Pinto, Cooking an Ontology, In The Twelfth International Conference on Artificial Intelligence: Methodology, Systems, Applications, Springer Berlin / Heidelberg, vol. 4183, series Lecture Notes in Computer Science, pages 213-221, Varna, Bulgaria, September 2006
  • Jorge Baptista, Fernando Batista, Nuno J. Mamede, Building a Dictionary of Anthroponyms, In PROPOR'2006 - Computational Processing of the Portuguese Language, Springer Verlag, Berlin / Heidelberg, vol. 3960, series Lecture Notes in Computer Science, pages 21-30, Itatiaia, Brazil, May 2006

2004

  • Luísa Coheur, Fernando Batista, Nuno J. Mamede, Towards a flexible syntax/semantics interface, In Proceedings of the Herramientas y Recursos Linguísticos para el Español y el Portugués workshop, a satelite of the Ninth, pages 265-272, Puebla, Mexico, November 2004

2003

  • Fernando Batista, Nuno J. Mamede, Flexible Module for Shallow Parsing, Using Preferences, In TASHA'2003 - Workshop on Tagging and Shallow Processing of Portuguese, Faculdade de Ciências da Universidade de Lisboa, series Technical Reports, pages 5-6, Lisboa, Portugal, October 2003
  • Luísa Coheur, Fernando Batista, Joana Paulo Pardal, JaVaLI! undestanding real questions, In Proc. EUROLAN'2003 - Student Workshop on Applied Natural Processing, Hamburg, Germany, July 2003

2002

2000

  • Luzia Helena Wittmann, Ricardo Ribeiro, Tânia Pego, Fernando Batista, Some Language Resources and Tools for Computational Processing of Portuguese at INESC, In LREC2000 – Second International Conference on Language Resources and Evaluation, vol. 1, pages 347, Athens, Greece, June 2000

National Journals

2016

  • Vera Cabarrão, Helena Moniz, Jaime Rodrigues Ferreira, Fernando Batista, Isabel Trancoso, Ana Isabel Mata da Silva, Sérgio dos Santos Lopes Curto, Classificação prosódica de marcadores discursivos, Revista da Associação Portuguesa de Linguística, n. 2, pages 69 -- 95, July 2016

2014

National Conferences

2016

  • Angela Jusupova, Fernando Batista, Ricardo Ribeiro, Characterizing the Personality of Twitter Users based on their Timeline Information, In 16 Conferência da Associacao Portuguesa de Sistemas de Informação, pages 292 - 299, Porto, Portugal, October 2016
  • Fernando Rebelo, Fernando Batista, Ricardo Ribeiro, Cascatas de Classificação de Sentimento em Microblogues, In INFORUM 2016 - Atas do 8.o Simpósio de Informática, pages 203 -- 214, Lisboa, Portugal, September 2016
  • Luís Dias, Tomás Brandão, Fernando Batista, Detecting violence on movie excerpts: A machine-learning approach based on audio and video features, In INForum 2016, Gestão de Dados e Conhecimento, Lisboa, Portugal, September 2016

2015

  • Gaspar Manuel Rocha Brogueira, Fernando Batista, Joao P. Carvalho, Sistema Inteligente de Recolha, Armazenamento e Visualização de Informação proveniente do Twitter, In CAPSI2015 - 15ª Conferência da Associação Portuguesa de Sistemas de Informação, Lisboa, Portugal, October 2015
  • Mariana Juliao, Jorge Silva, Ana Aguiar, Helena Moniz, Jaime Rodrigues Ferreira, Fernando Batista, Speech Features for Discriminating Stress, In 10th Conference on Telecommunications, Conftele 2015, IT, https://www.it.pt/Publications/PaperConference/224, September 2015
  • Gaspar Manuel Rocha Brogueira, Fernando Batista, Joao P. Carvalho, Arquitetura e Desenvolvimento de um Repositório de Tweets em Português Europeu, In 5as Jornadas de Informática da Universidade de Évora - JIUE 2015, Springer, Évora, Portugal, February 2015

2014

2013

2012

2005

  • Jorge Baptista, Fernando Batista, Nuno J. Mamede, Cristina Mota, Npro: um novo recurso para o processamento computacional do Português, In XXI Encontro APL, Porto, Portugal, December 2005

Technical Reports

2014

  • Anabela Barreiro, Luísa Coheur, Tiago Luís, Angela Costa, Fernando Batista, João Graça, Isabel Trancoso, Multiword and Semantico-Syntactic Unit Alignments, Tech. Rep. 23 / 2014 INESC-ID Lisboa, December 2014

2008

  • David Martins de Matos, Ricardo Ribeiro, Sérgio Paulo, Fernando Batista, Luísa Coheur, Joana Paulo Pardal, Natural Language Engineering on a Computational Grid (NLE-GRID) T2 - Encapsulation of Reusable Components, Tech. Rep. 31 / 2008 INESC-ID Lisboa, January 2008

2006

Doctoral Theses

2011

Masters Theses

2003

Other Publications

2014

  • Vera Cabarrão, Helena Moniz, Fernando Batista, Isabel Trancoso, Ana Isabel Mata, Sérgio dos Santos Lopes Curto, Discourse markers in spontaneous speech in European Portuguese: a first approach, Università dell'Insubria, October 2014

2012

MSc theses

Finished

  • Characterizing the Personality of Twitter Users Based on their Timeline Information, Anzhela Zhusupova. ISCTE-IUL (2015-2016). Fernando Batista, advisor.
  • Análise de Sentimento em Microblogues com base em Cascatas de Classificação, Fernando Manuel Dias Rebelo. ISCTE-IUL (2015-2016). Fernando Batista, advisor. Ricardo Ribeiro, co-advisor.
  • Detecting Violent Excerpts in Movies using Audio and Video Features, Luís Jorge Gregório Dias. ISCTE-IUL (2015-2016). Fernando Batista, co-advisor.
  • Sistema Inteligente de Recolha e Armazenamento de Informação provenienter do Twitter, Gaspar Manuel Rocha Brogueira. ISCTE-IUL (2014-2015). Fernando Batista, advisor. Joao P. Carvalho, co-advisor.
  • Topic Detection within Public Social Networks, Hugo Rosa. Instituto Superior Técnico, Universidade de Lisboa (2013-2014). Joao P. Carvalho, advisor. Fernando Batista, co-advisor.

Research Interests

  • Machine learning
  • Natural Language Processing
  • Text and Speech processing
  • Shallow Parsing
  • Also: Operating Systems and Computer Architectures

Projects

Past Projects

  • SpeDial - Spoken Dialogue Analytics
  • COPAS - Contrast and Parallelism in Speech (03-2012 a 07-2015)
  • MISNIS - Intelligent Mining of Public Social Networks’ Influence in Society (04-2013 a 07-2015)
  • DIRHA - Distant-speech Interaction for Robust Home Applications (01-2012 a 12-2014)
  • METANET4U - European project aiming at supporting language technology for European languages and multilingualism (02-2011 a 02-2013)
  • PT-STAR - Speech Translation Advanced Research to And From Portuguese (05-2009 a 07-2012)
  • POSTPORT - Porting Speech Technologies to other Varieties of Portuguese (01-2008 a 06-2011)
  • Automatic Punctuation and Capitalization for automatic speech transcripts
  • Implementation of a shallow parser

Other Activities

  • Handbook Chair, EMNLP2015 - 2015 Empirical Methods in Natural Language Processing, Lisbon, Portugal, September 2015
  • Organisation Committee, LxMLS 2016
  • Student Workshop Co-chair, PROPOR 2016