Name: Sandra Antunes Sandra clul2

E-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.

Academic Degree: Master

Professional Status: PhD Student

CLUL's Research Sector: Grammar & Resources

Research Areas: Multiword Expressions, Second Language Acquisition, Corpus Linguistics, Machine Translation


Academic Degrees:


Other courses:

  • 2017 - Training School: Computational methods and statistics in the annotation and analysis of discourse relations. Second TextLink Training School. February 6-10. Prague, Czech Republic.
  • 2016 - Training School: Methods and tools for the analysis of discourse relational devices. First TextLink Training School. January 18-22. Valencia, Spain.
  • 2014 - Learner Corpus Research Summer School: Corpus Approaches to Phraseology. September, 8-12 . Louvain-la-Neuve, Belgium.
  • 2013 - I Curso Paleografia Portuguesa Moderna (Séculos XVI-XVIII), “Escrivaninhas Judiciais, Notariais e Outras”. June, 11 - July, 13. Centro de História da Faculdade de Letras da Universidade de Lisboa.
  • 2000 - Speech Analysis, by Janet Slifka (Massachusetts Institute of Technology). CLUL.

Activities:

  • PhD Student, Faculty of Arts, University of Lisbon (Unidades Multilexicais do Português. Uma abordagem corpus driven para o estabelecimento de uma tipologiae de um léxico anotado).
  • Collaborator at CLUL.
  • Quality Manager at Unbabel.

Projects:


Participation in Cost Actions:

  • Parsing and multi-word expressions. Towards linguistic precision and computational efficiency in natural language processing – PARSEME (ICT COST Action IC1207).
  • Structuring Discourse in Multilingual Europe – TextLink (ISCH COST Action IS1312).

Participation in Scientific Committees:

  • Language Resources and Evaluation Conference (LREC 2018).
  • Language Resources and Evaluation Conference (LREC 2016).

Organization Committees:

  • CLUL-LingMe (Conference for graduate students and young researchers). June 16th, 2016. Faculty of Arts, University of Lisbon.
  • ConSOLE XXII - 22nd Conference of the Student Organization of Linguistics in Europe. January 8-10, 2014. Faculty of Arts, University of Lisbon.
  • Meeting Commemorating the 75th Anniversary of CLUL. November, 2-5. 2007. CLUL.

Publications:

Book Chapters:

2016

  • Mendes, A. & S. Antunes
    “Collocations in Portuguese: A corpus-based approach to lexical patterns”
    In: B. S. Vilas (ed.). Collocations Cross-Linguistically. Corpora, Dictionaries and Language Teaching. Mémoires de la Société Néophilologique de Helsinki C. Helsinki: Société Néophilologique. Pp. 141-166.
  • Mendes, A., A. Estrela, M. F. Bacelar do Nascimento, L. Pereira, S. Antunes
    “New words, old suffixes: Nominal derivation in the African varieties of Portuguese compared to European Portuguese”
    In: Rudolf Muhr, Eugênia Duarte, Amália Mendes, Carla Amóros Negre, Juan Testa (eds.). Pluricentric Languages and non-dominant Varieties worldwide: Volume 2. The pluricentricity of Portuguese and Spanish: New concepts and descriptions. Frankfurt a.M. / Wien u.a., Peter Lang Verlag. Pp. 121-136.

2014

  • Bacelar do Nascimento, M. F., A. Mendes, S. Antunes & L. Pereira
    “The Reference Corpus of Contemporary Portuguese and related projects at Centre of Linguistics of the University of Lisbon”
    In: Sardinha, T. B. & T. Ferreira (eds.). Working with Portuguese Corpora. London/New York: Continuum Publishers. Pp. 237-256.

2006

  • Bacelar do Nascimento, M. F., A. Mendes & S. Antunes
    “Typologies of MultiWord Expressions Revisited: A Corpus-driven Approach”
    In: Kawaguchi, Yuji, Susumu Zaima, Toshihiro Takagaki (eds.). Spoken Language Corpus and Linguistic Informatics. Coll. Usage-Based Linguistic Informatics, Vol. V. Amsterdam/Philadelphia: John Benjamins Publishing Company. Pp. 227-244.

2005

  • Bacelar do Nascimento, M. F., J. Bettencourt Gonçalves, R. Veloso, S. Antunes, F. Barreto & R. Amaro
    "The Portuguese Corpus"
    In: Cresti, E. and M. Moneglia (eds.), C-ORAL-ROM: Integrated Reference Corpora for Spoken Romance Languages. Studies in Corpus Linguistics nº 15. Amsterdam/Philadelphia: John Benjamins Publishing Company. Pp. 163-207.

Proceedings:

2017

  • S. Antunes
    “A Lexical Database for the Analysis of Portuguese Multiword Expressions”
    In: proceedings of EUROPHRAS 2017 – Computational and Corpus-based Phraseology: Recent Advances and Interdisciplinary Approaches. November 13-14. London, UK.
  • Mendes, A., S. Antunes & P. Quaresma
    “The annotation coreference task at IberEval’17: the experience of CLUL/UE”
    In: proceedings of IBEREVAL workshop to be held together with SEPLN 2017. September, 19th.  Murcia, Spain.
  • Estrela, A. & S. Antunes  
    “A sufixação num corpus de aquisição de PLE/L2”
    In: Proceedings of Terceiro Congresso Internacional Pelos Mares da Língua Portuguesa. 4-6 maio. Aveiro.

2016

  • Del Río, I., S. Antunes, A. Mendes & M. Janssen
    “Towards error annotation in a learner corpus of Portuguese”
    In: Proceedings of the 1st NLP4LA. November, 16. Umea, Sweden. Pp. 8-17.
  • Mendes, A., S. Antunes, J. Maarten & A. Gonçalves
    “COPLE2 corpus: a learner corpus of Portuguese”
    In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC). 25-27 May. Portoroz, Slovenia.

2015

  • Antunes, S., A. Mendes, A. Gonçalves, M. Janssen, N. Alexandre, A. Avelar, A. Castelo, I. Duarte, M. J. Freitas, J. pascoal & J. Pinto 
    “Apresentação do Corpus de Português Língua Estrangeira/Língua Segunda – COPLE2
    In: Textos Selecionados do XXX Encontro Nacional da Associação Portuguesa de Linguística. Porto, Portugal. Pp. 77-87.

2014

  • Antunes, S. & A. Mendes
    “An evaluation of the role of statistical measures and frequency for MWE identification”
    In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC). Reykjavik, Iceland. Pp. 4046-4051.

2013

  • Antunes, S. & A. Mendes
    “MWE in Portuguese: Proposal for a Typology for Annotation in Running Text”
    In: Proceedings of the 9th Workshop on Multiword Expressions. North American Chapter of the Association for Computational Linguistics. Atlanta, Georgia, USA. Pp. 87-92.
  • Antunes, S.
    “Para uma tipologia de associações de palavras do português”
    In: Textos Selecionados do XXVIII Encontro Nacional da Associação Portuguesa de Linguística. Faro, Portugal. Pp. 79-97.

2012

  • Cardoso, A., S. Pereira, S. Antunes & R. Veloso
    “A Comment on Comment Clauses: data from European Portuguese”
    In: Proceedings of the VII GSPC International Conference: Speech and Corpora. Belo Horizonte, Brasil.
  • Mendes, A., M. Généreux, I. Hendrickx, L. Pereira, M. F. Bacelar do Nascimento & S. Antunes
    “CQPWeb: uma nova plataforma de pesquisa para o CRPC”
    In: Textos Selecionados do XXVII Encontro Nacional da Associação Portuguesa de Linguística. Lisboa, Portugal. Pp. 466-477.

2010

  • Hendricks, I., A. Mendes & S. Antunes
    “Proposal for Multi-word Expression Annotation in Running Text”
    In: Proceedings of the fourth Linguistic Annotation Workshop (LAW IV). Association for Computational Linguistics. Uppsala, Sweden. Pp. 100-108.

2008

  • Antunes, S., M. F. Bacelar do Nascimento, A. Mendes, L. Pereira & T. Sá
    “COMBINA-PT: uma base de dados de combinatórias lexicais do português”
    In: Textos Selecionados do XXIII Encontro Nacional da Associação Portuguesa de Linguística. Évora, Portugal. Pp. 33-45.

2006

  • Mendes, A., S. Antunes, M. F. Bacelar do Nascimento, J. M. Casteleiro, L. Pereira & T. Sá
    “COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions”
    In: Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC). Genoa, Italy. Pp. 1900-1905.
  • Antunes, S., M. F. Bacelar do Nascimento, J. M. Casteleiro, A. Mendes, L. Pereira & T. Sá
    “A Lexical Database of Portuguese Multiword Expressions”
    In: Vieira, R. et alii (eds.), Computational Processing of the Portuguese Language. Proceedings of the 7th International Workshop of PROPOR. IME-RJ/Itatiaia, Rio de Janeiro, Brasil. Berlin Heidelberg: Springer-Verlag. Pp. 238-143.
  • Antunes, S., M. F. Bacelar do Nascimento, J. M. Casteleiro, A. Mendes, L. Pereira & T. Sá
    “Corpus-based extraction and identification of Portuguese Multiword Expressions”
    In: Mertens, P. et alii (eds.). Actes de la 13ème Conference sur le Traitement Automatique des Langues Naturelles (TALN). Vol. I. Leuven: Presses Universitaires de Louvain. Pp. 389-397.

2003

  • Antunes, S. & R. P. Chaves
    “On the Licensing Conditions of Co-predication”
    In: Bouillon P. & K. Kanzaki (eds.). Proceedings of the 2nd International Workshop on Generative Approaches to the Lexicon. Université de Genève, Suisse. Pp. 161-168.

Presentations without publication:

2016

  • Del Río, I., A. Mendes, M. Janssen & S. Antunes
    “Towards error annotation in a learner corpus of Portuguese”
    IV Jornadas de PLE. 9-10 dezembro. FLUC. Coimbra, Portugal.

2015

  • Mendes, A., S. Antunes & A. Gonçalves
    “Corpus de Português LE/L2 – COPLE2”
    III Jornadas de PLE. Aquisição e didática. 11-12 dezembro. FLUL. Lisboa, Portugal.
  • Gonçalves, A., A. Mendes, S. Antunes, N. Alexandre, J. Pinto, M. J. Freitas, J. Pascoal & A. Avelar 
    “Apresentação do Corpus de Português Língua Estrangeira/Língua Segunda” 
    Workshop Learner Corpus: da construção de um corpus à análise de dados de PL2. 16 outubro. FLUL. Lisboa, Portugal.
  • Antunes, S. & A Mendes  
    “O uso de unidades multilexicais num corpus de aprendizagem de Português L2”
    Workshop Learner Corpus: da construção de um corpus à análise de dados de PL2. 16 outubro. FLUL. Lisboa, Portugal.
  • Mendes, A., S. Antunes & A. Gonçalves
    “COPLE2 – Corpus of Portuguese FL/L2”
    Learner Corpus Research Conference (LCR2015). 11-13 September. Nijmegen, The Netherlands.
  • Antunes, S. & A. Mendes 
    “Portuguese Multiword Expressions: data from a learner corpus”
    Learner Corpus Research Conference (LCR2015). 11-13 September. Nijmegen, The Netherlands.
  • Antunes, S. & A. Mendes  
    “Portuguese Multiword Expressions: data from a learner corpus”
    Corpus Linguistics Conference (CL2015). 21-24 July. Lancaster, United Kingdom.

2014

  • Mendes, A., S. Antunes, L. Pereira & M. F. Bacelar do Nascimento
    “Wokflow of a Portuguese Dictionary of Multiword Expressions”
    Workflow of Corpus-based Lexicography. COST ENeL WG3 Meeting. 19 July. Bolzano, Italy.
  • Mendes, A. & S. Antunes
    “A Lexical database of Multiword Expressions in Portuguese” 
    PARSEME 2nd General Meeting. 10-11 March. Athens, Greece.

2007

  • Antunes, S.
    “Combinatórias do Português: uma abordagem corpus-driven com fins lexicográficos”
    Encontro Comemorativo do 75.º Aniversário do CLUL. 5 de Novembro. Lisboa, Portugal.
  • Antunes, S., M. F. Bacelar do Nascimento & A. Mendes
    “A Corpus-driven Study of the Variation of Co-occurrence Patterns in Written and Spoken Registers” 
    Corpus Linguistics Conference. 28-30 May. Birmingham, United Kingdom.