Grammar & Resources

The group is centered on modeling linguistic knowledge, integrating interfaces between different areas of grammar and knowledge about how language is put to use. Joint work in formal phonology, lexicon, syntax and semantics allows building an integrated model of grammar, considering how it is represented in the human mind, as well as how it can be computationally modelled; work on L1 and L2 acquisition is at the core of this work. The integration of models of language representation and models of language use is achieved through the study of corpora.

The production of corpora and resources is justified by the goal of developing documentation and providing descriptions of contemporary European Portuguese, but also of understudied contact languages or varieties (Portuguese-based creoles, national varieties of Portuguese in Africa and Asia). The group also produces resources for the study of L1 and L2 acquisition in different settings. The group integrates CLARIN LP.

Research on L1 and L2 acquisition contributes to CLUL’s general purpose of effectively articulating fundamental and applied research, namely in the areas of Educational Linguistics and Clinical Linguistics.

General goals:

- To produce new resources for the study of Portuguese and Portuguese-based creoles;

- To pursue basic research on natural language modeling, integrating knowledge on interfaces between language modules;

- To continue the documentation and description of understudied creoles and new varieties of Portuguese that emerged in a context of language contact;

- To develop the study of language acquisition with an emphasis on language contact situations (see new international Heritage Language Consortium) and on the comparison between typical and atypical development;

- To explore the potential of comparative linguistics in the production of resources for translation and to promote connections with the industry in the area of translation.

 

Resources Type
A Lexicon of Child European Portuguese - CEPLEXicon Lexicon
A Portuguese Native Language Identification Dataset - NLI-PT Database
Acquisition of European Portuguese Databank - AcEP Database
Child-Adult Interaction Corpus - CAI Corpus
Child-Adult interaction European Portuguese Database
Consonantic Sequences Oral and Written Production Tasks - PORESC Tool
Controlled Portuguese - CLG Database
Corpora of PLE Corpus
Corpus Almeida - European Portuguese / French Corpus
Corpus Angolar Corpus
Corpus C-ORAL-ROM Corpus
Corpus CCF Corpus
Corpus CINTIL Corpus
Corpus Fadambo Corpus
Corpus Leiria (1991) Corpus
Corpus of Cape Verdean Portuguese Corpus
Corpus of Sri Lanka Portuguese Corpus
Corpus of the Diaries of the Portuguese Parliament annotated with PoS - PTPARL Corpus
Corpus PESTRA Corpus
Corpus Português Fundamental - Corpus PF Corpus
Corpus Principense Corpus
Corpus REDIP Corpus
Corpus Santome Corpus
Corpus SANTOS - European Portuguese Corpus
Crosslinguistic Child Phonology Project - Português Europeu - CLCP-PE Tool
Dados Orais de Cabo Verde - CV Words Database
Demo de Subespecificação e Desambiguação de Escopo Tool
Dictionary of Hindi-Portuguese-Hindi Database
Diu Indo-Portuguese Data Set Database
Learner Corpus of Portuguese L2 - COPLE2 Corpus
LT Corpus (Literary Corpus) - LT Corpus Corpus
Multifunctional Computational Lexicon of Contemporary Portuguese Lexicon
Nominal Multiword Lexical Units in European Portuguese Lexicon
NPChunks: Corpus of 1000 sentences annotated with PoS and nominal chunks - NPChunks Corpus
Online Corpus of Writing and Speech of Children in the Early Years of Schooling - EFFE-On Corpus
Online Dictionary Portuguese-Slovak/Slovak-Portuguese Database
Pereira&Freitas - EP Corpus
Person-Machine Interaction in Natural Language - INQUER Database
PhonoDis Corpus
Phonological Awareness Tasks for First Grade School Children - TCFC Tool
Portuguese Corpus Annotated for Modality - MODAL Corpus
Portuguese Lexicon of Discourse Markers - LDM-PT Lexicon
Portuguese Technical Lexica - LEXTEC Lexicon
Ramalho – EP Corpus
Reference Corpus of Contemporary Portuguese - CRPC Corpus
Santome Structure Dataset Database
Spoken Corpus Mozambique 1986-87 - SCM Corpus
Spoken Portuguese - Geographical and Social Varieties Corpus
Vocatives in European Portuguese Corpus
Word Combination in European Portuguese - LEX-MWE-PT Lexicon
WordNet.PT Lexicon
Capítulo de Livro
Duarte, I., & Matos, G. (2000). Romance Clitics and the Minimalist Program, in New Comparative Studies in Portuguese Syntax. In . Oxford: Oxford University Press.
Viana, M. C., Trancoso, I., Mascarenhas, I., Duarte, I., Matos, G., Oliveira, L., et al. (1999). Apresentação do Projecto CORAL – Corpus de Diálogo Etiquetado, in Linguística Computacional — Investigação Fundamental e Aplicações. In . Lisboa: APL/ Edições Colibri.
Duarte, I. (1998). Chomsky e Descartes: O Uso Estratégico de um Argumento Cartesiano e a Fundação das Ciências da Cognição. In Descartes, Leibniz e a Modernidade. Ribeiro dos Santos, Alves & Cardoso. Lisboa: Edições Colibri.
Duarte, I. (1997). Ordem de Palavras: Sintaxe e Estrutura Discursiva. In Sentido que a Vida Faz. Estudos para Óscar Lopes. Brito et al. Porto: Campo das Letras.
Duarte, I. (1996). Se a língua materna se tem de ensinar, que professores temos de formar?. In Formar Professores de Português Hoje. Lisboa: Edições Colibri.
Duarte, I., & Brito, A. M. (1996). Sintaxe. In Introdução à Linguística Geral e Portuguesa. Colecção Universitária. Série Linguística. Faria, Ribeiro, Duarte & Gouveia. Lisboa: Caminho.
Duarte, I., & Hagège, C. (1996). Construção de Gramáticas Formais para o Processamento da Linguagem Natural. In . Mateus & Branco. Lisboa: Edições Colibri.
Duarte, I., Matos, G., & Faria, I. H. (1996). Specificity of European Portuguese Clitics in Romance, in Studies on the Acquisition of Portuguese. In . Lisboa: APL /Edições Colibri.
Duarte, I., & Delgado-Martins, M. R. (1993). Brincar com a Linguagem, Conhecer a Língua, Fazer Gramática, in Linguagem e Desenvolvimento. In . Braga: Instituto de Educação.
Duarte, I. (1993). O Ensino da Gramática como Explicitação do Conhecimento Linguístico, in Ensino-Aprendizagem da Língua Portuguesa. In . Leiria: ESEL-IPL.
Duarte, I. (1992). Oficina Gramatical: Contextos de Uso Obrigatório do Conjuntivo. In Para a Didáctica do Português. Seis Estudos de Linguística. Delgado‑Martins et al. Lisboa: Edições Colibri.
Duarte, I. (1991). Funcionamento da Língua: a Periferia dos NPP, in Documentos do Encontro sobre Novos Programas de Português. In . Lisboa: Edições Colibri.
Afonso, C., Gonçalves, A., & Freitas, M. J. (2013). Como é que as crianças contam as palavras? Dados sobre a consciência lexical em Português europeu, in Textos Selecionados, XXVIII Encontro Nacional da Associação Portuguesa de Linguística. In (pp. 23-39). Silva, I. Falé & I. Pereira. Coimbra: Associação Portuguesa de Linguística.
Ramalho, A. M., & Freitas, M. J. (2012). Morphophonological complexity in the acquisition of EP: the case of nominal plural forms with final nasal diphthongs, in Selected Proceedings of the Romance Turn IV Workshop on the Acquisition of Romance Languages. In (pp. 27-52). Ferré, P. Prévost, L. Tuller & R. Zebib. Newcastle: Cambridge Scholars Publishing.
Freitas, M. J., Miguel, M., & Faria, I. (2001). Interaction between Prosody and morphosyntax: plurals within Codas in the acquisition of European Portuguese. In Approaches to Bootstrapping. Phonological, Lexical, Syntactic and Neurological Aspects of Early Language Acquisition (p. 45‑58). B. Höehle & J. Weissenborn. Amsterdam: John Benjamins Publishers.
Duarte, I., & Freitas, M. J. (2000). O oral e o escrito. In Língua Portuguesa. Instrumentos de Análise (pp. 379-420). I. Duarte. Lisboa: Universidade Aberta.
Hagemeijer, T. (2013). The Gulf of Guinea creoles: genetic and typological relations, in Creole languages and linguistic typology. In (pp. 163-206). Bhatt & T. Veenstra. Amsterdam, Philadelphia: John Benjamins.
Hagemeijer, T. (2013). Santome. In The Survey of Pidgin and Creole Languages (Vol. II: Portuguese-based, Spanish-based, and French-based Languages, pp. 50-58). Michaelis, P. Maurer, M. Haspelmath & M. Huber. Oxford: Oxford University Press.
Hagemeijer, T. (2009). Initial vowel agglutination in the Gulf of Guinea creoles, in Complex processes in new languages. In (pp. 29-50). Aboh & N. Smith. Amsterdam/Philadelphia: John Benjamins.
Hagemeijer, T. (2009). Aspects of discontinuous negation in Santome, in Negation patterns in West African languages and beyond. In (pp. 139-165). Cyffer, E. Ebermann & G. Ziegelmeyer. Amsterdam/Philadelphia: John Benjamins.
Hagemeijer, T. (2008). Languages in São Tomé and Príncipe. In Bradt travel guide for S. Tomé e Príncipe. .
Hagemeijer, T., & Lima, C. (2008). Lungwa Santome vocabulary. In Bradt travel guide for S. Tomé e Príncipe (pp. 201-210). .
Hagemeijer, T. (2003). Languages in São Tomé and Príncipe, in Bradt guide for Gabon and S. In (pp. 188-189). Tomé e Príncipe.
Hagemeijer, T. (2001). Semi-lexicality and underspecification in serial verb constructions, in Semi-lexical categories. In (pp. 415-451). Corver & H. van Riemsdijk. New York: Mouton de Gruyter.
Lejeune, P. (2012). Le discours d’expert de l’analyse conjoncturelle au Monde et à l’INSEE: De Sirius à Knock, in Discours d’experts et d’expertise. In (pp. 47-73). Garric & I. Léglise. Berne: Peter Lang.
Lejeune, P. (2012). Le mot marché(s) dans les comptes rendus boursiers : entre métonymie et personnalisation. In Les discours de la bourse et de la finance. Forum Für Fachsprachen-Forschung (pp. 159-177). L. Gautier. Berlin: Frank und Timme.
Lejeune, P. (2009). Quand já ne se traduit pas par déjà. In Estudos linguísticos 2 (pp. 123-140). Lisboa: Colibri.
Lejeune, P. (2009). Kärcher et racaille, ou quand un événement énonciatif déteint sur le sens des mots, in La circulation des discours. In (pp. 163-185). Munoz, S. Marnette, L. Rosier & D. Vincent. Québec: Editions Nota Bene.
Lejeune, P. (2006). Le brouillage énonciatif dans le compte rendu de documents techniques : le cas du Monde et des Notes de conjoncture de l’INSEE. In Dans la jungle des discours. Genres de discours et discours rapportés (pp. 237-248). J. M. Lopez-Muñoz, S. Marnette & L. Rosier. Cadiz: Publicaciones de la Universidad de Cadiz.
Marques, R. (2013). Modo, Gramática do Português. In Eduardo Paiva Raposo et al. Lisboa: Fundação Calouste Gulbenkian, cap (Vol. 19, pp. 671-693). Lisboa: Fundação Calouste Gulbenkian.