Research interests
- Multiword expressions. Automatic extraction, lexical description and formalization in wide-coverage computational grammars.
- Lexical semantics and computational lexicography.
- Corpus linguistics.
- Syntax-semantics interface. Psychological predicates. Pronominal clitics.
Projects and other academic activities
- Guest editor of the Special Issue of the International Journal of Language Resources and Evaluation: "Multiword Expressions: hard going or plain sailing?" with Paul Rayson, Serge Sharoff, Scott Piao and Stefan Evert.
- Co-organizer of the COLING/ACL 2006 Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties in Sydney.
- June 2005 -- May 2007 member of the STEVIN project Identification and Representation of Multiword Expressions(IRME), a collaboration between Utrecht Institute of Linguistics (Uil-OTS) and Alfa-Informatica.
- 2002 -- 2004 member of ``Finding and Processing Multi-word Lexemes'', a collaboration between Alfa-Informatica and Morphologic funded by OTKA-NWO.
- 2000 -- 2005 member of the PIONIER project Algorithms for Linguistic Processing (ALP)
Publications
- Tim van de Cruys and Villada Moirón, Begoña (2007). Lexico-Semantic Multiword Expression Extraction (.pdf) In F. Van Eynde, P. Dirix, I. Schuurman, and V. Vandeghinste (eds.) Proceedings of Computational Linguistics in The Netherlands 2006.
- Tim van de Cruys and Villada Moirón, Begoña (2007). Semantics-based Multiword Expression Extraction (.pdf) In Proceedings of the ACL Workshop A Broader Perspective on Multiword Expressions, pp. 25-32. Prague, Czech Republic.
- Villada Moirón, B., Aline Villavicencio, Diana McCarthy, Stefan Evert and Suzanne Stevenson (eds.) 2006. Proceedings of the ACL-SIGLEX 2006 Workshop Multiword Expressions: Identifying and Exploiting Underlying Properties.
- Villada Moirón, Begoña and Joerg Tiedemann (2006). Identifying idiomatic expressions using automatic word-alignment(.pdf). In Proceedings of the EACL 2006 Workshop on Multiword Expressions in a Multilingual Context. Trento, Italy.
- Villada Moirón, Begoña (2005). Linguistically enriched corpora for establishing variation in support verb constructions (.pdf). In Proceedings of the 6th International Workshop on Linguistically Interpreted Corpora (LINC-2005). Jeju Island, Republic of Korea.
- Villada Moirón, Begoña (2005). Data-driven identification of fixed expressions and their modifiability (.pdf). PhD Thesis. Alfa-Informatica. University of Groningen.
- Villada Moirón, Begoña (2004). Distinguishing prepositional complements from fixed arguments (.rtf) In Proceedings of the 11th EURALEX International Congress. Vol. III, pp. 935-942. Lorient, France.
- B. Kis, B. Villada, G. Bouma, G. Ugray, T. Bíró, G. Pohl and J. Nerbonne (2004). Methods for the extraction of Hungarian Multi-Word Lexemes (.pdf) In B. Decadt, V. Hoste and G. de Pauw (eds.) Proceedings of Computational Linguistics in the Netherlands 2003. pp. 47--62. Antwerp Papers in Linguistics. The Netherlands
- B. Kis, B. Villada, G. Bouma, G. Ugray, T. Bíró, G. Pohl and J. Nerbonne (2004). A New Approach to the Corpus-based Statistical Investigation of Hungarian Multi--word Lexemes (.pdf) In Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC) 2004. Vol. V, pp 1677--1681. Lisbon, Portugal
- Villada Moirón, Begoña (2004). Discarding noise in an automatically acquired lexicon of support verb constructions (.pdf) In Proceedings of the 4th International Conference on Language Resources and Evaluation LREC 2004. Vol. V, pp 1859--1862. Lisbon, Portugal
- Villada, Begoña & Bouma, Gosse(2002). A corpus-based approach to the acquisition of collocational prepositional phrases. Proceedings of EURALEX 2002, Copenhagen. Denmark
- Bouma, Gosse & Villada, Begoña (2002). Corpus-based acquisition of collocational prepositional phrases. Computational Linguistics in the Netherlands (CLIN) 2001, University of Twente, 2002
- Villada, Begoña and Carl Vogel (2001). Grammatical Relations and Semantic Argument Structure in Spanish Emotion Predicates in Gutierrez-Rexach, Javier (ed.) Meaning and the Components of Grammar/El Significado y los componentes de la Gramatica. LINCOM Studies in Theoretical Linguistics 26, Muenchen, Germany:LINCOM Europa.
- Villada, Begoña (2000). Status of Galician and Spanish Pronominal Clitics. abstract.ps MSc dissertation submitted to the University of Dublin.
- Vogel, Carl and Begoña Villada (2000). Spanish Psychological Predicates in Ronnie Cann, Claire Grover, and Philip Miller (eds.) Grammatical Interfaces in HPSG. Stanford, CA:CSLI Publications.
- Vogel, Carl and Begoña Villada (1999). An HPSG Analysis of Grammatical Relations, Syntactic Valency and Semantic Argument Structure in Spanish Psychological Predicates and other Instances of Quirky Case and Agreement Tech. rep. TCD-CS-1999-77, Computational Linguistics Lab. Department of Computer Science. Trinity College, University of Dublin.
Teaching
- Corpus Linguistics, 2006-2007 (2nd semester)
- Together with Gosse Bouma, Introduction to Natural Language Processing, 2002-2003 Natuurlijke-Taalverwerking
Links
Computational Linguistics
Corpus Linguistics and Statistical NLP
- IMS Stuttgart |Michael Barlow |School of Cognitive and Computing Sciences Susx |CL Sydney University
Dictionaries, Thesauri, etc.
Grammatical frameworks
- HPSG Ohio State | Stanford | Tubingen
- LFG Stanford
- Construction Grammar
- XTAG
- Minimalist Syntax
Publishers
Multi-word expressions
Miscellaneous
- ELDA |elsnet | The Linguist List
- ACL
- Linguistic Exploration by S. Bird
- Python Resources by Michael A. Covington
News
Fun
http://www.let.rug.nl/~begona
Last modified: Sun May 18 09:20:53 CEST 2008