Thesis (Selection of subject)Thesis (Selection of subject)(version: 368)
Thesis details
   Login via CAS
An online collaborative platform for the development of empirical grammars
Thesis title in Czech: On-line platforma pro spolupráci na vývoji empirických gramatik
Thesis title in English: An online collaborative platform for the development of empirical grammars
Key words: gramatika; spolupráce na vývoji; webová platforma; HPSG
English key words: grammar; collaborative development; web-based platform; HPSG
Academic year of topic announcement: 2014/2015
Thesis type: diploma thesis
Thesis language: angličtina
Department: Institute of Formal and Applied Linguistics (32-UFAL)
Supervisor: Ing. Alexandr Rosen, Ph.D.
Author: Mgr. Antonio Fernando Garcia Sevilla - assigned and confirmed by the Study Dept.
Date of registration: 09.03.2015
Date of assignment: 14.03.2015
Confirmed by Study dept. on: 15.07.2015
Date and time of defence: 03.02.2016 09:00
Date of electronic submission:21.01.2016
Date of submission of printed version:04.12.2015
Date of proceeded defence: 03.02.2016
Opponents: RNDr. Jiří Hana, Ph.D.
 
 
 
Guidelines
The thesis is based on the assumption that the development of a formal grammar and its integration with data-driven methods and testing environment becomes significantly easier if the grammar developers could use a single platform, unifying available resources and results and allowing for collaborative development. More specifically, the goals of the thesis are twofold:

(1) To develop a web-based, collaborative user-friendly platform for the development of HPSG grammars, where users could build, test and share their grammars, data and results. A typical scenario would include the option of collaborative development within a large project.

(2) To build and implement an HPSG grammar of Spanish of non-trivial coverage, using existing resources as the starting point and the proposed platform as the grammar writing and testing environment.

The platform should allow for extending the rule-based components by data-driven modules to make the system more robust and adaptive, including applications such as dynamic lexicon induction and constraint application weighing.
References
Abeillé, A., Borsley, R. D., and Espinal, M.-T. (2006). The syntax of comparative correlatives in French and Spanish. In Müller, S., editor, The Proceedings of the 13th International Conference on Head-Driven Phrase Structure Grammar, pages 6–26, Stanford. CSLI Publications.

Bildhauer, F. (2007). Representing Information Structure in an HPSG Grammar of Spanish. PhD thesis, Universität Bremen.

Bildhauer, F. (2008). Clitic left dislocation and focus projection in Spanish. In Müller, S., editor, Proceedings of the 15th International Conference on Head-Driven Phrase Structure Grammar, National Institute of Information and Communications Technology, Keihanna, pages 346–357, Stanford, CA. CSLI Publications.

Baldwin, T., Bender, E. M., Flickinger, D., Kim, A., and Oepen, S. (2004). Road-testing the English Resource Grammar over the British National Corpus. In In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004), pages 2047–2050.

Brew, C. (1995). Stochastic HPSG. In Proceedings of the 7th Conference of the European Chapter of the Association for Computational Linguistics, Dublin, Ireland, March 28–31. University College, pages 83–89.

Copestake, A. and Flickinger, D. (2000). An open-source grammar development environment and broad-coverage English grammar using HPSG. In Proceedings of the Second conference on Language Resources and Evaluation (LREC-2000), Athens, Greece.

Crysmann, B., Frank, A., Kiefer, B., Krieger, H.-U., Müller, S., Neumann, G., Piskorski, J., Schäfer, U., Siegel, M., Uszkoreit, H., and Xu, F. (2002). An integrated architecture for shallow and deep processing. In Proceedings of ACL-2002, 40th Anniversary Meeting, Philadelphia, USA. Association for Computational Linguistic, Association for Computational Linguistics.

Gilcub, M. M. and Marimon, M. (2002). Integrating shallow linguistic processing into a unification-based Spanish grammar. In Proceedings of COLING-2002.

Marimon, M., Bel, N., Espeja, S., and Seghezzi, N. (2007). The Spanish Resource Grammar: Pre-processing strategy and lexical acquisition. In Proceedings of the Workshop on Deep Lin- guistic Processing, DeepLP ’07, pages 105–111, Stroudsburg, PA, USA. Association for Computational Linguistics.

Marimon, M. (2010). The Spanish Resource Grammar. In Calzolari, N., Choukri, K., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S., Rosner, M., and Tapias, D., editors, Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10), Valletta, Malta. European Language Resources Association (ELRA).

Meza, I. and Pineda, L. (2002). The Spanish auxiliary verb system in HPSG. In Proceedings of CICLing-2002. Springer-Verlag.

Meza, I. and Pineda, L. (2005). Syntax-driven bindings of Spanish clitic pronoun. Procesamiento del Lenguaje Natural, 35.

Pineda, L. and Meza, I. (2000). Una gramática básica del español en HPSG. Technical report, Universidad Nacional Autónoma de México.

Smith, T. C. and Cleary, J. G. (1997). Probabilistic unification grammars. In Australasian Natural Language Processing Workshop, pages 25–32. Macquarie University.

Torruella, M. C. and Antonín, A. M. (2002). Design principles for a Spanish treebank. In 1st Workshop on Treebanks and Linguistic Theories (TLT), Sozopol, Bulgaria.
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html