Morfosyntaktická anotace torwalštiny
Název práce v češtině: | Morfosyntaktická anotace torwalštiny |
---|---|
Název v anglickém jazyce: | Morphosyntactic Annotation of Torwali |
Klíčová slova: | treebank|universal dependencies|low-resource languages |
Klíčová slova anglicky: | treebank|universal dependencies|low-resource languages |
Akademický rok vypsání: | 2023/2024 |
Typ práce: | diplomová práce |
Jazyk práce: | |
Ústav: | Ústav formální a aplikované lingvistiky (32-UFAL) |
Vedoucí / školitel: | doc. RNDr. Daniel Zeman, Ph.D. |
Řešitel: | skrytý![]() |
Datum přihlášení: | 20.05.2024 |
Datum zadání: | 20.05.2024 |
Datum potvrzení stud. oddělením: | 21.05.2024 |
Zásady pro vypracování |
The goal of the thesis is to create a small dependency treebank of Torwali, an Indo-Aryan language with very little pre-existing digital resources. The treebank will follow the Universal Dependencies standard; however, certain language-specific guidelines for Torwali will have to be specified. For the morphological part of the annotation, a finite-state morphological analyzer and lexicon will be prepared. The annotation of the sample can be helped using known techniques for low-resource languages, such as parser bootstrapping, multilingual and cross-lingual parsing. |
Seznam odborné literatury |
Marie-Catherine de Marneffe, Christopher Manning, Joakim Nivre, Daniel Zeman (2021): Universal Dependencies. In: Computational Linguistics, ISSN 1530-9312, vol. 47, no. 2, pp. 255-308
Kenneth R. Beesley, Lauri Karttunen: Finite State Morphology. CSLI Publications, 2003 Daniel Zeman, Philip Resnik (2008): Cross-Language Parser Adaptation between Related Languages. In: IJCNLP 2008 Workshop on NLP for Less Privileged Languages, pp. 35-42, International Institute of Information Technology, Hyderabad, India Željko Agić, Dirk Hovy, and Anders Søgaard (2015). If all you have is a bit of the Bible: Learning POS taggers for truly lowresource languages. In The 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2015). Universal Dependencies v2 guidelines (2014-2018): http://universaldependencies.org/ |