SubjectsSubjects(version: 945)
Course, academic year 2023/2024
   Login via CAS
Dependency Grammars and Treebanks - NPFX075
Title: Závislostní gramatiky a korpusy
Guaranteed by: Student Affairs Department (32-STUD)
Faculty: Faculty of Mathematics and Physics
Actual: from 2022
Semester: summer
E-Credits: 6
Hours per week, examination: summer s.:2/2, C+Ex [HT]
Capacity: unlimited
Min. number of students: unlimited
4EU+: no
Virtual mobility / capacity: no
State of the course: taught
Language: Czech
Teaching methods: full-time
Teaching methods: full-time
Is provided by: NPFL075
Guarantor: doc. RNDr. Markéta Lopatková, Ph.D.
Class: Informatika Mgr. - Matematická lingvistika
Classification: Informatics > Computer and Formal Linguistics
Pre-requisite : {NXXX011, NXXX012, NXXX013, NXXX070, NXXX071}
Incompatibility : NPFL075
Interchangeability : NPFL075
Is incompatible with: NPFL075
Is interchangeable with: NPFL075
Annotation -
Last update: doc. Mgr. Barbora Vidová Hladká, Ph.D. (19.05.2020)
The goal of the course is to introduce a dependency-based description of natural languages, principles of dependency-based grammar formalisms and their application in morphologically and syntactically annotated corpora. The course will focus on the Prague Dependency Treebank project and on the Universal Dependencies project. The emphasis is also placed on annotation schemata and data formats, on practical work with treebanks and useful tools. The course is designed for students with the computer science background as well as for linguists with some CS experience.
Course completion requirements -
Last update: RNDr. Jiří Mírovský, Ph.D. (10.05.2022)

To pass the course, you will be required to actively participate in the classes and to submit all of the homework tasks. The quality of your homework solutions will determine your grade. More detailed information on grading can be found at the course webpage https://ufal.mff.cuni.cz/courses/npfl075

Literature -
Last update: RNDr. Jiří Mírovský, Ph.D. (10.05.2022)
  • Hajič J., Hajičová E., Mikulová M., Mírovský J.: Prague Dependency Treebank. In N. Ide & J. Pustejovsky (eds.): Handbook of Linguistic Annotation. Dordrecht 2017, pp. 555-594.
  • PDT Guide - http://ufal.mff.cuni.cz/pdt2.0/
  • PDT documentation - https://ufal.mff.cuni.cz/pdt3.5/documentation
  • Nivre, J., de Marneffe, M.C., Ginter, F., Goldberg, Y., Hajič, J., Manning, C., McDonald, R., Petrov, S., Pyysalo, S., Silveira, N., Tsarfaty, R., Zeman, D. (2006) Universal Dependencies v1: A Multilingual Treebank Collection. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), ELRA, Paris, pp. 1659-1666.
  • Universal Dependencies - http://universaldependencies.org/
  • Zeman, D.: The World of Tokens, Tags and Trees. Studies in Computational and Theoretical Linguistics, vol. 19. ÚFAL, Praha, 2018, ISBN 978-80-88132-09-7.
  • Partee, B. H., ter Meulen, A., Wall, R.E.: Mathematical Methods in Linguistics. Studies in Linguistics and Philosophy, vol. 30, Kluwer Academic Publishers, Dordrecht, Boston, London, 1990 (chapter 16)

Syllabus -
Last update: doc. Mgr. Barbora Vidová Hladká, Ph.D. (29.01.2019)
  1. Trees as a data structure (dependency trees, phrase-based trees, word order, projectivity)
  2. Dependency and non-dependency relations in natural languages
  3. Family of Prague Dependency Treebanks - introduction and principles; Functional Generative Description as their theoretical basis
  4. Universal Dependencies - introduction and principles
  5. Stratificational approach to natural language description:
    • morphology and its annotation in dependency treebanks
    • (surface) syntax and its annotation in dependency treebanks
    • (deep) syntax and its annotation in dependency treebanks
  6. Annotation of selected deep syntactic phenomena
  7. Annotation schemata, data formats
  8. Tools (TrEd, PML-TQ, Udapi)
 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html