|
|
|
||
The goal of the course is to provide students with the survey of the field of Language Data Resources. Selected types of linguistic annotations will be described, with
emphasis on annotating corpus data and lexical data. Students will gain practice in using software tools for processing such data, especially in the programming language
Python. Leading projects for English, Czech, and some other languages will be used for illustration.
Last update: Vidová Hladká Barbora, doc. Mgr., Ph.D. (25.01.2019)
|
|
||
To pass the course, you need to get at least 50% of the total points from the written test and submit all homework assignments. Your grade is based on the average of your performance; the test and the homework assignments are weighted 1:1. The final grade is assigned according to the following table: 1: ≥ 90% 2: ≥ 70% 3: ≥ 50% 4: < 50% For example, if you get 600 out of 1000 points for homework assignments (60%) and 36 out of 40 points for the test (90%), your total performance is 75% and you get a 2. For details, see https://ufal.mff.cuni.cz/courses/npfl070#grading Last update: Popel Martin, Mgr., Ph.D. (12.06.2019)
|
|
||
Last update: Vidová Hladká Barbora, doc. Mgr., Ph.D. (25.01.2019)
|
|
||
1. Introduction
2. Corpora
3. Treebanks
4. Computer lexicography
5. Other types of language data resources
6. Authors’ rights perspective on building language data resources; licenses Last update: Vidová Hladká Barbora, doc. Mgr., Ph.D. (25.01.2019)
|