SubjectsSubjects(version: 945)
Course, academic year 2023/2024
   Login via CAS
Introduction to Language Technologies - NPFL125
Title: Základy jazykových technologií
Guaranteed by: Institute of Formal and Applied Linguistics (32-UFAL)
Faculty: Faculty of Mathematics and Physics
Actual: from 2021
Semester: winter
E-Credits: 3
Hours per week, examination: winter s.:0/2, MC [HT]
Capacity: unlimited
Min. number of students: unlimited
4EU+: no
Virtual mobility / capacity: no
State of the course: taught
Language: Czech, English
Teaching methods: full-time
Teaching methods: full-time
Additional information: https://ufal.mff.cuni.cz/courses/npfl125
Guarantor: doc. Ing. Zdeněk Žabokrtský, Ph.D.
Mgr. Rudolf Rosa, Ph.D.
Incompatibility : NPFL092
Interchangeability : NPFL092
Is incompatible with: NPFL092
Is interchangeable with: NPFL092
Annotation -
Last update: doc. Mgr. Barbora Vidová Hladká, Ph.D. (03.05.2019)
Students will acquire hands-on experience with basic software tools that are useful for Natural Language Processing in UNIX systems.
Course completion requirements -
Last update: doc. Ing. Zdeněk Žabokrtský, Ph.D. (12.06.2019)

To pass the course, you will need to submit homework assignments and do a written test. A brief overview follows; detailed information on grading can be found at https://ufal.mff.cuni.cz/courses/npfl092#grading

Homework assignments

  • Assignments will be set in the class and specified on the website.
  • To pass the course, you need to get at least 50% of the total achievable points for the assignments.
  • If you miss the deadline, there is a second deadline in 2 weeks, but your points for the assignment will be multiplied by 0.5; after the second deadline, you get 0 points.

Test

  • There will be a written test at the end of the semester.
  • To pass the course, you need to get at least 50% of the total points from the test.

Grading

  • Your grade is based on the average of your performance; the test and the homework assignments are weighted 1:1.
  • ≥ 90%: grade 1 (excellent)
  • ≥ 70%: grade 2 (very good)
  • ≥ 50%: grade 3 (good)
  • < 50%: grade 4 (fail)
Literature -
Last update: doc. Mgr. Barbora Vidová Hladká, Ph.D. (25.01.2018)

Mendel Cooper: Advanced Bash-Scripting Guide, http://tldp.org/LDP/abs/html/

Allen B. Downey: Think Python 2e, O'Reilly Media, http://greenteapress.com/wp/think-python-2e/

Steven Bird, Ewan Klein, and Edward Loper: Natural Language Processing with Python, O'Reilly Media, 2015, http://www.nltk.org/book/

Dive into NLTK: http://textminingonline.com/dive-into-nltk-part-i-getting-started-with-nltk

Syllabus -
Last update: doc. Mgr. Barbora Vidová Hladká, Ph.D. (25.01.2018)

Survival in Linux, Bash command line and scripting, text-processing commands.

Introduction to Python, text processing, regular expressions.

Representing linguistic structures in XML and JSON, processing XML and JSON in Python.

Selected Natural Language Processing tools, processing morphologically and syntactically annotated data, visualization, search.

 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html