Subjects

Your browser does not support JavaScript, or its support is disabled. Some features may not be available.

NLP Applications - NPFX093

Title:	Aplikace NLP
Guaranteed by:	Student Affairs Department (32-STUD)
Faculty:	Faculty of Mathematics and Physics
Actual:	from 2022
Semester:	summer
E-Credits:	5
Hours per week, examination:	summer s.:2/1, MC [HT]
Capacity:	unlimited
Min. number of students:	unlimited
4EU+:	no
Virtual mobility / capacity:	no
State of the course:	taught
Language:	Czech
Teaching methods:	full-time
Teaching methods:	full-time
Is provided by:	NPFL093

Guarantor:	doc. RNDr. Vladislav Kuboň, Ph.D.
Class:	Informatika Mgr. - Matematická lingvistika
Classification:	Informatics > Computer and Formal Linguistics
Pre-requisite :	{NXXX011, NXXX012, NXXX013, NXXX070, NXXX071}
Incompatibility :	NPFL093
Interchangeability :	NPFL093

Opinion survey results Examination dates SS schedule Noticeboard

Annotation -

Last update: T_UFAL (10.05.2010)

The main goal of the course is to introduce basic types of natural language processing (NLP) applications and to give the students a chance to work with some of those applications in seminars. The course will concern machine translation, machine aided human translation tools, localization tools, information retrieval and extraction, question answering, speech recognition, spelling and grammar checking, generation etc.

Course completion requirements

Last update: doc. RNDr. Vladislav Kuboň, Ph.D. (22.04.2020)

The course requires a continuous work of students in the form of reports describing topics of individual lectures. The reports are required even if a student is not present on the lecture, in such a case (s)he submits a general report on the topic of the missed lecture. A participation is strongly recommended. After submitting all reports, the student obtains a grade based upon the quality of those reports.

Literature -

Last update: T_UFAL (10.05.2010)

Handbook of NLP, ed. N.Indurkhya, F.Damerau, CRC Press, 2010.

Foundations of Statistical Natural Language Processing, C. Manning and H. Schütze, MIT Press, 1999.

Syllabus -

Last update: T_UFAL (10.05.2010)

1. Introduction - an overview of basic application components.

2. Spelling checker

Dictionary based methods vs. checking of illegal combinations of characters, string similarity metrics, communication towards the user.

3. Grammar checking

Error patterns vs. syntactic analysis, types of detectable errors, attitude towards the user, RFODG and LanGR.

4. Machine Assisted human translation

Translation memory and its variants in commercial products, controlled language, glossary hierarchies.

5. Machine Translation

Google Translate vs. rule-based systems commercial systems (Systran, PC Translator), quality evaluation methods, evaluation of translation competitions, project Euromatrix.

6. Localization

Differences between translation and localization, commercial localization tools.

7. Generating

Text generation from tectogrammatical layer.

8. Information retrieval and extraction

Basic models, evaluation metrics, text similarity metrics, lemmatization, stop words, the role of linguistic tools, Malach project.

9. Question answering
Dialog systems, multimodal communication.

10. Speech synthesis and recognition

Basic problems and algorithms.

11. Semantic web

Exploitation of linguistic methods for searching for information on the web, the role of the tectogrammatical layer.