SubjectsSubjects(version: 945)
Course, academic year 2023/2024
   Login via CAS
Information Retrieval - NPFL103
Title: Vyhledávání informací
Guaranteed by: Institute of Formal and Applied Linguistics (32-UFAL)
Faculty: Faculty of Mathematics and Physics
Actual: from 2020
Semester: winter
E-Credits: 5
Hours per week, examination: winter s.:2/2, C+Ex [HT]
Capacity: unlimited
Min. number of students: unlimited
4EU+: no
Virtual mobility / capacity: no
State of the course: taught
Language: English, Czech
Teaching methods: full-time
Teaching methods: full-time
Additional information: https://ufal.mff.cuni.cz/courses/npfl103
Note: course can be enrolled in outside the study plan
enabled for web enrollment
Guarantor: doc. RNDr. Pavel Pecina, Ph.D.
Incompatibility : NPFX103
Interchangeability : NPFX103
Is incompatible with: NPFX103
Is interchangeable with: NPFX103
Annotation -
Last update: T_UFAL (13.05.2014)
The course introduces modern algorithms and principles used in the field of information retrieval in large data collections. The students will gain practical knowledge and experience with experimentation and evaluation on real data. A special focus is given to web search.
Course completion requirements -
Last update: doc. RNDr. Pavel Pecina, Ph.D. (04.05.2022)

Both the course credit and exam are required to complete the course.

The course credit will be given after completing two homework assignments.

The final grade will be based on the results of the exam and homework.

Literature -
Last update: doc. RNDr. Pavel Pecina, Ph.D. (04.05.2022)

Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press. 2008. ISBN 0521865719.

Requirements to the exam -
Last update: doc. RNDr. Pavel Pecina, Ph.D. (04.05.2022)

The exam is in a form of a written test with approximately 20 short-answer questions covered by the topics discussed during the course.

Syllabus -
Last update: doc. RNDr. Pavel Pecina, Ph.D. (04.05.2022)

Introduction, basic concepts and principles

Boolean retrieval

Indexing

Vector space model

Evaluation in information retrieval

Query expansion

Probabilistic information retrieval

Language models for information retrieval

Text classification

Clustering

Web search

Near-duplicate detection

 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html