Thesis (Selection of subject)

Your browser does not support JavaScript, or its support is disabled. Some features may not be available.

Obohatit pojmenované entity anotované v Pražském závislostním korpusu o automaticky extrahované informace z Wikipedie a dalších zdrojů

Thesis title in Czech:	Obohatit pojmenované entity anotované v Pražském závislostním korpusu o automaticky extrahované informace z Wikipedie a dalších zdrojů
Thesis title in English:	Enrich named entities annotated in the Prague Dependency Treebank with information automatically extracted from Wikipedia and other sources
Key words:	pojmenované entity, information retrieval, wikipedie
English key words:	Named Entities, Information Retrieval, Wikipedia,
Academic year of topic announcement:	2015/2016
Thesis type:	Bachelor's thesis
Thesis language:
Department:	Institute of Formal and Applied Linguistics (32-UFAL)
Supervisor:	Mgr. Bc. Pavel Straňák, Ph.D.
Author:

Guidelines

Pojmenované entity, které jsou součástí dat Prague Dependency Treebank 2.5 obohatit o automaticky extrahované glosy a zařadit je do slovníku.

- analyzovat, co lze ke kterým entitám získat pomocí reg. výrazů (viz Feng et al.)
- provést a vyhodnotit experimenty alespoň pro nejnadějnější typ pojmenovaných entit, např. "jména osob", nebo "lokace"

References

D. Feng, D. Ravichandran, and E. H. Hovy. Mining and re-ranking for answering biographical queries on the web. In Proceedings of the conference of the American Association of Artificial Intelligence (AAAI-06), Boston, MA, 2006.