SubjectsSubjects(version: 945)
Course, academic year 2023/2024
   Login via CAS
Work with Language Corpora - ADE120020
Title: Práce s jazykovými korpusy
Guaranteed by: Institute of Germanic Studies (21-UGS)
Faculty: Faculty of Arts
Actual: from 2023
Semester: winter
Points: 0
E-Credits: 3
Examination process: winter s.:
Hours per week, examination: winter s.:0/2, C [HT]
Capacity: unknown / unlimited (unknown)
Min. number of students: unlimited
4EU+: no
Virtual mobility / capacity: no
Key competences:  
State of the course: taught
Language: Czech
Teaching methods: full-time
Teaching methods: full-time
Level:  
Note: course can be enrolled in outside the study plan
enabled for web enrollment
Guarantor: Mgr. Věra Hejhalová, Ph.D.
Teacher(s): Mgr. Věra Hejhalová, Ph.D.
Annotation -
Last update: Mgr. Věra Hejhalová, Ph.D. (04.10.2023)
Working with language corpora is a course in which students of the Bachelor's degree get acquainted with the existence of language corpora and the possibilities of their use in linguistic practice. The course focuses on practical work and is a precursor to the Seminar in Corpus Linguistics, which follows in the NMgr. programme. In addition to introducing the basic types and properties of corpora, the emphasis is on the practical applicability of corpora in the everyday life of a linguist. Emphasis is placed on corpora of Czech (CNK) and German (DeReKo and DWDS) including associated applications Kookkurrenzanalyse, CCDB, SOM, respectively Treq, SyD, Morfio, WaG, etc. In the area of usability, the needs of students are reflected with regard to their research and writing of seminar and thesis papers in other disciplines.
Aim of the course -
Last update: Mgr. Věra Hejhalová, Ph.D. (04.10.2023)

Basic introduction to working with language corpora in order to understand their function and be able to use them practically in their further studies.

Course completion requirements -
Last update: Mgr. Věra Hejhalová, Ph.D. (04.01.2024)

1) Absence max. 3

2) Continuous preparation for class

3) Active participation in solving assignments

4) Preparation of one assigned publication for InterCorp needs or verification of practical knowledge of working with corpora in the form of testing in the last week

MODIFIED ENROLLMENT TERMS FOR EXAM PERIOD ZS 2023/24: Students will complete the credit assignments online at the designated time for which they have registered in SIS. While writing the credit test, they have the opportunity to consult with the instructor synchronously online via MS Teams for any questions.

Literature -
Last update: Mgr. Věra Hejhalová, Ph.D. (04.10.2023)

Recommended reading:

BUBENHOFER, Noah: Einführung in die Korpuslinguistik: Praktische Grundlagen und Werkzeuge. – Online-Kurs

ČERMÁK, F. et al.: Studie z korpusové lingvistiky, úvod, edice a překlad, Karolinum 2000.

ČERMÁK, F./BLATNÁ, R. (ed.): Studie z korpusové lingvistiky 1, Korpusová lingvistika, stav a modelové přístupy, NLN Praha 2006.

ČERMÁK, F./BLATNÁ, R.: Jak využívat Český národní korpus, NLN Praha 2005.

ČERMÁK, F./ŠULC, M.(ed.): Studie z korpusové lingvistiky 2, Kolokace, NLN Praha 2006.

ČERMÁK, F.: Korpusová lingvistika Praha 2011. Vyd. 1. Praha: NLN, Nakladatelství Lidové noviny, 2011, 372 s. ISBN 9788074221149.

ČERMÁK, František. Korpusová lingvistika Praha 2011. Vyd. 1. Praha: NLN, Nakladatelství Lidové noviny, 2011, 323 s. ISBN 9788074221156.

KOCEK, J./KOPŘIVOVÁ, M./KUČERA, K. (ed.): Český národní korpus. Úvod a příručka uživatele, Praha 2000.

KOPŘIVOVÁ, Marie: Jak využívat Český národní korpus, Praha 2005.

LEMNITZER, Lothar a Heike ZINSMEISTER. Korpuslinguistik. Eine Einführung. Tübingen: Gunter Narr, 2006, 220 s. ISBN 3-8233-6210-0.

LÜDELING, Anke / MAIK Walter: Korpuslinguistik für Deutsch als Fremdsprache. Sprachvermittlung und Spracherwerbsforschung. (.pdf – zugänglich unter https://www.linguistik.hu-berlin.de/de/institut/professuren/korpuslinguistik/mitarbeiter-innen/anke/pdf/LuedelingWalterDaF.pdf)

LÜDELING, Anke a Merja KYTÖ. Corpus linguistics: an international handbook. Berlin: Walter de Gruyter, c2008, xvii, 776 s. ISBN 9783110180435.

LÜDELING, Anke a Merja KYTÖ. Corpus linguistics: an international handbook. Berlin: Walter de Gruyter, c2008, xvii, s. 778-1353. ISBN 9783110207330.

PERKUHN, Rainer a Cyril BELICA. Korpuslinguistik - das unbekannte Wesen oder Mythen über Korpora und Korpuslinguistik. In: Sprachreport 1/2006. S. 2-8. Mannheim: 2006.

PERKUHN, Rainer, Holger KEIBEL a Marc KUPIETZ. Korpuslinguistik. Paderborn: Wilhelm Fink, 2012, 144 s. ISBN 9783825234331.

PETKEVIČ, Vladimír a Alexandr ROSEN. Korpusová lingvistika Praha 2011. Vyd. 1. Praha: Nakladatelství Lidové noviny, 2011, 225 s. ISBN 9788074221163.

ŠEBESTA, K./ŠORMOVÁ, K. et al.: Korpusy v jazykovém vyučování, Praha: FFUK, 2019, zdarma ke stažení zde: https://e-shop.ff.cuni.cz/e_knihy/e_publikace/korpusy_v_jazykovem_vyucovani_pdf_-1761

ŠEBESTA, K./ŠORMOVÁ, K. et al.: Korpusy v jazykovém vyučování II. Praha: FF UK 2019, zdarma ke stažení zde https://e-shop.ff.cuni.cz/e_knihy/e_publikace/korpusy_v_jazykovem_vyucovani_ii_pdf_-1908

ŠULC, Michal: Korpusová lingvistika. První vstup, Praha 1999.

 

Internet links:

www.ids-mannheim.de

https://cosmas2.ids-mannheim.de/cosmas2-web/

http://corpora.ids-mannheim.de/ccdb/

http://ucnk.ff.cuni.cz

https://www.korpus.cz/

 

Other literature:

ABEL, Andrea/Renata ZANIN: Korpora in Lehre und Forschung. Bozen-Bolzano University Press, Bozen 2011, ISBN 978-88-6046-040-0

BELICA, Cyril a Kathrin STEYER. Korpusanalytische Zugänge zu sprachlichem Usus. In: Beiträge zur bilingualen Lexikographie, M. Vachková (Hg.), AUC (Acta Universitatis Carolinae), GERMANISTICA PRAGENSIA XX, Praha: Karolinum, 2008.

BUHOFER, Annelies a Gertrud GRÉCIANO, (eds.): Flut von Texten - Vielfalt der Kulturen. Ascona 2001 zur Methodologie und Kulturspezifik der Phraseologie. (=Phraseologie und Parömiologie 14). Baltmannsweiler, S. 33-46. 2003.

KÁŇA, Tomáš a Hana PELOUŠKOVÁ (eds.): Deutsch und Tschechisch im Vergleich – korpusbasierte linguistische Studien, Brno: Masarykova univerzita, 2009.

KOPŘIVOVÁ, Marie: Využití korpusu při zpracování frazeologie ve výkladovém slovníku, In: Frazeografia sĺowińska. 1. vyd. Opole: Wydawnictwo Uniwersytetu Opolskiego; 2001

KRATOCHVÍLOVÁ, Iva a Norbert Richard WOLF (eds.): Kompendium Korpuslinguistik: eine Bestandaufnahme aus deutsch-tschechischer Perspektive, Heidelberg, Winter, 2010.

KRATOCHVÍLOVÁ, Iva.  Zum aktuellen Thema: Die Korpuslinguistik im deutsch-tschechischen Sprachvergleich, In: Acta Facultatis Ostraviensis, Studia Germanistica, Ostrava, Ostravská univerzita v Ostravě, 2006, S. 35–45.

LEMNITZER, Lothar a Heike ZINSMEISTER. Korpuslinguistik. Eine Einführung. Tübingen: Gunter Narr, 2006, 220 s. ISBN 3-8233-6210-0.

KRATOCHVÍLOVÁ, Iva (Hrsg.): Kompendium Korpuslinguistik: eine Bestandaufnahme aus deutsch-tschechischer Perspektive, Heidelberg, Winter, 2010.

MOON, Rosamund: Fixed Expressions and Idioms in English: A Corpus-Based Approach. Oxford Studies In Lexicography & Lexicology. Oxford University Press, 1998.

PELOUŠKOVÁ, Hana a Tomáš KÁŇA: Elektronische Korpora in Tschechien und das tschechisch-deutsche Parallelkorpus, In: Planning, Gluing and Painting Corpora, B. Kettemann - G. Marko (eds.), Frankfurt am Main, Peter Lang GmbH, 2006,  S. 27–46.

STEYER, Kathrin: Korpus, Statistik, Kookkurrenz. Lässt sich Idiomatisches "berechnen"? In: Burger, Harald/Häcki, Buhofer, Annelies/Gréciano, Gertrud (eds.): Flut von Texten - Vielfalt der Kulturen. Ascona 2001 zur Methodologie und Kulturspezifik der Phraseologie. (=Phraseologie und Parömiologie 14). Baltmannsweiler, S. 33-46. 2003.

Teaching methods -
Last update: Mgr. Věra Hejhalová, Ph.D. (04.10.2023)

exercises, seminar, joint work, discussion

Syllabus -
Last update: Mgr. Věra Hejhalová, Ph.D. (04.10.2023)

1. What is a corpus? How is it formed? What is it used for? How can it be used? Basic typology of corpora. Czech and German corpora (DeReKo, CNK, InterCorp) in the form of presentation and joint work. Conditions and creation of facilities for working with corpora (laptops, internet access, registration to COSMAS II and CNK. Finding simple expressions and brainstorming together about the possibilities of using corpora, their limits and interpretation.

2. Text processing in corpora (presentation), brief development of the discipline, basic terminology (token, type, lemma, parser, disambiguation, corpus X corpus manager, concordance, etc.). Basic search scheme in (COSMAS II) - archive selection, corpus selection, formulation of the simplest queries.

3. DeReKo (IDS Mannheim) - COSMAS II. - basic functions of the corpus search engine; Practice basic search in COSMAS II., regular characters, settings (Optionen)

4. Advanced search and its practice in COSMAS II; multi-word phrases

5. Extension functions from the COSMAS II. menu; working with tagged corpora

6. Cooccurrence analysis - method of entering and evaluating generated data; practicing CQL (Corpus Query Language) ; phrases and collocations- a special task for corpus linguistics

7. Associated applications - CCDB, SOM, etc.; DWDS

8. Czech National Corpus - basic information, basic search

10. Czech National Corpus - practice searching, explanation of extension functions,

11. InterCorp

12. SyD, Morfio, Treq, KWords, WaG

13. Final discussion.

Entry requirements -
Last update: Mgr. Věra Hejhalová, Ph.D. (04.10.2023)

The course is conducted partly in Czech and partly in German. The course works with Czech and German corpora. Enrolled students should have a minimum level of German B1 according to the CEFR.

 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html