SubjectsSubjects(version: 945)
Course, academic year 2023/2024
   Login via CAS
Genome-oriented applications in plant evolution - MB120C54
Title: Genome-oriented applications in plant evolution
Czech title: Genomické metody pro studium evoluce rostlin
Guaranteed by: Department of Botany (31-120)
Faculty: Faculty of Science
Actual: from 2022
Semester: winter
E-Credits: 3
Examination process: winter s.:
Hours per week, examination: winter s.:0/2, C [HT]
Capacity: 15
Min. number of students: unlimited
4EU+: no
Virtual mobility / capacity: no
State of the course: taught
Language: English
Note: enabled for web enrollment
Guarantor: Levi Jeremiah Yant, Dr. rer. nat.
Teacher(s): Levi Jeremiah Yant, Dr. rer. nat.
Annotation
Last update: Mgr. Michal Štefánek (17.01.2022)
Annotation:
This is a practical, plant genomics-based computational course taking students from beginning command line scripting and basic HPC cluster computing to pipeline reproducibility and container management, using real data from current research.

Participants will develop basic command line skills in Unix to manipulate data types commonly used in genomic studies. They will learn how to run scripts from others as well as write and develop their own. The module will cover Unix, HPC cluster computing, reproducibility, and basic python scripting and provide students with the tools to develop their own code and pipelines. Participants will work within interactive coding environments to maintain reproducibility in code and practice. By the end of the module, students will be equipped with the practical skills to write simple code to address diverse biological problems.

Learning Aims:
This module aims to introduce fundamental concepts of programming and data manipulation to students without prior experience of the command line. Participants will learn how to write useful, simple code in an interactive environment. Sessions will introduce fundamental programming concepts and approaches. Participants will then have sessions of directed learning to practice and develop the skills required to complete a series of programming challenges.

Learning Outcomes:
• Competence in the Unix environment, command line operations and scripting.
• Knowledge of information sources and guidance for solving common problems in computational biology.
• The ability to install and run scripts within an appropriate environment and install necessary co-requisites.
• Knowledge of how to packages to solve common problems in programming.
• Understanding how to use environments (e.g. Anaconda-based) and package managers for code portability.
• Knowledge of typical biological data file formats and ability to write new files or edit existing files in place.
• The ability to write simple scripts, employ the concepts of class-oriented programming and develop simple applications to reproducibly solve common problems.

The core of the work is thus hands-on practical experience with analysis of empirical data, supervised by Professor Yant - the format is approximating real work on own data generated during their independent research work (e.g. during a Masters or PhD project). Sample datasets will be provided, an analysis of own data in the project work is possible and welcomed but not required. Previous experience with scripting is welcome but not required.

Every student selects a topic and will run a project: i.e. will process a provided sample/own dataset using the presented tools and share their results with others during a short presentation. Credits will be given for the presentations and simple programming challenges.

This course will be held in English.
Literature
Last update: Mgr. Michal Štefánek (17.01.2022)

Literature

Bradnam K & Korf I (2012) UNIX and Perl to the Rescue!: A Field Guide for the Life Sciences (and Other Data-rich Pursuits) 1st edition. Cambridge University Press.

Saetre GP & Ravinet M (2019): Evolutionary Genetics. 1st edition. Oxford University Press.

For course materials please see  https://botany.natur.cuni.cz/TO_BE_GENERATED

Requirements to the exam
Last update: Mgr. Michal Štefánek (17.01.2022)

Requirements to the exam

- individual and group-based practical challenges (in class and take-home)

Syllabus
Last update: Mgr. Michal Štefánek (17.01.2022)

Syllabus:

* Preliminary schedule:

Week 1 Introduction to Unix

Week 3 Common Data Types & Tools Part 1: common genomic data types and tools.

Week 5 Common Data Types & Tools Part 2: HPC computing, package managers, containers, and pipelines

Week 7 Python Basics

Week 9 In-class Presentations and assessment

Week 11 Code and pipeline reproducibility

(2-3 hour blocks every two weeks for better immersion.)

 
Charles University | Information system of Charles University | http://www.cuni.cz/UKEN-329.html