Bioinformatics

15 credits

Syllabus, Master's level, 3MR103

A revised version of the syllabus is available.

Code: 3MR103
Education cycle: Second cycle
Main field(s) of study and in-depth level: Medical Science A1F
Grading system: Fail (U), Pass (G)
Finalised by: The Master Programmes Board of the Faculty of Medicine, 22 August 2018
Responsible department: Department of Medical Biochemistry and Microbiology

General provisions

The course is offered as part of the Master's programs in the Faculty of Medicine.

Entry requirements

Admitted to the Master Programme in Medical Research.

For admission to freestanding course, 180 credits are required in life sciences as the main field of study (e.g. biomedicine, biotechnology, medicine, veterinary medicine or similar).

Knowledge in English equivalent to that required for basic eligibility to Swedish higher education on basic level.

Learning outcomes

On completion of the course, the student should be able to:

Work in a UNIX/LINUX operating system, including manipulation of files and directories, working with text files, performing basic system administration tasks, installing bioinformatics software/tools, writing shell scripts, manage jobs on desktop computers and servers. Understand how to develop UNIX/LINUX skills.

Use basic scripting in Perl/Python for efficient handling of large biological datasets, including writing and understanding scripts, how to store data in data structures, how to process data using loops, functions and built-in operators, how to sort data, use regular expressions (pattern matching), references and multi-dimensional data structures. Understand how to develop scripting skills.

Perform standard analyses of Next Generation Sequencing data, including variant calling, RNAseq, de novo assembly. Understanding of NGS platforms including advantages and limitations. Use of NGS data files and formats. Understand and design NGS workflow steps from raw data. Perform quality control, mapping, visualisation, and downstream analysis. Use relevant bioinformatics software and tools for analysis of NGS data understand advantages and limitations of each tool. Deposit and retrieve NGS data from public databases (e.g. NCBI).

Use of R for statistical data analysis, including data import/export, summary statistics, graphics, statistical testing, and installing packages. Understand how to develop skills in R.

Perform standard linkage/association (QTL/GWAS) analyses. Be able to use common analysis software and create required input data files and formats using scripting. Understand the underlying modeling assumptions of the most commonly used analysis approaches. Interpret obtained results and understand the advantages and limitations of linkage vs association analysis to identify candidate genes for Mendelian and complex traits.

Bioinformatic functional prediction based on non-synonymous amino-acid substitutions. Deleteriousness and conservation scores. Variant annotation and effect prediction. Understanding of experiments involved in ENCODE project to determine genome function (i.e. transcription factor bind sites, methylation, chromatin structure) and comparative genomics to determine genome function and how to incorporate these into data analysis.

Gain an understanding for metabolomics and proteomics data analysis.

Content

The course utilises current research problems to illustrate different statistical and bioinformatics data analysis methods used for genomics data and their applications in studies of human genetics, model organism biology and natural variation and evolution. Data that are analysed include those from: large scale genetic polymorphism data from next generation sequencing and SNP-chip genotyping, RNAsequencing, genotype to phenotype associations, and functional prediction from sequence data. Students will gain proficiency in the entire data analysis process from installation of software to efficient summarization of results using advanced graphics. The course gives insight in the central role of statistical and bioinformatics analysis in current genomics and other omics research and experience in using state-of-the art methodologies within the analysis of such data. The course covers working in a UNIX/LINUX command line environment, scripting using Perl/Python, statistical data analysis with applications in R, processing and analysis of next-generation sequence data of various types, and analysis and interpretation of results in genomics research.

Instruction

The teaching is performed as lectures, mandatory seminars and workshops in English.

Assessment

Examination includes a written exam graded fail (U) or pass (G). The bioinformatics problem solving ability will be examined by practical assignments relating to each of the course sections, that are to be solved individually or in groups, and that will be graded fail (U) or pass (G) only. To pass the course the students have to successfully complete all practical assignments and pass the written exam. Possibility for completion of not approved practical assignments may be given at the earliest at next course and only in case of a vacancy. Students who have failed the first examination are allowed five re-examinations.

If there exist special reasons the examiner can give allowance for alternative sets of assessment to examine an individual student. Specific conditions may e.g. be special pedagogic support described by the university’s coordinator for special support.

Reading list

No reading list found.