- Sheffield - 6th/7th April 2020
- 09:30am - 5pm
- Bartolome House, Seminar Room EG03
Please book your place on the course using this link
Healthcare systems around the globe are transforming medical practice to incorporate genetic variation as a vital information used for diagnosis and treatment planning. Genome sequencing technologies allow us to detect all variants in a patient’s genome and international collaborative efforts such as The 100,000 Genomes Project, The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) have begun to catalogue and release data on genomic variation in a variety of disease types.
However, such datasets pose new challenges in the way the data have to be analyzed, annotated and interpreted which are not trivial and are daunting to the clinician or biomedical scientist. This course covers state-of-the-art and best-practice tools for the analysis of genomes. We describe, and give hands-on experience of, the entire analysis workflow from raw data generated by a sequencing machine to deriving variant calls (e.g. Single Nucleotide Variants) that are ready for downstream analysis, interpretation and prioritisation.
We will describe the steps involved to go from sequencing library to a prioritised, clinically-relevant list of DNA variants. Practical sessions will use the user-friendly Galaxy interface (https://usegalaxy.org/) to demonstrate tasks such as alignment, quality control, variant-calling and annotation.
Who should attend this course?
Healthcare providers and researchers who want to get an appreciation for how variants are identified and interpreted, but not neccesarily working on this problem routinely.
Objectives:- After this course you should be able to:
- Recall the key pipeline steps in calling variants from NGS data
- Understand the contents of fastq, bam and vcf file formats for NGS data
- Execute key steps of the pipeline using the online Galaxy services presented
- Interpret a set of variant-calls using best-practice guidelines
Aims:- During this course you will learn about:
- Common file formats for representing NGS data
- Aligning genome data to reference sequence using modern alignment programmes (e.g. bwa)
- How to determine the analytical sensitiviy of genomic tests
- Use of tools to call sequence variants (e.g. GATK) and annotation of variant call files using established databases
- Principles of integration of laboratory and clinical information, and place of best-practice guidelines for indicating the clinical significance of results (eg. ACMG guidelines and standards)
- No prior programming experience is required. Prior knowledge of NGS sequencing technologies would be an advantage, but not required.
- Dr Matthew Parker, Clinical Bioinformatics Core Scientist, Course Leader
- Dr Emily Chambers, Bioinformatics Core Scientist
- Dr Mark Dunning, Bioinformatics Core Director
- Dr Dennis Wang, Bioinformatics Lecturer
- Michael Spiller, Sheffield Childrens Hospital
- 09:30 - 09:40 Introduction to course and Learning Objectives
- 09:40 - 10:15 Overview of NGS
- 10:15 - 10:45 Introduction to Galaxy
- 11:00 - 12:45 Fastq files and alignment Practical
- 12:45 - 13:45 LUNCH (not provided)
- 13:45 - 14:30 Viewing Alignments in IGV
- 14:30 - 15:30 Variant Calling and VCF files
- 15:30 - 17:00 Variant Calling in Galaxy
- 17:00 Day 1 Wrap-up