When you enroll in this course, you'll also be enrolled in this Specialization.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 4 modules in this course
We will learn computational methods -- algorithms and data structures -- for analyzing DNA sequencing data. We will learn a little about DNA, genomics, and how DNA sequencing is used. We will use Python to implement key algorithms and data structures and to analyze real genomes and DNA sequencing datasets.
This module we begin our exploration of algorithms for analyzing DNA sequencing data. We'll discuss DNA sequencing technology, its past and present, and how it works.
What's included
19 videos7 readings2 assignments
Show info about module content
19 videos•Total 112 minutes
Module 1 Introduction•2 minutes
Lecture: Why study this?•4 minutes
Lecture: DNA sequencing past and present•3 minutes
Lecture: Genomes as strings, reads as substrings•5 minutes
Lecture: String definitions and Python examples•4 minutes
Practical: String basics •7 minutes
Practical: Manipulating DNA strings •7 minutes
Practical: Downloading and parsing a genome •6 minutes
Lecture: How DNA gets copied•4 minutes
Optional lecture: How second-generation sequencers work •7 minutes
Optional lecture: Sequencing errors and base qualities •7 minutes
Lecture: Sequencing reads in FASTQ format•5 minutes
Practical: Working with sequencing reads •11 minutes
Practical: Analyzing reads by position •7 minutes
Lecture: Sequencers give pieces to genomic puzzles•6 minutes
Lecture: Read alignment and why it's hard•3 minutes
Lecture: Naive exact matching•10 minutes
Practical: Matching artificial reads •7 minutes
Practical: Matching real reads •7 minutes
7 readings•Total 70 minutes
Welcome to Algorithms for DNA Sequencing•10 minutes
In this module, we learn useful and flexible new algorithms for solving the exact and approximate matching problems. We'll start by learning Boyer-Moore, a fast and very widely used algorithm for exact matching
What's included
15 videos1 reading2 assignments
Show info about module content
15 videos•Total 114 minutes
Week 2 Introduction •2 minutes
Lecture: Boyer-Moore basics•9 minutes
Lecture: Boyer-Moore: putting it all together•6 minutes
Lecture: Diversion: Repetitive elements•5 minutes
Practical: Implementing Boyer-Moore •10 minutes
Lecture: Preprocessing•7 minutes
Lecture: Indexing and the k-mer index•11 minutes
Lecture: Ordered structures for indexing•8 minutes
Lecture: Hash tables for indexing•7 minutes
Practical: Implementing a k-mer index •7 minutes
Lecture: Variations on k-mer indexes•9 minutes
Lecture: Genome indexes used in research•10 minutes
Lecture: Approximate matching, Hamming and edit distance•7 minutes
Lecture: Pigeonhole principle•7 minutes
Practical: Implementing the pigeonhole principle •10 minutes
This week we finish our discussion of read alignment by learning about algorithms that solve both the edit distance problem and related biosequence analysis problems, like global and local alignment.
What's included
13 videos1 reading2 assignments
Show info about module content
13 videos•Total 92 minutes
Module 3 Introduction •1 minute
Lecture: Solving the edit distance problem•13 minutes
Lecture: Using dynamic programming for edit distance•12 minutes
Practical: Implementing dynamic programming for edit distance •6 minutes
Lecture: A new solution to approximate matching•9 minutes
Lecture: Meet the family: global and local alignment•11 minutes
Practical: Implementing global alignment •8 minutes
Lecture: Read alignment in the field•4 minutes
Lecture: Assembly: working from scratch•3 minutes
Lecture: First and second laws of assembly•8 minutes
Lecture: Overlap graphs•8 minutes
Practical: Overlaps between pairs of reads •4 minutes
Practical: Finding and representing all overlaps •4 minutes
In the last module we began our discussion of the assembly problem and we saw a couple basic principles behind it. In this module, we'll learn a few ways to solve the alignment problem.
What's included
13 videos1 reading2 assignments
Show info about module content
13 videos•Total 83 minutes
Module 4 introduction •1 minute
Lecture: The shortest common superstring problem•8 minutes
Practical: Implementing shortest common superstring •5 minutes
Lecture: Greedy shortest common superstring•8 minutes
Practical: Implementing greedy shortest common superstring •7 minutes
Lecture: Third law of assembly: repeats are bad•6 minutes
Lecture: De Bruijn graphs and Eulerian walks•9 minutes
Practical: Building a De Bruijn graph •5 minutes
Lecture: When Eulerian walks go wrong•10 minutes
Lecture: Assemblers in practice•8 minutes
Lecture: The future is long?•10 minutes
Lecture: Computer science and life science•5 minutes
Lecture: Thank yous •1 minute
1 reading•Total 10 minutes
Post Course Survey•10 minutes
2 assignments•Total 60 minutes
Programming Homework 4•30 minutes
Module 4•30 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors
Instructor ratings
Instructor ratings
We asked all learners to give feedback on our instructors based on the quality of their teaching style.
The mission of The Johns Hopkins University is to educate its students and cultivate their capacity for life-long learning, to foster independent and original research, and to bring the benefits of discovery to the world.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.