When you enroll in this course, you'll also be enrolled in this Specialization.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 8 modules in this course
Large-scale biology projects such as the sequencing of the human genome and gene expression surveys using RNA-seq, microarrays and other technologies have created a wealth of data for biologists. However, the challenge facing scientists is analyzing and even accessing these data to extract useful information pertaining to the system being studied. This course focuses on employing existing bioinformatic resources – mainly web-based programs and databases – to access the wealth of data to answer questions relevant to the average biologist, and is highly hands-on.
Topics covered include multiple sequence alignments, phylogenetics, gene expression data analysis, and protein interaction networks, in two separate parts.
The first part, Bioinformatic Methods I (this one), deals with databases, Blast, multiple sequence alignments, phylogenetics, selection analysis and metagenomics.
The second part, Bioinformatic Methods II, covers motif searching, protein-protein interactions, structural bioinformatics, gene expression data analysis, and cis-element predictions.
This pair of courses is useful to any student considering graduate school in the biological sciences, as well as students considering molecular medicine. Both provide an overview of the many different bioinformatic tools that are out there.
These courses are based on one taught at the University of Toronto to upper-level undergraduates who have some understanding of basic molecular biology. If you're not familiar with this, something like BIO101 from Saylor Academy (https://learn.saylor.org/course/view.php?id=889) might be helpful. No programming is required for this course.
Bioinformatic Methods I is regularly updated, and was completely updated for January 2026.
In this module we'll be exploring the amazing resources available at NCBI, the National Centre for Biotechnology Information, run by the National Library of Medicine in the USA. We'll also be doing a Blast search to find similar sequences in the enormous NR sequence database. We can use similar sequences to infer homology, which is the primary predictor of gene or protein function.
What's included
4 videos4 readings1 assignment
Show info about module content
4 videos•Total 52 minutes
Introduction•2 minutes
Lecture•24 minutes
Lab Discussion•25 minutes
Summary•1 minute
4 readings•Total 120 minutes
Acknowledgements•10 minutes
Course Logistics•10 minutes
Lecture Materials•10 minutes
Lab 1 -- Exploring NCBI•90 minutes
1 assignment•Total 30 minutes
Lab 1 Quiz•30 minutes
Blast II/Comparative Genomics
Module 2•3 hours to complete
Module details
In this module we'll continue exploring the incredible resources available at NCBI, the National Centre for Biotechnology Information. We will be performing several different kinds of Blast searches: BlastP, PSI-Blast, and Translated Blast. We can use similar sequences identified by such methods to infer homology, which is the primary predictor of gene or protein function. We'll also be comparing parts of the genomes of a couple of different species, to see how similar they are.
What's included
4 videos2 readings1 assignment
Show info about module content
4 videos•Total 58 minutes
Introduction•2 minutes
Lecture•32 minutes
Lab Discussion•23 minutes
Summary•1 minute
2 readings•Total 100 minutes
Lecture Materials•10 minutes
Lab 2 -- Advanced Blast and Comparative Genomics•90 minutes
1 assignment•Total 30 minutes
Lab 2 Quiz•30 minutes
Multiple Sequence Alignments
Module 3•3 hours to complete
Module details
In this module we'll be doing multiple sequence alignments with Clustal and MUSCLE (as implemented in MEGA), and MAFFT. Multiple sequences alignments can tell you where in a sequence the conserved and variable regions are, which is important for understanding the biology of the sequences under investigation. It also has practical applications, such as being able to design PCR primers that will amplify sequences from a number of different species, for example.
What's included
4 videos2 readings1 assignment
Show info about module content
4 videos•Total 46 minutes
Introduction•2 minutes
Lecture•25 minutes
Lab Discussion•18 minutes
Summary•1 minute
2 readings•Total 100 minutes
Lecture Materials•10 minutes
Lab 3 -- Multiple Sequence Alignment•90 minutes
1 assignment•Total 30 minutes
Lab 3 Quiz•30 minutes
Review: NCBI/Blast I, Blast II/Comparative Genetics, and Multiple Sequence Alignments
Module 4•1 hour to complete
Module details
What's included
1 assignment
Show info about module content
1 assignment•Total 30 minutes
Quiz: Modules 1-3•30 minutes
Phylogenetics
Module 5•3 hours to complete
Module details
In this module we'll be using the multiple sequence alignments we generated last lab to do some phylogenetic analyses with both neighbour-joining and maximum likelihood methods. The tree-like structure generated by such analyses tells us how closely sequences are related one to another, and suggests when in evolutionary time a speciation or gene duplication event occurred.
What's included
4 videos2 readings1 assignment
Show info about module content
4 videos•Total 43 minutes
Introduction•1 minute
Lecture•27 minutes
Lab Discussion•15 minutes
Summary•1 minute
2 readings•Total 100 minutes
Lecture Materials•10 minutes
Lab 4 -- Phylogenetics•90 minutes
1 assignment•Total 30 minutes
Lab 4 Quiz•30 minutes
Selection Analysis
Module 6•3 hours to complete
Module details
In this module we'll take a set of orthologous sequences from bacteria and use DataMonkey to analyze them for the presence of certain sites under positive, negative or neutral selection. Such an analysis can help understand the biology of a set of protein coding sequences by identifying residues that might be important for biological function (those residues under negative selection) or those that might be involved in response to external influences, such as drugs, pathogens or other factors (residues under positive selection).
In this module we'll explore some of the data that have been generated as a result of the rapid decrease in the cost of sequencing DNA. We'll be exploring a couple of RNA-Seq data sets that can tell us where any given gene is expressed, and also how that gene might be alternatively spliced. We'll also be looking at a couple of metagenome data sets that can tell us about the kinds of species (especially microbial species that might otherwise be hard to culture) that are in a given environmental niche.
What's included
4 videos2 readings1 assignment
Show info about module content
4 videos•Total 49 minutes
Introduction•3 minutes
Lecture•24 minutes
Lab Discussion•18 minutes
Summary•3 minutes
2 readings•Total 100 minutes
Lecture Materials•10 minutes
Lab 6 -- Next Generation Sequencing Applications: RNA-Seq and Metagenomics•90 minutes
1 assignment•Total 30 minutes
Lab 6 Quiz•30 minutes
Review: Phylogenetics, Selection Analysis, and 'Next Gen' Sequence Analysis (RNA-seq)/Metagenomics + Final Assignment
Module 8•1 hour to complete
Module details
What's included
1 reading2 assignments
Show info about module content
1 reading•Total 10 minutes
Final Assignment Instructions•10 minutes
2 assignments•Total 60 minutes
Review: Modules 5-7 •30 minutes
Final Assignment•30 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Instructor ratings
Instructor ratings
We asked all learners to give feedback on our instructors based on the quality of their teaching style.
Established in 1827, the University of Toronto is one of the world’s leading universities, renowned for its excellence in teaching, research, innovation and entrepreneurship, as well as its impact on economic prosperity and social well-being around the globe.
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Learner reviews
4.7
1,786 reviews
5 stars
77.77%
4 stars
18.08%
3 stars
3.24%
2 stars
0.33%
1 star
0.55%
Showing 3 of 1786
P
PG
5·
Reviewed on Jan 21, 2019
This course is so accurate and useful to the bioinformatic students and the science students who needs help with the computational biology. I would recommend this course to everyone.
N
NS
4·
Reviewed on Oct 2, 2015
A thorough course for beginners with very interesting labs! I was pleasantly surprised with how much and how easily I learned bioinformatics skills with this course. Well done!
R
RG
5·
Reviewed on Nov 4, 2017
I enjoyed doing the course. It is exciting to see how much one can learn from a few gene or protein sequences. Thank you for making the course understandable to a beginner in bioinformatics!
What will I actually learn in this bioinformatics course?
You'll learn how to use bioinformatics tools to access biological data, compare sequences, and draw sensible conclusions from the results. It starts with databases and BLAST, then builds into broader sequence analysis and evolutionary interpretation, with later work on selection and sequencing-based data. In the guided labs, you'll do tasks such as searching GenBank, interpreting BLAST output, and examining related sequences side by side.
Do I need to know molecular biology or programming before taking this course?
No, you don't need programming for this course. Some familiarity with basic molecular biology is helpful because the course works with genes, proteins, and sequence data from the start. It moves quickly into using databases and analysis tools rather than teaching biology fundamentals from scratch.
Is this course beginner-friendly for bioinformatics?
Yes, if you're new to bioinformatics but already know some basic molecular biology, this is a good beginner course. The material starts with core tasks like using biological databases and interpreting BLAST results, then adds more advanced analysis through guided lessons and labs. If terms like genes, proteins, or homology are completely new to you, the pace may feel faster.
How long does it take to complete this course?
Plan on about 20 hours in total. That's roughly two weeks at around 10 hours a week, with enough time to move through the lessons and readings and then complete the guided labs and quizzes. The course includes lessons, readings, labs, quizzes, and a final assignment.
Are there hands-on exercises or labs in this course?
Yes, the course includes guided labs throughout rather than a single end-of-course project. You'll search biological databases, run BLAST analyses, build multiple sequence alignments, and generate phylogenetic trees in step-by-step exercises using established tools. That hands-on work helps you apply each method as you learn it.
What skills and topics are covered in this course?
You'll cover the main bioinformatics tasks used to work with sequence data: finding records, comparing sequences, and studying how sequences are related. The course also introduces selection analysis and sequencing-based data, including RNA-seq and metagenomics, so you can see how bioinformatics supports modern biology research. Overall, the emphasis is on using existing resources to answer biological questions, not on writing code.
What can I actually do after finishing this course?
After finishing, you should be able to use common bioinformatics resources to investigate a gene or protein sequence and interpret the main results. A realistic task would be finding related sequences, comparing them in an alignment, and using that information to read or build a simple phylogenetic tree. You'll also be better prepared to make sense of selection analyses and basic RNA-seq or metagenomic outputs.
Is this course more focused on theory or hands-on learning?
It's a hands-on course, but the practice is guided rather than project-heavy. Each topic is explained in lessons and then reinforced through labs and quizzes, so it suits learners who want to use real bioinformatics tools step by step.
Why would I choose this course over other bioinformatics courses?
Choose this course if you want bioinformatics taught as a working method for biologists, not as a programming course. It focuses on using established web-based resources and guided labs to answer real questions about sequences, relationships, and sequencing data, with Professor Nicholas Provart leading the instruction. If you want a broad, applied introduction that stays accessible to beginners, this course is a strong fit.