Do I need to take the courses in a specific order?

We recommend taking the courses in the order presented, as each subsequent course will build on material from previous courses.

Will I earn university credit for completing the Specialization?

Coursera courses and certificates don't carry university credit, though some universities may choose to accept Specialization Certificates for credit. Check with your institution to learn more.

What will I be able to do upon completing the Specialization?

You will understand the ideas behind many different software tools that are used every day by biotech researchers, and you will know how to apply these tools to real datasets.

Is this course really 100% online? Do I need to attend any classes in person?

This course is completely online, so there’s no need to show up to a classroom in person. You can access your lectures, readings and assignments anytime and anywhere via the web or your mobile device.

Can I just enroll in a single course?

Yes! To get started, click the course card that interests you and enroll. You can enroll and complete the course to earn a shareable certificate. When you subscribe to a course that is part of a Specialization, you’re automatically subscribed to the full Specialization. Visit your learner dashboard to track your progress.

Is financial aid available?

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Can I take the course for free?

No, you cannot take this course for free. When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. If you cannot afford the fee, you can apply for financial aid.

Bioinformatics Specialization

Bioinformatics Specialization

Journey to the Frontier of Computational Biology.

Master bioinformatics software and computational approaches in modern biology.

Instructors: Pavel Pevzner

80,826 already enrolled

Included with Learn more

Ask Coursera

7 course series

Get in-depth knowledge of a subject

from 1,282 reviews of courses in this program

Beginner level

No prior experience required

3 months to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

7 course series

Get in-depth knowledge of a subject

from 1,282 reviews of courses in this program

Beginner level

No prior experience required

3 months to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

What you'll learn

Join Us in a Top 50 MOOC of All Time!

How do we sequence and compare genomes? How do we identify the genetic basis for disease? How do we construct an evolutionary Tree of Life for all species on Earth?

When you complete this Specialization, you will learn how to answer many questions in modern biology that have become inseparable from the computational approaches used to solve them. You will also obtain a toolkit of existing software resources built on these computational approaches and that are used by thousands of biologists every day in one of the fastest growing fields in science.

Although this Specialization centers on computational topics, you do not need to know how to program in order to complete it. If you are interested in programming, we feature an "Honors Track" (called "hacker track" in previous runs of the course). The Honors Track allows you to implement the bioinformatics algorithms that you will encounter along the way in dozens of automatically graded coding challenges. By completing the Honors Track, you will be a bioinformatics software professional!

Learn more about the Bioinformatics Specialization (including why we are wearing these crazy outfits) by watching our introductory video.

You can purchase the Specialization's print companion, Bioinformatics Algorithms: An Active Learning Approach, from the textbook website.

Our first course, "Finding Hidden Messages in DNA", was named a top-50 MOOC of all time by Class Central!

Skills you'll gain

Tools you'll learn

Python Programming

Details to know

Shareable certificate

Add to your LinkedIn profile

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Advance your subject-matter expertise

Learn in-demand skills from university and industry experts
Master a subject or tool with hands-on projects
Develop a deep understanding of key concepts
Earn a career certificate from University of California San Diego

Specialization - 7 course series

Join Us in a Top 50 MOOC of All Time!

How do we sequence and compare genomes? How do we identify the genetic basis for disease? How do we construct an evolutionary Tree of Life for all species on Earth?

Learn more about the Bioinformatics Specialization (including why we are wearing these crazy outfits) by watching our introductory video.

You can purchase the Specialization's print companion, Bioinformatics Algorithms: An Active Learning Approach, from the textbook website.

Our first course, "Finding Hidden Messages in DNA", was named a top-50 MOOC of all time by Class Central!

Finding Hidden Messages in DNA (Bioinformatics I)

Course 1, 16 hours

What you'll learn

Named a top 50 MOOC of all time by Class Central!

This course begins a series of classes illustrating the power of computing in modern biology. Please join us on the frontier of bioinformatics to look for hidden messages in DNA without ever needing to put on a lab coat. In the first half of the course, we investigate DNA replication, and ask the question, where in the genome does DNA replication begin? We will see that we can answer this question for many bacteria using only some straightforward algorithms to look for hidden messages in the genome. In the second half of the course, we examine a different biological question, when we ask which DNA patterns play the role of molecular clocks. The cells in your body manage to maintain a circadian rhythm, but how is this achieved on the level of DNA? Once again, we will see that by knowing which hidden messages to look for, we can start to understand the amazingly complex language of DNA. Perhaps surprisingly, we will apply randomized algorithms, which roll dice and flip coins in order to solve problems. Finally, you will get your hands dirty and apply existing software tools to find recurring biological motifs within genes that are responsible for helping Mycobacterium tuberculosis go "dormant" within a host for many years before causing an active infection.

Skills you'll gain

Category: Algorithms

Category: Bioinformatics

Category: Molecular Biology

Category: Molecular, Cellular, and Microbiology

Category: Biology

Category: Statistical Methods

Category: Python Programming

Category: Data Analysis Software

Category: Microbiology

Genome Sequencing (Bioinformatics II)

Course 2, 17 hours

What you'll learn

You may have heard a lot about genome sequencing and its potential to usher in an era of personalized medicine, but what does it mean to sequence a genome?

Biologists still cannot read the nucleotides of an entire genome as you would read a book from beginning to end. However, they can read short pieces of DNA. In this course, we will see how graph theory can be used to assemble genomes from these short pieces. We will further learn about brute force algorithms and apply them to sequencing mini-proteins called antibiotics. In the first half of the course, we will see that biologists cannot read the 3 billion nucleotides of a human genome as you would read a book from beginning to end. However, they can read shorter fragments of DNA. In this course, we will see how graph theory can be used to assemble genomes from these short pieces in what amounts to the largest jigsaw puzzle ever put together. In the second half of the course, we will discuss antibiotics, a topic of great relevance as antimicrobial-resistant bacteria like MRSA are on the rise. You know antibiotics as drugs, but on the molecular level they are short mini-proteins that have been engineered by bacteria to kill their enemies. Determining the sequence of amino acids making up one of these antibiotics is an important research problem, and one that is similar to that of sequencing a genome by assembling tiny fragments of DNA. We will see how brute force algorithms that try every possible solution are able to identify naturally occurring antibiotics so that they can be synthesized in a lab. Finally, you will learn how to apply popular bioinformatics software tools to sequence the genome of a deadly Staphylococcus bacterium that has acquired antibiotics resistance.

Skills you'll gain

Category: Bioinformatics

Category: Algorithms

Category: Python Programming

Category: Computational Thinking

Category: Precision Medicine

Category: Brute-force attacks

Category: Molecular Biology

Category: Infectious Diseases

Category: Biotechnology

Category: Microbiology

Category: Biochemistry

Comparing Genes, Proteins, and Genomes (Bioinformatics III)

Course 3, 22 hours

What you'll learn

Once we have sequenced genomes in the previous course, we would like to compare them to determine how species have evolved and what makes them different.

In the first half of the course, we will compare two short biological sequences, such as genes (i.e., short sequences of DNA) or proteins. We will encounter a powerful algorithmic tool called dynamic programming that will help us determine the number of mutations that have separated the two genes/proteins. In the second half of the course, we will "zoom out" to compare entire genomes, where we see large scale mutations called genome rearrangements, seismic events that have heaved around large blocks of DNA over millions of years of evolution. Looking at the human and mouse genomes, we will ask ourselves: just as earthquakes are much more likely to occur along fault lines, are there locations in our genome that are "fragile" and more susceptible to be broken as part of genome rearrangements? We will see how combinatorial algorithms will help us answer this question. Finally, you will learn how to apply popular bioinformatics software tools to solve problems in sequence alignment, including BLAST.

Skills you'll gain

Category: Bioinformatics

Category: Graph Theory

Category: Memory Management

Category: Computational Thinking

Category: Python Programming

Molecular Evolution (Bioinformatics IV)

Course 4, 18 hours

What you'll learn

In the previous course in the Specialization, we learned how to compare genes, proteins, and genomes. One way we can use these methods is in order to construct a "Tree of Life" showing how a large collection of related organisms have evolved over time.

In the first half of the course, we will discuss approaches for evolutionary tree construction that have been the subject of some of the most cited scientific papers of all time, and show how they can resolve quandaries from finding the origin of a deadly virus to locating the birthplace of modern humans. In the second half of the course, we will shift gears and examine the old claim that birds evolved from dinosaurs. How can we prove this? In particular, we will examine a result that claimed that peptides harvested from a T. rex fossil closely matched peptides found in chickens. In particular, we will use methods from computational proteomics to ask how we could assess whether this result is valid or due to some form of contamination. Finally, you will learn how to apply popular bioinformatics software tools to reconstruct an evolutionary tree of ebolaviruses and identify the source of the recent Ebola epidemic that caused global headlines.

Skills you'll gain

Category: Bioinformatics

Category: Statistical Analysis

Category: Statistical Methods

Category: Molecular Biology

Category: Graph Theory

Category: Taxonomy

Genomic Data Science and Clustering (Bioinformatics V)

Course 5, 10 hours

What you'll learn

How do we infer which genes orchestrate various processes in the cell? How did humans migrate out of Africa and spread around the world? In this class, we will see that these two seemingly different questions can be addressed using similar algorithmic and machine learning techniques arising from the general problem of dividing data points into distinct clusters.

In the first half of the course, we will introduce algorithms for clustering a group of objects into a collection of clusters based on their similarity, a classic problem in data science, and see how these algorithms can be applied to gene expression data. In the second half of the course, we will introduce another classic tool in data science called principal components analysis that can be used to preprocess multidimensional data before clustering in an effort to greatly reduce the number dimensions without losing much of the "signal" in the data. Finally, you will learn how to apply popular bioinformatics software tools to solve a real problem in clustering.

Skills you'll gain

Category: Unsupervised Learning

Category: Bioinformatics

Category: Anthropology

Category: Machine Learning

Category: Applied Machine Learning

Category: Data Analysis Software

Category: Data Mining

Category: Dimensionality Reduction

Category: Data Preprocessing

Category: Taxonomy

Category: Life Sciences

Category: Machine Learning Algorithms

Category: Statistical Methods

Finding Mutations in DNA and Proteins (Bioinformatics VI)

Course 6, 24 hours

What you'll learn

In previous courses in the Specialization, we have discussed how to sequence and compare genomes. This course will cover advanced topics in finding mutations lurking within DNA and proteins.

In the first half of the course, we would like to ask how an individual's genome differs from the "reference genome" of the species. Our goal is to take small fragments of DNA from the individual and "map" them to the reference genome. We will see that the combinatorial pattern matching algorithms solving this problem are elegant and extremely efficient, requiring a surprisingly small amount of runtime and memory. In the second half of the course, we will learn how to identify the function of a protein even if it has been bombarded by so many mutations compared to similar proteins with known functions that it has become barely recognizable. This is the case, for example, in HIV studies, since the virus often mutates so quickly that researchers can struggle to study it. The approach we will use is based on a powerful machine learning tool called a hidden Markov model. Finally, you will learn how to apply popular bioinformatics software tools applying hidden Markov models to compare a protein against a related family of proteins.

Skills you'll gain

Category: Markov Model

Category: Bioinformatics

Category: Memory Management

Category: Algorithms

Category: Data Transformation

Category: Performance Tuning

Category: Machine Learning Methods

Category: Molecular Biology

Bioinformatics Capstone: Big Data in Biology

Course 7, 13 hours

What you'll learn

In this course, you will learn how to use the BaseSpace cloud platform developed by Illumina (our industry partner) to apply several standard bioinformatics software approaches to real biological data.

In particular, in a series of Application Challenges will see how genome assembly can be used to track the source of a food poisoning outbreak, how RNA-Sequencing can help us analyze gene expression data on the tissue level, and compare the pros and cons of whole genome vs. whole exome sequencing for finding potentially harmful mutations in a human sample. Plus, hacker track students will have the option to build their own genome assembler and apply it to real data!