When you enroll in this course, you'll also be enrolled in this Specialization.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 10 modules in this course
This course introduces data analysis methods used in systems biology, bioinformatics, and systems pharmacology research. The course covers methods to process raw data from genome-wide mRNA expression studies (microarrays and RNA-seq) including data normalization, clustering, dimensionality reduction, differential expression, enrichment analysis, and network construction. The course contains practical tutorials for using several bioinformatics tools and setting up data analysis pipelines, also covering the mathematics behind the methods applied by these tools and workflows. The course is mostly appropriate for beginning graduate students and advanced undergraduates majoring in fields such as biology, statistics, physics, chemistry, computer science, biomedical and electrical engineering. The course should be useful for wet- and dry-lab researchers who encounter large datasets in their own research. The course presents software tools developed by the Ma’ayan Laboratory (http://labs.icahn.mssm.edu/maayanlab/) from the Icahn School of Medicine at Mount Sinai in New York City, but also other freely available data analysis and visualization tools. The overarching goal of the course is to enable students to utilize the methods presented in this course for analyzing their own data for their own projects. For those students that do not work in the field, the course introduces research challenges faced in the fields of computational systems biology and systems pharmacology.
The 'Introduction to Complex Systems' module discusses complex systems and leads to the idea that a cell can be considered a complex system or a complex agent living in a complex environment just like us. The 'Introduction to Biology for Engineers' module provides an introduction to some central topics in cell and molecular biology for those who do not have the background in the field. This is not a comprehensive coverage of cell and molecular biology. The goal is to provide an entry point to motivate those who are interested in this field, coming from other disciplines, to begin studying biology.
What's included
3 videos3 readings3 assignments
Show info about module content
3 videos•Total 52 minutes
Design Principles of Complex Systems•16 minutes
Introduction to Cell Biology•17 minutes
Introduction to Molecular Biology•19 minutes
3 readings•Total 30 minutes
Course Logistics•10 minutes
Grading Policy•10 minutes
Resources and Links to Additional Materials•10 minutes
3 assignments•Total 90 minutes
Introduction to Complex Systems•30 minutes
Introduction to Cell Biology•30 minutes
Introduction to Molecular Biology•30 minutes
Topological and Network Evolution Models
Module 2•3 hours to complete
Module details
In the 'Topological and Network Evolution Models' module, we provide several lectures about a historical perspective of network analysis in systems biology. The focus is on in-silico network evolution models. These are simple computational models that, based of few rules, can create networks that have a similar topology to the molecular networks observed in biological systems.
What's included
4 videos4 assignments
Show info about module content
4 videos•Total 45 minutes
Small-World and Scale-Free Networks•15 minutes
Duplication-Divergence and Network Motifs•9 minutes
Large Size Motifs and Complex Models of Network Evolution•11 minutes
Network Properties of Biological Networks•12 minutes
4 assignments•Total 120 minutes
Rich-Get-Richer•30 minutes
Duplication-Divergence and Network Motifs•30 minutes
Large Size Motifs•30 minutes
Topological Properties of Biological Networks•30 minutes
Types of Biological Networks
Module 3•3 hours to complete
Module details
The 'Types of Biological Networks' module is about the various types of networks that are typically constructed and analyzed in systems biology and systems pharmacology. This lecture ends with the idea of functional association networks (FANs). Following this lecture are lectures that discuss how to construct FANs and how to use these networks for analyzing gene lists.
What's included
4 videos4 assignments
Show info about module content
4 videos•Total 58 minutes
Types of Biological Networks•12 minutes
Genes2Networks and Network Visualization•17 minutes
Sets2Networks - Creating Functional Association Networks•15 minutes
Genes2FANs - Analyzing Gene Lists with Functional Association Networks•14 minutes
4 assignments•Total 120 minutes
Types of Biological Networks•30 minutes
Genes2Networks and Network Visualization•30 minutes
Functional Association Networks with Sets2Networks•30 minutes
Functional Association Networks with Genes2FANs•30 minutes
Data Processing and Identifying Differentially Expressed Genes
Module 4•2 hours to complete
Module details
This set of lectures in the 'Data Processing and Identifying Differentially Expressed Genes' module first discusses data normalization methods, and then several lectures are devoted to explaining the problem of identifying differentially expressed genes with the focus on understanding the inner workings of a new method developed by the Ma'ayan Laboratory called the Characteristic Direction.
What's included
5 videos2 assignments
Show info about module content
5 videos•Total 41 minutes
Data Normalization•9 minutes
Characteristic Direction Method - Part 1•9 minutes
Characteristic Direction Method - Part 2•8 minutes
Characteristic Direction Method - Part 3•11 minutes
Characteristic Direction Method - Part 4•6 minutes
2 assignments•Total 60 minutes
Data Normalization•30 minutes
Characteristic Direction•30 minutes
Gene Set Enrichment and Network Analyses
Module 5•6 hours to complete
Module details
In the 'Gene Set Enrichment and Network Analyses' module the emphasis is on tools developed by the Ma'ayan Laboratory to analyze gene sets. Several tools will be discussed including: Enrichr, GEO2Enrichr, Expression2Kinases and DrugPairSeeker. In addition, one lecture will be devoted to a method we call enrichment vector clustering we developed, and two lectures will describe the popular gene set enrichment analysis (GSEA) method and an improved method we developed called principal angle enrichment analysis (PAEA).
What's included
9 videos1 reading8 assignments
Show info about module content
9 videos•Total 139 minutes
Enrichment Analysis and Enrichr•21 minutes
GEO2Enrichr: A Google Chrome Extension for Gene Set Extraction and Enrichment•8 minutes
Gene Set Enrichment Analysis (GSEA) - Preliminaries•13 minutes
Gene Set Enrichment Analysis (GSEA) - Part 2•8 minutes
Principal Angle Enrichment Analysis (PAEA)•18 minutes
Network2Canvas (N2C) and Enrichment Analysis with N2C•18 minutes
Expression2Kinases: Inferring Pathways from Differentially Expressed Genes•24 minutes
DrugPairSeeker and the New CMAP•17 minutes
Classifying Patients/Tumors from TCGA•11 minutes
1 reading•Total 10 minutes
GATE Desktop Software Tool•10 minutes
8 assignments•Total 240 minutes
The Fisher Exact Test and Enrichr•30 minutes
Gene Set Enrichment Analysis (GSEA) - Part 1•30 minutes
Gene Set Enrichment Analysis (GSEA) - Part 2•30 minutes
Principal Angle Enrichment Analysis (PAEA)•30 minutes
GATE and Network2Canvas•30 minutes
Expression2Kinases•30 minutes
DrugPairSeeker and the New CMAP•30 minutes
Classifying Patients from TCGA•30 minutes
Deep Sequencing Data Processing and Analysis
Module 6•6 hours to complete
Module details
A set of lectures in the 'Deep Sequencing Data Processing and Analysis' module will cover the basic steps and popular pipelines to analyze RNA-seq and ChIP-seq data going from the raw data to gene lists to figures. These lectures also cover UNIX/Linux commands and some programming elements of R, a popular freely available statistical software. Note that since these lectures were developed and recorded during the Fall of 2013, it is possible that there are better tools that should be used now since the field is rapidly advancing.
What's included
7 videos7 assignments
Show info about module content
7 videos•Total 125 minutes
RNA-seq Analysis - Preliminaries•18 minutes
RNA-seq Analysis - Using TopHat and Cufflinks•21 minutes
RNA-seq Analysis - R Basics•23 minutes
RNA-seq Analysis - CummeRbund•23 minutes
STAR: An Ultra-fast RNA-seq Aligner•14 minutes
ChIP-seq Analysis - Part 1•13 minutes
ChIP-seq Analysis - Part 2•12 minutes
7 assignments•Total 210 minutes
RNA-seq and UNIX/Linux Commands•30 minutes
RNA-seq Pipeline•30 minutes
CummeRbund and R Programming•30 minutes
CummeRbund - Demo•30 minutes
RNA-seq STAR•30 minutes
ChIP-seq Analysis - Part 1•30 minutes
ChIP-seq Analysis - Part 2•30 minutes
Principal Component Analysis, Self-Organizing Maps, Network-Based Clustering and Hierarchical Clustering
Module 7•5 hours to complete
Module details
This module is devoted to various method of clustering: principal component analysis, self-organizing maps, network-based clustering and hierarchical clustering. The theory behind these methods of analysis are covered in detail, and this is followed by some practical demonstration of the methods for applications using R and MATLAB.
What's included
6 videos1 reading6 assignments
Show info about module content
6 videos•Total 90 minutes
Principal Component Analysis (PCA) - Part 1•13 minutes
Principal Component Analysis (PCA) - Part 2•8 minutes
Principal Component Analyis (PCA) Plotting in MATLAB•16 minutes
Clustergram in MATLAB•14 minutes
Self-Organizing Maps•14 minutes
Network-Based Clustering•25 minutes
1 reading•Total 10 minutes
MATLAB License•10 minutes
6 assignments•Total 180 minutes
Principal Component Analysis (PCA) - Part 1•30 minutes
Principal Component Analysis (PCA) - Part 2•30 minutes
Principal Component Analysis (PCA) with MATLAB•30 minutes
Hierarchical Clustering (HC) with MATLAB•30 minutes
Self-Organizing Maps•30 minutes
Network-Based Clustering•30 minutes
Resources for Data Integration
Module 8•2 hours to complete
Module details
The lectures in the 'Resources for Data Integration' module are about the various types of networks that are typically constructed and analyzed in systems biology and systems pharmacology. These lectures start with the idea of functional association networks (FANs). Following this lecture are several lectures that discuss how to construct FANs from various resources and how to use these networks for analyzing gene lists as well as to construct a puzzle that can be used to connect genomic data with phenotypic data.
What's included
5 videos2 assignments
Show info about module content
5 videos•Total 49 minutes
Big Data in Biology and Data Integration•6 minutes
Resources for Data Integration - Part 1•10 minutes
Resources for Data Integration - Part 2•12 minutes
Resources for Data Integration - Part 3•9 minutes
Resources for Data Integration - Part 4•11 minutes
2 assignments•Total 60 minutes
Big Data in Biology and Data Integration•30 minutes
Resources for Data Integration•30 minutes
Crowdsourcing: Microtasks and Megatasks
Module 9•1 hour to complete
Module details
The final set of lectures presents the idea of crowdsourcing. MOOCs provide the opportunity to work together on projects that are difficult to complete alone (microtasks) or compete for implementing the best algorithms to solve hard problems (megatasks). You will have the opportunity to participate in various crowdsourcing projects: microtasks and megatasks. These projects are designed specifically for this course.
What's included
2 videos1 assignment
Show info about module content
2 videos•Total 19 minutes
Crowdsourcing in Bioinformatics•16 minutes
Crowdsourcing Tasks for this Course•4 minutes
1 assignment•Total 30 minutes
Crowdsourcing: Microtasks and Megatasks•30 minutes
Final Exam
Module 10•1 hour to complete
Module details
The final exam consists of multiple choice questions from topics covered in all of modules of the course. Some of the questions may require you to perform some of the analysis methods you learned throughout the course on new datasets.
What's included
1 assignment
Show info about module content
1 assignment•Total 30 minutes
Final Exam•30 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Instructor ratings
Instructor ratings
We asked all learners to give feedback on our instructors based on the quality of their teaching style.
The Icahn School of Medicine at Mount Sinai, in New York City is a leader in medical and scientific training and education, biomedical research and patient care.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.