About this Course
10,089 recent views

100% online

Start instantly and learn at your own schedule.

Flexible deadlines

Reset deadlines in accordance to your schedule.

Approx. 12 hours to complete

Suggested: 3 hours/week...

English

Subtitles: English

Skills you will gain

Data AnalysisPython ProgrammingMachine LearningExploratory Data Analysis

100% online

Start instantly and learn at your own schedule.

Flexible deadlines

Reset deadlines in accordance to your schedule.

Approx. 12 hours to complete

Suggested: 3 hours/week...

English

Subtitles: English

Syllabus - What you will learn from this course

Week
1
5 hours to complete

Decision Trees

7 videos (Total 40 min), 15 readings, 1 quiz
7 videos
Machine Learning and the Bias Variance Trade-Off6m
What Is a Decision Tree?5m
What is the Process of Growing a Decision Tree?4m
Building a Decision Tree with SAS9m
Strengths and Weaknesses of Decision Trees in SAS4m
Building a Decision Tree with Python9m
15 readings
Some Guidance for Learners New to the Specialization10m
SAS or Python - Which to Choose?10m
Getting Started with SAS10m
Getting Started with Python10m
Course Codebooks10m
Course Data Sets10m
Uploading Your Own Data to SAS10m
Data Set for Decision Tree Videos (tree_addhealth.csv)10m
SAS Code: Decision Trees10m
CART Paper - Prevention Science10m
Python Code: Decision Trees10m
Installing Graphviz and pydotplus10m
Getting Set up for Assignments10m
Tumblr Instructions10m
Assignment Example10m
Week
2
3 hours to complete

Random Forests

4 videos (Total 25 min), 4 readings, 1 quiz
4 videos
Building a Random Forest with SAS7m
Building a Random Forest with Python6m
Validation and Cross-Validation7m
4 readings
SAS code: Random Forests10m
The HPForest Procedure in SAS10m
Python Code: Random Forests10m
Assignment Example10m
Week
3
3 hours to complete

Lasso Regression

5 videos (Total 32 min), 3 readings, 1 quiz
5 videos
Testing a Lasso Regression with SAS10m
Data Management for Lasso Regression in Python3m
Testing a Lasso Regression Model in Python10m
Lasso Regression Limitations2m
3 readings
SAS Code: Lasso Regression10m
Python Code: Lasso Regression10m
Assignment Example10m
Week
4
3 hours to complete

K-Means Cluster Analysis

6 videos (Total 42 min), 3 readings, 1 quiz
6 videos
Running a k-Means Cluster Analysis in SAS, pt. 18m
Running a k-Means Cluster Analysis in SAS, pt. 26m
Running a k-Means Cluster Analysis in Python, pt. 18m
Running a k-Means Cluster Analysis in Python, pt. 210m
k-Means Cluster Analysis Limitations2m
3 readings
SAS Code: k-Means Cluster Analysis10m
Python Code: k-Means Cluster Analysis10m
Assignment Example10m
4.2
50 ReviewsChevron Right

29%

started a new career after completing these courses

36%

got a tangible career benefit from this course

17%

got a pay increase or promotion

Top reviews from Machine Learning for Data Analysis

By BCOct 5th 2016

Very good course. I recommend to anyone who's interested in data analysis and machine learning.

By EMJun 26th 2016

Good introduction with python example for famous algorithm such as random forest and k-mean

Instructors

Avatar

Jen Rose

Research Professor
Psychology
Avatar

Lisa Dierker

Professor
Psychology

About Wesleyan University

At Wesleyan, distinguished scholar-teachers work closely with students, taking advantage of fluidity among disciplines to explore the world with a variety of tools. The university seeks to build a diverse, energetic community of students, faculty, and staff who think critically and creatively and who value independence of mind and generosity of spirit. ...

About the Data Analysis and Interpretation Specialization

Learn SAS or Python programming, expand your knowledge of analytical methods and applications, and conduct original research to inform complex decisions. The Data Analysis and Interpretation Specialization takes you from data novice to data expert in just four project-based courses. You will apply basic data science tools, including data management and visualization, modeling, and machine learning using your choice of either SAS or Python, including pandas and Scikit-learn. Throughout the Specialization, you will analyze a research question of your choice and summarize your insights. In the Capstone Project, you will use real data to address an important issue in society, and report your findings in a professional-quality report. You will have the opportunity to work with our industry partners, DRIVENDATA and The Connection. Help DRIVENDATA solve some of the world's biggest social challenges by joining one of their competitions, or help The Connection better understand recidivism risk for people on parole in substance use treatment. Regular feedback from peers will provide you a chance to reshape your question. This Specialization is designed to help you whether you are considering a career in data, work in a context where supervisors are looking to you for data insights, or you just have some burning questions you want to explore. No prior experience is required. By the end you will have mastered statistical methods to conduct original research to inform complex decisions....
Data Analysis and Interpretation

Frequently Asked Questions

  • Once you enroll for a Certificate, you’ll have access to all videos, quizzes, and programming assignments (if applicable). Peer review assignments can only be submitted and reviewed once your session has begun. If you choose to explore the course without purchasing, you may not be able to access certain assignments.

  • When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.

More questions? Visit the Learner Help Center.