When you enroll in this course, you'll also be enrolled in this Specialization.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 7 modules in this course
This course focuses on essential data analysis using Excel. Learn to design and implement realistic predictive models to reduce uncertainty for informed business decisions. In a hands-on project, you'll act as a business data analyst, building models to assess credit card applications, minimize default risk and maximize bank profits. You'll master key uncertainty measures like classification error rates, entropy of information, and confidence intervals for linear regression. Assignments use data provided within the course and basic Excel functions, ensuring fluency for future business applications. No prior knowledge of advanced Excel features (Visual Basic, Pivot Tables) is required. The Excel and data analysis skills you will learn will enable you to apply business data analysis methods based on binary classification, information theory and entropy measures, and linear regression, and prepare you for roles such as a business data analyst.
This course will prepare you to design and implement realistic predictive models based on data. In the Final Project (module 6) you will assume the role of a business data analyst for a bank, and develop two different predictive models to determine which applicants for credit cards should be accepted and which rejected. Your first model will focus on minimizing default risk, and your second on maximizing bank profits. The two models should demonstrate to you in a practical, hands-on way the idea that your choice of business metric drives your choice of an optimal model.The second big idea this course seeks to demonstrate is that your data-analysis results cannot and should not aim to eliminate all uncertainty. Your role as a data-analyst is to reduce uncertainty for decision-makers by a financially valuable increment, while quantifying how much uncertainty remains. You will learn to calculate and apply to real-world examples the most important uncertainty measures used in business, including classification error rates, entropy of information, and confidence intervals for linear regression. All the data you need is provided within the course, and all assignments are designed to be done in MS Excel. The course will give you enough practice with Excel to become fluent in its most commonly used business functions, and you’ll be ready to learn any other Excel functionality you might need in future (module 1). The course does not cover Visual Basic or Pivot Tables and you will not need them to complete the assignments. All advanced concepts are demonstrated in individual Excel spreadsheet templates that you can use to answer relevant questions. You will emerge with substantial vocabulary and practical knowledge of how to apply business data analysis methods based on binary classification (module 2), information theory and entropy measures (module 3), and linear regression (module 4 and 5), all using no software tools more complex than Excel.
What's included
2 videos3 readings
Show info about module content
2 videos•Total 11 minutes
About This Specialization•4 minutes
Introduction to Mastering Data Analysis in Excel•6 minutes
3 readings•Total 25 minutes
Specialization Overview•10 minutes
Report a problem with the course•5 minutes
Course Overview•10 minutes
Excel Essentials for Beginners
Module 2•2 hours to complete
Module details
In this module, will explore the essential Excel skills to address typical business situations you may encounter in the future. The Excel vocabulary and functions taught throughout this module make it possible for you to understand the additional explanatory Excel spreadsheets that accompany later videos in this course.
What's included
8 videos1 reading2 assignments
Show info about module content
8 videos•Total 52 minutes
Introduction to Using Excel in this Course•6 minutes
Basic Excel Vocabulary; Intro to Charting•7 minutes
Arithmetic in Excel•2 minutes
Functions on Individual Cells•4 minutes
Functions on a Set of Numbers•10 minutes
Functions on Ordered Pairs of Data•9 minutes
Sorting Data in Excel•5 minutes
Introduction to the Solver Plug-in•9 minutes
1 reading•Total 10 minutes
Tips for Success•10 minutes
2 assignments•Total 60 minutes
Excel Essentials Practice•30 minutes
Excel Essentials•30 minutes
Binary Classification
Module 3•2 hours to complete
Module details
Separating collections into two categories, such as “buy this stock, don’t but that stock” or “target this customer with a special offer, but not that one” is the ultimate goal of most business data-analysis projects. There is a specialized vocabulary of measures for comparing and optimizing the performance of the algorithms used to classify collections into two groups. You will learn how and why to apply these different metrics, including how to calculate the all-important AUC: the area under the Receiver Operating Characteristic (ROC) Curve.
What's included
6 videos1 reading2 assignments
Show info about module content
6 videos•Total 46 minutes
Introduction to Binary Classification•8 minutes
Bombers and Seagulls: Confusion Matrix•9 minutes
Costs Determine Optimal Threshold•5 minutes
Calculating Positive and Negative Predictive Values•6 minutes
How to Calculate the Area Under the ROC Curve•12 minutes
Binary Classification with More than One Input Variable•7 minutes
1 reading•Total 10 minutes
Tips for Success•10 minutes
2 assignments•Total 75 minutes
Binary Classification (practice)•30 minutes
Binary Classification (graded)•45 minutes
Information Measures
Module 4•2 hours to complete
Module details
In this module, you will learn how to calculate and apply the vitally useful uncertainty metric known as “entropy.” In contrast to the more familiar “probability” that represents the uncertainty that a single outcome will occur, “entropy” quantifies the aggregate uncertainty of all possible outcomes.
The entropy measure provides the framework for accountability in data-analytic work. Entropy gives you the power to quantify the uncertainty of future outcomes relevant to your business twice: using the best-available estimates before you begin a project, and then again after you have built a predictive model.
The difference between the two measures is the Information Gain contributed by your work.
What's included
7 videos1 reading2 assignments
Show info about module content
7 videos•Total 42 minutes
Quantifying the Informational Edge•1 minute
Probability and Entropy•7 minutes
Entropy of a Guessing Game•7 minutes
Dependence and Mutual Information•3 minutes
The Monty Hall Problem•9 minutes
Learning from One Coin Toss, Part 1•5 minutes
Learning From One Coin Toss, Part 2•9 minutes
1 reading•Total 10 minutes
Tips for Success•10 minutes
2 assignments•Total 75 minutes
Using the Information Gain Calculator Spreadsheet (practice)•30 minutes
Information Measures (graded)•45 minutes
Linear Regression
Module 5•3 hours to complete
Module details
The Linear Correlation measure is a much richer metric for evaluating associations than is commonly realized. You can use it to quantify how much a linear model reduces uncertainty. When used to forecast future outcomes, it can be converted into a “point estimate” plus a “confidence interval,” or converted into an information gain measure. You will develop a fluent knowledge of these concepts and the many valuable uses to which linear regression is put in business data analysis. This module also teaches how to use the Central Limit Theorem (CLT) to solve practical problems. The two topics are closely related because regression and the CLT both make use of a special family of probability distributions called “Gaussians.” You will learn everything you need to know to work with Gaussians in these and other contexts.
What's included
11 videos1 reading3 assignments
Show info about module content
11 videos•Total 73 minutes
Introducing the Gaussian•1 minute
Introduction to Standardization•5 minutes
Standard Normal Probability Distribution in Excel•7 minutes
Calculating Probabilities from Z-scores•5 minutes
Central Limit Theorem•3 minutes
Algebra with Gaussians•7 minutes
Markowitz Portfolio Optimization•13 minutes
Standardizing x and y Coordinates for Linear Regression•6 minutes
Standardization Simplifies Linear Regression•9 minutes
Modeling Error in Linear Regression•10 minutes
Information Gain from Linear Regression•6 minutes
1 reading•Total 10 minutes
Tips for Success•10 minutes
3 assignments•Total 120 minutes
The Gaussian (practice)•30 minutes
Regression Models and PIG (practice)•45 minutes
Parametric Models for Regression (graded)•45 minutes
Additional Skills for Model Building
Module 6•1 hour to complete
Module details
This module gives you additional valuable concepts and skills related to building high-quality models.
As you know, a “model” is a description of a process applied to available data (inputs) that produces an estimate of a future and as yet unknown outcome as output.
Very often, models for outputs take the form of a probability distribution. This module covers how to estimate probability distributions from data (a “probability histogram”), and how to describe and generate the most useful probability distributions used by data scientists. It also covers in detail how to develop a binary classification model with parameters optimized to maximize the AUC, and how to apply linear regression models when your input consists of multiple types of data for each event.
The module concludes with an explanation of “over-fitting” which is the main reason that apparently good predictive models often fail in real life business settings. We conclude with some tips for how you can avoid over-fitting in you own predictive model for the final project – and in real life.
What's included
4 videos1 reading1 assignment
Show info about module content
4 videos•Total 37 minutes
Describing Histograms and Probability Distributions Functions•9 minutes
Some Important and Frequently Encountered PDFs•8 minutes
Linear Regression with More than One Input Variable•5 minutes
Understanding Why Over-fitting Happens•15 minutes
1 reading•Total 10 minutes
AUC Calculator Explanation and Spreadsheet•10 minutes
1 assignment•Total 30 minutes
Probability, AUC, and Excel Linest Function•30 minutes
Final Course Project
Module 7•10 hours to complete
Module details
The final course project is a comprehensive assessment covering all of the course material, and consists of four quizzes and a peer review assignment. For quiz one and quiz two, there are learning points that explain components of the quiz. These learning points will unlock only after you complete the quiz with a passing grade. Before you start, please read through the final project instructions. From past student experience, the final project which includes all the quizzes and peer assessment, takes anywhere from 10-12 hours.
What's included
2 videos4 readings4 assignments1 peer review
Show info about module content
2 videos•Total 14 minutes
Final Project Information: Part 1•3 minutes
Final Project Information: Part 2•11 minutes
4 readings•Total 50 minutes
Final Project Information•20 minutes
Summary of Learning Points for Final Project: Quiz 1•10 minutes
Summary of Learning Points for Final Project: Quiz 2•10 minutes
Share your learning experience•10 minutes
4 assignments•Total 420 minutes
Part 1: Building your Own Binary Classification Model•60 minutes
Part 2: Should the Bank Buy Third-Party Credit Information?•120 minutes
Part 3: Comparing the Information Gain of Alternative Data and Models•120 minutes
Part 4: Modeling Profitability Instead of Default•120 minutes
1 peer review•Total 120 minutes
Part 5: Modeling Credit Card Default Risk and Customer Profitability•120 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors
Instructor ratings
Instructor ratings
We asked all learners to give feedback on our instructors based on the quality of their teaching style.
Duke University has about 13,000 undergraduate and graduate students and a world-class faculty helping to expand the frontiers of knowledge. The university has a strong commitment to applying knowledge in service to society, both near its North Carolina campus and around the world.
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Learner reviews
4.2
3,953 reviews
5 stars
56.27%
4 stars
23.96%
3 stars
9.18%
2 stars
5.31%
1 star
5.26%
Showing 3 of 3953
B
BR
5·
Reviewed on Jan 16, 2017
I very much enjoyed this course. I've been able to apply some of the strategies I learned in the specialization and I look forward to using it as a resource moving forward.
P
PS
4·
Reviewed on May 7, 2020
Quite comprehensive on the usage of concepts taught in the course. However, bit of diversion seems to come in assignments and quizzes. Overall, very challenging and fulfilling.
J
JE
5·
Reviewed on Oct 30, 2015
The course deserves a 5-star rating because: (1) content is relevant, (2) the professor is concise and possesses great teaching skills, and (3) the learning modules are applicable to daily problems.
Will I receive a transcript from Duke University for completing this course?
No. Completion of a Coursera course does not earn you academic credit from Duke; therefore, Duke is not able to provide you with a university transcript. However, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.