University of Colorado Boulder
Data Analysis with Python Project
University of Colorado Boulder

Data Analysis with Python Project

Di Wu

Instructor: Di Wu

Access provided by Equatorial Coca-Cola Bottling Company

Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Define the scope and direction of a data analysis project, identifying appropriate techniques and methodologies for achieving project objectives.

  • Apply various classification and regression algorithms and implement cross-validation and ensemble techniques to enhance the performance of models.

  • Apply various clustering, dimension reduction association rule mining, and outlier detection algorithms for unsupervised learning models.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

1 assignment

Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Data Analysis with Python Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 7 modules in this course

In this first week, you will gain an overview of data analysis, understanding supervised and unsupervised learning directions. You will learn how to define the scope and direction of their data analysis project effectively.

What's included

1 reading

This week focuses on classification techniques, where you will explore Nearest Neighbors, Decision Trees, SVM, Naive Bayes, Logistic Regression, cross-validation, ensemble methods, and evaluation metrics.

What's included

1 reading

This week you will delve into regression techniques, including Simple Linear, Polynomial Linear, Linear with regularization, multivariate regression, cross-validation, ensemble methods, and evaluation metrics.

What's included

1 reading

This week introduces clustering techniques, including partitioning, hierarchical, density-based, and grid-based methods, for unsupervised pattern discovery.

What's included

1 reading

This week will focus on dimension reduction techniques, with a particular emphasis on Principal Component Analysis (PCA).

What's included

1 reading

This week focuses on a comprehensive case study where you will apply association rule mining and outlier detection techniques to solve a real-world problem.

What's included

1 reading

This final week focuses on outlier detection methods, including Zscore, IQR, OneClassSVM, Isolation Forest, DBSCAN, LOF, and contextual outliers.

What's included

2 readings1 assignment1 discussion prompt

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Di Wu
University of Colorado Boulder
21 Courses54,230 learners

Offered by

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Explore more from Data Science