Statistical Data Visualization with Seaborn

4.6
stars
141 ratings
Offered By
Coursera Project Network
6,257 already enrolled
In this Guided Project, you will:

Produce and customize various chart types with Seaborn

Apply feature selection and feature extraction methods with scikit-learn

Build a boosted decision tree classifier with XGBoost

Clock1.5 hours
IntermediateIntermediate
CloudNo download needed
VideoSplit-screen video
Comment DotsEnglish
LaptopDesktop only

Welcome to this project-based course on Statistical Data Visualization with Seaborn. Producing visualizations is an important first step in exploring and analyzing real-world data sets. As such, visualization is an indispensable method in any data scientist's toolbox. It is also a powerful tool to identify problems in analyses and for illustrating results. In this project, we will employ the statistical data visualization library, Seaborn, to discover and explore the relationships in the Breast Cancer Wisconsin (Diagnostic) data set. We will use the results from our exploratory data analysis (EDA) in the previous project, Breast Cancer Diagnosis – Exploratory Data Analysis to: drop correlated features, implement feature selection and feature extraction methods including feature selection with correlation, univariate feature selection, recursive feature elimination, principal component analysis (PCA) and tree based feature selection methods. Lastly, we will build a boosted decision tree classifier with XGBoost to classify tumors as either malignant or benign. This course runs on Coursera's hands-on project platform called Rhyme. On Rhyme, you do projects in a hands-on manner in your browser. You will get instant access to pre-configured cloud desktops containing all of the software and data you need for the project. Everything is already set up directly in your internet browser so you can just focus on learning. For this project, you’ll get instant access to a cloud desktop with Python, Jupyter, and scikit-learn pre-installed. Notes: - You will be able to access the cloud desktop 5 times. However, you will be able to access instructions videos as many times as you want. - This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.

Skills you will develop

Data ScienceMachine LearningPython ProgrammingSeabornData Visualization (DataViz)

Learn step-by-step

In a video that plays in a split-screen with your work area, your instructor will walk you through these steps:

  1. Project Overview

  2. Importing Libraries and Data

  3. Dropping Correlated Columns from Feature List

  4. Classification using XGBoost (minimal feature selection)

  5. Univariate Feature Selection

  6. Recursive Feature Elimination with Cross-Validation

  7. Plot CV Scores vs Number of Features Selected

  8. Feature Extraction using Principal Component Analysis

How Guided Projects work

Your workspace is a cloud desktop right in your browser, no download required

In a split-screen video, your instructor guides you step-by-step

Reviews

TOP REVIEWS FROM STATISTICAL DATA VISUALIZATION WITH SEABORN

View all reviews

Frequently asked questions

Frequently Asked Questions

More questions? Visit the Learner Help Center.