91% of learners achieved a positive career outcome
Build your subject-matter expertise
This course is available as part of
When you enroll in this course, you'll also be asked to select a specific program.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 5 modules in this course
Description: This course delves into the world of data analysis with Python. You'll learn how to use libraries like pandas and Matplotlib to manipulate, analyze, and visualize data, extracting valuable insights and communicating findings effectively.
Benefits: Become proficient in data analysis techniques, enabling you to extract meaningful insights from data and present them in compelling visualizations.
By the end of this course, you'll be able to:
• Perform data cleaning, transformation, and manipulation using pandas.
• Create various types of visualizations using Matplotlib.
• Understand the fundamentals of generative AI and its applications in data analysis.
• Implement basic machine learning models for data analysis.
Tools/Software: Python, Jupyter Notebook, pandas, Matplotlib, Scikit-learn
This course is for entry-Level professionals looking to build a foundational understanding and experience with Python, while seeking employment as a Python developer. No prior work experience or degree is required.
This module provides a foundational understanding of data analysis and its role in various industries. Learners will explore the data analysis process, key concepts, and ethical considerations. They will also be introduced to essential Python libraries and tools like Jupyter Notebook, equipping them with the necessary skills to begin their data analysis journey. By the end of this module, learners will be able to define data analysis, differentiate it from data science, explain the data analysis process, identify key data analysis concepts, and set up their data analysis toolkit.
Data ethics and privacy: Navigating the responsible use of data•6 minutes
Data governance•6 minutes
Setting up your environment for data analysis•2 minutes
Jupyter notebook tips and tricks•6 minutes
Demo: Jupyter notebook shortcuts and productivity tips•6 minutes
Use cases for Python libraries•6 minutes
Understanding datasets•2 minutes
Finding and accessing real-world datasets•5 minutes
7 readings•Total 70 minutes
Data analysis and visualization with Python syllabus•10 minutes
Foundations of data analysis•10 minutes
The difference between data analysis and data science•10 minutes
Key concepts in data analysis•10 minutes
Essential Python libraries for data analysis•10 minutes
Common dataset types and sources•10 minutes
Data cleaning 101•10 minutes
5 assignments•Total 90 minutes
Unveiling data analysis•15 minutes
Activity: A simple analysis in Jupyter Notebook•15 minutes
Setting up your data analysis toolkit•15 minutes
Diving into datasets•15 minutes
Introduction to data analysis•30 minutes
1 discussion prompt•Total 5 minutes
What is visualization in data analysis?•5 minutes
Data processing and manipulation
Module 2•4 hours to complete
Module details
This module focuses on equipping learners with practical data processing and manipulation skills. Learners will be introduced to pandas, a powerful Python library, as a core tool for data manipulation. Learners will become proficient in using pandas dataFrames, mastering essential operations such as indexing, slicing, and filtering data. They will gain a thorough understanding of various indexing techniques (loc, iloc, boolean indexing) and their appropriate applications. The module emphasizes the importance of data cleaning for accurate analysis and guides learners through various techniques to identify and handle missing values and outliers. It also covers different data types in Python, enabling learners to make informed choices for their analysis. Learners will practice loading, inspecting, and transforming datasets using pandas functions, applying these skills to real-world scenarios. By the end of this module, learners will confidently leverage pandas to clean, transform, and prepare data for subsequent analysis and visualization, ensuring data integrity and reliability in their data analysis projects.
What's included
13 videos5 readings5 assignments
Show info about module content
13 videos•Total 60 minutes
Manipulating data with pandas•2 minutes
pandas Dataframes: The basics•6 minutes
Demo: Loading and inspecting datasets in pandas•5 minutes
Exploring data transformations•5 minutes
Demo: Transforming data with pandas•5 minutes
Exploratory data analysis (EDA)•2 minutes
The importance of data cleaning•5 minutes
Identifying and handling missing data•5 minutes
Handling duplicate values•6 minutes
Detecting and removing outliers•6 minutes
Data types in Python: Choosing the right fit•2 minutes
Demo: pandas for exploration and cleaning•5 minutes
Taming messy data with pandas•5 minutes
5 readings•Total 50 minutes
pandas indexing explained•10 minutes
pandas cheat sheet•10 minutes
Essential tactics for data manipulation•10 minutes
Common causes of missing data•10 minutes
pandas for essential analysis tasks•10 minutes
5 assignments•Total 95 minutes
Activity: Loading and inspecting datasets in pandas•20 minutes
pandas: Your data manipulation powerhouse•15 minutes
The hero of data analysis: Data cleaning•15 minutes
Using pandas for cleaning and exploration•15 minutes
Data processing and manipulation•30 minutes
Data visualization
Module 3•4 hours to complete
Module details
Module 3 focuses on the essential skill of data visualization. Learners examine a variety of visualization types, such as line charts, bar charts, and scatter plots, learning how to choose the most effective ones for different data and analysis goals. The module provides a comparison of popular visualization libraries, including Matplotlib, Plotly, and Bokeh, highlighting the unique strengths of each to help learners select the right tool. Learners gain practical experience creating visualizations with Matplotlib, mastering the basics of plot customization for clear and informative communication. The module also introduces advanced techniques with Plotly and Bokeh, enabling learners to design interactive and highly customized visualizations. It emphasizes the importance of communicating data insights effectively, teaching learners how to construct narratives with data. Learners are introduced to best practices for data visualization design, ensuring their visuals are clear, informative, and engaging. By the end of this module, learners will be able to transform data into impactful visuals that support effective communication and informed decision-making.
What's included
10 videos8 readings5 assignments
Show info about module content
10 videos•Total 49 minutes
Charting your data visually•2 minutes
Common visualizations•6 minutes
Introduction to Matplotlib•7 minutes
Explore visualization libraries•3 minutes
Interactive plots with Plotly•5 minutes
Customizing visualizations with Bokeh•5 minutes
Use data for storytelling•3 minutes
The art of data storytelling•6 minutes
Presenting data insights•6 minutes
Avoiding bias in conclusions•6 minutes
8 readings•Total 80 minutes
What is data visualization?•10 minutes
Anatomy of a Matplotlib Plot•10 minutes
Matplotlib gallery•10 minutes
Choosing the right visualization library•10 minutes
Plotly interactive dashboards•10 minutes
Strategies for data storytelling•10 minutes
Data visualization best practices•10 minutes
Cognitive load theory and data visualization•10 minutes
5 assignments•Total 95 minutes
Introduction to visualization•15 minutes
Creating visualizations•15 minutes
Interpreting and presenting data insights•15 minutes
Activity: Visualizing trends•20 minutes
Data visualization•30 minutes
Introduction to generative AI
Module 4•3 hours to complete
Module details
This module provides learners with a foundational understanding of generative AI, its applications, and ethical implications, along with practical techniques for leveraging it in data analysis and visualization. Learners will explore the core concepts of generative AI, including transformer models, large language models (LLMs), and natural language processing (NLP). They will delve into the distinctions between generative AI and other AI types, examining real-world applications across various sectors. The module also emphasizes the ethical considerations surrounding generative AI, covering topics such as ownership, authenticity, and responsible use of AI-generated content. Additionally, learners will gain hands-on experience with techniques for generating synthetic data using generative adversarial networks (GANs) and other models, and explore data augmentation methods for enhancing the size and diversity of datasets, ultimately improving the performance of machine learning models.
What's included
8 videos6 readings4 assignments
Show info about module content
8 videos•Total 38 minutes
What is generative AI?•6 minutes
Real-world applications of generative AI•7 minutes
The ethics of AI-Generated content•2 minutes
Filling the gaps in your data•2 minutes
Using Generative Adversarial Networks (GANs)•6 minutes
Data augmentation: Supercharging your dataset•2 minutes
Why augmentation matters•5 minutes
Text augmentation techniques•7 minutes
6 readings•Total 60 minutes
Generative AI vs. other AI•10 minutes
Ethical guidelines for generative AI•10 minutes
Introduction to synthetic data•10 minutes
Synthetic data generation techniques•10 minutes
Image augmentation techniques•10 minutes
Best practices for data augmentation•10 minutes
4 assignments•Total 75 minutes
Basics of generative AI•15 minutes
Generating synthetic data with GenAI•15 minutes
Data augmentation•15 minutes
Introduction to generative AI•30 minutes
Introduction to machine learning
Module 5•7 hours to complete
Module details
This module provides a foundational understanding of machine learning, its applications, and how to build basic models. Learners will explore core concepts like supervised and unsupervised learning, delve into model evaluation techniques using metrics like precision, recall, and F1-score, and gain hands-on experience building linear and logistic regression models with Scikit-learn. Additionally, the module covers the use of synthetic data in machine learning, including ethical considerations and practical applications.
Our goal at Microsoft is to empower every individual and organization on the planet to achieve more.
In this next revolution of digital transformation, growth is being driven by technology. Our integrated cloud approach creates an unmatched platform for digital transformation. We address the real-world needs of customers by seamlessly integrating Microsoft 365, Dynamics 365, LinkedIn, GitHub, Microsoft Power Platform, and Azure to unlock business value for every organization—from large enterprises to family-run businesses. The backbone and foundation of this is Azure.
OK
Why people choose Coursera for their career
Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Learner reviews
4.4
112 reviews
5 stars
63.15%
4 stars
23.68%
3 stars
5.26%
2 stars
2.63%
1 star
5.26%
Showing 3 of 112
T
TG
4·
Reviewed on Sep 24, 2025
This course was challenging. The content didn't flow or connect for me as the previous courses. Maybe dealing with life events, I lost some of my focus.
M
MM
4·
Reviewed on May 14, 2026
Was a great course, a lot of information and knowledge, but with some imbalance related to practice.
S
SB
5·
Reviewed on Nov 18, 2025
4.5 stars. it gets better and hands on towards the end
Frequently asked questions
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Certificate?
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.