University of Pittsburgh

Cloud Computing for Data Science Specialization

University of Pittsburgh

Cloud Computing for Data Science Specialization

Build Scalable Cloud Solutions for Data Science. Master distributed computing, APIs, and big data tools for cloud-driven analytics.

Dmitriy Babichenko

Instructor: Dmitriy Babichenko

Access provided by Beko

Get in-depth knowledge of a subject
Intermediate level

Recommended experience

4 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
Build toward a degree
Get in-depth knowledge of a subject
Intermediate level

Recommended experience

4 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
Build toward a degree

What you'll learn

  • Design and deploy scalable cloud architectures for data-driven applications.

  • Build and integrate RESTful web services within distributed systems.

  • Implement big data workflows using Hadoop and Spark frameworks.

  • Apply containerization and virtualization for efficient cloud deployment.

Details to know

Shareable certificate

Add to your LinkedIn profile

Taught in English
Recently updated!

February 2026

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Advance your subject-matter expertise

  • Learn in-demand skills from university and industry experts
  • Master a subject or tool with hands-on projects
  • Develop a deep understanding of key concepts
  • Earn a career certificate from University of Pittsburgh

Specialization - 3 course series

Cloud Computing Fundamentals

Cloud Computing Fundamentals

Course 1 16 hours

What you'll learn

  • Configure and deploy virtual machines to simulate cloud environments and understand IaaS, PaaS, and SaaS service models.

  • Differentiate between databases, data warehouses, and data lakes while applying Star and Snowflake schemas for optimal performance.

  • Compare MySQL, MongoDB, and Neo4j database technologies based on ACID and BASE properties to select optimal solutions for use cases.

  • Apply cloud computing principles through hands-on projects using Python, GitHub, and virtualization tools in real-world scenarios.

Skills you'll gain

Category: MongoDB
Category: SQL
Category: MySQL
Category: Data Warehousing
Category: Python Programming
Category: Cloud Computing Architecture
Category: Virtual Machines
Category: Database Management
Category: Cloud Platforms
Category: NoSQL
Category: Cloud Services
Category: Git (Version Control System)
Category: Application Programming Interface (API)
Category: Cloud Computing
Category: Database Design
Category: Data Infrastructure
Category: Scalability
Category: Cloud Infrastructure
Category: Virtualization
Category: Cloud Storage

What you'll learn

  • Analyze key architectural styles in distributed systems and their scalability trade-offs.

  • Design and implement RESTful web services for reliable system communication.

  • Deploy and manage containerized applications using Docker in virtualized environments.

  • Integrate cloud storage and distributed data systems for scalable application design.

Skills you'll gain

Category: JSON
Category: Virtualization
Category: Extensible Markup Language (XML)
Category: Containerization
Category: Restful API
Category: Docker (Software)
Category: Scalability
Category: Software Architecture
Category: Cloud Storage
Category: Distributed Computing
Category: Microservices
Category: Cloud Computing
Category: Cloud Applications
Category: Cloud Computing Architecture
Category: Flask (Web Framework)
Category: Google Cloud Platform
Category: Computer Architecture
Category: Web Services
Category: Cloud Infrastructure
Category: API Design

What you'll learn

  • Explain how Hadoop and Spark enable large-scale data processing.

  • Build and manage distributed data pipelines using Hadoop frameworks.

  • Implement in-memory analytics and real-time processing with Spark.

  • Apply big data tools to design scalable, data-driven applications.

Skills you'll gain

Category: Predictive Modeling
Category: Data Pipelines
Category: Data Science
Category: Data Transformation
Category: Scalability
Category: Distributed Computing
Category: PySpark
Category: Apache Spark
Category: Data Processing
Category: Data Storage Technologies
Category: Scikit Learn (Machine Learning Library)
Category: Apache Hive
Category: Data Analysis
Category: Apache Hadoop
Category: Data Management
Category: Data Storage
Category: Python Programming
Category: Information Technology
Category: Big Data

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Build toward a degree

This Specialization is part of the following degree program(s) offered by University of Pittsburgh. If you are admitted and enroll, your completed coursework may count toward your degree learning and your progress can transfer with you.¹

 

Instructor

Dmitriy Babichenko
University of Pittsburgh
4 Courses 1,845 learners

Offered by

Why people choose Coursera for their career

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."