This self-paced IBM course will teach you all about big data! You will become familiar with the characteristics of big data and its application in big data analytics. You will also gain hands-on experience with big data processing tools like Apache Hadoop and Apache Spark.

Introduction to Big Data with Spark and Hadoop

Introduction to Big Data with Spark and Hadoop
This course is part of multiple programs.



Instructors: Aije Egwaikhide +2 more
76,048 already enrolled
Included with
482 reviews
Recommended experience
What you'll learn
Explain the impact of big data, including use cases, tools, and processing methods.
Describe Apache Hadoop architecture, ecosystem, practices, and user-related applications, including Hive, HDFS, HBase, Spark, and MapReduce.
Apply Spark programming basics, including parallel programming basics for DataFrames, data sets, and Spark SQL.
Use Spark’s RDDs and data sets, optimize Spark SQL using Catalyst and Tungsten, and use Spark’s development and runtime environment options.
Skills you'll gain
- Category: Open Source Technology
- Category: Distributed Computing
- Category: Big Data
- Category: Data Transformation
- Category: Debugging
- Category: Development Environment
- Category: Scalability
- Category: Data Processing
- Category: Performance Tuning
Tools you'll learn
- Category: IBM Cloud
- Category: PySpark
- Category: Kubernetes
- Category: Apache Spark
- Category: Apache Hadoop
- Category: Docker (Software)
- Category: Apache Hive
Details to know

Add to your LinkedIn profile
14 assignments
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 7 modules in this course
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors



Offered by

Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
Learner reviews
- 5 stars
66.39%
- 4 stars
19.08%
- 3 stars
8.09%
- 2 stars
2.90%
- 1 star
3.52%
Showing 3 of 482
Reviewed on May 1, 2022
hands on lab and quizzes at the end of each session was very helpful
Reviewed on Jan 15, 2024
Great program to explore more about AI and Big Data
Reviewed on Nov 8, 2022
All the thinks I need to know about Big Data, Spark, Hadoop and Hive and explained in details