Manipulating big data distributed over a cluster using functional concepts is rampant in industry, and is arguably one of the first widespread industrial uses of functional ideas. This is evidenced by the popularity of MapReduce and Hadoop, and most recently Apache Spark, a fast, in-memory distributed collections framework written in Scala. In this course, we'll see how the data parallel paradigm can be extended to the distributed case, using Spark throughout. We'll cover Spark's programming model in detail, being careful to understand how and when it differs from familiar programming models, like shared-memory parallel collections or sequential Scala collections. Through hands-on examples in Spark and Scala, we'll learn when important issues related to distribution like latency and network communication should be considered and how they can be addressed effectively for improved performance.

Big Data Analysis with Scala and Spark (Scala 2 version)

Big Data Analysis with Scala and Spark (Scala 2 version)

Instructor: Prof. Heather Miller
Access provided by SR University
2,519 already enrolled
Gain insight into a topic and learn the fundamentals.
Intermediate level
Some related experience required
3 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
Skills you'll gain
Tools you'll learn
Details to know

Shareable certificate
Add to your LinkedIn profile
Taught in English
See how employees at top companies are mastering in-demand skills

There are 4 modules in this course
Instructor

Offered by
Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Explore more from Computer Science

École Polytechnique Fédérale de Lausanne




