This course will teach you efficient and scalable data labeling for ML and various business processes. The key here is the crowdsourcing approach, based on splitting complex challenges into small tasks and distributing them among a vast cloud of performers.
Offered By
Practical Crowdsourcing for Efficient Machine Learning
YandexAbout this Course
General understanding of ML and AI. Basic knowledge of HTML, JS, and CSS is an advantage
What you will learn
Understand the applicability, benefits and limits of the crowdsourcing approach
Integrate an on-demand workforce directly into your processes and build human-in-the-loop processes
Control the quality and accuracy of data labeling to develop high performing ML models
Design and run a full-cycle crowdsourcing project: from planning to getting labeled data
Skills you will gain
General understanding of ML and AI. Basic knowledge of HTML, JS, and CSS is an advantage
Offered by

Yandex
Yandex is a technology company that builds intelligent products and services powered by machine learning. Our goal is to help consumers and businesses better navigate the online and offline world.
Syllabus - What you will learn from this course
Introduction to crowdsourcing
We will start the course with discussing what crowdsourcing is and how it is applicable to Machine Learning. By showing examples of large-scale data labeling processes we will learn how diverse and powerful crowdsourcing is. We will also go through the steps necessary to prepare a crowdsourcing projects. This basic understanding will be developed in the following weeks, as well as your own crowdsourcing projects. This time you will choose a project most relevant to you and draft its pipeline. Last but not least – you will meet a team of Yandex’s Crowd Solutions Architects. They will give a short introduction to their crowdsourcing projects and share experience on how to design an efficient task pipeline.
Instructions and interfaces
This week we will dive into designing crowdsourcing projects. After a task has been decomposed to smaller pieces, it is time to create interfaces and guidelines. We will go through some tips on performer-friendly interface design and learn how to compose guidelines that will help performers along the way.
Quality control
It’s time to talk about ensuring data quality. This week we will discuss how to select and train performers and learn how to configure quality checks depending on task specifics. Most crowdsourcing platforms offer a vide range of quality control mechanisms, but it is important to choose those that are most applicable to your task.
Smart techniques to enhance quality
This week is an introduction to the research field dealing with crowdsourcing challenges. It is a variety of topics that mostly follow the same goal: get more quality while keeping budget limits.
Frequently Asked Questions
When will I have access to the lectures and assignments?
What will I get if I purchase the Certificate?
Is financial aid available?
More questions? Visit the Learner Help Center.