About this Course
308 ratings
76 reviews

100% online

Start instantly and learn at your own schedule.

Flexible deadlines

Reset deadlines in accordance to your schedule.

Advanced Level

Approx. 35 hours to complete

Suggested: 5 weeks of study, 4-5 hours per week...


Subtitles: English

Skills you will gain

ChatterbotTensorflowDeep LearningNatural Language Processing

100% online

Start instantly and learn at your own schedule.

Flexible deadlines

Reset deadlines in accordance to your schedule.

Advanced Level

Approx. 35 hours to complete

Suggested: 5 weeks of study, 4-5 hours per week...


Subtitles: English

Syllabus - What you will learn from this course

5 hours to complete

Intro and text classification

In this module we will have two parts: first, a broad overview of NLP area and our course goals, and second, a text classification task. It is probably the most popular task that you would deal with in real life. It could be news flows classification, sentiment analysis, spam filtering, etc. You will learn how to go from raw texts to predicted classes both with traditional methods (e.g. linear classifiers) and deep learning techniques (e.g. Convolutional Neural Nets)....
11 videos (Total 114 min), 3 readings, 3 quizzes
11 videos
Welcome video5m
Main approaches in NLP7m
Brief overview of the next weeks7m
[Optional] Linguistic knowledge in NLP10m
Text preprocessing14m
Feature extraction from text14m
Linear models for sentiment analysis10m
Hashing trick in spam filtering17m
Neural networks for words14m
Neural networks for characters8m
3 readings
Prerequisites check-list2m
Hardware for the course5m
Getting started with practical assignments20m
2 practice exercises
Classical text mining10m
Simple neural networks for text10m
5 hours to complete

Language modeling and sequence tagging

In this module we will treat texts as sequences of words. You will learn how to predict next words given some previous words. This task is called language modeling and it is used for suggests in search, machine translation, chat-bots, etc. Also you will learn how to predict a sequence of tags for a sequence of words. It could be used to determine part-of-speech tags, named entities or any other tags, e.g. ORIG and DEST in "flights from Moscow to Zurich" query. We will cover methods based on probabilistic graphical models and deep learning....
8 videos (Total 84 min), 2 readings, 3 quizzes
8 videos
Perplexity: is our model surprised with a real text?8m
Smoothing: what if we see new n-grams?7m
Hidden Markov Models13m
Viterbi algorithm: what are the most probable tags?11m
MEMMs, CRFs and other sequential models for Named Entity Recognition11m
Neural Language Models9m
Whether you need to predict a next word or a label - LSTM is here to help!11m
2 readings
Perplexity computation10m
Probabilities of tag sequences in HMMs20m
2 practice exercises
Language modeling15m
Sequence tagging with probabilistic models20m
5 hours to complete

Vector Space Models of Semantics

This module is devoted to a higher abstraction for texts: we will learn vectors that represent meanings. First, we will discuss traditional models of distributional semantics. They are based on a very intuitive idea: "you shall know the word by the company it keeps". Second, we will cover modern tools for word and sentence embeddings, such as word2vec, FastText, StarSpace, etc. Finally, we will discuss how to embed the whole documents with topic models and how these models can be used for search and data exploration....
8 videos (Total 83 min), 3 quizzes
8 videos
Explicit and implicit matrix factorization13m
Word2vec and doc2vec (and how to evaluate them)10m
Word analogies without magic: king – man + woman != queen11m
Why words? From character to sentence embeddings11m
Topic modeling: a way to navigate through text collections7m
How to train PLSA?6m
The zoo of topic models13m
2 practice exercises
Word and sentence embeddings15m
Topic Models10m
5 hours to complete

Sequence to sequence tasks

Nearly any task in NLP can be formulates as a sequence to sequence task: machine translation, summarization, question answering, and many more. In this module we will learn a general encoder-decoder-attention architecture that can be used to solve them. We will cover machine translation in more details and you will see how attention technique resembles word alignment task in traditional pipeline....
9 videos (Total 98 min), 4 quizzes
9 videos
Noisy channel: said in English, received in French6m
Word Alignment Models12m
Encoder-decoder architecture6m
Attention mechanism9m
How to deal with a vocabulary?12m
How to implement a conversational chat-bot?11m
Sequence to sequence learning: one-size fits all?10m
Get to the point! Summarization with pointer-generator networks12m
3 practice exercises
Introduction to machine translation10m
Encoder-decoder architectures20m
Summarization and simplification15m
76 ReviewsChevron Right


started a new career after completing these courses


got a tangible career benefit from this course


got a pay increase or promotion

Top Reviews

By GYMar 24th 2018

Great thanks to this amazing course! I learned a lot on state-to-art natural language processing techniques! Really like your awesome programming assignments! See you HSE guys in next class!

By MVMar 18th 2019

Definitely best course in the Specialization! Lecturers, projects and forum - everything is super organized. Only StarSpace was pain in the ass, but I managed :)



Anna Potapenko

HSE Faculty of Computer Science

Alexey Zobnin

Accosiate professor
HSE Faculty of Computer Science

Anna Kozlova

Team Lead

Sergey Yudin


Andrei Zimovnov

Senior Lecturer
HSE Faculty of Computer Science

About National Research University Higher School of Economics

National Research University - Higher School of Economics (HSE) is one of the top research universities in Russia. Established in 1992 to promote new research and teaching in economics and related disciplines, it now offers programs at all levels of university education across an extraordinary range of fields of study including business, sociology, cultural studies, philosophy, political science, international relations, law, Asian studies, media and communicamathematics, engineering, and more. Learn more on www.hse.ru...

About the Advanced Machine Learning Specialization

This specialization gives an introduction to deep learning, reinforcement learning, natural language understanding, computer vision and Bayesian methods. Top Kaggle machine learning practitioners and CERN scientists will share their experience of solving real-world problems and help you to fill the gaps between theory and practice. Upon completion of 7 courses you will be able to apply modern machine learning methods in enterprise and understand the caveats of real-world data and settings....
Advanced Machine Learning

Frequently Asked Questions

  • Once you enroll for a Certificate, you’ll have access to all videos, quizzes, and programming assignments (if applicable). Peer review assignments can only be submitted and reviewed once your session has begun. If you choose to explore the course without purchasing, you may not be able to access certain assignments.

  • When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.

More questions? Visit the Learner Help Center.