Are you curious about how chatbots hold conversations or how ChatGPT generates human-like responses? This course in Natural Language Processing (NLP) is your gateway into the fascinating world where language meets AI. Designed for students and professionals alike, the course blends essential theory with hands-on experience to equip you with the skills needed to build intelligent language systems.
We start by unravelling what makes language so complex—and why teaching machines to understand it is such a challenging task. You’ll explore the inner workings of Natural Language Understanding (NLU) and Generation (NLG), investigate real-world NLP applications, and dive into current trends like large language models (LLMs) and transformer-based systems.
From there, you’ll roll up your sleeves and learn core NLP techniques like tokenization, stemming, lemmatization, and sentence segmentation. You’ll master vector-based approaches like Bag of Words and TF-IDF, then progress to powerful word embeddings like Word2Vec, Skip-gram, and GloVe.
As you advance, you'll build language models, train simple neural networks, and explore cutting-edge tools in POS tagging, syntactic parsing, and semantic analysis. You’ll even touch the future with knowledge graphs and Word Sense Disambiguation. By the end, you’ll be ready to innovate in the fast-evolving NLP landscape.
Graduates of this NLP course can pursue roles such as NLP Engineer, Machine Learning Engineer, or Data Scientist with a focus on language technologies. Opportunities also exist in AI-driven fields like chatbots, voice assistants, sentiment analysis, and information retrieval. Advanced learners may explore careers in research, LLM fine-tuning, or knowledge graph development.
Are you ready to unlock the power of cutting-edge NLP skills? Join us on this exciting journey into the world of language, AI, and intelligent data processing!
This module introduces the fundamental concepts of Natural Language Processing (NLP). It begins with the definition of NLP and explores a variety of real-world applications. You will gain an understanding of Natural Language Understanding (NLU) and Natural Language Generation (NLG). The module also covers key evaluation metrics used to assess NLP systems. Additionally, a hands-on lab session will guide you through the implementation of basic NLP preprocessing techniques.
Inclus
15 vidéos5 lectures12 devoirs
Afficher les informations sur le contenu du module
15 vidéos•Total 82 minutes
Course Introduction•3 minutes
Meet Your Instructor: Prof. Dr. Chetana Gavankar•2 minutes
NLP Definition•3 minutes
NLP Applications•5 minutes
Why NLP is Hard?•10 minutes
Natural Language Understanding •4 minutes
Levels of Language Understanding•5 minutes
Natural Language Generation•4 minutes
Organisation of NLP System•6 minutes
Intrinsic vs. Extrinsic Evaluation•4 minutes
Challenges in Evaluation•4 minutes
NLP Tools Overview•7 minutes
Demo of NLP Tools•6 minutes
Basic NLP Application Development Using NLP Tools•13 minutes
Module Wrap-Up•6 minutes
5 lectures•Total 70 minutes
Course Overview•10 minutes
Recommended Reading: What is NLP?•15 minutes
Recommended Reading: NLP Fundamentals•15 minutes
Recommended Reading: Evaluation of NLP Systems•15 minutes
Basic NLP Application Development Using NLP Tools•3 minutes
Text Preprocessing and Analysis in NLP
Module 2•5 heures à terminer
Détails du module
This module introduces essential NLP preprocessing techniques. It begins with regular expressions for text pattern matching, followed by an overview of words and corpora as foundational data sources. Sentence segmentation and tokenization are then covered through practical demonstrations. Finally, the module explores normalization, lemmatization, and stemming as methods to standardise text, with a demo highlighting their differences and effects.
Inclus
14 vidéos4 lectures14 devoirs
Afficher les informations sur le contenu du module
14 vidéos•Total 79 minutes
Regular Expressions•8 minutes
Words and Corpora•5 minutes
Sentence Segmentation•3 minutes
Code Demo Segmentation•5 minutes
Tokenization•5 minutes
Tokenization Methods•7 minutes
Code Demo Tokenization•14 minutes
Normalization •4 minutes
Code Demo Normalization •4 minutes
Stemming•6 minutes
Code Demo Stemming•5 minutes
Lemmatization •3 minutes
Code Demo Lemmatization•6 minutes
Module Wrap-Up•4 minutes
4 lectures•Total 115 minutes
Recommended Reading: Basic Text Preprocessing•35 minutes
Recommended Reading: Segmentation and Tokenization •30 minutes
Recommended Reading: Normalization•20 minutes
Recommended Reading: Stemming and Lemmatization•30 minutes
14 devoirs•Total 99 minutes
Regular Expressions•3 minutes
Words and Corpora•3 minutes
Sentence Segmentation•3 minutes
Code Demo Segmentation•3 minutes
Tokenization•3 minutes
Tokenization Methods•3 minutes
Code Demo Tokenization•3 minutes
Normalization •3 minutes
Code Demo Normalization•3 minutes
Stemming•3 minutes
Code Demo Stemming•3 minutes
Lemmatization•3 minutes
Code Demo Lemmatization•3 minutes
Graded Quiz: Week 1 and 2•60 minutes
Vector Semantics
Module 3•2 heures à terminer
Détails du module
This module explores lexical and vector semantics, focusing on computational representations of word meaning. It covers word vectors, Bag of Words, and co-occurrence matrices to capture contextual relationships. Techniques such as TF-IDF are introduced to measure word importance, along with methods for computing word similarity. Practical examples and mathematical exercises on TF-IDF help reinforce these core NLP concepts.
Inclus
13 vidéos3 lectures10 devoirs
Afficher les informations sur le contenu du module
13 vidéos•Total 72 minutes
Lexical Semantics •3 minutes
Why Vectors?•7 minutes
Word and Vectors•8 minutes
Bag of Words•4 minutes
Computing Word Similarity•3 minutes
Cosine Similarity•4 minutes
Cosine Similarity Example•7 minutes
Term Frequency•4 minutes
Inverse Document Frequency•11 minutes
TF-IDF•7 minutes
Demo of Words as Vectors•4 minutes
Demo of TF-IDF•8 minutes
Module Wrap-Up•4 minutes
3 lectures•Total 45 minutes
Recommended Reading: Foundations of Lexical and Vector Semantics •15 minutes
Recommended Reading: Representing Text Using Vectors •15 minutes
Recommended Reading: Term and Inverse Document Frequency •15 minutes
10 devoirs•Total 30 minutes
Lexical Semantics •3 minutes
Why Vectors? •3 minutes
Word and Vectors •3 minutes
Bag of Words•3 minutes
Computing Word Similarity •3 minutes
Cosine Similarity •3 minutes
Cosine Similarity Example •3 minutes
Term Frequency •3 minutes
Inverse Document Frequency •3 minutes
TF-IDF •3 minutes
Word Embedding
Module 4•4 heures à terminer
Détails du module
This module introduces Word Embeddings, focusing on the transition from sparse to dense vector representations of words. It covers Word2Vec models, including Skip-gram and CBOW, explained with simple, intuitive examples. The module also explores GloVe embeddings, which capture global word co-occurrence statistics for improved semantic understanding. Learners will visualise word embeddings to gain insights into how words relate in vector space. Finally, the module highlights real-world applications of word embeddings in NLP tasks like sentiment analysis, machine translation, and question answering.
Inclus
13 vidéos3 lectures13 devoirs
Afficher les informations sur le contenu du module
Skip-Gram Negative Training Data Example•3 minutes
SGNS Log Loss Function•3 minutes
Derivative of SGNS Loss Function•3 minutes
SGNS Example Part 1•3 minutes
SGNS Example Part 2•3 minutes
Continuous Bag of Words (CBOW)•3 minutes
Graded Quiz - Week 3 and 4•60 minutes
N-gram Language Modeling
Module 5•3 heures à terminer
Détails du module
This module introduces Language Modeling (LM) and its role in predicting word sequences in natural language. It explores practical applications of LMs and explains N-gram models, including challenges like generalization and handling zero probabilities. Techniques such as smoothing and stupid backoff are covered to improve model robustness. The module concludes with methods for evaluating language models using standard metrics.
Inclus
15 vidéos4 lectures13 devoirs
Afficher les informations sur le contenu du module
15 vidéos•Total 96 minutes
What is Language Modeling?•3 minutes
Language Modelling Applications •3 minutes
How to Build a Language Model •5 minutes
Markov Assumption •2 minutes
N-gram Language Models•4 minutes
Bi-gram Computation•10 minutes
Raw Probabilities•10 minutes
Perils of Overfitting•3 minutes
Laplace Smoothing•14 minutes
Interpolation & Backoff•10 minutes
How Good is the Model?•3 minutes
Extrinsic Evaluation•5 minutes
Perplexity & It's Example•9 minutes
Module Demo•10 minutes
Module Wrap-Up•5 minutes
4 lectures•Total 60 minutes
Recommended Reading: Language Modelling Introduction•15 minutes
Recommended Reading: N-grams •15 minutes
Recommended Reading: Smoothing •15 minutes
Recommended Reading: Language Modelling Evaluation •15 minutes
13 devoirs•Total 39 minutes
What is Language Modeling? •3 minutes
Language Modelling Applications •3 minutes
How to Build a Language Model •3 minutes
Markov Assumption•3 minutes
N-gram Language Models •3 minutes
Bi-gram Computation •3 minutes
Raw Probabilities •3 minutes
Perils of Overfitting •3 minutes
Laplace Smoothing•3 minutes
Interpolation & Backoff•3 minutes
How Good is the Model?•3 minutes
Extrinsic Evaluation •3 minutes
Perplexity & its Example•3 minutes
Neural Networks and Neural Language Models
Module 6•5 heures à terminer
Détails du module
This module explores the use of Neural Networks in Language Modelling, starting with the fundamentals of Feed-Forward Neural Networks and their training process for language tasks. It introduces Neural Language Models, which capture complex patterns in text beyond traditional statistical methods. The module also provides a foundational understanding of Large Language Models (LLMs) and their capabilities. Finally, it introduces Prompt Engineering as a technique to effectively interact with and guide LLMs for various NLP applications.
Inclus
17 vidéos5 lectures16 devoirs
Afficher les informations sur le contenu du module
17 vidéos•Total 98 minutes
Neural Network Unit•3 minutes
Non-Linear Activation Functions•5 minutes
Perceptron with Examples•4 minutes
Multi-Layer Perceptron•8 minutes
Softmax Function with Example•4 minutes
Feed Connected Neural Network•4 minutes
Feedforward Network•5 minutes
Forward Algorithm•4 minutes
Backpropagation Algorithm•5 minutes
Training Neural Network•12 minutes
Neural Language Modeling•6 minutes
Training Neural Language Model•9 minutes
N-gram Versus Neural Language Model•4 minutes
Neural LM Demo•10 minutes
What is LLM?•6 minutes
LLM Use Cases•5 minutes
Module Wrap Up•3 minutes
5 lectures•Total 90 minutes
Recommended Reading: Introduction to Neural Network•15 minutes
Recommended Reading: Training Neural Network •15 minutes
Recommended Reading: Neural Language Models •15 minutes
Recommended Reading: Introduction to Large Language Models •30 minutes
16 devoirs•Total 105 minutes
Neural Network Unit•3 minutes
Non-Linear Activation Functions•3 minutes
Perceptron with Examples•3 minutes
Multi-Layer Perceptron•3 minutes
Softmax Function with Example•3 minutes
Feed Connected Neural Network•3 minutes
Feed Forward Network•3 minutes
Forward Algorithm•3 minutes
Backpropagation Algorithm•3 minutes
Training Neural Network•3 minutes
Neural Language Modeling•3 minutes
Training Neural Language Model•3 minutes
N-gram Versus Neural Language Model•3 minutes
What is LLM?•3 minutes
LLM Use Cases•3 minutes
Graded Quiz - Week 5 and 6•60 minutes
Part of Speech Tagging
Module 7•4 heures à terminer
Détails du module
This module provides an introduction to Part-of-Speech (POS) Tagging, techniques to perform POS Tagging and their applications in NLP. POS tagging is a fundamental task in Natural Language Processing (NLP) that involves assigning grammatical categories (like noun, verb, adjective) to words in text. Starting from basic linguistic foundations and real-world applications, the module dives into the evolution of POS tagging techniques—from statistical models like Hidden Markov Models (HMMs) and Maximum Entropy classifiers, to modern deep learning approaches using Recurrent Neural Networks (RNNs). Learners will gain a strong theoretical understanding and insight into how POS tagging supports downstream tasks like parsing, named entity recognition, and machine translation. The module includes a hands-on coding demonstration for POS tagging.
Inclus
13 vidéos5 lectures11 devoirs
Afficher les informations sur le contenu du module
This module introduces students to the syntactic structure of natural language and its critical role in Natural Language Processing (NLP) applications. Parsing is the task of assigning a structured representation—typically a tree—to a sentence, revealing the grammatical relationships between its components. The module begins by revisiting Context-Free Grammars (CFGs) and how they form the foundation for syntactic parsing. We explore Constituent Parsing, introducing classical parsing techniques such as the CKY (Cocke-Kasami-Younger) algorithm. The module then transitions to modern span-based neural parsing approaches that use neural networks to score and predict parse trees. A significant portion of the module is dedicated to Dependency Parsing, where syntactic structure is represented through direct relationships between words rather than phrases. Students will study both transition-based and graph-based dependency parsers, gaining insight into their strengths, algorithmic designs, and practical performance. Throughout the module, we emphasise real-world NLP applications.
Inclus
18 vidéos4 lectures17 devoirs
Afficher les informations sur le contenu du module
18 vidéos•Total 88 minutes
Outline of the Module •2 minutes
Introduction to Context-Free Grammars (CFGs)•8 minutes
Constituency and Phrase Structure•5 minutes
Ambiguity in Grammar•4 minutes
Chomsky Normal Form (CNF) and Grammar Normalisation•5 minutes
This module explores the semantic dimension of natural language by covering both lexical semantics—including word senses, ambiguity, and disambiguation techniques—and the semantic web—a framework for enabling machine-readable, structured understanding of web data. The module starts with foundational concepts in lexical semantics and WordNet, then proceeds to classical and modern word sense disambiguation (WSD) methods. The second part focuses on Semantic Web technologies, covering ontologies, knowledge graphs, RDF/OWL, and their role in enabling intelligent systems and knowledge-driven NLP applications.
Inclus
17 vidéos5 lectures14 devoirs
Afficher les informations sur le contenu du module
17 vidéos•Total 85 minutes
Outline of the Module•1 minute
What is a Word Sense?•3 minutes
Homonymy vs Polysemy•7 minutes
Sense Relations•7 minutes
Introduction to WordNet and Synsets•7 minutes
Relations in WordNet•5 minutes
Navigating WordNet Hierarchies and Graph Structures•5 minutes
What is Word Sense Disambiguation? •4 minutes
Supervised WSD•8 minutes
Knowledge-Based WSD: Lesk Algorithm•5 minutes
From Syntactic Web to Semantic Web: What's the Problem?•6 minutes
Semantic Web Vision: Data Integration and Automation•3 minutes
Ontologies•4 minutes
Ontology Languages and Their Layers•9 minutes
What is a Knowledge Graph? •3 minutes
Applications in NLP•6 minutes
Module Wrap Up•1 minute
5 lectures•Total 130 minutes
Recommended Reading: Word Senses and Lexical Semantics•30 minutes
Code Document: Querying WordNet in Python (using nltk.corpus.wordnet)•10 minutes
Recommended Reading: WordNet and Semantic Lexicons•30 minutes
Recommended Reading: Word Sense Disambiguation (WSD)•30 minutes
Recommended Reading: Introduction to the Semantic Web and Ontologies•30 minutes
14 devoirs•Total 42 minutes
What is a Word Sense? •3 minutes
Homonymy vs Polysemy•3 minutes
Sense Relations•3 minutes
Introduction to WordNet and Synsets•3 minutes
Relations in WordNet•3 minutes
Navigating WordNet Hierarchies and Graph Structures•3 minutes
What is Word Sense Disambiguation?•3 minutes
Supervised WSD•3 minutes
Knowledge-Based WSD: Lesk Algorithm•3 minutes
Semantic Web Vision: Data Integration and Automation•3 minutes
Ontologies•3 minutes
Ontology Languages and Their Layers•3 minutes
What is a Knowledge Graph? •3 minutes
Applications in NLP•3 minutes
Ethical Implications
Module 10•6 heures à terminer
Détails du module
This module introduces students to the evolution of neural network architectures in NLP, beginning with recurrent models (RNNs), progressing through attention mechanisms, and culminating in Transformer-based models that have revolutionised natural language processing. Through hands-on coding and application-driven lessons, students will explore how Transformers power state-of-the-art systems in sentiment analysis (text classification), machine translation, and question answering. The module emphasises both theoretical foundations and practical implementation using modern deep learning frameworks.
Inclus
16 vidéos6 lectures17 devoirs
Afficher les informations sur le contenu du module
16 vidéos•Total 97 minutes
What RNNs Are and Why They Fall Short•7 minutes
Why Do We Need Attention•5 minutes
The Attention Mechanism Explained•6 minutes
From Attention to Transformer Architecture •6 minutes
High-Level Structure of the Transformer•4 minutes
Self-Attention in Detail•6 minutes
Multi-Head Attention•4 minutes
Positional Encodings•4 minutes
Popular Transformer Variants•5 minutes
What Text Summarisation is and its Uses •2 minutes
Types of Text Summarisation•5 minutes
Natural Text Summarisation •11 minutes
Stages of Text Summarisation •6 minutes
Demo of Text Summarisation •9 minutes
Ethical Issues in NLP •10 minutes
Ethical Design of NLP Applications •6 minutes
6 lectures•Total 140 minutes
Recommended Reading: From RNNs to Attention•30 minutes
Code Document: Transformer Demonstration with Classification•10 minutes
NLP Application - Text Summarisation•30 minutes
Recommended Reading: Ethics in NLP•30 minutes
Course Summary•10 minutes
17 devoirs•Total 108 minutes
What RNNs Are and Why They Fall Short•3 minutes
Why Do We Need Attention•3 minutes
The Attention Mechanism Explained•3 minutes
From Attention to Transformer Architecture •3 minutes
High-Level Structure of the Transformer•3 minutes
Self-Attention in Detail•3 minutes
Multi-Head Attention•3 minutes
Positional Encodings•3 minutes
Popular Transformer Variants•3 minutes
What Text Summarisation is and its Uses •3 minutes
Types of Text Summarisation •3 minutes
Natural Text Summarisation •3 minutes
Stages of Text Summarisation •3 minutes
Demo of Text Summarisation •3 minutes
Ethical Issues in NLP •3 minutes
Ethical Design of NLP Applications •3 minutes
Graded Quiz - Week 9 and 10•60 minutes
Préparer un diplôme
Ce site cours fait partie du (des) programme(s) diplômant(s) suivant(s) proposé(s) par Birla Institute of Technology & Science, Pilani. Si vous êtes admis et que vous vous inscrivez, les cours que vous avez suivis peuvent compter pour l'apprentissage de votre diplôme et vos progrès peuvent être transférés avec vous.¹
Consulter les diplômes éligibles
Préparer un diplôme
Ce site cours fait partie du (des) programme(s) diplômant(s) suivant(s) proposé(s) par Birla Institute of Technology & Science, Pilani. Si vous êtes admis et que vous vous inscrivez, les cours que vous avez suivis peuvent compter pour l'apprentissage de votre diplôme et vos progrès peuvent être transférés avec vous.¹
¹La réussite de la candidature et de l'inscription est requise. Les conditions d'admissibilité s'appliquent. Chaque établissement détermine le nombre de crédits reconnus en complétant ce contenu qui peut compter pour les exigences du diplôme, en tenant compte de tout crédit existant que vous pourriez avoir. Cliquez sur un cours spécifique pour plus d'informations.
Birla Institute of Technology & Science, Pilani (BITS Pilani) is one of only ten private universities in India to be recognised as an Institute of Eminence by the Ministry of Human Resource Development, Government of India. It has been consistently ranked high by both governmental and private ranking agencies for its innovative processes and capabilities that have enabled it to impart quality education and emerge as the best private science and engineering institute in India.
BITS Pilani has four international campuses in Pilani, Goa, Hyderabad, and Dubai, and has been offering bachelor's, master’s, and certificate programmes for over 58 years, helping to launch the careers for over 1,00,000 professionals.
Pour quelles raisons les étudiants sur Coursera nous choisissent-ils pour leur carrière ?
Felipe M.
Étudiant(e) depuis 2018
’Pouvoir suivre des cours à mon rythme à été une expérience extraordinaire. Je peux apprendre chaque fois que mon emploi du temps me le permet et en fonction de mon humeur.’
Jennifer J.
Étudiant(e) depuis 2020
’J'ai directement appliqué les concepts et les compétences que j'ai appris de mes cours à un nouveau projet passionnant au travail.’
Larry W.
Étudiant(e) depuis 2021
’Lorsque j'ai besoin de cours sur des sujets que mon université ne propose pas, Coursera est l'un des meilleurs endroits où se rendre.’
Chaitanya A.
’Apprendre, ce n'est pas seulement s'améliorer dans son travail : c'est bien plus que cela. Coursera me permet d'apprendre sans limites.’
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I purchase the Certificate?
When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.