Applied Natural Language Processing in Engineering Part 1

Applied Natural Language Processing in Engineering Part 1

Instructor: Ramin Mohammadi

Access provided by Allegiant Giving Corporation

7 modules

Gain insight into a topic and learn the fundamentals.

3 weeks to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

7 modules

Gain insight into a topic and learn the fundamentals.

3 weeks to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

Skills you'll gain

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

22 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 7 modules in this course

Welcome to this course on applied natural language processing in engineering. This course is designed to provide you with an in-depth understanding of NLP, a pivotal area of artificial intelligence that empowers computers to comprehend, interpret, and generate human language. Throughout this course, you will explore a wide range of topics, from fundamental NLP tasks like text classification and Named Entity Recognition (NER) to advanced techniques in neural machine translation and optimization methods critical for machine learning. We will delve into the complexities of teaching language to machines, addressing challenges like ambiguity, grammar, and cultural nuances. By the end of this part 1 course, you will have a foundational understanding of how modern NLP systems work - particularly those involving machine learning and deep learning. These topics will equip you to build, analyze and improve NLP systems across many applications.

This module provides an in-depth exploration of Natural Language Processing (NLP), a crucial area of artificial intelligence that enables computers to understand, interpret, and generate human language. By combining computational linguistics with machine learning, NLP is applied in various technologies, from chatbots and sentiment analysis to machine translation and speech recognition. The module introduces fundamental NLP tasks such as text classification, Named Entity Recognition (NER), and neural machine translation, showcasing how these applications shape real-world interactions with AI. Additionally, it highlights the complexities of teaching language to machines, including handling ambiguity, grammar, and cultural nuances. Through the course, you will gain hands-on experience and knowledge about key techniques like word representation and distributional semantics, preparing them to solve language-related challenges in modern AI systems.

What's included

4 videos19 readings2 assignments1 app item

4 videosTotal 6 minutes

Course Introduction3 minutes
Meet Your Faculty1 minute
Natural Language Processing (NLP)1 minute
Representing the Meaning of a Word1 minute

19 readingsTotal 154 minutes

Course Introduction2 minutes
Syllabus - Applied Natural Language Processing in Engineering Part 110 minutes
Academic Integrity1 minute
Recommended Prior Knowledge100 minutes
Week 1 Introduction2 minutes
Introduction to NLP5 minutes
Example: Chatbots2 minutes
Example: Email Filtering2 minutes
Example: Sentiment Analysis3 minutes
Example: GPT - 33 minutes
Example: ChatGPT Capabilities5 minutes
Natural Language Processing1 minute
Funny Takes on Language Evolution2 minutes
How Do We Represent the Meaning of a Word?2 minutes
How Do We Have Usable Meaning in a Computer?4 minutes
Words as Discrete Symbols5 minutes
Representing Words by Their Context2 minutes
Word Vectors2 minutes
Final Thoughts on NLP1 minute

2 assignmentsTotal 36 minutes

Assess Your Learning: What is NLP?18 minutes
Assess Your Learning: Motivation18 minutes

1 app itemTotal 15 minutes

Challenges of Teaching Language to AI15 minutes

This module focuses on optimization techniques critical for machine learning, particularly in natural language processing (NLP) tasks. It introduces Gradient Descent (GD), a fundamental algorithm used to minimize cost functions by iteratively adjusting model parameters. You’ll explore variants like Stochastic Gradient Descent (SGD) and Mini-Batch Gradient Descent to learn more about their efficiency in handling large datasets. Advanced methods such as Momentum and Adam are covered to give you insight on how to enhance convergence speed by smoothing updates and adapting learning rates. The module also covers second-order techniques like Newton’s Method and Quasi-Newton methods (e.g., BFGS), which leverage curvature information for more direct optimization, although they come with higher computational costs. Overall, this module emphasizes balancing efficiency, accuracy, and computational feasibility in optimizing machine learning models.

What's included

2 videos16 readings3 assignments

2 videosTotal 8 minutes

Machine Learning and NLP4 minutes
Optimization Techniques4 minutes

16 readingsTotal 82 minutes

Week 2 Overview2 minutes
Machine Learning2 minutes
Variations of Gradient Descent2 minutes
Types of ML in NLP6 minutes
What is a Model in NLP and How Does it Learn?6 minutes
Understanding Cost Functions2 minutes
Minimizing the Cost Function in NLP10 minutes
Why Optimization Techniques Matter1 minute
Why SGD Works10 minutes
Jacobian Matrix & Hessian Matrix5 minutes
Momentum10 minutes
Newton's Methods5 minutes
Quasi-Newton Methods5 minutes
Root Mean Square Propagation (RMSProp)5 minutes
Adaptive Moment Estimation (Adam)10 minutes
Overall Challenges of Second-Order Optimization Techniques1 minute

3 assignmentsTotal 81 minutes

Assess Your Learning: ML in NLP18 minutes
Assess Your Learning: Optimization Techniques18 minutes
Module 2 Quiz45 minutes

This module explores Named Entity Recognition (NER), a core task in Natural Language Processing (NLP) that identifies and classifies entities like people, locations, and organizations in text. We’ll begin by examining how logistic regression can be used to model NER as a binary classification problem, though this approach faces limitations with complexity and context capture. We’ll then transition to more advanced techniques, such as neural networks, which excel at handling the complex patterns and large-scale data that traditional models struggle with. Neural networks' ability to learn hierarchical features makes them ideal for NER tasks, as they can capture contextual information more effectively than simpler models. Throughout the module, we compare these methods and highlight how deep learning approaches such as Recurrent Neural Networks (RNNs) and transformers like BERT improve NER accuracy and scalability.

What's included

2 videos14 readings3 assignments1 app item

2 videosTotal 4 minutes

Neural Networks Definitions4 minutes
Network Propagation0 minutes

14 readingsTotal 89 minutes

Week 3 Overview2 minutes
Neural Networks2 minutes
Named Entity Recognition (NER)5 minutes
NER as a Binary Regression Problem5 minutes
Neural Network5 minutes
Neural Network Structure5 minutes
How Does a Neural Network Learn?10 minutes
Mathematical Representation20 minutes
Steps in Back Propagation Algorithm5 minutes
Stochastic Gradient5 minutes
Classification Tasks5 minutes
Sequence-to-Sequence Tasks5 minutes
Sequence Labeling Tasks5 minutes
Regression Tasks & Divergence Measures10 minutes

3 assignmentsTotal 81 minutes

Assess Your Learning: NER & Neural Networks18 minutes
Assess Your Learning: Cost Functions18 minutes
Module 3 Quiz45 minutes

1 app itemTotal 10 minutes

Some Common Activation Functions10 minutes

The Word2Vec and GloVe models are popular word embedding techniques in Natural Language Processing (NLP), each offering unique advantages. Word2Vec, developed by Google, operates via two key models: Continuous Bag of Words (CBOW) and Skip-gram, focusing on predicting a word based on its context or vice versa (Word2Vec). The GloVe model, on the other hand, created by Stanford, combines count-based and predictive approaches by leveraging word co-occurrence matrices to learn word vectors (GloVe). Both models represent words in a high-dimensional vector space and capture semantic relationships. Word2Vec focuses on local contexts, learning efficiently from large datasets, while GloVe emphasizes global word co-occurrence patterns across the entire corpus, revealing deeper word associations. These embeddings enable tasks like analogy-solving, semantic similarity, and other linguistic computations, making them central to modern NLP applications.

What's included

3 videos29 readings4 assignments1 app item

3 videosTotal 11 minutes

GLoVe Training Process5 minutes
Word2Vec4 minutes
Skip-Gram2 minutes

29 readingsTotal 267 minutes

Week 4 Overview2 minutes
Introduction to GLoVe5 minutes
Co-occurrence Matrix5 minutes
Objective: Ratio of Co-occurrences5 minutes
Calculating Probability Ratios5 minutes
Symmetry and Linearity in GloVe5 minutes
Minimizing the Cost Function and Optimizing Word Vectors5 minutes
Optimization Process10 minutes
Final Word Vectors2 minutes
Implicit Properties in GloVe5 minutes
GLoVe Introduction2 minutes
What is Language Modeling?5 minutes
Co-occurrence Matrix5 minutes
Vector Representations for Word3 minutes
Continuous Bag of Words (CBOW)5 minutes
Mathematical Objectives10 minutes
Mathematical Objectives 215 minutes
Limitations of CBOW1 minute
Skip-Gram15 minutes
Gradient Derivation15 minutes
The Challenge of Training Skip-Gram10 minutes
Binary Classification Perspective10 minutes
Gradient of Negative Sampling Objective10 minutes
Connecting Between Skip-Gram, Negative Sampling, and One Sampling2 minutes
Skip-Gram with Negative Sampling Across All Words10 minutes
Negative Sampling in Skip-Gram Model10 minutes
Word2Vec Example30 minutes
Word2Vec Worked Example 30 minutes
Word2Vec Example 230 minutes

4 assignmentsTotal 99 minutes

Assess Your Learning: GLoVe18 minutes
Assess Your Learning: Word2Vec & CBOW18 minutes
Assess Your Learning: Skip-Gram & Negative Sampling18 minutes
Module 4 Quiz45 minutes

1 app itemTotal 3 minutes

GloVe Training Process3 minutes

This module delves into the evaluation techniques for Natural Language Processing (NLP) models, focusing on both intrinsic and extrinsic evaluation methods. Intrinsic evaluation assesses the model's performance based on internal criteria, such as word embedding quality, parsing accuracy, and language model perplexity. In contrast, extrinsic evaluation measures the model's effectiveness in real-world applications, including tasks like machine translation, sentiment analysis, and named entity recognition. You’ll also learn more about key differences between these evaluation types, and the importance of context and application in determining a model's utility. Additionally, you’ll review specific metrics like cross-entropy loss, perplexity, BLEU, and ROUGE scores, providing a comprehensive understanding of how to evaluate and improve NLP models.

What's included

9 readings2 assignments1 app item

9 readingsTotal 99 minutes

Week 5 Overview2 minutes
General Concept of Evaluation (in NLP)15 minutes
Key Differences Between Intrinsic and Extrinsic Evaluation2 minutes
Cross-Entropy Loss - Intrinsic10 minutes
Cross-Entropy and Learning from Incorrect Predictions15 minutes
Perplexity - Intrinsic15 minutes
Bilingual Evaluation Understudy Score (BLEU) - Extrinsic15 minutes
Recall and Precision in Text Summarization or Translation15 minutes
Recall-Oriented Understudy for Gisting Evaluation (ROUGE) - Extrinsic10 minutes

2 assignmentsTotal 63 minutes

Assess Your Learning: NLP Model Evaluation18 minutes
Module 5 Quiz45 minutes

1 app itemTotal 10 minutes

Evaluation Techniques10 minutes

This module explores various techniques for topic modeling in natural language processing (NLP), focusing on methods like Latent Semantic Analysis (LSA), Non-Negative Matrix Factorization (NMF), and Latent Dirichlet Allocation (LDA). It begins with an introduction to matrix factorization and the importance of transforming textual data into numerical representations. You’ll delve into the mechanics of LSA and NMF, paying attention to their use of TF-IDF and Singular Value Decomposition (SVD) to uncover latent semantic structures. Additionally, you’ll review LDA's probabilistic approach to topic modeling, explaining its reliance on Dirichlet distributions and Bayesian inference to identify hidden topics within a corpus. Through detailed examples and mathematical explanations, the module provides a comprehensive understanding of how these techniques can be applied to extract meaningful topics from large text datasets.

What's included

1 video16 readings4 assignments1 app item

1 videoTotal 5 minutes

Topic Modeling5 minutes

16 readingsTotal 133 minutes

Week 6 Overview2 minutes
Matrix Factorization1 minute
Latent Semantic Analysis (LSA)15 minutes
LSA Example15 minutes
Topic Modeling Using Latent Semantic Analysis (LSA)5 minutes
Dimensions and Applications5 minutes
Non-Negative Matrix Factorization (NMF)5 minutes
Operationalizing NMF7 minutes
Numerical Example of NMF15 minutes
Applications of NMF2 minutes
Latent Dirichlet Allocation (LDA)5 minutes
Defining the Problem and Key Assumptions1 minute
Mathematical Model of LDA10 minutes
Steps in LDA: Mathematical Explanation15 minutes
Maximizing the Posterior Probability in LDA15 minutes
Full Example15 minutes

4 assignmentsTotal 99 minutes

Assess Your Learning: Latent Semantic Analysis18 minutes
Assess Your Learning: Non-Negative Matrix Factorization18 minutes
Assess Your Learning: Latent Dirichlet Allocation18 minutes
Module 6 Quiz45 minutes

1 app itemTotal 10 minutes

Recapping NMF & LDA10 minutes

This module delves into the essential techniques of syntactic and semantic parsing in natural language processing (NLP). You’ll begin with an exploration of linguistic structures, focusing on phrase structure and dependency structure, which are fundamental for understanding sentence syntax. Then you’ll review various parsing methods, including transition-based and graph-based dependency parsing, highlighting their respective advantages and challenges. Additionally, you’ll touch on neural transition-based parsing, which leverages neural networks for improved accuracy and efficiency. Finally, the module touches on semantic parsing, emphasizing its role in mapping sentences to formal representations of meaning, crucial for applications like dialogue systems and information extraction.