Welcome to this course on applied natural language processing in engineering. This course is designed to provide you with an in-depth understanding of NLP, a pivotal area of artificial intelligence that empowers computers to comprehend, interpret, and generate human language. Throughout this course, you will explore a wide range of topics, from fundamental NLP tasks like text classification and Named Entity Recognition (NER) to advanced techniques in neural machine translation and optimization methods critical for machine learning. We will delve into the complexities of teaching language to machines, addressing challenges like ambiguity, grammar, and cultural nuances. By the end of this part 1 course, you will have a foundational understanding of how modern NLP systems work - particularly those involving machine learning and deep learning. These topics will equip you to build, analyze and improve NLP systems across many applications.

Applied Natural Language Processing in Engineering Part 1

Applied Natural Language Processing in Engineering Part 1

Instructor: Ramin Mohammadi
Access provided by Allegiant Giving Corporation
Details to know

Add to your LinkedIn profile
22 assignments
October 2025
See how employees at top companies are mastering in-demand skills

There are 7 modules in this course
This module provides an in-depth exploration of Natural Language Processing (NLP), a crucial area of artificial intelligence that enables computers to understand, interpret, and generate human language. By combining computational linguistics with machine learning, NLP is applied in various technologies, from chatbots and sentiment analysis to machine translation and speech recognition. The module introduces fundamental NLP tasks such as text classification, Named Entity Recognition (NER), and neural machine translation, showcasing how these applications shape real-world interactions with AI. Additionally, it highlights the complexities of teaching language to machines, including handling ambiguity, grammar, and cultural nuances. Through the course, you will gain hands-on experience and knowledge about key techniques like word representation and distributional semantics, preparing them to solve language-related challenges in modern AI systems.
What's included
4 videos19 readings2 assignments1 app item
4 videos• Total 6 minutes
- Course Introduction• 3 minutes
- Meet Your Faculty• 1 minute
- Natural Language Processing (NLP)• 1 minute
- Representing the Meaning of a Word• 1 minute
19 readings• Total 154 minutes
- Course Introduction• 2 minutes
- Syllabus - Applied Natural Language Processing in Engineering Part 1• 10 minutes
- Academic Integrity• 1 minute
- Recommended Prior Knowledge• 100 minutes
- Week 1 Introduction• 2 minutes
- Introduction to NLP• 5 minutes
- Example: Chatbots• 2 minutes
- Example: Email Filtering• 2 minutes
- Example: Sentiment Analysis• 3 minutes
- Example: GPT - 3• 3 minutes
- Example: ChatGPT Capabilities• 5 minutes
- Natural Language Processing• 1 minute
- Funny Takes on Language Evolution• 2 minutes
- How Do We Represent the Meaning of a Word?• 2 minutes
- How Do We Have Usable Meaning in a Computer?• 4 minutes
- Words as Discrete Symbols• 5 minutes
- Representing Words by Their Context• 2 minutes
- Word Vectors• 2 minutes
- Final Thoughts on NLP• 1 minute
2 assignments• Total 36 minutes
- Assess Your Learning: What is NLP?• 18 minutes
- Assess Your Learning: Motivation• 18 minutes
1 app item• Total 15 minutes
- Challenges of Teaching Language to AI• 15 minutes
This module focuses on optimization techniques critical for machine learning, particularly in natural language processing (NLP) tasks. It introduces Gradient Descent (GD), a fundamental algorithm used to minimize cost functions by iteratively adjusting model parameters. You’ll explore variants like Stochastic Gradient Descent (SGD) and Mini-Batch Gradient Descent to learn more about their efficiency in handling large datasets. Advanced methods such as Momentum and Adam are covered to give you insight on how to enhance convergence speed by smoothing updates and adapting learning rates. The module also covers second-order techniques like Newton’s Method and Quasi-Newton methods (e.g., BFGS), which leverage curvature information for more direct optimization, although they come with higher computational costs. Overall, this module emphasizes balancing efficiency, accuracy, and computational feasibility in optimizing machine learning models.
What's included
2 videos16 readings3 assignments
2 videos• Total 8 minutes
- Machine Learning and NLP• 4 minutes
- Optimization Techniques• 4 minutes
16 readings• Total 82 minutes
- Week 2 Overview• 2 minutes
- Machine Learning• 2 minutes
- Variations of Gradient Descent• 2 minutes
- Types of ML in NLP• 6 minutes
- What is a Model in NLP and How Does it Learn?• 6 minutes
- Understanding Cost Functions• 2 minutes
- Minimizing the Cost Function in NLP• 10 minutes
- Why Optimization Techniques Matter• 1 minute
- Why SGD Works• 10 minutes
- Jacobian Matrix & Hessian Matrix• 5 minutes
- Momentum• 10 minutes
- Newton's Methods• 5 minutes
- Quasi-Newton Methods• 5 minutes
- Root Mean Square Propagation (RMSProp)• 5 minutes
- Adaptive Moment Estimation (Adam)• 10 minutes
- Overall Challenges of Second-Order Optimization Techniques• 1 minute
3 assignments• Total 81 minutes
- Assess Your Learning: ML in NLP• 18 minutes
- Assess Your Learning: Optimization Techniques• 18 minutes
- Module 2 Quiz• 45 minutes
This module explores Named Entity Recognition (NER), a core task in Natural Language Processing (NLP) that identifies and classifies entities like people, locations, and organizations in text. We’ll begin by examining how logistic regression can be used to model NER as a binary classification problem, though this approach faces limitations with complexity and context capture. We’ll then transition to more advanced techniques, such as neural networks, which excel at handling the complex patterns and large-scale data that traditional models struggle with. Neural networks' ability to learn hierarchical features makes them ideal for NER tasks, as they can capture contextual information more effectively than simpler models. Throughout the module, we compare these methods and highlight how deep learning approaches such as Recurrent Neural Networks (RNNs) and transformers like BERT improve NER accuracy and scalability.
What's included
2 videos14 readings3 assignments1 app item
2 videos• Total 4 minutes
- Neural Networks Definitions• 4 minutes
- Network Propagation• 0 minutes
14 readings• Total 89 minutes
- Week 3 Overview• 2 minutes
- Neural Networks• 2 minutes
- Named Entity Recognition (NER)• 5 minutes
- NER as a Binary Regression Problem• 5 minutes
- Neural Network• 5 minutes
- Neural Network Structure• 5 minutes
- How Does a Neural Network Learn?• 10 minutes
- Mathematical Representation• 20 minutes
- Steps in Back Propagation Algorithm• 5 minutes
- Stochastic Gradient• 5 minutes
- Classification Tasks• 5 minutes
- Sequence-to-Sequence Tasks• 5 minutes
- Sequence Labeling Tasks• 5 minutes
- Regression Tasks & Divergence Measures• 10 minutes
3 assignments• Total 81 minutes
- Assess Your Learning: NER & Neural Networks• 18 minutes
- Assess Your Learning: Cost Functions• 18 minutes
- Module 3 Quiz• 45 minutes
1 app item• Total 10 minutes
- Some Common Activation Functions• 10 minutes
The Word2Vec and GloVe models are popular word embedding techniques in Natural Language Processing (NLP), each offering unique advantages. Word2Vec, developed by Google, operates via two key models: Continuous Bag of Words (CBOW) and Skip-gram, focusing on predicting a word based on its context or vice versa (Word2Vec). The GloVe model, on the other hand, created by Stanford, combines count-based and predictive approaches by leveraging word co-occurrence matrices to learn word vectors (GloVe). Both models represent words in a high-dimensional vector space and capture semantic relationships. Word2Vec focuses on local contexts, learning efficiently from large datasets, while GloVe emphasizes global word co-occurrence patterns across the entire corpus, revealing deeper word associations. These embeddings enable tasks like analogy-solving, semantic similarity, and other linguistic computations, making them central to modern NLP applications.
What's included
3 videos29 readings4 assignments1 app item
3 videos• Total 11 minutes
- GLoVe Training Process• 5 minutes
- Word2Vec• 4 minutes
- Skip-Gram• 2 minutes
29 readings• Total 267 minutes
- Week 4 Overview• 2 minutes
- Introduction to GLoVe• 5 minutes
- Co-occurrence Matrix• 5 minutes
- Objective: Ratio of Co-occurrences• 5 minutes
- Calculating Probability Ratios• 5 minutes
- Symmetry and Linearity in GloVe• 5 minutes
- Minimizing the Cost Function and Optimizing Word Vectors• 5 minutes
- Optimization Process• 10 minutes
- Final Word Vectors• 2 minutes
- Implicit Properties in GloVe• 5 minutes
- GLoVe Introduction• 2 minutes
- What is Language Modeling?• 5 minutes
- Co-occurrence Matrix• 5 minutes
- Vector Representations for Word• 3 minutes
- Continuous Bag of Words (CBOW)• 5 minutes
- Mathematical Objectives• 10 minutes
- Mathematical Objectives 2• 15 minutes
- Limitations of CBOW• 1 minute
- Skip-Gram• 15 minutes
- Gradient Derivation• 15 minutes
- The Challenge of Training Skip-Gram• 10 minutes
- Binary Classification Perspective• 10 minutes
- Gradient of Negative Sampling Objective• 10 minutes
- Connecting Between Skip-Gram, Negative Sampling, and One Sampling• 2 minutes
- Skip-Gram with Negative Sampling Across All Words• 10 minutes
- Negative Sampling in Skip-Gram Model• 10 minutes
- Word2Vec Example• 30 minutes
- Word2Vec Worked Example • 30 minutes
- Word2Vec Example 2• 30 minutes
4 assignments• Total 99 minutes
- Assess Your Learning: GLoVe• 18 minutes
- Assess Your Learning: Word2Vec & CBOW• 18 minutes
- Assess Your Learning: Skip-Gram & Negative Sampling• 18 minutes
- Module 4 Quiz• 45 minutes
1 app item• Total 3 minutes
- GloVe Training Process• 3 minutes
This module delves into the evaluation techniques for Natural Language Processing (NLP) models, focusing on both intrinsic and extrinsic evaluation methods. Intrinsic evaluation assesses the model's performance based on internal criteria, such as word embedding quality, parsing accuracy, and language model perplexity. In contrast, extrinsic evaluation measures the model's effectiveness in real-world applications, including tasks like machine translation, sentiment analysis, and named entity recognition. You’ll also learn more about key differences between these evaluation types, and the importance of context and application in determining a model's utility. Additionally, you’ll review specific metrics like cross-entropy loss, perplexity, BLEU, and ROUGE scores, providing a comprehensive understanding of how to evaluate and improve NLP models.
What's included
9 readings2 assignments1 app item
9 readings• Total 99 minutes
- Week 5 Overview• 2 minutes
- General Concept of Evaluation (in NLP)• 15 minutes
- Key Differences Between Intrinsic and Extrinsic Evaluation• 2 minutes
- Cross-Entropy Loss - Intrinsic• 10 minutes
- Cross-Entropy and Learning from Incorrect Predictions• 15 minutes
- Perplexity - Intrinsic• 15 minutes
- Bilingual Evaluation Understudy Score (BLEU) - Extrinsic• 15 minutes
- Recall and Precision in Text Summarization or Translation• 15 minutes
- Recall-Oriented Understudy for Gisting Evaluation (ROUGE) - Extrinsic• 10 minutes
2 assignments• Total 63 minutes
- Assess Your Learning: NLP Model Evaluation• 18 minutes
- Module 5 Quiz• 45 minutes
1 app item• Total 10 minutes
- Evaluation Techniques• 10 minutes
This module explores various techniques for topic modeling in natural language processing (NLP), focusing on methods like Latent Semantic Analysis (LSA), Non-Negative Matrix Factorization (NMF), and Latent Dirichlet Allocation (LDA). It begins with an introduction to matrix factorization and the importance of transforming textual data into numerical representations. You’ll delve into the mechanics of LSA and NMF, paying attention to their use of TF-IDF and Singular Value Decomposition (SVD) to uncover latent semantic structures. Additionally, you’ll review LDA's probabilistic approach to topic modeling, explaining its reliance on Dirichlet distributions and Bayesian inference to identify hidden topics within a corpus. Through detailed examples and mathematical explanations, the module provides a comprehensive understanding of how these techniques can be applied to extract meaningful topics from large text datasets.
What's included
1 video16 readings4 assignments1 app item
1 video• Total 5 minutes
- Topic Modeling• 5 minutes
16 readings• Total 133 minutes
- Week 6 Overview• 2 minutes
- Matrix Factorization• 1 minute
- Latent Semantic Analysis (LSA)• 15 minutes
- LSA Example• 15 minutes
- Topic Modeling Using Latent Semantic Analysis (LSA)• 5 minutes
- Dimensions and Applications• 5 minutes
- Non-Negative Matrix Factorization (NMF)• 5 minutes
- Operationalizing NMF• 7 minutes
- Numerical Example of NMF• 15 minutes
- Applications of NMF• 2 minutes
- Latent Dirichlet Allocation (LDA)• 5 minutes
- Defining the Problem and Key Assumptions• 1 minute
- Mathematical Model of LDA• 10 minutes
- Steps in LDA: Mathematical Explanation• 15 minutes
- Maximizing the Posterior Probability in LDA• 15 minutes
- Full Example• 15 minutes
4 assignments• Total 99 minutes
- Assess Your Learning: Latent Semantic Analysis• 18 minutes
- Assess Your Learning: Non-Negative Matrix Factorization• 18 minutes
- Assess Your Learning: Latent Dirichlet Allocation• 18 minutes
- Module 6 Quiz• 45 minutes
1 app item• Total 10 minutes
- Recapping NMF & LDA• 10 minutes
This module delves into the essential techniques of syntactic and semantic parsing in natural language processing (NLP). You’ll begin with an exploration of linguistic structures, focusing on phrase structure and dependency structure, which are fundamental for understanding sentence syntax. Then you’ll review various parsing methods, including transition-based and graph-based dependency parsing, highlighting their respective advantages and challenges. Additionally, you’ll touch on neural transition-based parsing, which leverages neural networks for improved accuracy and efficiency. Finally, the module touches on semantic parsing, emphasizing its role in mapping sentences to formal representations of meaning, crucial for applications like dialogue systems and information extraction.
What's included
2 videos32 readings4 assignments
2 videos• Total 5 minutes
- Transition-Based and Graph Parsing Examples• 2 minutes
- Neural Advancements in Parsing: Dependency and Semantics• 3 minutes
32 readings• Total 160 minutes
- Week 7 Overview• 2 minutes
- Introduction to Dependency Parsing• 5 minutes
- What is a Constituent?• 5 minutes
- Pseudoclefting & Sentence Fragments• 5 minutes
- Substitution of Well-formed Sentences• 5 minutes
- Naming the Other Phases• 5 minutes
- Immediate Domination, Domination, & Feminine Kinship Relations• 5 minutes
- Syntactic Parsing - Phrase Structure• 10 minutes
- Phrase Structure• 3 minutes
- Phrase Structure Rules & Explanation• 2 minutes
- Building the Tree• 2 minutes
- Summary• 5 minutes
- Dependency Structure• 2 minutes
- Key Concepts & Example• 10 minutes
- Advantages of Dependency Structure• 5 minutes
- What is Transition-based Parsing?• 2 minutes
- The Greedy Transition-Based Method• 10 minutes
- Type 2: Shift-Reduce Parser• 10 minutes
- Introduction• 5 minutes
- Key Concepts & Configuration• 10 minutes
- Advantages, Challenges & Considerations• 5 minutes
- Introduction• 2 minutes
- Key Concepts of Graph-Based Dependency Parsing• 5 minutes
- Worked Example• 10 minutes
- Advantages and Disadvantages• 2 minutes
- Transition-Based vs Graph-Based Parsing• 1 minute
- Semantic Parsing• 5 minutes
- Example 1• 5 minutes
- Example 2• 5 minutes
- Example 3• 5 minutes
- Example 4• 5 minutes
- Congratulations! • 2 minutes
4 assignments• Total 135 minutes
- Assess Your Learning: Constituents & Detection of Sentence Parts• 30 minutes
- Assess Your Learning: Phrase and Dependency Structure• 30 minutes
- Assess Your Learning: Parsing• 30 minutes
- Module 7 Quiz• 45 minutes
Instructor

Offered by

Offered by

Founded in 1898, Northeastern is a global research university with a distinctive, experience-driven approach to education and discovery. The university is a leader in experiential learning, powered by the world’s most far-reaching cooperative education program. The spirit of collaboration guides a use-inspired research enterprise focused on solving global challenges in health, security, and sustainability.
Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
Explore more from Data Science
NNortheastern University
Course
NNortheastern University
Course

Specialization

Course