When you enroll in this course, you'll also be enrolled in this Specialization.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 5 modules in this course
Marketing data is often so big that humans cannot read or analyze a representative sample of it to understand what insights might lie within. In this course, learners use unsupervised deep learning to train algorithms to extract topics and insights from text data. Learners walk through a conceptual overview of unsupervised machine learning and dive into real-world datasets through instructor-led tutorials in Python. The course concludes with a major project.
This course uses Jupyter Notebooks and the coding environment Google Colab, a browser-based Jupyter notebook environment. Files are stored in Google Drive.
This course can be taken for academic credit as part of CU Boulder’s Master of Science in Data Science (MS-DS) degree offered on the Coursera platform. The MS-DS is an interdisciplinary degree that brings together faculty from CU Boulder’s departments of Applied Mathematics, Computer Science, Information Science, and others. With performance-based admissions and no application process, the MS-DS is ideal for individuals with a broad range of undergraduate education and/or professional experience in computer science, information science, mathematics, and statistics. Learn more about the MS-DS program at https://www.coursera.org/degrees/master-of-science-data-science-boulder.
In this module, we will cover the fundamental concepts of topic modeling, also known as unsupervised machine learning on unstructured text documents. We will contrast unsupervised methods to supervised ones and survey common applications of topic modeling.
Introduction to Using Google Colab for this Course•10 minutes
Dr. Vargo's Topic Modeling Approach to YikYak Data•10 minutes
1 programming assignment•Total 180 minutes
Homework 1: Segmenting by Sentiment•180 minutes
1 discussion prompt•Total 10 minutes
Introduce Yourself!•10 minutes
The Assumptions of a Topic Model, Bag of Words, and Natural Language Processing
Module 2•4 hours to complete
Module details
In this module, we will go under the hood inside a topic modeling approach and understand what assumptions drive topic model fit. We will also uncover how bag-of-words approaches to topic modeling work, and the natural language processing required to produce meaningful topic modeling features.
Dr. Vargo’s Chapter on How Topic Modeling Compares with Lexicon-based Approaches•10 minutes
1 assignment•Total 30 minutes
Topic Modeling Quiz•30 minutes
1 programming assignment•Total 180 minutes
Homework 2: Build a Topic Model•180 minutes
Prepping Amazon Review Data
Module 3•1 hour to complete
Module details
In this module, we will cover how to parse through JSON-like data and segment it to create a corpus that is ready for the topic modeling process. We will cover how the data for your project is structured and its taxonomy.
What's included
2 videos2 readings1 assignment
Show info about module content
2 videos•Total 27 minutes
Topic Modeling Lecture 4•18 minutes
Topic Modeling Lecture 5•9 minutes
2 readings•Total 20 minutes
Lecture Notebook Links•10 minutes
Coding Lab 1: Segmenting Data•10 minutes
1 assignment•Total 30 minutes
Lab 1 Quiz•30 minutes
Pre-Processing Text and Training a Topic Model
Module 4•2 hours to complete
Module details
In this module, we will take Amazon review data and load it into a corpus to preprocess it. We will cover how to build topic models from the data and also save those topic models.
What's included
2 videos2 readings1 peer review
Show info about module content
2 videos•Total 29 minutes
Topic Modeling Lecture 6•17 minutes
Topic Modeling Lecture 7•12 minutes
2 readings•Total 70 minutes
Lecture Notebook Links•10 minutes
Lab 2: Classification and Visualization•60 minutes
Topic Modeling Evaluation, Classification, and Neural Network Approaches
Module 5•2 hours to complete
Module details
In this module, we will learn how to evaluate the fit of topic models and use the best topic model to classify documents. We will also cover how to build topic models with pre-trained neural networks.
What's included
3 videos3 readings1 peer review
Show info about module content
3 videos•Total 37 minutes
Topic Modeling Lecture 8•15 minutes
Topic Modeling Lecture 9•9 minutes
Topic Modeling Lecture 10•13 minutes
3 readings•Total 80 minutes
Lecture Notebook Links•10 minutes
Papers (1, 2, and 3) on Topic Modeling Fit Statistics•10 minutes
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Build toward a degree
This course is part of the following degree program(s) offered by University of Colorado Boulder. If you are admitted and enroll, your completed coursework may count toward your degree learning and your progress can transfer with you.¹
View eligible degrees
Build toward a degree
This course is part of the following degree program(s) offered by University of Colorado Boulder. If you are admitted and enroll, your completed coursework may count toward your degree learning and your progress can transfer with you.¹
¹Successful application and enrollment are required. Eligibility requirements apply. Each institution determines the number of credits recognized by completing this content that may count towards degree requirements, considering any existing credits you may have. Click on a specific course for more information.
CU Boulder is a dynamic community of scholars and learners on one of the most spectacular college campuses in the country. As one of 34 U.S. public institutions in the prestigious Association of American Universities (AAU), we have a proud tradition of academic excellence, with five Nobel laureates and more than 50 members of prestigious academic academies.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.