Recent years have seen a dramatic growth of natural language text data, including web pages, news articles, scientific literature, emails, enterprise documents, and social media such as blog articles, forum posts, product reviews, and tweets. Text data are unique in that they are usually generated directly by humans rather than a computer system or sensors, and are thus especially valuable for discovering knowledge about people’s opinions and preferences, in addition to many other kinds of knowledge that we encode in text.



Text Retrieval and Search Engines
This course is part of Data Mining Specialization

Instructor: ChengXiang Zhai
Access provided by Tata Communications
61,215 already enrolled
(960 reviews)
Skills you'll gain
Details to know

Add to your LinkedIn profile
14 assignments
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 7 modules in this course
You will become familiar with the course, your classmates, and our learning environment. The orientation will also help you obtain the technical skills required for the course.
What's included
2 videos6 readings2 assignments1 plugin
During this week's lessons, you will learn of natural language processing techniques, which are the foundation for all kinds of text-processing applications, the concept of a retrieval model, and the basic idea of the vector space model.
What's included
6 videos1 reading2 assignments
In this week's lessons, you will learn how the vector space model works in detail, the major heuristics used in designing a retrieval function for ranking documents with respect to a query, and how to implement an information retrieval system (i.e., a search engine), including how to build an inverted index and how to score documents quickly for a query.
What's included
6 videos1 reading2 assignments
In this week's lessons, you will learn how to evaluate an information retrieval system (a search engine), including the basic measures for evaluating a set of retrieved results and the major measures for evaluating a ranked list, including the average precision (AP) and the normalized discounted cumulative gain (nDCG), and practical issues in evaluation, including statistical significance testing and pooling.
What's included
6 videos2 readings2 assignments1 programming assignment
In this week's lessons, you will learn probabilistic retrieval models and statistical language models, particularly the detail of the query likelihood retrieval function with two specific smoothing methods, and how the query likelihood retrieval function is connected with the retrieval heuristics used in the vector space model.
What's included
7 videos1 reading2 assignments
In this week's lessons, you will learn feedback techniques in information retrieval, including the Rocchio feedback method for the vector space model, and a mixture model for feedback with language models. You will also learn how web search engines work, including web crawling, web indexing, and how links between web pages can be leveraged to score web pages.
What's included
8 videos1 reading2 assignments
In this week's lessons, you will learn how machine learning can be used to combine multiple scoring factors to optimize ranking of documents in web search (i.e., learning to rank), and learn techniques used in recommender systems (also called filtering systems), including content-based recommendation/filtering and collaborative filtering. You will also have a chance to review the entire course.
What's included
10 videos1 reading2 assignments1 programming assignment1 plugin
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor

Offered by
Why people choose Coursera for their career




Learner reviews
960 reviews
- 5 stars66.07% 
- 4 stars23.51% 
- 3 stars6.65% 
- 2 stars1.66% 
- 1 star2.08% 
Showing 3 of 960
Reviewed on May 18, 2020
A bit difficult to complete as the Quiz questions were tougher. But when you go through all, you might feel good.
Reviewed on Aug 23, 2018
I'd appreciate Prof.Zhai's lesson design with robust framework. BUT I really can not accept that the mistake that homework putted in wrong week and STILL NOT FIXED AFTER TWO YEARS!
Reviewed on Aug 28, 2016
A great overview of text retrieval methods. Good coverage of search engines. A longer course will cover search engine better (remember this is a 6 weeker)
Explore more from Data Science
 - The Hong Kong University of Science and Technology 
 - DeepLearning.AI 
 - University of Illinois Urbana-Champaign 
 - Simplilearn 

