In Retrieval Optimization: Tokenization to Vector Quantization, taught by Kacper Łukawski, Developer Relations Lead of Qdrant, you’ll learn all about tokenization and also how to optimize vector search in your large-scale customer-facing RAG applications. You’ll explore the technical details of how vector search works and how to optimize it for better performance.



Retrieval Optimization: Tokenization to Vector Quantization

Instructor: Kacper Łukawski
Access provided by Yale
Recommended experience
What you'll learn
Learn how tokenization works in large language and embedding models and how the tokenizer can affect the quality of your search.
Explore how different tokenization techniques including Byte-Pair Encoding, WordPiece, and Unigram are trained and work.
Understand how to measure the quality of your retrieval and how to optimize your search by adjusting HNSW parameters and vector quantizations.
Skills you'll practice
Details to know
Only available on desktop
See how employees at top companies are mastering in-demand skills

Learn, practice, and apply job-ready skills in less than 2 hours
- Receive training from industry experts
- Gain hands-on experience solving real-world job tasks

About this project
Instructor

Offered by
How you'll learn
Hands-on, project-based learning
Practice new skills by completing job-related tasks with step-by-step instructions.
No downloads or installation required
Access the tools and resources you need in a cloud environment.
Available only on desktop
This project is designed for laptops or desktop computers with a reliable Internet connection, not mobile devices.
Why people choose Coursera for their career




You might also like
Google Cloud
DeepLearning.AI
DeepLearning.AI
DeepLearning.AI