Learners will gain the skills to serve powerful language models as practical and scalable web APIs. They will learn how to use the llama.cpp example server to expose a large language model through a set of REST API endpoints for tasks like text generation, tokenization, and embedding extraction.

Beginning Llamafile for Local Large Language Models (LLMs)


Beginning Llamafile for Local Large Language Models (LLMs)


Instructors: Noah Gift
Access provided by Inter IKEA
Gain insight into a topic and learn the fundamentals.
Beginner level
Recommended experience
3 hours to complete
Flexible schedule
Learn at your own pace
What you'll learn
Learn how to serve large language models as production-ready web APIs using the llama.cpp framework
Understand the architecture and capabilities of the llama.cpp example server for text generation, tokenization, and embedding extraction
Gain hands-on experience in configuring and customizing the server using command line options and API parameters
Skills you'll gain
Details to know

Shareable certificate
Add to your LinkedIn profile
Assessments
4 assignments
Taught in English
See how employees at top companies are mastering in-demand skills

There is 1 module in this course
This week, you run language models locally. Keep data private. Avoid latency and fees. Use Mixtral model and llamafile.
What's included
8 videos19 readings4 assignments1 discussion prompt4 ungraded labs
Offered by
Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Explore more from Computer Science
Duke University
University of Michigan
Duke University