Optimize TensorFlow Models For Deployment with TensorRT

Optimize TensorFlow Models For Deployment with TensorRT

Instructor: Snehan Kekre

8,401 already enrolled

Included with Coursera Plus

Learn more

Guided Project

Learn, practice, and apply job-ready skills with expert guidance

4.5

(74 reviews)

Intermediate level

Recommended experience

1.5 hours

Learn at your own pace

Hands-on learning

Learn more

Guided Project

Learn, practice, and apply job-ready skills with expert guidance

4.5

(74 reviews)

Intermediate level

Recommended experience

1.5 hours

Learn at your own pace

Hands-on learning

Learn more

What you'll learn

Optimize Tensorflow models using TensorRT (TF-TRT)
Use TF-TRT to optimize several deep learning models at FP32, FP16, and INT8 precision
Observe how tuning TF-TRT parameters affects performance and inference throughput

Skills you'll practice

Details to know

Shareable certificate

Add to your LinkedIn profile

Taught in English

No downloads or installation required

Only available on desktop

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

Learn, practice, and apply job-ready skills in less than 2 hours

Receive training from industry experts
Gain hands-on experience solving real-world job tasks
Build confidence using the latest tools and technologies

About this Guided Project

This is a hands-on, guided project on optimizing your TensorFlow models for inference with NVIDIA's TensorRT. By the end of this 1.5 hour long project, you will be able to optimize Tensorflow models using the TensorFlow integration of NVIDIA's TensorRT (TF-TRT), use TF-TRT to optimize several deep learning models at FP32, FP16, and INT8 precision, and observe how tuning TF-TRT parameters affects performance and inference throughput.

Prerequisites: In order to successfully complete this project, you should be competent in Python programming, understand deep learning and what inference is, and have experience building deep learning models in TensorFlow and its Keras API. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.

Learn step-by-step

In a video that plays in a split-screen with your work area, your instructor will walk you through these steps:

Introduction and Project Overview
Setup your TensorFlow and TensorRT Runtime
Load the Data and Pre-trained InceptionV3 Model
Create batched Input
Load the TensorFlow SavedModel
Get Baseline for Prediction Throughput and Accuracy
Convert a TensorFlow saved model into a TF-TRT Float32 Graph
Benchmark TF-TRT Float32
Convert to TF-TRT Float16 and Benchmark
Converting to TF-TRT INT8

Recommended experience

It is assumed that are competent in Python programming and have prior experience with building deep learning models with TensorFlow and its Keras API

7 project images

Instructor

Instructor ratings

4.5 (7 ratings)

Snehan Kekre

Coursera Project Network

11 Courses109,881 learners

Offered by

Coursera Project Network

How you'll learn

Skill-based, hands-on learning
Practice new skills by completing job-related tasks.
Expert guidance
Follow along with pre-recorded videos from experts using a unique side-by-side interface.
No downloads or installation required
Access the tools and resources you need in a pre-configured cloud workspace.
Available only on desktop
This Guided Project is designed for laptops or desktop computers with a reliable Internet connection, not mobile devices.

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

4.5

74 reviews

5 stars
68.91%
4 stars
21.62%
3 stars
5.40%
2 stars
2.70%
1 star
1.35%

Showing 3 of 74

Reviewed on Jun 14, 2023

good content, but some code is out of date, especially the package installation part.

Reviewed on Mar 14, 2022

The first to introduce such a rare and important topic.

Reviewed on Jun 3, 2021

Great workshop, all the concepts were very well explained.

View more reviews

DeepLearning.AI
Advanced Computer Vision with TensorFlow
Course
Google Cloud
Classify Images of Cats and Dogs using Transfer Learning
Project
Coursera Project Network
Object Localization with TensorFlow
Guided Project
Google Cloud
Intro to TensorFlow 日本語版
Course

New to Machine Learning? Start here.

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

Because your workspace contains a cloud desktop that is sized for a laptop or desktop computer, Guided Projects are not available on your mobile device.

Guided Project instructors are subject matter experts who have experience in the skill, tool or domain of their project and are passionate about sharing their knowledge to impact millions of learners around the world.

You can download and keep any of your created files from the Guided Project. To do so, you can use the “File Browser” feature while you are accessing your cloud desktop.

Optimize TensorFlow Models For Deployment with TensorRT

What you'll learn

Skills you'll practice

Details to know

See how employees at top companies are mastering in-demand skills

Learn, practice, and apply job-ready skills in less than 2 hours