Optimize TensorFlow Models For Deployment with TensorRT

4.6
stars

63 ratings

Offered By

4,145 already enrolled

In this Free Guided Project, you will:

Optimize Tensorflow models using TensorRT (TF-TRT)

Use TF-TRT to optimize several deep learning models at FP32, FP16, and INT8 precision

Observe how tuning TF-TRT parameters affects performance and inference throughput

Showcase this hands-on experience in an interview

1.5 hours
Intermediate
No download needed
Split-screen video
English
Desktop only

This is a hands-on, guided project on optimizing your TensorFlow models for inference with NVIDIA's TensorRT. By the end of this 1.5 hour long project, you will be able to optimize Tensorflow models using the TensorFlow integration of NVIDIA's TensorRT (TF-TRT), use TF-TRT to optimize several deep learning models at FP32, FP16, and INT8 precision, and observe how tuning TF-TRT parameters affects performance and inference throughput. Prerequisites: In order to successfully complete this project, you should be competent in Python programming, understand deep learning and what inference is, and have experience building deep learning models in TensorFlow and its Keras API. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.

Requirements

It is assumed that are competent in Python programming and have prior experience with building deep learning models with TensorFlow and its Keras API

Skills you will develop

  • Deep Learning

  • NVIDIA TensorRT (TF-TRT)

  • Python Programming

  • Tensorflow

  • keras

Learn step-by-step

In a video that plays in a split-screen with your work area, your instructor will walk you through these steps:

  1. Introduction and Project Overview

  2. Setup your TensorFlow and TensorRT Runtime

  3. Load the Data and Pre-trained InceptionV3 Model

  4. Create batched Input

  5. Load the TensorFlow SavedModel

  6. Get Baseline for Prediction Throughput and Accuracy

  7. Convert a TensorFlow saved model into a TF-TRT Float32 Graph

  8. Benchmark TF-TRT Float32

  9. Convert to TF-TRT Float16 and Benchmark

  10. Converting to TF-TRT INT8

How Guided Projects work

Your workspace is a cloud desktop right in your browser, no download required

In a split-screen video, your instructor guides you step-by-step

Reviews

TOP REVIEWS FROM OPTIMIZE TENSORFLOW MODELS FOR DEPLOYMENT WITH TENSORRT

View all reviews

Frequently Asked Questions

Because your workspace contains a cloud desktop that is sized for a laptop or desktop computer, Guided Projects are not available on your mobile device.

Guided Project instructors are subject matter experts who have experience in the skill, tool or domain of their project and are passionate about sharing their knowledge to impact millions of learners around the world.

You can download and keep any of your created files from the Guided Project. To do so, you can use the “File Browser” feature while you are accessing your cloud desktop.

At the top of the page, you can press on the experience level for this Guided Project to view any knowledge prerequisites. For every level of Guided Project, your instructor will walk you through step-by-step.

Yes, everything you need to complete your Guided Project will be available in a cloud desktop that is available in your browser.

You'll learn by doing through completing tasks in a split-screen environment directly in your browser. On the left side of the screen, you'll complete the task in your workspace. On the right side of the screen, you'll watch an instructor walk you through the project, step-by-step.