You're reading from Learn TensorFlow Enterprise Build, manage, and scale machine learning workloads seamlessly using Google's TensorFlow Enterprise

Product type Paperback

Published in Nov 2020

Publisher Packt

ISBN-13 9781800209145

Length 314 pages

Edition 1st Edition

Languages

Python

Tools

TensorFlow

Concepts

Artificial Intelligence

Author (1):

KC Tung

View More author details

Table of Contents (15) Chapters

Preface

1. Section 1 – TensorFlow Enterprise Services and Features

2. Chapter 1: Overview of TensorFlow Enterprise FREE CHAPTER

3. Chapter 2: Running TensorFlow Enterprise in Google AI Platform

4. Section 2 – Data Preprocessing and Modeling

5. Chapter 3: Data Preparation and Manipulation Techniques

6. Chapter 4: Reusable Models and Scalable Data Pipelines

7. Section 3 – Scaling and Tuning ML Works

8. Chapter 5: Training at Scale

9. Chapter 6: Hyperparameter Tuning

10. Section 4 – Model Optimization and Deployment

11. Chapter 7: Model Optimization

12. Chapter 8: Best Practices for Model Training and Performance

13. Chapter 9: Serving a TensorFlow Model

14. Other Books You May Enjoy

Leave a review - let other readers know what you think

Converting a full model to an integer quantization model

This strategy requires TensorFlow 2.3. This quantization strategy is suitable for an environment where compute resources are really constrained, or where the compute node only operates in integer mode, such as edge devices or TPUs. As a result, all parameters are changed to int8 representation. This quantization strategy will try to use int8 representation for all ops or operations as the goal. When this is not possible, the ops are left as the original precision (in other words, float32).

This quantization strategy requires some representative data. This data represents the type of data that the model typically expects in terms of a range of values. In other words, we need to provide either some training or validation data to the integer quantization process. This may be the data already used, such as a subset of the training or validation data. Usually, around 100 samples are recommended. We are going to use 80 samples...