Building and deploying the inferencing application
Before we dive deep into the inferencing application, let’s understand the application components. Our aim is to collect information from a camera, such as the video camera on your laptop, and then send it to the application, where the application will make a call to your model and see whether a face has been detected.
The video-capturing application (we call it the frontend) will capture the video and send every tenth frame as an array of 256-by-256 image to the server via HTTP. The server (or the backend application) will receive the frame or image and make an inference call to the model. The backend service will also keep a Redis-based counter, and when a face is detected, the application will increment the face counter in the Redis database. The backend service will also expose another HTTP service to read the value of the counter, which will then be displayed in the frontend service. Conceptually, it looks as in the...