Assume you want to classify a vehicle image into 12 category name, Accord, Altima, Camry, Corolla, Elantra, Freed, HR-V, Insight, Jazz, Prius, Roadster, S2000. When you input an RGB image with 128 * 128 pixels, the predicted outcome is one dimensional array of 12 elements which is in between 0 to 1. The neural networks are trained so the target values of an output array is below. This is an example of the target value when the input image is 'Jazz'.
Let's input a 'Jazz' image below to a trained deep learning model.
First of all, prepare images for training. The directory structure is like this. All images in the subfolder is RGB images with 128 * 128 pixels.
The model that I trained can be described like below by using this code. It shows how each layers transform input arrays.