Traffic Sign Classification using Residual Networks(ResNet)

Deep Residual Learning to Classify Traffic Signs

Jun 6 ·4min read

bmamIb7.jpg!web

Deep Convolutional Neural Networks(CNNs)are widely used to solve various computer vision tasks in the field of Artificial Intelligence. This article focuses on developing a deep learning model in order to recognize traffic signs. :x::no_entry_sign::no_pedestrians::no_bicycles:

Data Analysis
Create a ResNet Model
Model Training
Model Evaluation
Predictions
References

First of all, we need a dataset to train the deep learning model to recognize traffic signs. Kaggle Datasets is the best platform to find datasets for different tasks. Such as Machine Learning(ML), Deep Learning(DL), and Data Science.

Here is one of the datasets contains nearly 73,139 diverse images of traffic signs of 43 classes.

Traffic Signs Classification

Big Database of Traffic Sign Cropped(+70%)

www.kaggle.com

Data Analysis

In this section, we are going to use a simple way to analyze the dataset.

Here is a simple count plot to analyze the spread of data in classes. The below code is used to plot the graph:

yYVvymb.png!web

Countplot w.r.t to classes

Let’s visualize some of the samples from the dataset. This will help us to understand the data. The below code serves the purpose by plotting 100 images from the dataset.

R7b6viU.png!web

Images from the Dataset

Create a ResNet Model

In this section, we are going to create a deep learning model to recognize traffic signs.

Residual Network(ResNet)

Microsoft introduced the deep residual learning framework to overcome the ‘degradation’ problem which is a hard optimization task. The shortcut connections i.e., skipping one or more layers as shown in the below figure.

QbAfYrq.png!web

Skip Connection in Residual Networks

These shortcut connections perform identity mapping and the outputs are added to the outputs of stacked layers. This has solved many problems such as :

Easy to optimize
It gains accuracy from greatly increased depth, producing results that are better than previous network architectures.

For a better understanding of deep residual learning. Use the research paper entitled ‘Deep Residual Learning for Image Recognition’ which is freely available on arxiv.

Deep Residual Learning for Image Recognition

Deeper neural networks are more difficult to train. We present a residual learning framework to ease the training of…

arxiv.org

We are going to use the TensorFlow applications module which provides different popular deep learning models with pretrained weights to use.

Module: tf.keras.applications | TensorFlow Core v2.2.0

Educational resources to learn the fundamentals of ML with TensorFlow

www.tensorflow.org

We are going to use ResNet50 architecture without pretrained weights. We add the dense layer with softmax activation at the end to predict the classes. Below is used to create the model.

You can see the visualization of the model created using the plot_model method.

Model Training

These are the parameters used during the training process. The batch size as 32, epochs 50, learning rate as 0.001, loss metric ‘Categorical Cross Entropy’, optimizer as ‘Adam’. The callbacks ModelCheckpoint, EarlyStopping, ReduceLROnPlateau, and CSVLogger are used in the training of the ResNet50 model. You can use the below link for understanding the nuts and bolts of callbacks.