Signature Verification with deep learning / transfer learning using Keras and KN...

In the previous posts we used traditional Machine Learning methods andDeep Learning in Python andKNIME to detect credit card fraud, in this post we will see how to use a pretrained deep neural networks to classify images of offline signatures into genuine and forged. A neural network like this could support experts to fight cheque fraud. We will use the VGG16 network architecture pertained on ImageNet. The technique we are going to apply is called transfer learning and allows us to use a deep network even if we have only limited data, as in our case. The idea for this post is based on the paper ‘ Offline Signature Verification with Convolutional Neural Networks ‘ by Gabe Alvarez, Blue Sheffer and Morgan Bryant. We combine native KNIME features with our own extension in Python using Keras for the transfer learning).

We will use the same data source for our training set: The signature collection of the ICDAR 2011 Signature Verification Competition (SigComp2011) which contains offline and online signature samples. The offline dataset comprises PNG images, scanned at 400 dpi, RGB color. The data can be downloaded from http://www.iapr-tc11.org/mediawiki/index.php/ICDAR_2011_Signature_Verification_Competition_(SigComp2011) .

Data Preparation

AzEVJnr.png!web

The offline signatures images have all different size, so in the first step we resize them to have all the same size 150×150 and divide all pixel by 255 to scale our features in the [0,1] space.

With the List Files node we filter all images (pngs) in our training set folder.

rI3aqqi.png!web

YJbquyi.png!web

In the next step (Rule Node) we mark the first 123 images are forged or fraudulent and the remaining images are genuine signatures.

EFzYb2Q.png!web

In the following Metanode we read all images from the files using the image reader (table) node and then standardise the features and resize all images.

MJNfQ3R.png!web

URRnQzr.png!web

2aaUR3q.png!web

J3qEjmA.png!web

In the next step we perform a 80/20 split into training and test data and apply a data augmentation step (using Keras ImageDataGenerator to create more training data) which finalise our data preparation. We will cover the data augmentation in one of the coming posts.

Deep Learning network

We will load the weights of a VGG16 network pertained on ImageNet classification tasks using Keras Applications and strip of the last layers and replace them with our new dense layers and then train and fine tune the parameters our data. Instead of training the complete network which would require a lot more data and training time we will use pretrained weights and leverage the previous experience of the network. We just learn the last layers on our classification task.

fu6FrqF.png!web

Step 1: Load the pretrained network

We use the DL Python Network Creator Node and need to write a few line of Python code to load the network. We are using Keras, which will automatically download the weights.

QrMJNvM.png!web

Step 2: Replace Top Layers and freeze weights

In the next step we modify the network, we add 3 Dense Layer with Dropouts and ReLu and a Sigmoid activation at the end and freeze the weights of the original VGG network and again we need to write a few line of Python code:

iIzYnuA.png!web

Step 3: Train Top Layers

Then we train the new layers for 5 epochs with the Keras Network Learner Node:

This slideshow requires JavaScript.

Step 4 and 5: Unfreeze and fine tune

We modify the resulting network and unfreeze the last layers of the VGG16 network to fine-tune the pre-learned weights (3 layers) and train the network for another 10 epochs.

6VJN3uF.png!web

rAvqaii.png!web

Apply Network and Test Results

In the last step we apply the network to the test data, convert the predicted probability into a class (p>0.5 -> Forged) and calculate the Confusion Matrix and AUC curve.

jYfIBvZ.png!web

Confusion Matrix:

jEnUV3m.png!web

AUC Curve:

QBneU3Q.png!web

The results look very impression, actually a bit to good. We are maybe overfitting the data, since the test data may contains signatures (genuine and forged) from the same reference authors (since there are only 10 reference authors in the complete training set). The author of the paper found that the performance on signatures of persons whose signatures has been seen in the training phase is very good but that on unseen data it’s only little bit better then a naive baseline. In one of the next posts we will to check the network performance on new unseen signatures and try to train and test the model also on the SigComp2009 data which have signatures of 100 authors and we will look into detail in the data augmentation and maybe we compare this network to some more other network architectures.

So long…

Data Preparation

Deep Learning network

Step 1: Load the pretrained network

Step 2: Replace Top Layers and freeze weights

Step 3: Train Top Layers

Step 4 and 5: Unfreeze and fine tune

Apply Network and Test Results

Recommend

脏读、幻读与不可重复读

This week in Usability & Productivity, part 33

Golang Gin实践番外请入门 Makefile

微软超融合私有云测试16-SCVMM2016部署之创建VM并创建VM模板-孟纪超的技术博客

基础运维：基于IP实现网页分流-小恒Leo

爱国者林毅夫

何时才是小米的最佳入场点？

易到终是沦为资本炒作的工具

袁隆平神话

滴滴一下，顺风夺命

About Joyk