Five Popular Data Augmentation Techniques In Deep Learning


Data Augmentation techniques in deep learning

As Alan turing stated

What we wish is a machine that may study from expertise.

The machine will get extra studying expertise from feeding extra information. Specifically for deep studying fashions extra information is the important thing for constructing excessive efficiency fashions.

If we aren’t capable of feed the correct quantity of information the deep studying fashions we construct  face the underfitting difficulty, Someday the information we feed must be extra diversified, else even when we’re feeding excessive quantity information, the mannequin will face the overfitting issue.

So we’re clear now, we’d like massive quantities of information to construct deep studying fashions however not on a regular basis we may have sufficient information, 

So we are going to cease constructing the mannequin in such circumstances.

No proper, We have to discover methods to make use of the out there information, to generate extra information with extra variety. In machine studying to resolve the same sort of downside dealing with restricted information, we use the oversampling method

In the identical method for constructing deep studying fashions we use totally different information augmentation strategies to create extra significant information which can be utilized for constructing deep studying fashions.

So let’s drive additional.

Beneath are the ideas you’ll study on this article.

What’s Knowledge Augmentation?

Knowledge Augmentation is a course of of accelerating the out there restricted information to massive significant and extra variety quantities. In different phrases, we’re artificially growing the dimensions of the dataset by creating totally different models of the present information from our dataset. 

The principle motive for this, as everyone knows the true world information could not at all times be within the right kind. 

For instance, contemplate a automotive in a picture, the automotive is probably not on the middle in all circumstances, generally it may be within the left facet of the picture or proper. The picture could also be clicked on a vibrant sunny day or on a cloudy day. The picture is likely to be the left view of the automotive or the proper view. 

All these components have an effect on the mannequin whereas evaluating a picture. The mannequin must be educated in such a method that it could actually detect the item precisely no matter the above components.  

We will apply information augmentation to various kinds of information, however on this article we’re specializing in the Picture Knowledge Augmentation methods which can be utilized in widespread.

Why do we’d like Knowledge Augmentation?

Common Knowledge Augmentation methods In Deep Studying

Click on to Tweet

A lot of the state-of-the-art fashions comprise plenty of parameters within the order of tens of millions.

With a purpose to practice a mannequin for correct outcomes we have to have extra variety of parameters to study nearly all of the options from the information. To accommodate all these parameters we have to have a great quantity of information. Deep studying fashions usually require extra information which isn’t at all times out there.

“What will we do if we have now much less quantity of information or imbalance information?”

We’d like not dig in google for brand spanking new pictures. We will merely use some methods and generate pictures that are ten instances of our dataset or much more. 

In case of imbalanced data we are able to generate extra pictures for the category which has much less information.

The place will we apply Knowledge Augmentation?

We will apply this method on the time of the information era after preprocessing and earlier than coaching. 

We apply this method just for the coaching dataset. At take a look at time we use the take a look at picture straight with none transformations.

For small datasets we are able to generate the transformations of the pictures and practice the mannequin with all the information directly. For big datasets we are able to generate distinctive remodeled pictures for each batch of an epoch.

Knowledge Augmentation Strategies

Five Popular Data Augmentation techniques

5 Common Knowledge Augmentation methods

Beneath are among the  hottest information augmentation extensively utilized in deep studying.

  1. Random Rotation.
  2. Flip (Horizontal and Vertical).
  3. Zoom
  4. Random Shift
  5. Brightness

To get a greater understanding of those information augmentation methods we’re going to use a cat picture.

First step is to learn it utilizing the matplotlib library

Beneath is the code to learn the picture:

We’re going to match the picture on the ImageDataGenerator class from keras which applies the transformations and returns the information in batches.

The ImageDataGenerator wants the enter within the form of (batch_size, peak, width, channels) however the form of our picture is ( peak, width, channels).

So , let’s reshape our picture into the specified form.

We now have to create an occasion for the ImageDataGenerator and go these transformations as parameters.

Change the above code cell with the respective code cells from the under methods to use the transformations.

Now we have to go the picture to the information generator stream methodology which generates the transformations.

After that Let’s view our picture utilizing matplotlib with none augmentations.

Beneath is the loaded cat picture.

Cat image for data augmentation

Now, let’s dive into the small print of the information augmentation methods and apply them on our picture.

Random Rotation

We will rotate the picture by making use of some angle. Every rotated picture is a novel one to the mannequin. The rotation might be utilized as much as 360 levels based mostly on the item within the picture. 

For the above instance we’re making use of rotation_range = 50, which suggests the ImageGenerator considers it as a spread [-50,50] and applies some random angle from the vary to the picture.

rotation technique

Rotation approach


The picture might be flipped both horizontally or vertically based mostly on the item within the picture. 

For instance, the picture of a automotive can’t be flipped vertically because it leads to the the other way up automotive. Nevertheless, It may be flipped horizontally producing left view and proper view of a automotive.

For some objects we should always not flip it vertically because the picture could change fully. The under flip transformation is only for understanding the idea.

Horizontal Flip

horizontal flip technique

Horizontal flip approach

vertical flip technique

Vertical flip approach


The picture might be zoomed in or out with the zoom Augmentation.

ImageDataGenerator class accepts a single float worth or a listing of two values:

  • If a single worth is given then the zoom vary is [1-value, 1+value]
  • If a listing is given then one worth is taken as decrease restrict and the opposite as higher restrict.

The picture is randomly zoomed in or out throughout the given vary.

zoom technique

Zoom approach

Random Shift

The pixels of the picture might be shifted horizontally or vertically.

ImageDataGenerator class accepts two varieties of values(float and int):

  • If float worth is given then it considers the worth as share of width or peak to shift the picture. 
  • If int worth is given then it shifts the pixels of the peak or width by that worth.

Width Shift

The width_shift_range shifts the pixels horizontally both to the left or to the proper randomly.

Width shift technique

Width shift approach

Peak Shift

The height_shift_range shifts the pixels vertically both to the highest or to the underside randomly.

hight shift technique

Hight shift approach


Brightness is a vital issue when coaching the mannequin. We’re not positive that the pictures are at all times taken in higher lighting. So, our mannequin must establish the item even with the least decision. 

ImageDataGenerator class accepts a spread of values and units the brightness of a picture randomly from that vary.

Brightness technique

Brightness approach

We will apply all these transformations at a time based mostly on the context of our dataset.

Beneath is the entire code for the Knowledge Augmentation.

Full Code


Extra fashions are being educated on a regular basis with some accuracy. However solely the fashions which give correct outcomes are rewarded one of the best. The above Augmentation methods assist in generalizing the mannequin by stopping the overfitting and in flip will increase the accuracy of the mannequin.

These methods might be relevant just for the Laptop Imaginative and prescient issues with picture datasets. There are additionally methods to generate artificial information for different varieties of datasets additionally.

Strive the one which higher fits your downside and procure state-of-the-art accuracy to your fashions.

Beneficial Deep Studying programs

Deep Learning Coursera

Deep Studying Specializations

Tensorflow Course

Deep Studying With TensorFlow

Deep Learning python

Deep Studying A to Z Python Course


Source link

Write a comment