Until now, we had data which we could separate with a line.

But what if we get data like this?

Now we can't use any linear lines to separate them.

There are few ways to separate them.Ways:

We can go to higher dimension:

Assume that we have some data on this line (1D). We can not simply separate them with 1 dot. We will need two at least.

But what if we take x=5 where y=0

We can now see those red crosses to shift to left. and set y=0 at x=6

Then f=(x-5)^2

This gives us a Parabola.

Now, if we fit all of our data to this parabola, we can create a line . Let's see:

You can see how we took our 1D line to a 2D line and then managed to have a line to separate them

Remember, in 1D plane , we need 1 dot, in 2D plane, we need a line and in 3D plane, we need a hyperplane to separate data

In the same way, we can take this 2D data to 3D and manage a plane to separate them.

We would need a Mapping function for that.

Again, from 3D to 2D, we can use projection to do this.

Issues:
- Mapping to higher dimension is very highly compute impulsive. So, we will need a high end processor etc.
Kernel trick:

This is Gaussian RBF kernel. Note that, we did see kernel='rbf' in our sklearn documentation for SVM

Here dot red is the landmark(l) and from x, we can check out the distance.

So, in 2D plane, we can use this Gaussian kernel

here, red dot is Landmark and sigma defines the circumference

If sigma increases, the circumference increases.

Same goes when it decreases

Again for data like this, we can use 2 Gaussian kernel

Types of kernel functions:

Types of Kernel functions

Non Linear SVR

Assume we have data in this 2D plane

We can't separate them with a line

Let's create a box with this 2D plane and keep it on the ground like this.

Now if we use Kernel rbf, we can see

All of the data plotted in 2D can bee seen in the 3D plot on right.Now in the 3D , we can use hyperplane to separate the data

This creates a non-linear line on the plane. If we try to see it's projection on the 2D plane . We can see like this.

You may ask, why are we still calling Support Vector Regression?

if we basically create 2 more plane, we can see 1 epsilon tube to be created . And we can see some supported vectors which is supporting the tube. Isn't this how we got the idea of support vector?

This image is shared to make you remind about our epsilon tube concept for Support vector

So, That's it.

We have successfully created a SVR

Let's code it down

Problem statement: We are launching a new SUV in the market and we want to know which age people will buy it. Here is a list of people with their age and salary. Also, we have previous data of either they did buy any SUV before or not.