An overview - Machine Learning


Machine Learning (ML) is defined as the use of algorithms and computational statistics to learn from data without being explicitly programmed. It is a subsection of the artificial intelligence domain within computer science. The term Machine Learning was first coined in 1959. The recent rise in ML is due to the availability of a large number of datasets, and increased computation power
Traditional Programming
Most of you may already be familiar with traditional programming, where you start with a goal, write logical rules, and refine it through testing until it works the way you want it to. An example of traditional programming is adding two given numbers. traditional programming is also called a Rule-based approach
Machine Learning
But there are certain problems which cannot be solve by traditional programming, for example consider creating an application which classify doodles made by kinder garden kids. Using traditional programming how would you differentiate between a cat and a human? you might say cat have a tail, whiskers, four legs etc.., looks simple and intutive right? now differentiate between a cat and dog? both have four legs, and tail which makes it little difficult to differentiate. Now Imagine having hundreds of such classes how many rules should you check. To tackle such problem we use Machine Learning, ML is all about learning from examples.
Machine Learning is a specific field of Artificial Intelligence, where a system learns to find patterns in examples in order to make predictions.
In Quick Drawgoal is to create a model which classify given doodle into one of 345 predefined categories, Quick Draw Dataset is a collection of 50 million drawings across 345 categories. First we generate a hypthesis then update it by training
Flow
We first understand user requirements
Define an objective (Formulating problem)
Collect data for training and evaluating
learning algorithm generate a hypothesis function
Hypothesis is used to predict validation data if results are not satisfactory we try to enhance our hypothesis
Classification
Supervised : In supervised learning, we have access to examples of correct input and output labels that we can show to the machine during the training phase.
image classification
input is an image
output will be label like cat or dog
Stock price prediction
inputs can be time, date, company etc..,
output can be price of stock at given date
Unsupervised : In Unsupervised learning tasks find patterns in data where labels are not present. Clustering is the task of dividing the population or data points into a number of groups such that data points in the same groups are more similar to other data points in the same group and dissimilar to the data points in other groups.
Google News: In google news, articles describing same incidents by different news channels are clubbed together using clustering.
Gene sequencing
Semi-supervised : In Semi-supervised learning large amount of input data (X) is present and only some of the data is labeled (Y)
- Google News
Reinforcement Learning : Reward system and trial-and-error where goal is to maximize the long-term reward.
- Robots
Regression vs. classification
Regression: A regression model predicts continuous values. For example, regression models make predictions that answer questions like the following:
What is the value of a house in California?
What is the probability that a user will click on this ad?
Classification: A classification model predicts discrete values. For example, classification models make predictions that answer questions like the following:
Is a given email message spam or not spam?
Is this an image of a dog, a cat, or a hamster?
Examples:
Stock price prediction
Breast Cancer detection
Self Driving Cars
Speech Recognition and many more
major concerns
Bias : Bias is when our model failed to generalize, if data is not diverse model might never learn better. Suppose if we create face recognition model for gender classification using only american peoples, our model might not be able to predict gender of asians accurately.
Privacy : As machine learning models are data driven chances of data breech are high. Sensitive information such as bank transaction details and medical reports are prone to this problem.
Can we use machine learning on all problems? Answer is a definate no When to use Machine Learning? when there are no explict rules for solving the problem
References
[1]. http://cs229.stanford.edu/
[2]. https://medium.com/@randylaosat/a-beginners-guide-to-machine-learning-dfadc19f6caf
Subscribe to my newsletter
Read articles from KENNEDY NDUTHA directly inside your inbox. Subscribe to the newsletter, and don't miss out.
Written by

KENNEDY NDUTHA
KENNEDY NDUTHA
Kennedy Ndutha is a junior developer from Nairobi, Kenya, who is interested in the daily and long-term development that affects most enterprises in fulfilling their obligations. Throughout my career, I have experienced numerous development challenges that have really pushed me to the limit, but because to my skills in the development sector, I has been able to solve them with my team. This has truly motivated me to keep going and ensure that my clients are completely satisfied.