Scroll to top of page

Machine learning in a nutshell, part 2: Predicting future behaviour based on past data, with supervised learning

In our first article of this series dedicated to machine learningmachine learningMachine learning is an artificial intelligence system that is based on the learning ability of algorithms. As this learning process relies on the repetition of an action, the accuracy of the results produced by machine learning algorithms improves over time.Learn more, we saw that the principle goal of machine learning is to automate common tasks, with the help of computers. To do so, machine learning algorithmalgorithmAn algorithm is a mathematical process designed to solve a problem or to obtain a result, using a finite number of operations. It can be translated into a computer program thanks to a programming language.Learn mores try to mimic human learning based on a mathematical model (if you’re just joining us, you can catch up here).

Machine learning solutions can fall into 3 categories: supervised learning, unsupervised learning, and reinforcement learning. Let’s address the first one.

An introductory example, or why there’s no magic in machine learning

“Predicting future data” is a concept that underlies supervised learning, often misunderstood due to the word “prediction”. If someone tells you: “in my hand I have a green fruit, rather round, with a diameter of 7.2 cm, a weight of 152 grams, that has seeds and a stem”, you are likely to assume it is an apple. You have just predicted future data, based on your past observation of your parents implicitly teaching you how to identify an apple, in your childhood. Descriptors such as weight, shape, and colour are called “observable variables”, and the fact that this is an apple is the “target”. If you feed a machine learning algorithm these variables and explicitly give it the name of the fruit, this is supervised information. Then, take a fruit that you do not know the name of, gather information and give it to your model. The model will predict the most likely name based on what it has learned. But the model cannot know the unknown: if you only provide it with data about apples and pears, and then show it a banana, it will never guess “banana”! However, if you give it 10 million new apples and pears, it will give you the correct answer in just a second.

Important applications for digital marketing – if the data fits

Supervised learning can solve plenty of critical issues as long as you gather a large enough history of supervised information. This history can come from search engines (showing the best results following a specific query, bidding the right amount on a search query, etc.) and recommender systems (think Netflix recommendations), to pricing or advertising attributionattributionDigital attribution refers to a set of methods whose purpose is to reconstruct the digital journey that has led a client to conversion. This process aims to assess the efficiency of each of the channels used during a marketing campaign.Learn more modeling. For instance, you might want to determine sociodemographic information about users in order to adjust your message for a given target. Based on a large enough quantity of labeled information (real user sociodemographic data), supervised learning can learn rules between target and web navigation in order to infer the value for the rest of the population.

Supervised learning is also often used to predict a user’s action: will she/he buy a product in the near future? What is her/his probability of churn? Looking at a long purchase or churn history, an algorithm could learn the rule that explains the target, but only if meaningful information were available. Indeed, if 90% of users churn because of a hotline service, but the hotline service is not recorded, it is logical that the algorithm will not be able to achieve a good performance.

Thus, before dedicating months of work to a machine learning project, remember that there’s no magic to it: the algorithm learns rules to link variables to a target, just like a human would do. Start thinking about how you could try to explain the target of interest, and check if the variables you would use are available. If not, try to gather this information before spending too much time exploring your machine learning model.

Want to learn more?
Stay tuned, unsupervised learning and reinforcement learning will be addressed in upcoming articles!

Want to learn more? Get in touch!

26-01-2018

close legal

À propos

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Donec a venenatis dolor, non ornare ligula. Nam ultricies elementum tellus, sed pulvinar libero egestas nec. Fusce facilisis nulla vestibulum, commodo neque eget, dapibus lacus. Aliquam neque felis, sagittis nec consequat sed, commodo ac ipsum. Sed neque tortor, semper quis viverra et, malesuada et eros. Donec at dui ut ligula pharetra aliquet. Etiam dapibus semper orci. Integer efficitur dolor tortor, nec mattis elit placerat vel. Ut nulla enim, lacinia in pharetra id, convallis vitae massa. Donec neque est, tincidunt non ullamcorper commodo, tincidunt non turpis. Pellentesque viverra enim a sapien placerat, ut volutpat mauris condimentum. Proin tincidunt sollicitudin dui, sit amet condimentum ante commodo a. Aenean posuere aliquam purus, sed aliquam magna sagittis finibus. Morbi molestie feugiat feugiat. Phasellus tempus in dolor vel maximus. Cras efficitur sagittis lorem porta iaculis. Maecenas sed hendrerit urna. In mattis posuere purus, sit amet placerat arcu posuere quis. Etiam nec arcu nec magna interdum maximus. Integer sit amet lacus neque. Curabitur interdum molestie magna, in scelerisque tellus iaculis sed. Sed nec metus ut purus efficitur laoreet a quis eros. Proin dui dui, dignissim eget risus sit amet, bibendum condimentum velit. Maecenas in justo eu elit eleifend consectetur. Aenean scelerisque fringilla sollicitudin. Nam sem nibh, pharetra nec lacus non, mollis interdum odio. Aliquam sollicitudin posuere nibh sed eleifend.

Édition

55 SAS, 5 — 7 rue d'Athènes

75009 Paris

+33 1 76 21 91 37

Hébergement

OVH SAS

2, rue Kellermann

59100 Roubaix

+33 8 20 69 87 65

Publication

Lan Anh Vu Hong

Crédits photo

Mats Carduner, Adobe Stock & Unsplash

Vous avez aimé nos nouvelles fraîches sur l'état du marché brandtech ? Inscrivez vous à notre newsletter