{"id":383,"date":"2023-08-30T17:21:58","date_gmt":"2023-08-30T15:21:58","guid":{"rendered":"http:\/\/james@data-cubed.co.uk"},"modified":"2023-08-30T17:21:58","modified_gmt":"2023-08-30T15:21:58","slug":"optimise-your-business-with-machine-learning-2","status":"publish","type":"post","link":"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/","title":{"rendered":"Optimise Your Business with Machine Learning: How Random Forest models can help you make predictions"},"content":{"rendered":"\n

Dom Owens is a Data Science Intern at Data Cubed. In this blog post he explains how Random Forest models work, and how machine learning can generate value for your business.<\/em><\/p>\n\n\n\n


\n\n\n\n

Rise of the machines<\/strong><\/h2>\n\n\n\n
<\/div>\n\n\n\n

If you could predict something about your business, what would it be? <\/p>\n\n\n\n

In recent years, there\u2019s been an explosion in computing power. At the same time, there\u2019s been a vast increase in the amount of data being collected. Combined, this gives businesses the potential to use machine learning techniques to pick up on patterns hidden inside data. These patterns can then help us make predictions \u2014 valuable information for a business in rapidly-changing industry.<\/p>\n\n\n\n

Think of any piece of information that you\u2019d really like to know but don\u2019t yet. How long it takes for a subscriber to unsubscribe, the value a new customer might bring to your business over their lifetime, the sales figures of a new offer you\u2019re thinking of launching. All of these are valuable pieces of information when you\u2019re trying to plan for the future.<\/p>\n\n\n\n

Machine learning techniques can not only help you make predictions, but can also tell you how accurate that prediction is and how the other information you have is influencing it.<\/p>\n\n\n\n


\n\n\n\n

Exploring<\/strong> the Random Forest<\/h2>\n\n\n\n
<\/div>\n\n\n\n

Suppose you have a large amount of data with lots of measurements and you want to know one of two things: a number (say, the number of units a product will sell) or a class (the newspaper that a customer will buy).<\/p>\n\n\n\n

The Random Forest algorithm \u2014 a set of rules for making decisions \u2014 can handle this situation elegantly, giving accurate and unbiased predictions. 
<\/p>\n\n\n\n

Let\u2019s start from the roots. You\u2019re probably familiar with the concept of a decision tree as a way to find the answer to a problem. We ask a question that has two possible answers \u2014 often yes or no \u2014 and follow the answer to another question. We repeat the process until we get a final answer that we\u2019re satisfied with.<\/p>\n\n\n\n

The Random Forest algorithm does the same thing. It asks a question of the data to split the current branch of the tree. It selects the best question to ask from a small group of questions, opting for the question that splits the data most evenly.<\/p>\n\n\n\n

This gives us a tool for making predictions. A new set of data can be fed through the decision tree, outputting a prediction for each new observation. But \u2014 a single decision tree can veer far off course if it gets caught on irregularities in the data, leading to less accurate predictions. <\/p>\n\n\n\n

Thankfully, we can remedy this by using the wisdom of crowds. We choose a large subset of the original data and create a tree to fit it. Then, with a new subset each time, we fit many more trees, each capable of making predictions for new data. Since each tree fits different data, and the trees split according to random rules, each tree will be unique and give different predictions for the same data. <\/p>\n\n\n\n

We then take an average of all the predictions. This gives us a more accurate prediction than one tree alone, either the most popular vote for classes, or the mean average for a predicted number.<\/p>\n\n\n\n

We can even see which factors are important in influencing how the trees make their decisions. Perhaps product sales depend heavily on the weather conditions and your website traffic, but less so on how much is spent on marketing the product.<\/p>\n\n\n\n


\n\n\n\n

Let’s look at an example<\/strong><\/h2>\n\n\n\n
<\/div>\n\n\n\n

Let\u2019s consider a telecoms business using this example dataset<\/a>, and see if we can predict how new customers will behave. The dataset contains information on previous customers, such as their gender, the type of contract they hold, and the charges they\u2019ve incurred, as well as whether they\u2019re no longer a customer. Being able to predict customer churn can be very useful for making decisions about pricing and marketing. <\/p>\n\n\n\n

The Error Plot tells us that, as we fit more trees, the model becomes more accurate. This shows us that taking the average of multiple trees is really valuable.<\/p>\n\n\n\n

The Variable Importance plot shows us how important the information we\u2019re measuring is. Larger readings correspond to more importance. We can see that charges and type of contract are most important, while the customer\u2019s gender and whether they have a partner are least important.<\/p>\n\n\n\n

When we try out the model on new data, the model gets predictions right around 80% of the time. It does well at identifying customers who don\u2019t churn, but less well at identifying customers who do. Given that 73% of the customers in the data set did churn, this is a useful improvement on randomly guessing if the customer churns based on population percentages. We might improve on this by collecting more data, recording more information for each customer, or by adjusting the settings of the model algorithm.<\/p>\n\n\n\n

Many other problems can be solved in a similar way, which brings us back to our original question \u2014 if you could predict something about your business, what would it be?
<\/p>\n\n\n\n


\n\n\n\n

We\u2019ve developed a new predictive modelling tool using Random Forest algorithms, which we\u2019re using to help our clients peer into the future. Our new tool allows us to identify your most valuable clients now, and then lets us go one step further to predict which clients will be the most valuable in the future. We can predict other things too, such as customer churn, or sales figures for a new offer. And we can show you the results within days. <\/p>\n\n\n\n

We\u2019re really excited about this technology. If you are too and would like to know more, email us at <\/strong>hello@data-cubed.co.uk<\/strong><\/a> or call us on 0117 25 10 100.<\/strong>
<\/p>\n","protected":false},"excerpt":{"rendered":"

Dom Owens is a Data Science Intern at Data Cubed. In this blog post he explains how Random Forest models work, and how machine learning can generate value for your business. Rise of the machines If you could predict something about your business, what would it be? In recent years, there\u2019s been an explosion in […]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"inline_featured_image":false,"footnotes":""},"categories":[11],"tags":[25,80,88,89],"acf":[],"yoast_head":"\nOptimise Your Business with Machine Learning - DATA\u00b3 Central Europe<\/title>\n<meta name=\"description\" content=\"Optimise Your Business with Machine Learning. If you could predict something about your business, what would it be?\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Optimise Your Business with Machine Learning - DATA\u00b3 Central Europe\" \/>\n<meta property=\"og:description\" content=\"Optimise Your Business with Machine Learning. If you could predict something about your business, what would it be?\" \/>\n<meta property=\"og:url\" content=\"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/\" \/>\n<meta property=\"og:site_name\" content=\"DATA\u00b3 Central Europe\" \/>\n<meta property=\"article:published_time\" content=\"2023-08-30T15:21:58+00:00\" \/>\n<meta name=\"author\" content=\"James\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"James\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/\"},\"author\":{\"name\":\"James\",\"@id\":\"https:\/\/data-cubed.eu\/#\/schema\/person\/81d0c18d92f2845d64d1a83fdae520b3\"},\"headline\":\"Optimise Your Business with Machine Learning: How Random Forest models can help you make predictions\",\"datePublished\":\"2023-08-30T15:21:58+00:00\",\"dateModified\":\"2023-08-30T15:21:58+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/\"},\"wordCount\":982,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/data-cubed.eu\/#organization\"},\"keywords\":[\"Business Optimisation\",\"Machine Learning\",\"Predictive Analytics\",\"Random Forest\"],\"articleSection\":[\"Blog\"],\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/\",\"url\":\"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/\",\"name\":\"Optimise Your Business with Machine Learning - DATA\u00b3 Central Europe\",\"isPartOf\":{\"@id\":\"https:\/\/data-cubed.eu\/#website\"},\"datePublished\":\"2023-08-30T15:21:58+00:00\",\"dateModified\":\"2023-08-30T15:21:58+00:00\",\"description\":\"Optimise Your Business with Machine Learning. If you could predict something about your business, what would it be?\",\"breadcrumb\":{\"@id\":\"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/#breadcrumb\"},\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/data-cubed.eu\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Optimise Your Business with Machine Learning: How Random Forest models can help you make predictions\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/data-cubed.eu\/#website\",\"url\":\"https:\/\/data-cubed.eu\/\",\"name\":\"DATA\u00b3 Central Europe\",\"description\":\"Disrupt your market with data\",\"publisher\":{\"@id\":\"https:\/\/data-cubed.eu\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/data-cubed.eu\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-GB\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/data-cubed.eu\/#organization\",\"name\":\"DATA\u00b3 Central Europe\",\"url\":\"https:\/\/data-cubed.eu\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/data-cubed.eu\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/data-cubed.eu\/wp-content\/uploads\/logo-main.svg\",\"contentUrl\":\"https:\/\/data-cubed.eu\/wp-content\/uploads\/logo-main.svg\",\"width\":346,\"height\":100,\"caption\":\"DATA\u00b3 Central Europe\"},\"image\":{\"@id\":\"https:\/\/data-cubed.eu\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/data-cubed.eu\/#\/schema\/person\/81d0c18d92f2845d64d1a83fdae520b3\",\"name\":\"James\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/data-cubed.eu\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/41863914d04228004a24edb0e51037ae?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/41863914d04228004a24edb0e51037ae?s=96&d=mm&r=g\",\"caption\":\"James\"},\"sameAs\":[\"https:\/\/data-cubed.eu\"],\"url\":\"https:\/\/data-cubed.eu\/author\/james\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Optimise Your Business with Machine Learning - DATA\u00b3 Central Europe","description":"Optimise Your Business with Machine Learning. If you could predict something about your business, what would it be?","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/","og_locale":"en_GB","og_type":"article","og_title":"Optimise Your Business with Machine Learning - DATA\u00b3 Central Europe","og_description":"Optimise Your Business with Machine Learning. If you could predict something about your business, what would it be?","og_url":"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/","og_site_name":"DATA\u00b3 Central Europe","article_published_time":"2023-08-30T15:21:58+00:00","author":"James","twitter_card":"summary_large_image","twitter_misc":{"Written by":"James","Estimated reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/#article","isPartOf":{"@id":"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/"},"author":{"name":"James","@id":"https:\/\/data-cubed.eu\/#\/schema\/person\/81d0c18d92f2845d64d1a83fdae520b3"},"headline":"Optimise Your Business with Machine Learning: How Random Forest models can help you make predictions","datePublished":"2023-08-30T15:21:58+00:00","dateModified":"2023-08-30T15:21:58+00:00","mainEntityOfPage":{"@id":"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/"},"wordCount":982,"commentCount":0,"publisher":{"@id":"https:\/\/data-cubed.eu\/#organization"},"keywords":["Business Optimisation","Machine Learning","Predictive Analytics","Random Forest"],"articleSection":["Blog"],"inLanguage":"en-GB","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/","url":"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/","name":"Optimise Your Business with Machine Learning - DATA\u00b3 Central Europe","isPartOf":{"@id":"https:\/\/data-cubed.eu\/#website"},"datePublished":"2023-08-30T15:21:58+00:00","dateModified":"2023-08-30T15:21:58+00:00","description":"Optimise Your Business with Machine Learning. If you could predict something about your business, what would it be?","breadcrumb":{"@id":"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/data-cubed.eu\/blog\/optimise-your-business-with-machine-learning-2\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/data-cubed.eu\/"},{"@type":"ListItem","position":2,"name":"Optimise Your Business with Machine Learning: How Random Forest models can help you make predictions"}]},{"@type":"WebSite","@id":"https:\/\/data-cubed.eu\/#website","url":"https:\/\/data-cubed.eu\/","name":"DATA\u00b3 Central Europe","description":"Disrupt your market with data","publisher":{"@id":"https:\/\/data-cubed.eu\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/data-cubed.eu\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-GB"},{"@type":"Organization","@id":"https:\/\/data-cubed.eu\/#organization","name":"DATA\u00b3 Central Europe","url":"https:\/\/data-cubed.eu\/","logo":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/data-cubed.eu\/#\/schema\/logo\/image\/","url":"https:\/\/data-cubed.eu\/wp-content\/uploads\/logo-main.svg","contentUrl":"https:\/\/data-cubed.eu\/wp-content\/uploads\/logo-main.svg","width":346,"height":100,"caption":"DATA\u00b3 Central Europe"},"image":{"@id":"https:\/\/data-cubed.eu\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/data-cubed.eu\/#\/schema\/person\/81d0c18d92f2845d64d1a83fdae520b3","name":"James","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/data-cubed.eu\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/41863914d04228004a24edb0e51037ae?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/41863914d04228004a24edb0e51037ae?s=96&d=mm&r=g","caption":"James"},"sameAs":["https:\/\/data-cubed.eu"],"url":"https:\/\/data-cubed.eu\/author\/james\/"}]}},"_links":{"self":[{"href":"https:\/\/data-cubed.eu\/wp-json\/wp\/v2\/posts\/383"}],"collection":[{"href":"https:\/\/data-cubed.eu\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/data-cubed.eu\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/data-cubed.eu\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/data-cubed.eu\/wp-json\/wp\/v2\/comments?post=383"}],"version-history":[{"count":0,"href":"https:\/\/data-cubed.eu\/wp-json\/wp\/v2\/posts\/383\/revisions"}],"wp:attachment":[{"href":"https:\/\/data-cubed.eu\/wp-json\/wp\/v2\/media?parent=383"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/data-cubed.eu\/wp-json\/wp\/v2\/categories?post=383"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/data-cubed.eu\/wp-json\/wp\/v2\/tags?post=383"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}