302: How to Engineer Features
In the previous video we discussed what bad data is. In this video we discuss how we can alter the data to improve it. Techniques range from rescaling and transforming data to creating new features from scratch.
In the previous video we discussed what bad data is. In this video we discuss how we can alter the data to improve it. Techniques range from rescaling and transforming data to creating new features from scratch.
In this video we introduce the topic of data engineering. Understanding your data is so vitally important. It is the raw material you use to create results. The common phrase 'garbage in, garbage out' (excuse my American!) summarises the ability to win or ruin data science projects and entire products with good or poor data. We will discuss common pitfalls and introduce steps to overcome them.
This video shows an example of using segmentation on categorical data and we'll also find that we have just derived our first important machine learning algorithm: a decision tree.
In this video we discuss segmentation, a very simple way of generating a predictive classification model given some data. We'll see later how this is the basis for a fundamental but very powerful machine learning algorithm.
Now we have a firm understanding of how business problems map to solutions we need to learn the techniques to deliver the solutions. This section introduces the basic terminology and concepts used in data science.
In this video we will talk about the problems encountered in data science. We'll also discover how it fits into a process, which you can used as a plan. Finally, we'll look at the impacts of a Data Science project which will help you avoid any common pitfalls.
This section introduces Data Science. It explains what it is and why we need it. We discuss some of the reasons for doing Data Science and provides famous examples from around the world.
Case studies and industry analysis from our team. No hype, roughly monthly.