Machine Learning Under the Hood: The Technical Tips, Tricks, and Pitfalls

Introduction

It’s the age of machine learning. Companies are seizing upon the power of this technology to combat risk, boost sales, cut costs, block fraud, streamline manufacturing, conquer spam, toughen crime fighting, and win elections.

This course will prepare you to participate in the deployment of machine learning – whether you’ll do so in the role of enterprise leader or quant. In order to serve both types, this course goes further than typical machine learning courses, which cover only the technical foundations and core quantitative techniques. This course includes illuminating software demos of machine learning in action using SAS products. However, the curriculum is vendor-neutral and universally-applicable. The contents and learning objectives apply, regardless of which machine learning software tools you end up choosing to work with.

What You’ll Learn

Participate in the application of machine learning, helping select between and evaluate technical approaches
Interpret a predictive model for a manager or executive, explaining how it works and how well it predicts
Circumvent the most common technical pitfalls of machine learning
Screen a predictive model for bias against protected classes – aka AI ethics

What you will learn from this course

Module 1 – The Foundational Underpinnings of Machine Learning

➤ P-hacking: a treacherous pitfall

➤ P-hacking: your predictive insights may be bogus

➤ P-hacking: how to ensure sound discoveries

➤ Avoiding overfitting: the train/test split

➤ Why ice cream is linked to shark attacks

➤ Causation is just a hobby — prediction is your job

➤ The art of induction: why generalizing from data is hard

➤ Learning from mistakes: why negative cases matter

➤ Intro to the hands-on assessment (Excel or Google Sheets

Module 2 – Standard, Go-To Machine Learning Methods

➤ Business rules rock and decision trees rule

➤ Pruning decision trees to avoid overfitting

➤ DEMO – Comparing decision tree models

➤ Drawing the gains curve for a decision tree

➤ Drawing the profit curve for a decision tree

➤ Naïve Bayes

➤ Linear models and perceptron’s

➤ Linear part II: a perceptron in two dimensions

➤ Why probabilities drive better decisions than yes/no outputs

➤ Logistic regression

➤ DEMO – Training a logistic regression model

Module 3 – Advanced Methods, Comparing Methods, & Modeling Software

➤ Neural nets: decision boundaries

& a comparison to logistic regression

➤ DEMO – Training a neural network model

➤ Deep learning

➤ Ensemble models and the Netflix Prize

➤ Supercharging prediction: ensembles

& the generalization paradox

➤ DEMO – Training an ensemble model

➤ DEMO – Autotuning a machine learning model

➤ Compare and contrast: summary of ML methods

➤ Machine learning software: dos and don’ts for

choosing a tool

➤ Machine learning software: dos and don’ts for choosing a tool

➤ Machine learning software: how tools vary and how to choose one

➤ Model deployment: out of the software tool and into the field

➤ Uplift modelling I: optimize for influence and persuade by the numbers

➤ Uplift modelling II: modelling over treatment and control groups

➤ Uplift modelling III: how it works – for banks and for Obama

➤ Uplift modelling IV: improving churn modelling, plus other applications

Module 4 – Pitfalls, Bias, and Conclusions

➤ Visualizing why models are inequitable

➤ Justice can’t be colorblind

➤ Explainable ML, model transparency,

and the right to explanation

➤ Conclusions on ML ethics: establishing standards as a form of social activism

➤ Pitfalls: the seven deadly sins of machine learning

➤ Conclusions and what’s next – continuing your learning

Hello!

We are happy to see you again!

Login to your account

Register to start learning

Machine Learning Under the Hood: The Technical Tips, Tricks, and Pitfalls