Abstract: With customer privacy laws, missing data is becoming more of a normal situation. Many approaches are either computationally expensive or throw out the baby with the bathwater. There are certain models which allow missing data directly, and variants of common models which can be adapted to do so. Surprisingly, we can create effective predictability even in scenarios where data is missing not at random, and at rates of higher than 80%. A hands-on workshop using R or Python.
Types of missingness
Missing data helper functions
Models that accept missingness
Neural networks and missingness
Work on your own with guidance (30-40 minutes)
Discuss your results
Discuss my results
Either R or Python
Bio: Anne Lifton has ten years of experience in data science and 3 years in data science management. She has worked across a range of industries from medical devices to retail to engineering and specializes in reducing the cycle time to delivery of models.