Feature Engineering on the Modern Data Stack

Abstract: 

Feature engineering is more than simply missing value imputation, handling outlier and categorical variables and scaling numerical variables. It is an opportunity to allow a data scientist's creativity to shine and as Andrew Ng’s stated, “Applied machine learning is basically feature engineering.” In this talk, we will show how to aggregate time series data and calculate moving averages in pandas, directly on the data warehouse using SQL and leveraging Rasgo to calculate and publish those features on Snowflake.

Bio: 

Andrew Engel is the Chief Data Scientist at Rasgo. He has been working as a data scientist and leading teams of data scientists for over ten years in a wide variety of domains from fraud prediction to marketing analytics. Andrew received his Ph.D. in Systems and Industrial Engineering with a focus on optimization and stochastic modeling. He has worked for Towson University, SAS Institute, the US Navy, Websense (now ForcePoint), Stics, HP and led DataRobot's efforts in Entertainment, Sports and Gaming before joining Rasgo in August of 2020.

Open Data Science

 

 

 

Open Data Science
One Broadway
Cambridge, MA 02142
info@odsc.com

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Youtube
Consent to display content from Youtube
Vimeo
Consent to display content from Vimeo
Google Maps
Consent to display content from Google