Robust Regression: Solving the Challenges Posed by Dirty Data
Robust Regression: Solving the Challenges Posed by Dirty Data

Abstract: 

Data scientists are rarely presented with clean data. Instead, data is often corrupted by measurement error, bugs in the ETL pipeline, poorly chosen defaults, etc. This can wreak havoc on algorithms not designed with robustness in mind. One such instance is the humble least-squares regression, where a single outlier can have an unbounded effect on the resulting line of best fit. In this talk, we will discuss why this is the case and how fairly simple alternatives can greatly improve robustness.

Bio: 

TBD

Open Data Science

 

 

 

Open Data Science
One Broadway
Cambridge, MA 02142
info@odsc.com

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Youtube
Consent to display content from Youtube
Vimeo
Consent to display content from Vimeo
Google Maps
Consent to display content from Google