
Abstract: Scikit-learn is a machine learning library in Python, that has become a valuable tool for many data science practitioners. This training will cover some advanced topics in using scikit-learn, such as how to perform out-of-core learning with scikit-learn and how to speed up parameter search. We'll also cover how to build your own models or feature extraction methods that are compatible with scikit-learn, which is important for feature extraction in many domains. We will see how we can customize scikit-learn even further, using custom methods for cross-validation or model evaluation.
This workshop assumes familiarity with Jupyter notebooks and basics of pandas, matplotlib and numpy. It also assumes experience using scikit-learn and familiarity with the API.
Bio: Thomas Fan is a Software Developer at Columbia University's Data Science Institute. He collaborates with the scikit-learn community to develop features, review code, and resolve issues. On his free time, Thomas contributes to skorch, a scikit-learn compatible neural network library that wraps PyTorch.

Thomas Fan
Title
Software Developer - Machine Learning | Columbia Data Science Institute
