
Abstract: R and Python are the primary tools of today’s data science workflows. R provides powerful statistics and quick visualizations. Python offers an intuitive syntax, abundant support, and is the choice interface for today’s major AI frameworks.
When building a data science team it is important to document and record work flows in both R and python. A data science department can use installable git controlled packages to form a foundation of code and best practices. In addition, libraries dramatically cut down the time and effort required for a team to bring work to production. Additionally package development is the first step for engineers and scientists aiming to contribute their unique ideas back to the community.
This workshop aims to teach the basics of package development in both R and python in 90 minutes. We will touch upon why a data science team should strive to be fully fluent in both languages. We will show simple R package development and simple Python package development. Finally, we will demonstrate how one can use an open source package to test the interface similarity between R and Python packages designed to support identical workflows.
● Text editor, command line terminal;
● Possibly RStudio/jupyter lab;
https://github.com/tedbakanas/starterkits
https://drive.google.com/open?id=1zdNO4bYIP2831ARS_SflHqAkgxkev1GN
Bio: Theodore is working on the Data Science team at Uptake Technologies. He utilizes machine learning and predictive analytics to help clients optimize their usage of industrial machinery. Prior to Uptake, Theodore worked within PepsiCo's Data and Analytics organization. While contributing to enterprise wide projects at PepsiCo, Theodore also returned to school part time. He graduated from the Masters of Science in Analytics at the University of Chicago in December 2016. He has a strong interest in data analysis and predictive analytics.