
Abstract: In this workshop we cover the basics of modern R. You'll learn how to read data from a CSV using the readr package, manipulate data with dplyr and make compelling visualizations with ggplot2.
Session Outline:
Reading Data from a CSV
Manipulating Data with dplyr
- Selecting columns
- Filtering rows
- Creating new columns
- Modifying columns
- Sorting data
- Computing grouped summary statistics
Plotting with ggplot2
- Scatterplots
- Controlling shapes, sizes and colors of points
- Faceted plots
- Smoothing curves
- Histograms
- Bar charts
Background Knowledge:
They should have R and RStudio installed
Bio: Jared Lander is the Chief Data Scientist of Lander Analytics a data science consultancy based in New York City, the Organizer of the New York Open Statistical Programming Meetup and the New York R Conference and an Adjunct Professor of Statistics at Columbia University. With a masters from Columbia University in statistics and bachelors from Muhlenberg College in mathematics, he has experience in both academic research and industry. His work for both large and small organizations ranges from music and fundraising to finance and humanitarian relief efforts. He specializes in data management, multilevel models, machine learning, generalized linear models, data management and statistical computing. He is the author of R for Everyone: Advanced Analytics and Graphics, a book about R Programming geared toward Data Scientists and Non-Statisticians alike and is creating a course on glmnet with DataCamp.

Jared Lander
Title
Chief Data Scientist, Author of R for Everyone, Professor | Lander Analytics, Columbia Business School
