Please note that, due to their popularity, we offer two sections of some courses. For these courses, please pay attention to which instructor you wish to take the class with.
Mon/Tue Block: Introduction to R for Data Science > Specify instructor Duggins or Post
Wed Block: Programming in R > Specify instructor Duggins or Post
Thurs/Fri Block: Intro to Data Mining & Machine Learning > Specify instructor Krishnamurthy or Vatsavai

August 1314, 2018
This course provides a basic introduction to the R software environment for the purpose of data science. The course covers importing and exporting data, manipulating data or recoding variables, and visualization and statistical analysis.
This course provides a basic introduction to the R software environment for the purpose of data science. The course covers importing and exporting data, manipulating data or recoding variables, and visualization and statistical analysis.
Participants will learn how to clean and structure data; see how freely and commonly available software can be used to create effective visualizations; and learn basic design principles, so you can go beyond the defaults and create eyecatching and impactful figures and infographics!
This course is an introduction to programming as a skill, a discipline, and a profession for graduate students. We will dive into handson programming from day one and progress to evaluating and using open source libraries and frameworks to manage large and complex datasets. We will focus equally on reading and writing code.
Generalized Linear Models & Multilevel Modeling
This short course covers advanced statistical modelling and computing using R. We will review the multiple linear regression for continuous data and then proceed to cover the logistic regression for binary/binomial data; Poisson regression and negative binomial regression for counts data; longitudinal data analysis and general multilevel modelling. We also cover some topics in statistical computing on data simulations and bootstrapping.

August 15, 2018
This course explains how to analyze textbased data collected from the internet using R. The course will cover how to retrieve data through Application Programming Interfaces (APIs) such as those provided by Twitter and Project Gutenberg, and methods for doing basic natural language processing on the data.
This class provides students with an introduction to basic programming techniques in R, a program with stronger objectoriented programming facilities than most statistical computing languages. The R language is widely used among statisticians and data miners for developing statistical software and data analysis. R’s popularity has increased substantially in recent years.
This class provides an introduction to basic programming techniques in R. The R language is widely used among statisticians and data miners for developing statistical software and data analysis. R’s popularity has increased substantially in recent years.
This course will provide an introduction to network analysis with a focus on data and applications. It will introduce basic concepts and ideas in network science, and cover methods that are practically useful in dealing with network data. At the completion of this course, participants will have a solid understanding of what network analysis does, and be able to run common methods on network data.
The web has become an important and popular tool for communicating research findings, but it carries a layer of complexity not found in other media: user interactivity. This course will teach participants how to engage their audience in immersive presentations of their data. Participants will learn basic user experience (UX) principles and apply them to interactive dashboards in Tableau. Beginner experience with dashboard creation in Tableau is required.

August 1617, 2018
When working with data, one thing is fairly certain: data is rarely in an optimized format. A misplaced space here, or an extra comma there, can mean the difference between two clicks and two hours of work. In this course, we will work with ways to manipulate, interpret, and present data from web pages and text using Python version 2.7 and OpenRefine. This class will also cover regular expressions, various imported libraries to extend Python functionality, and import/export of data in OpenRefine.
The class provides students with a primer on the use of R for the writing of reproducible reports and presentations that easily embed R output using R markdown as well as the creation of interactive and customizable web applets called R Shiny applications.
This course is designed for two audiences: experienced visualization designers looking to apply open data science techniques to their work, and data science professionals who have limited experience with visualization. Participants will develop skills in visualization design using R, a tool commonly used for data science. Basic familiarity with R is required.
This course will introduce participants to a selection of the techniques used in data mining and machine learning in a handson, applicationoriented way. Topics covered will include data exploration, decision trees, clustering, association rules, regression and pattern classification. The computing exercises will be based on the statistical programming language, R. At the end of the two days, you will be able to explore a data set, and determine which analysis method is appropriate for the data, and be able to use R packages to obtain results.
This course will introduce participants to a selection of the techniques used in data mining and machine learning in a handson, applicationoriented way. Topics covered will include data exploration, decision trees, clustering, association rules, regression and pattern classification. The computing exercises will be based on the statistical programming language, R. At the end of the two days, you will be able to explore a data set, and determine which analysis method is appropriate for the data, and be able to use R packages to obtain results.