Course Structure and Topics
This course combines lectures on theory and concepts with significant time practicing statistical tools within the R environment. The course concludes with a week-long project session where participants work independently to conduct a full analysis of a provided dataset and present their results.
Participants should expect to spend between 8-10 hours per week working through the course material and assignments. Each week will include:
- 2-4 hours of recorded presentations (slides with audio) introducing new theory and concepts
- One or more well-commented demonstration code scripts teaching new analytical tools
- At least one analysis assignment, where participants will adapt the demonstration code to complete a novel analysis and answer a series of questions regarding the results
- An optional live Q & A session with the instructor (Wednesday 9:00-10:15 am EST)
- An optional live review session to walk through assignment solutions (Monday 10:30-11:45 am EST)
- Two opportunities for “virtual office hours” with instructor where time blocks can be reserved by individual participants (Tuesday 10:00-11:00 am EST; Thursday 5:00-6:00 pm EST)
This course covers:
- Introduction to probability theory, random variables, and statistical distributions
- Linear models (univariate, multivariate)
- Generalized linear models (Poisson, quasi-Poisson, negative binomial, and binomial)
- Linear mixed models and generalized linear mixed models
- Testing for model assumptions and model fit
- Proper use of data transformations to improve model performance and fulfill assumptions
- Evaluating model performance with diagnostic tools
- Preparing quality graphics and interpreting results
- Dealing with common problems in data including missing data and collinearity
- Tips and tricks of programming in R
- Organizing analyses into data exploration, descriptive statistics, model application, model diagnostics, and discussion of results.
More advanced topics may be covered depending on available time and feedback of participants.
Software and Programs
All exercises and some lectures will be performed in R, through user-friendly interface RStudio. Pre-course work will be suggested for all participants not familiar with the program R and emailed to participants at least 3 weeks prior to the course. All participants should have a basic familiarity working in the R environment by the time the course begins.