SISMID 2026 · Pre-session materials 2
Data Science in Infectious Disease Modeling using R 2
Please complete all materials below before your first synchronous session.

The materials on this page should be reviewed before your live lab sessions on June 22 and June 23. Work through the videos, readings, and problem sets at your own pace.

← Back to module page

Pre-session materials — Page 2

Pre-Course Work 2: Advanced Data Wrangling
Welcome to Advanced Data Wrangling!
Please take the pre-assessment poll prior to watching each video. Videos should be watched in order from 1–10.
Post-assessment/coding activity for each module: ⬇ DSR_adv_data_wrangling_post_assessment_coding_exercises.pdf

Module 2.1 — Overview: Tidyverse, dplyr, and pipes
Video
Overview: Tidyverse, dplyr, and pipes

Module 2.2 — Tidyverse cheat sheets: a primer
Video
Tidyverse cheat sheets: a primer

Slides
Module 2.2 slides

Module 2.3 — Wide vs. long data
Video
Wide vs. long data

Slides
Module 2.3 slides

Module 2.4 — Reshaping data with pivots
Video
Reshaping data with pivots

Slides
Module 2.4 slides

Module 2.5 — Advanced data reshaping
Video
Advanced data reshaping

Slides
Module 2.5 slides

Module 2.6 — Combining data with joins
Video
Combining data with joins

Slides
Module 2.6 slides

Module 2.7 — Complex data joining
Video
Complex data joining

Slides
Module 2.7 slides

Module 2.8 — Strings
Video
Strings

Slides
Module 2.8 slides

Module 2.9 — Categorical data
Video
Categorical data

Slides
Module 2.9 slides

Module 2.10 — Regular expressions
Video
Regular expressions

Slides
Module 2.10 slides



Pre-Course Work 3: Special Considerations for Public Health Data

Lab 3.1 — Loading Data From an API
Video
Getting web datasets fast via APIs: Application Programming Interface

Slides
API slides

Post-Video Exercises
  1. Request your US census API key at api.census.gov/data/key_signup.html — use an email address you will be able to access during the synchronous sessions.
Additional API resources:
R packages:

Lab 3.2 — Working with PII Data
Video
Personally Identifiable Information

Slides
PII slides

Additional PII resources
R packages:



Pre-Course Work 4: Advanced Methods

Lab 4.1 — Troubleshooting Functions Using Conditions
Video
Troubleshooting with Conditions

Slides
Troubleshooting Conditions slides

Lab 4.2 — Limits of R
Video
R can’t be used for everything

Slides
Limits of R slides

Lab 4.3 — Parallel Programming
Video
Parallel Computing: What is it and when to use it

Lab 4.4 — Does It Function?
Video
Functions: When and Why

Slides
Does It Function? slides

You’ve completed the asynchronous portion of the course!
We want your feedback on this course! Please share your thoughts.

← Pre-session materials — Page 1

Return to Overview and Best Practices

© 2026 Emory University · Rollins School of Public Health
Contact us