R for Epidemiology
Welcome
- Acknowledgements
Introduction
About the Authors
- Brad Cannell
- Melvin Livingston
I Foundational Epidemiologic Concepts
1 Using R for Epidemiology
2 Populations and Samples
3 Measures of Occurrence
4 Random Error in Measures
5 Creating Contingency Tables in R
6 Measures of Association
7 Time-to-event Analysis
8 Stratification
9 Standardization
10 Selection Bias
- 10.1 Direction of bias
- 10.2 Summary
11 Systematic Error in Measures
12 Effect-measure Modification
13 Missing Data
II Introduction to Regression Analysis
14 Introduction to Regression Analysis
- 14.1 Generalize linear models
  - 14.1.1 The glm function
- 14.2 Regression intuition
15 Linear Regression
16 Linear Regression
17 Poisson Regression
18 Cox Proportional Hazards Regression
19 Multilevel Models
20 Generalized Estimating Equations
III Predictive Analysis
21 Introduction to Predictive Analysis
IV Introduction to Causal Inference
22 Introduction to Causal Inference
23 Sufficient and Component Cause Diagrams
- 23.1 Summary
24 Introduction to Directed Acyclic Graphs
25 Confounding
26 Deconfounding
27 Mediation
V Study Design
28 Experimental Studies
29 Cohort Studies
30 Case-control Studies
31 Cross-sectional Studies
32 Ecologic Studies
33 Quasi-experimental Studies
34 Meta-analysis
35 Power and Sample Size
VI Getting Started
36 Installing R and RStudio
- 36.1 Download and install on a Mac
- 36.2 Download and install on a PC
37 What is R?
- 37.1 What is data?
- 37.2 What is R?
38 Navigating the RStudio Interface
39 Speaking R’s Language
40 Let’s Get Programming
41 Asking Questions
VII Coding Tools and Best Practices
42 R Scripts
- 42.1 Creating R scripts
43 Quarto Files
44 R Projects
45 Coding Best Practices
46 Using Pipes
VIII Data Transfer
47 Introduction to Data Transfer
48 File Paths
- 48.1 Finding file paths
- 48.2 Relative file paths
49 Importing Plain Text Files
50 Importing Binary Files
51 RStudio’s Data Import Tool
52 Exporting Data
- 52.1 Plain text files
- 52.2 R binary files
IX Descriptive Analysis
53 Introduction to Descriptive Analysis
- 53.1 What is descriptive analysis and why would we do it?
- 53.2 What kind of descriptive analysis should we perform?
54 Numerical Descriptions of Categorical Variables
55 Measures of Central Tendency
56 Measures of Dispersion
- 56.1 Comparing distributions
57 Describing the Relationship Between a Continuous Outcome and a Continuous Predictor
- 57.1 Pearson Correlation Coefficient
  - 57.1.1 Calculating r
  - 57.1.2 Correlation intuition
58 Describing the Relationship Between a Continuous Outcome and a Categorical Predictor
- 58.1 Single predictor and single outcome
- 58.2 Multiple predictors
59 Describing the Relationship Between a Categorical Outcome and a Categorical Predictor
- 59.1 Comparing two variables
X Data Management
60 Introduction to Data Management
- 60.1 Multiple paradigms for data management in R
- 60.2 The dplyr package
61 Creating and Modifying Columns
62 Subsetting Data Frames
63 Working with Dates
64 Working with Character Strings
65 Conditional Operations
66 Working with Multiple Data Frames
- 66.1 Combining data frames vertically: Adding rows
- 66.2 Combining data frames horizontally: Adding columns
  - 66.2.1 Combining data frames horizontally by position
  - 66.2.2 Combining data frames horizontally by key values
67 Restructuring Data frames
XI Repeated Operations
68 Introduction to Repeated Operations
- 68.1 Multiple methods for repeated operations in R
- 68.2 Tidy evaluation
69 Writing Functions
70 Column-wise Operations in dplyr
71 Writing For Loops
72 Using the purrr Package
XII Collaboration
73 Introduction to git and GitHub
74 Using git and GitHub
XIII Presenting Results
75 Creating Tables with R and Microsoft Word
XIV Appendix
Appendix: Alternative table formats
- 75.13 Smaller data frame
- 75.14 Larger data frame
References
Published with bookdown

R for Epidemiology

19 Multilevel Models

This chapter is under heavy development and may still undergo significant changes.