Data Visualization Manual
Preface
1
Project Workflow and Style Guide
1.1
Directory Setup
1.2
Files
1.2.1
R scripts
1.2.2
Rmarkdown
1.3
Code Guidelines
1.3.1
Comments
1.3.2
Packages
1.3.3
Tidyverse
1.3.4
Naming Objects
1.3.5
Spacing and Indentation
1.4
Output
1.4.1
Graphics
1.5
Citing/Documenting Data
1.5.1
Sample Citation
2
STAT 301-2 Final Project
2.1
Milestones
2.2
Submission
3
Lab: Linear Regression
3.1
Libraries
3.2
Simple Linear Regression
3.2.1
Plots for Assessing Linear Models
3.3
Fitting Many Models
3.3.1
Assessing Many Models
3.3.2
Examing One or Fewer Models
3.4
Qualitative Predictors
3.5
Modified Workflow
3.5.1
Exercise 15 (Section 3.7 - pg 126)
4
Lab: Logistic Regression, LDA, QDA, and KNN
4.1
Data Setup
4.2
Logistic Regression
4.3
Linear Discriminant Analysis
4.4
Quadratic Discriminant Analysis
4.5
K-Nearest Neighbors
5
Lab: Cross-Validation and the Bootstrap
5.1
Validation Set Approach
5.2
Leave-One-Out-Cross Validation
5.3
\(k\)
-fold Cross-Validation
5.4
The Bootstrap
5.4.1
Estimating the Accuracy of a Statistic of Interest
5.4.2
Estimating the Accuracy of a Linear Regression Model
6
Glossary of Terms
7
Helpful References
Published by Northwestern University Libraries
Data Visualization
Data Visualization
2019
Preface