May 2023
to run data analysis, you’re using already “using”
You’ll want to download R first before downloading any additional software
You can download the newest version of R (4.2) here
RStudio is the last software program you’ll need to get started
You can download the newest version of RStudio (Dec 2022) here
You can download all the materials for Day 1 of this workshop here * You want the correlation.qmd, data_clean.qmd, ttest.qmd, and regression.qmd files
You only have to install a package once (one exception)
You must load a library every time you open an R file (.R, .qmd, .rmd, etc) or restart R/RStudio
Important
A new install of R will remove all installed packages. You must either re-install the packages or save them prior to a new R installation. I’ll cover how to save them at a later date
R works best with csv files (smaller in size) but it will take .sav files (SPSS) and other file formats as well (e.g., .tsv)
Oh and obviously it can read Microsoft Excel files
TRUE
or FALSE
value (e.g., Is X > 1?)dplyr
packageNote
For anyone who might work with database data, you can pull data from external databases with R and RStudio. We won’t cover that in this workshop but you can do it
stringr
packagedplyr
package for string variable typeslubridate
packageTip
It won’t auto convert your dates to weird incorrect formats like certain spreadsheet programs might.
ggplot2
packageSometimes you want or need to export data you’ve cleaned to another program. Maybe you want to use a program like JASP
Or you’re not comfortable using R for analyses yet so you want to use SPSS
Or maybe others on your team use a different program
R can export to Excel, SPSS, SAS, and CSV files
Important
While important, outliers and influential cases rarely influence results with a sufficient sample size. Also difficult to say what “is” and “isn’t” an outlier. Outlier shouldn’t always mean removal
You can download all the materials for Day 2 of this workshop here * You want the anova.qmd, nonparametric.qmd, intro_qarto.qmd, mlm.qmd, sem.qmd and factor_analysis.qmd
Important
Firstly, thank you for your time this weekend and I hope you’ve learned something
Second, this is A LOT. I crammed stuffed about 2 years of statistical analyses time and practice into like 2 days. It’s okay and normal if you’re swimming. I’m here if anyone has any questions after or even if they’re using R and trying to do an actual analysis in R. People ask me for help all the time. I’m happy to help