Exploratory data analysis

  • What is exploratory data analysis?

    library(tidyverse) library(palmerpenguins) Exploratory data analysis (EDA) is often the first step to visualizing and transforming your data.1 Hadley Wickham defines EDA as an iterative cycle: Generate questions about your data Search for answers by visualising, transforming, and modeling your data Use what you learn to refine your questions and or generate new questions Rinse and repeat until you publish a paper EDA is fundamentally a creative process - it is not an exact science.

  • Practice exploring college education (data)

    library(tidyverse) Run the code below in your console to download this exercise as a set of R scripts. usethis::use_course("cis-ds/exploratory-data-analysis") The Department of Education collects annual statistics on colleges and universities in the United States.