A quick introductory tutorial

  • The concepts of exploring a data set visually were pioneered by John Tukey in the 1960s and 1970s. Tukey consolidated his many ideas on data exploration into a book in the late 1970s, John Tukey, Exploratory Data Analysis, 1977, Addison-Westley.
  • Bill Cleveland documented his seminal work in visualization of complex data sets in his book, William S. Cleveland, Visualizing Data, 1993, Hobart Press.
  • Edward Tufte is well know for his different take on data exploration and visualizaion. His first book outlined many of his key ideas: Edward Tufte, Envisioning Information, Graphics Press, 1990. His more famous book first appeared in 1993, and is still widely available: Edward Tufte, Envisioning Information, 2nd Edition, Graphics Press, 2001.
  • The key concept of exploratory data analysis (EDA) or visual exploration of data is to understand the relationships in the data set. Specifically using visualization when you approach a new data set you can:
    • Explore complex data sets, using visualization to develop understanding of the inherent relationships.
    • Use different chart types to create multiple views of data to highlight different aspects of the inherent relationships.
    • Use plot aesthetics to project multiple dimensions.
    • Apply conditioning or faceting methods to project multiple dimensions.

  • Exploratory Data Analysis Tukey
  • EDA:探索性数据分析 可以使用 DATA ANALYSIS WITH R
  • The Visual Display of Quantitative Information
  • Visual Explanations (豆瓣)
  • Beautiful Evidence (豆瓣) Tufte
  • Envisioning Information