Exploratory Data Analysis

About This Course

This course covers the essential exploratory techniques for summarizing data. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data. We will cover in detail the plotting systems in R as well as some of the basic principles of constructing data graphics. We will also cover some of the common multivariate statistical techniques used to visualize high-dimensional data.

What You’ll Learn

Understand analytic graphics and the base plotting system in R

Use advanced graphing systems such as the Lattice system

Make graphical displays of very high dimensional data

Apply cluster analysis techniques to locate patterns in data

                                                 Skills You’ll Gain

Cluster Analysis

Ggplot2

R Programming

Exploratory Data Analysis

What you will learn from this course

Module 1 – Analytical Graphics & Base plotting system

 Installing R on Windows (3.2.1)

 Installing R on a Mac (3.2.1)

 Installing R Studio (Mac)

 Setting Your Working Directory (Windows)

 Setting Your Working Directory (Mac)

 Principles of Analytic Graphics

 Exploratory Graphs (part 1)

 Exploratory Graphs (part 2)

Module 2 – More advanced graphic systems (17 hours to complete)

Lattice Plotting System (part 2)

ggplot2

Module 3 – Statistical methods for exploratory analysis

Hierarchical Clustering

K-Means Clustering

Dimension Reduction

Working with Color in R Plots

Module 4 – Case study

Clustering case study

Air pollution case study