stats_vizualization/

directory
v0.0.0-...-8a36715 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Dec 12, 2016 License: Apache-2.0

README

Visualizing Distributions

Although summary statistics are very useful and should be utilized to describe data, visualizing distributions of data can provide very quick intuition about how to proceed in an analysis. Tools like histograms and box plots provide a visual representation about the distribution of variables.

Example Histogram

alt tag
from here

Example Box Plot

alt tag
from here

Notes

  • A histogram is an estimate of the probability distribution of a continuous variable created by "binning" data together into a certain number of bins defined by variable ranges.
  • A box plot also displays the distribution of data. In this case, based on a five number summary: minimum, first quartile, median, third quartile, and maximum

Stat Trek
Khan Academy - Statistics
Bayesian Statistics
Elements of Statistical Learning

Code Review

github.com/gonum/plot docs
Histogram of a normal distribution
Histograms with real data
Box Plot with various distribution
Box Plots with real data

Exercises

Exercise 1

Part A Create a box plot of the values in the third column bmi of diabetes.csv.

Template | Answer

Part B By examining the box plot, draw (on paper) what you think a histogram of the bmi values would look like.

Exercise 2

Create a histogram of the values in the third column bmi of diabetes.csv. Compare the histogram to your drawing created in Part A.

Template | Answer


All material is licensed under the Apache License Version 2.0, January 2004.

Directories

Path Synopsis
Sample program to generate a histogram of a normal distribution.
Sample program to generate a histogram of a normal distribution.
Sample program to generate a histogram of the iris data variables.
Sample program to generate a histogram of the iris data variables.
Sample program to generate a box plot of example distributions.
Sample program to generate a box plot of example distributions.
Sample program to generate box plots of the iris data variables.
Sample program to generate box plots of the iris data variables.
exercises
exercise1
Sample program to generate a box plot of diabetes bmi values.
Sample program to generate a box plot of diabetes bmi values.
exercise2
Sample program to generate a histogram of diabetes bmi values.
Sample program to generate a histogram of diabetes bmi values.
template1
Sample program to generate a box plot of diabetes bmi values.
Sample program to generate a box plot of diabetes bmi values.
template2
Sample program to generate a histogram of diabetes bmi values.
Sample program to generate a histogram of diabetes bmi values.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL