Probability and Distribution




Analyzing data


Consider these examples


Communicative Probabilities


How is probability used?


What does this mean?



This is a binomial distribution...


Distributions




Demo with coin flips
Demo with dice


How do probabilities combine?




Independent events


Probability of two independent events


The conjunction fallacy


The mean of a random variable


The law of large numbers




Probabilities will be important




Here's a tricky test - The Monty Hall problem demo
An Explanation


Questions


Distributions and Variability


What do we do with data?


What are the entries in this table?


Types of Variables


Variability


Sorting the data can help


Graphics in data analysis





16 | 12344688899
17 | 1122233333333344455677788899
18 | 01234455
19 |
20 | 1



Stemplot


Qualities of distributions to keep in mind.


Aspects of the stemplot


16 | 12344688899
17 | 1122233333333344455677788899
18 | 01234455
19 |
20 | 1



Splitting the Stems


16 | 12344
16 | 688899
17 | 11222333333333444
17 | 55677788899
18 | 012344
18 | 55
19 |
19 |
20 | 1


What if you have too many data points?


A Histogram



Here's an interactive demo that shows how bin size affects histograms.
Here's a related example.


So, what does this mean?


Are the data well-behaved?


Other Ways of Looking at Data



Other Plots


Time Plot (Practice Effect)



Summary


Central Tendency and Standard Deviation - More on Distributions


Sample Size


Sample size and Distributions


Central Tendency and Variability


Central Tendency


The mean


A problem with the mean <---


The mean and skew


The Median


The median is resistant to outliers <---


The Mode


Demo on Mean, Median, Mode, and Variability


Variability


Percentiles


The Five Number Summary


The Boxplot


The standard deviation


The standard deviation


The effects of linear transformations


Summary


Questions


The Normal Distribution and Sampling Distributions


Mathematical Distributions


A Histogram with a Model



Not as good of a fit:



Why a Model?


Using the Normal Distribution


Different Normal Distributions




Demo on mean and standard deviation of the normal distribution


Area under the curve


Demo on area under the normal curve
Another Demo on area under the normal curve


The standard normal distribution


mean=0, sd=1



What is a z-score?


Practice with z-scores


Demo on z scores and probability


Sampling Distributions


Lots of Means


Increasing Sample Size


Demo on Sampling Distributions and Variance


How big a sample?


Where is this train taking me?