Distributions
So far we have been looking at histograms, which contain the measured data for a collection of objects or observations.
Often we can construct a mathematical model of these histograms that allow us to reason and make predictions.
These mathematical models are called distributions.
Here is an ideal uniform distribution.
This is a representation of the random data from that distribution. Note the density of dots is fairly even across the range.
Here are the same data listed out.
Non-uniform distribution
Here is an ideal normal distribution. Notice that the value of the
density takes on a range of values.
This is a representation of the random data from that distribution. Note that the density of the marks representing the data is greater near the middle as the y-axis of the distribution suggests.
Here are the same data listed out. The density in the middle of the range is still apparent.
Uniform Percentile
source.py:155: MatplotlibDeprecationWarning: The seaborn styles shipped by Matplotlib are deprecated since 3.6, as they no longer correspond to the styles shipped by seaborn. However, they will remain available as 'seaborn-v0_8-<style>'. Alternatively, directly use the seaborn API instead.
plt.style.use('seaborn-pastel')
The magnitude of the shaded area is the probability of a value falling between the shaded regions of the x-axis. The value on the x-axis of the right side of the shaded region is the value of the percentile.
Triangular Percentile
source.py:211: MatplotlibDeprecationWarning: The seaborn styles shipped by Matplotlib are deprecated since 3.6, as they no longer correspond to the styles shipped by seaborn. However, they will remain available as 'seaborn-v0_8-<style>'. Alternatively, directly use the seaborn API instead.
plt.style.use('seaborn-pastel')
Normal Distribution Percentile