Computer Tools
We can compute the statistical quantities we have seen before using a variety of computer tools.
Spreadsheets
Excel and Google Sheets usually have identical functions and syntax
Python
Similar functions are also available in the Pandas python library.
R
Spreadsheet Histograms
Plotting a histogram in Google Sheets
Spreadsheet Normal Distribution
We can plot a normal distribution in Google Sheets using the NORMDIST
function.
For example, we can create a column of values of the data for the x-axis, a cell for the mean, and a cell for the standard deviation. We can then create a column of values of the distribution.
=NORMDIST(x-value, mean, standard devation, FALSE)
Note, the FALSE
is so that you get the distribution
function instead of the cumulative distribution function.
Sending Data to the Function
Most of the functions above require many numbers to calculate the answer so we need to send several numbers to the spreadsheet.
In many computer programs (Excel, Google Sheets, Python, R) you have a function name followed by parentheses and the numbers sent to the function are in the parentheses.
=AVERAGE(range of numbers)
In excel, we specify a range of numbers by the cells where the data
starts and ends. For example A1:A10
is all the numbers
stored between cells A1
and A10
. To select an
entire column, we use the idiom A:A
.
To calculate the average of these numbers, we type
=AVERAGE(A1:A10)
Selecting data from another sheet in the spreadsheet
Often our data is in another sheet in the spreadsheet. Here is a function to calculate the average of that data.
=AVERAGE(datasheet!A1:A10)
=
means do a functionAVERAGE
is the computer commanddatasheet
is the name of the sheet with the dataA1:A10
is the chunk of data,