An important class of distributions or density curves in statistics is the normal distribution.
All normal distributions have the same overall shape that is often referred to as "bell shaped".
These shapes are symmetric about the center, which is the location of the mean, median and mode of the data set.
It appears that the precipitation frequency distribution for Reading is "almost normal" since it
is almost symmetric about the center and has a bell-shaped appearance. If students were to display precipitation
distributions for other cities, many of these would also have the same bell-shaped curve. Why is this
so? The answer lies in the fact that precipitation amounts, like many other natural phenomena, are the result
of many random factors. When this is the case, we expect a normal distribution.
Mentioned above was the fact that the center of a normal distribution is the mean.
The mean annual precipitation for the years 1863 to 1999 was 41.5 inches, which looks about right on the Excel
histogram. It's quite obvious that the center (mean) for the Reading distribution would be quite different
than the center for a much drier city such as Salt Lake or Phoenix. Also, a city whose precipitation
from year to year is more consistent or more varied would have a histogram that is narrower or wider, respectfully.
So even though normal distributions have the same overall shape, they may have different centers (means) and have
different widths (deviations from the mean). The width of a normal distribution is quantified by computing
the standard deviation. For the Reading data, the standard deviation is 6.5 inches. (The formula to compute
this number is not given here, but can be found in any basic statistics text. Also, most calculators, spreadsheets
and statistics packages will compute the standard deviation.)
If we denote the mean by m and the standard deviation by s, then any normal distribution can be described by the 68-95-99.7 rule. This rule states
that: 68% of the data will lie within 1s of the mean m, 95%
of the data will lie within 2s of the mean m, and 99.7% of
the data will lie within 3s of the mean m . We can use the
68-95-99.7 rule as a means to check "how normal" the precipitation data is for Reading. This is
a good student exercise. We can also use properties of the normal distribution to answer a question such
as, "How likely is it that the precipitation for Reading will be between 30 and 31 inches in any given year?".
Details are left for statistics courses.
We obtained precipitation data for the lower 48 states from the United States Historical
Climate Network (USHCN), part of NOAA's National Climate Data Center. The USHCN has historical records for 1221
weather station in the lower 48 states, of which 1182 are currently operational. At the USHCN website one
can download historical data sets that have varying amounts of editing. We downloaded the (gigantic) national precipitation
data, selected the Filnet edited data, then divided the national data set into state-by-state sets.
Note: The Filnet edited precipitation data (1) has removed outliers more that 3 standard deviations
from the period mean (2) has been adjusted to remove time of observation bias, the Maximum/Minimum Temperature
System (MMTS) bias, station moves/changes bias and (3) contains estimated values for missing/outlier data.
Additionally, the USHCN provides weather station information (referred to as the Station Inventory information),
with details about station location, history, measuring devices, personnel, reliability, etc. We edited
this file on a state by state basis to include only latitude, longitude, and elevation data for the latest location
of each station. To download state-by-state files, go to DataSet#050.
Those wishing to obtain a single national or unedited file should go to the USHCN website.
Data source: United States Historical Climate Network (USHCN)
http://www.ncdc.noaa.gov/ol/climate/research/ushcn/ushcn.html