JCE Online Journal of Chemical EducationDivision of Chemical Education, American Chemical SocietyAmerican Chemical Society
 | Subscriptions  | Software Orders  | Support  | Contributors  | Advertisers  | 

JCE Print

JCE Digital Library

JCE Software

Only@JCE Online

About JCE



 Home > JCEDlib> JCE WebWare > Collections > Reviewed >
JCE WebWare
Population versus Sampling Statistics

Ken Overway
Department of Chemistry, Bridgewater College, Bridgewater, VA 22812

Overview
When scientists draw random samples to be measured they expect their results will be accurate, assuming all systematic errors have been removed from the experiment. Unlike systematic errors, random errors cannot be removed from the experiment—only reduced. If several replicates are measured for each sample, random errors are mathematically minimized and are relegated to affecting precision, not accuracy. For some students, the difference between accuracy and precision is not clear enough for this to make sense. The solution is for students to interact with the statistics, which requires the laborious generation of multiple sets of random numbers, numerical comparisons, and graphical presentations of the data. The purpose of the spreadsheet exercise presented here is to remove the hurdle of constructing and generating such an interaction. The spreadsheet provides students with a self-led exercise that reinforces the statistics of sample and population distributions.

The sample distribution is the link between the measured random samples and the hidden probability that governs the process. For any given homogeneous sample, the true value of the target parameter is measurable, but what the analytical chemist must do is to accurately determine this in the most cost-effective way, i.e. the least number of samples. Further, when experimental results do not agree with the theoretical model or with a quality control sample, chemists need a non-arbitrary way of determining whether the mismatch is an artifact of low precision or if there is indeed a systematic difference between the two samples, methods, or instruments. This is done through hypothesis testing, which involves the sample standard deviation, a students t value, and the number of replicates, N,  measured according to the equation

eq1

wherexbaris the sample mean, μ is the expected value, and s is a random variable.

This exercise allows students to set up a numerical example, where the parameters of the population distribution are defined, and measure the proximity of the sample distribution to the population via a statistical analysis. Since no systematic error is present by design, the two distributions come from the same source but often do not appear as such. By directly interacting with the data numerically and visually through graphs, students can see how the number of replicates profoundly shapes the conclusion of the experiment.

Figure 1

Figure 1. Screen shot for the main page of the spreadsheet.

Application
* Spreadsheet
Documentation
* Documentation [DOC]
Viewing Requirements
* Microsoft Excel
More Information
* Keywords Analytical Chemistry; Computer-Based Learning; Mathematics / Symbolic Mathematics; Quantitative Analysis; Upper-Division Undergraduate
* Abstract Overway, Ken. J. Chem. Educ. 2008, 85, 749.
* JCEDLib   
* History Published May 2008
 Home > JCEDlib> JCE WebWare > Collections > Reviewed > Population versus Sampling Statistics


JCE Digital Library
JCE DLib