JCE Online Journal of Chemical EducationDivision of Chemical Education, American Chemical SocietyAmerican Chemical Society
 | Subscriptions  | Software Orders  | Support  | Contributors  | Advertisers  | 

JCE Print

JCE Digital Library

JCE Software

Only@JCE Online

About JCE



 Home > JCEDlib> JCE WebWare > Collections > Reviewed >
JCE WebWare
A Method of Visual Interactive Regression

Michelle S. Kim
Management Science and Engineering, Stanford University, Palo Alto, CA 94309

Maureen Burkart and Myung-Hoon Kim
Department of Science, Georgia Perimeter College–Dunwoody Campus, Dunwoody, GA 30338-4497

Overview
It is often necessary in general chemistry courses for students to find the best-fitting line through a set of scattered data. The method of least squares is routinely used as the statistical tool to accomplish this. However, the process behind the least-squares method, minimizing the sum of the deviations squared, is not clear to beginning students. Blind use of the analytical formulas for a slope and a y-intercept is not pedagogically effective, and it does not satisfy analytically minded students. With this in mind, we have made the process of minimizing the sum visible by allowing the individual to adjust heights in a bar graph, thus making the process more interactive and dynamic. The interactive feature of Excel spreadsheet programs (1) is utilized; use of the spinner bar (2) is particularly helpful.

Figure 1. Visual Interactive Regression. By clicking on the two spinner bars at the bottom of the screen display, a student can minimize the sum of the squares of the deviations, which are shown in the bar graphs at the lower right.

The visualization process requires the following steps. Students prepare a table of X–Y data in an Excel worksheet and introduce another column to generate theoretical Y values (Ycalc) with estimated arbitrary values of slope and intercept. The deviations squared and their sum are calculated in an additional column. The experimental Y and theoretical Y values are plotted against X in an Excel graph. Another graph in a separate chart is prepared to represent each of the deviations squared and their sum (SSQ, the last bar) in bar graph form. Finally, two spinner bars (2) are prepared in order to control the slope and intercept for Ycalc. At this point, students may click on the spinner bar for slope until the height for the sum of the squares (the last bar) no longer decreases. They do the same using the intercept spinner. Then, they alternate back and forth between the slope spinner and intercept spinner to repeat the steps until SSQ is minimized.

This process was developed using data and results from Beer–Lambert law experiments. Sample calculations and graphs are summarized in Figure 1. The first two columns at the top of the figure are for the original X–Y data. The next three columns are for the calculated Y values (with arbitrary slope and intercept), deviations between the Y values, and deviations squared. The last three columns are for the calculated Y values, deviations, and deviations squared after minimization is complete. The last (and highest) bar in each bar graph represents the sum of deviations squared (SSQ); its value is also displayed under the bar graphs. The slope and intercept are incremented and decremented by 0.01 and 0.001, respectively. Since the spinner control for a cell value requires an integer increment, the base value has to be scaled with scaling factors of 0.01 and 0.001 for a fractional increment. In order to allow negative values in the intercept, 50 must be subtracted from the base value for the intercept. Ten trials were made with various starting values of slope and intercept, and the results are either equal to or very close to those calculated from the analytical formulas, 3.99 for slope and 0.014 for intercept.

It should be noted that the results can be slightly different depending upon whether the slope or intercept is minimized first and upon the whims of clicking order on the spinner during the minimization. In seven out of ten trials, the method was able to find the global minimum. Only three trials yielded a local minimum that is close to the global minimum. Overall, the method was successful in yielding identical results to the analytical formulas for the slope and the intercept.

Literature Cited

  1. Coleman, W. F. Home Page (accessed Oct 2006); see Course and Other Web pages for links to a statistics site and interactive spreadsheets.
  2. Coleman, W. F. Building Interactive Spreadsheets Page (accessed Oct 2006) for the usage of a Spinner Bar.
Application
* A Method of Visual Interactive Regression
Documentation
* JCE2006p1884_2WW.doc  [DOC]
Viewing Requirements
* Microsoft Excel, with Analysis TookPak installed
More Information
* Keywords Computer-Based Learning; Demonstrations; First-Year Undergraduate / General; General Public; Hands-On Learning / Manipulatives; Kinetics; Laboratory Computing / Interfacing; Quantitative Analysis; Spectroscopy; Student-Centered Learning; UV-Vis Spectroscopy
* Abstract Kim, Michelle S.; Burkart, Maureen; Kim, Myung-Hoon. J. Chem. Educ. 2006 83 1884.
* JCEDLib  Metadata full record
* History Published November 2006.
 Home > JCEDlib> JCE WebWare > Collections > Reviewed > The "Reaction Rolodex": A Web-Based System for Learning Reactions in Organic Chemistry


JCE Digital Library
JCE DLib