Method: Boxplot

Methods: Boxplot, ANOVA,
Topics: Automotive, Consumer, Energy, Engineering, Environment,
Datafile Name: Auto Pollution Filter Noise

The data are from a statement by Texaco, Inc. to the Air and Water Pollution Subcommittee of the Senate Public Works Committee on June 26, 1973. Mr. John McKinley, President of Texaco, cited the Octel filter, developed by Associated Octel Company as effective in reducing pollution. However, quest...

Methods: Regression, Scatterplot, Boxplot,
Datafile Name:

American League baseball teams play their games with the designated hitter rule, meaning that piutchers do not bat. The League believes that replacing the pitcher, typically a weak hitter, with another player in the batting order produces more runs and gnerates more interet among fans. Is there e...

Methods: ANOVA, Boxplot, Transformation,
Topics: Medical,
Datafile Name: Cancer Survival

Patients with advanced cancers of the stomach, bronchus, colon, ovary or breast were treated with ascorbate. The purpose of the study was to determine if patient survival differed with respect to the organ affected by the cancer.

A one-way ANOVA with Organ as the discrete factor and Sur...

Methods: Outlier, Histogram, Mean, Median, Boxplot, Distribution,
Topics: Economics,
Datafile Name: CEO Salaries

Forbes magazine published data on the best small firms in 1993. These were firms with annual sales of more than five and less than $350 million. Firms were ranked by five-year average return on investment. The data extracted are the age and annual salary of the chief executive officer for the fir...

Methods: Two Sample T-Test, Boxplot,
Topics: Medical,
Datafile Name: Cholesterol and Smoking

A study examining the health risks of smoking measured the cholesterol levels of people who had smoked for at least 25 years and people of similar ages who had smoked for no more than 5 years and then stopped.

Methods: Two Sample T-Test, Transformation, Boxplot,
Topics: Environment,
Datafile Name: Clouds

Clouds were randomly seeded or not with silver nitrate. Rainfall amounts were recorded from the clouds. The purpose of the experiment was to determine if cloud seeding increases rainfall.

The rainfall distributions are more nearly symmetric after a log transformation. The log transforma...

Methods: ANOVA, Boxplot, Histogram, Median, Transformation,
Topics: Nutrition,
Datafile Name: Calories

"Let the buyer beware" is a phrase that comes to mind when buying a used car, not when buying food. However, Allison, Heshka, Sepulveda, and Heymsfield (1993) think that this phrase should apply to purchasing "diet" and "health" foods as well. They purchased 40 such ...

Methods: ANOVA, Boxplot,
Topics: Environment, Nature, Biology,
Datafile Name: Cuckoo Egg Lengths

That cuckoo eggs were peculiar to the locality where found was already known in 1892. A study by E.B. Chance in 1940 called The Truth About the Cuckoo demonstrated that cuckoos return year after year to the same territory and lay their eggs in the nests of a particular host species. Further, cuck...

Methods: Outlier, Summary Statistics, Mean, Median, Histogram, Boxplot,
Topics: Miscellaneous,
Datafile Name: Distribution Patterns

This "story" illustrates some of the different distribution patterns that variables may take on. The distribution of Tobin's Q-ratios for firms shows high positive skewness. Sometimes this can be 'remedied' by taking the logarithms of the data. Try this for the Q-ratios. Oth...

Methods: Boxplot, Correlation, Scatterplot, Smoothing,
Topics: Government,
Datafile Name: Draft Lottery

In 1970, Congress instituted a random selection process for the military draft. All 366 possible birth dates were placed in plastic capsules in a rotating drum and were selected one by one. The first date drawn from the drum received draft number one and eligible men born on that date were drafte...

Methods: Experimental Design, Nested Model, General Linear Model, Dotplot, Boxplot,
Topics: Biology,
Datafile Name: Eggs

A single can of dried eggs was stirred well. Samples were drawn and a pair of samples (claimed to be of two "types"), was sent to each of six commercial laboratories to be analyzed for fat content. Each laboratory assigned two technicians, who each analyzed both "types". Since...

Methods: Boxplot, MANOVA, Post Hoc Test,
Topics: Biology,
Datafile Name: Flea Beetles

Data were collected on the genus of flea beetle Chaetocnema, which contains three species: concinna(Con), heikertingeri (Hei), and heptapotamica (Hep). Measurements were made on the width and angle of the aedeagus of each beetle.

Methods: Boxplot, Two Sample T-Test, Pooled T-Test, Transformation,
Topics: Psychology,
Datafile Name: Fusion Time

This dataset contains results from an experiment in visual perception using random dot sterograms, such as that shown below. Both images appear to be composed entirely of random dots. However, they are constructed so that a 3D image (of a diamond) will be seen, if the images are viewed with a ste...

Methods: ANOVA, Boxplot,
Topics: Health,
Datafile Name: Hearing

Hearing aids must be fit individually. A common way to test whether a particular hearing aid is right for a patient is to play a tape on which 25 words are pronounced clearly but at low volume, and ask the patient to repeat the words as heard. Different lists are available that are supposed to be...

Methods: Outlier, Regression, Polynomial Regression, Boxplot,
Topics: Sports,
Datafile Name: Hitters 1920 -1950

The data set is Branch Rickey's set of outstanding hitters in baseball over the period 1920 to 1950 based on the sum of what Rickey defines as on-base- average and extra-base-power (OBA + EBP). The student should be asked to run the simple regression of EPB on OBA, as well as the second degre...

Methods: T-test, Outlier, Boxplot, Mann Whitney U Test, Summary Statistics,
Topics: Health, Medical,
Datafile Name: Nursing Home Data

Acorn is the acronym for Association of Community Organizations for Reform Now.
These data were presented by Acorn to a Joint Congressional Hearing on discri-
mination in lending.  Acorn concluded that "banks generally have exhibited a per-
vasive pattern of lending...

Methods: Assumptions, Regression, Outlier, T-test, Boxplot, Diagnostics, Multivariate Regression,
Topics: Economics,
Datafile Name: OECD Economic Development

Data on per capita income and the percentages of the labor force employed in agriculture, industry, and service occupations for 20 Eurpoean OECD countries lend themselves to several kinds of analysis. Univariate analysis of each of the variables is interesting, and the relations between per capit...

Methods: Boxplot, MANOVA, Post Hoc Test,
Topics: Archeology, Science,
Datafile Name: Pottery

Samples of Romano-British pottery were taken at four sites in the United Kingdom. A chemical analysis of the pottery was performed to measure the percentage of five metal oxides present in each sample. The purpose of the analysis was to determine if different sites produced pottery with different...

Methods: Boxplot, Scatterplot, Outlier,
Topics: Education, Economics,
Datafile Name: Faculty Salaries

A faculty salary study was done at The Ohio State University to compare faculty salaries with those at other universities. Data were collected from the Association of American Universities. The overall average salary for OSU was obtained by computing the weighted average of salaries at each facul...

Methods: Outlier, Boxplot, T-test, Summary Statistics,
Topics: Consumer, Economics, Finance,
Datafile Name: Refusals in Mortgage Lending

Acorn is the acronym for Association of Community Organizations for Reform Now. These data were presented by Acorn to a Joint Congressional Hearing on discri- mination in lending. Acorn concluded that "banks generally have exhibited a per- vasive pattern of lending practices that have the ef...

Methods: Boxplot, Regression,
Topics: Nature, Zoology,
Datafile Name: Wild Horses

Management of the growing mustang population on federal lands has been a controversial issue. A suggested method for controlling overpopulation is to sterilize the dominant male in each group. Eagle, Asa, and Garrott et al. (1993) conducted an experiment evaluating the effectiveness of sterilizin...

Methods: Pooled T-Test, ANOVA, Boxplot,
Topics: Psychology,
Datafile Name: Singers

Each singer in the NY Choral Society in 1979 self-reported his or her height to the nearest inch. Their voice parts in order from highest pitch to lowest pitch are Soprano, Alto, Tenor, Bass. The first two are typically sung by female voices and the last two by male voices.

One can exam...

Methods: Boxplot, Dotplot, Distribution, Summary Statistics,
Topics: Geography, Government,
Datafile Name: New Jersey

This datafile, containing the area in square miles of each county in New Jersey, may be used to illustrate basic descriptive statistics. The file contains only 21 observations, so students may even calculate means, medians, etc. by hand if desired. A boxplot or dotplot is a good graphical summary...

Methods: Boxplot, T-test, ANOVA,
Topics: Physics,
Datafile Name: Michelson

In 1879, A. A. Michelson made 100 determinations of the velocity of light in air using a modification of a method proposed by the French physicist Foucault. These measurements were grouped into five trials of 20 measurements each. The numbers are in km/sec, and have had 299,000 subtracted from th...

Methods: Assumptions, Regression, Outlier, T-test, Boxplot, Diagnostics, Multivariate Regression,
Topics: Economics,
Datafile Name: OECD Economic Development

The Datafile OECD Economic Develpment  contains per capita income (PCINC)
and the percentage of the labor force employed in agriculture (AGR) for 20 European
OECD countries in 1960. Below are the same variables for the United States by 
decades with PCINC adjusted t...

Methods: Boxplot, Paired T-Test,
Topics: Sociology, Economics,
Datafile Name: Labor Force

This dataset contains the labor force participation rate (LFPR) of women in 19 cities in the United States in each of two years (1968 and 1972). The data help to measure the growing presence of women in the labor force over this period.

It may seem reasonable to compare LFPR rates in th...

