Data Sets for Mind on Statistics (Utts and Heckard)

This file is a "printer friendly" version of the data set descriptions. The file readme.html is more useful for browsing the data sets and their descriptions.

beans
Fifteen students each put as many beans into a cup as they were able in 15 seconds. All students did the task twice, once with their dominant hand and once with their non-dominant hand. (Source: The authors)

Column   Name           Description
 C1        Dom             Beans placed with dominant hand
 C2        NonDom      Beans placed with nondominant hand.
 C3        Diff               Difference between two hands

bears-female
Age and physical measurement data for n=19 female wild bears. (Source: Minitab, Inc.).

Column   Name           Description
 C1         Age              age in months
 C2         Neck            neck girth, inches
 C3         Length         length of bear, inches
 C4         Chest           chest girth, inches
 C5         Weight         pounds

Cash5
Data for n = 1,560 plays of the Pennsylvania lottery game called Cash 5. In the game the state randomly picks 5 numbers between 1 and 39. The data are used in an example in Chapter 9. This game is called Fantasy 5 in California and some data for that state are in the dataset Fantasy5.
(Source: Pennsylvania Lottery Commission website, www.palottery.org)

Column   Name            Description
 C1         date              Date of game
 C2         low                Lowest number picked
 C3         second          Second number in low to high order
 C4         median          Median of numbers picked
 C5         fourth            Fourth number in low to high order
 C6         high               Highest number picked

ceodata
Ages and annual salaries for the CEOs of the 60 top ranked small companies in America in 1993. Source: Forbes, Nov. 8, 1993, "America's Best Small Companies," 
also at http://lib.stat.cmu.edu/ DASL/ Datafiles/ ceodat.html)

Column   Name           Description
  C1        Age              age of CEO
  C2        Salary          annual salary in thousands (There's one missing value.)

chap4-heights
Heights (inches) for 55 females and 45 males. The data are used by the Sampling applet in Section 4.2.  (Source: The authors.) 

Column   Name           Description
  C1        Sex              Male of Female
  C2        Height         height in inches 

cholest
Cholesterol levels for 28 heart attack patients measured 2, 4, and 14 days after the heart attack, and also cholesterol levels for 30 other patients who did not have a heart attack. (Source: Minitab, Inc.)

Column   Name           Description
 C1        2-Day           Cholesterol level 2 days after heart attack
 C2        4-Day           Cholesterol level 4 days after heart attack
 C3        14-Day         Cholesterol level 14 days after heart attack (9 missing values)
 C4        control          Cholesterol levels for 30 patients without a heart attack

chugtime
Body weights and the time it takes to chug a 12-ounce beverage for n=13 college students. The data were submitted by a student for a class project. (Source: William Harkness, Pennsylvania State University)

Column   Name            Description
 C1         Weight          body weight, pounds
 C2         ChugTime     time to chug, seconds

deprived
Amount of sleep per night and whether person feels sleep deprived for n=86 college students. (Source: Laura Simon, Pennsylvania State University)

Column   Name            Description
  C1        Deprived      Answer to, "Do you feel that you are sleep deprived?"
  C2        SleepHrs      Self-reported hours of sleep per night

Fantasy5
Data for n = 2,318 plays of the California lottery game called Fantasy 5. In the game, the state randomly picks 5 numbers between 1 and 39. This game is called Cash 5 in Pennsylvania and some data for that state are in the dataset Cash5. (Source: California Lottery website, www.calottery.com)

Column   Name            Description
 C1         Date             Date of game
 C2         Low               Lowest number picked
 C3         Second          Second number in low to high order
 C4         Third             Median of numbers picked
 C5         Fourth           Fourth number in low to high order
 C6         High              Highest number picked

Fantasy5-99
Data for the n = 365 daily plays in 1999 of the California lottery game called Fantasy 5. In the game, the state randomly picks 5 numbers between 1 and 39. The data in this data set are a subset of the data set Fantasy5. (Source: California Lottery website, www.calottery.com) 

Column   Name             Description
 C1         Date              Date of game 
 C2         Low               Lowest number picked 
 C3         Second          Second number in low to high order 
 C4         Third             Median of numbers picked 
 C5         Fourth           Fourth number in low to high order 
 C6         High              Highest number picked 

GSS-93
Data are for eleven variables and n = 1,606 respondents in the 1993 General Social Survey, a national survey done by the National Opinion Research Center at the University of Chicago Some questions are only asked of about two-thirds of the survey participants, so there is quite a bit of missing data. (Source: SDA archive at UC Berkeley website, http://csa.berkeley.edu:7502) 

Column   Name         Description
 C1         sex             Sex of respondent 
 C2         race           Race of respondent (White, African American, Other) 
 C3         degree       Highest educational degree received (Five categories)
 C4         relig           Religious preference (Catholic, Protestant, Jewish, Other) 
 C5         polparty     Does respondent think of self as Democrat, Republican, Indep. or Other? 
 C6         cappun       Does the respondent favor or oppose the death penalty.
 C7         tvhours       Hours of watching television on a typical day
 C8         marijuan     Whether the respondent thinks marijuana should be legalized or not 
 C9         owngun       Whether respondent owns a gun or not (Yes or No) 
 C10       gunlaw         Does respondent favor or oppose a law requiring a permit to buy a gun? 
 C11       age              Age of the respondent

GSS-93samp
n=500 randomly selected observations from the GSS-93 data set. The random selection was done by the authors using Minitab. 

handheight
Heights and stretched handspans for n=167 college students. Each student decided which of their hands to measure. The data are used for examples in Chapters 5 and 14. (Source: The authors)

Column   Name            Description
 C1         Sex               Male or Female
 C2         Height          Self-reported height (inches) 
 C3         HandSpan    Self-measured stretched handspan (cm)

happyface
Tip percents on restaurant bills for male and female servers who either draw a happy face on the bill or do not in a randomized experiment.The data are used for Example 16.6 and an example in Supplemental Topic S4. (Source: Professor Bruce Rind of Temple University)

Column   Name            Description
 C1         Message       HappyFace or None 
 C2         Sex                Sex of server  (Male or Female)
 C3         TipPct           Tip as percent of the bill

heightfoot
Heights and foot lengths for n=33 college students. (Source: William Harkness, Pennsylvania State University)

Column   Name           Description
 C1         height          Self-reported height, inches
 C2         foot              Self-measured foot length, cm

idealwt
Actual weight and ideal weight for n=182 college students. Actual weights were reported by the students The "ideal" weight is the response to, "What is your ideal weight?" The data were collected in the Fall semester of 1999. (Source: William Harkness, Pennsylvania State University)

Column   Name           Description
 C1         sex               Male or Female
 C2         actual           Self-reported weight, pounds
 C3         ideal             Student's ideal weight, pounds
 C4         diff                Difference between actual and ideal weights 

idealwtmen
Actual and ideal weights for the n=63 males in the idealwt data set. (Source: A subset of the idealwt data set.

idealwtwomen
Actual and ideal weights for the n=119 females in the idealwt data set. 

letters
A sample of 63 students wrote as many letters of the alphabet, in order as capital letters, as they could in 15 seconds using their dominant hand, and then repeated this task using their nondominant hand. The variables dom and nondom contain the raw data for the results. (Source: Laura Simon, Pennsylvania State University)

Column   Name           Description
 C1         sex               Male or Female
 C2         dom              Number of letters printed with dominant hand in 15 seconds
 C3         nondom        Number of letters printed with non-dominant hand in 15 seconds 

MusicCDs
Responses by n = 24 college students to the question, "About how many music CDs do you own?" Data are used for Example 2.7. (Source: The authors.)

Column   Name           Description
 C1         CDs              Number of music CDs owned
 C2         Sex               Male or Female 

pennstate1
Data from n=190 college students in a statistical literacy class. The survey is described in Section 2.1 of the text, and the data are the basis for several examples in Chapter 2.  (Source: The authors)

Column   Name             Description
 C1         Sex                 Male or Female
 C2         HrsSleep        Hours of sleep the previous night 
 C3         SQpick           Choice of either S or Q
 C4         Height            Reported height, inches  
 C5         RandNumb    "Random" pick of a number between 1 and 10  
 C6         Fastest            Fastest speed ever driven, mph
 C7         RtSpan            Measured stretched right handspan, cm
 C8         LftSpan           Measured stretched left handspan, cm 
 C9         Form                For SQpick variable, order of listing S and Q on survey form

pennstate1F
Data from the n=103 females in the pennstate1 data set.

pennstate1M
Data from the n=87 males in the pennstate1 data set.

pennstate2
Data from n=205 students in a statistics class for students in the social and behavioral sciences. The survey was done in the Spring semester of 2000. (Source: The authors)

Column   Name           Description
 C1         Sex              Male or Female
 C2         EarPrces     Total ear pierces in two ears
 C3         Tattoo         Does student have a tattoo (Yes or No)     
 C4         CDs             Student's estimate of how many music CDs he/she owns
 C5         Height         Self-reported height in inches
 C6         HtChoice     Height student would choose to be if they could choose any height. 
 C7         Looks          On a scale of 0-9, importance of looks (0) versus personality (9) 
 C8         Friends        Sex it's easiest to make friends with (Opposite, Same, No Diff)

pennstate3
Data from n=227 students in a statistics class for students in the social and behavioral sciences. The survey was done during the Spring semester of 1997. (Source: William Harkness, Pennsylvania State University)

Column   Name           Description
 C1         Sex               Male or Female  
 C2         WtFeel         Whether student feels he/she is Overweight, Underweight, About Right. 
 C3         Friends         Sex it's easiest to make friends with (Opposite, Same, No Diff)
 C4         atfirst            Do you believe in love at first sight? (Yes or No) 
 C5         supernat        Do you believe in the supernatural? (Yes or No)
 C6         xtraterr         Do you think there is extraterrestrial life? (Yes or No)
 C7         grtpers          Would you date someone with a great personality even if   
                                        you did not find them attractive? (Yes or No)

pennstate4
Data from n=75 students in a statistics class for biology and pre-med students The survey was done during the Spring semester of 1999. (Source: Laura Simon, Pennsylvania State University)

Column   Name            Description
 C1         Phone           Minutes spent talking on phone in typical week  
 C2         Exercise       Minutes spent exercising in typical week
 C3         BookCost     Amount spent on textbooks for present semester, dollars
 C4         Sex                Male or Female

pennstate5
Data from n=175 students in a statistics class for liberal arts and social science students The survey was done during the Spring semester of 2000. (Source: The authors.)

Column   Name            Description
 C1         TV                 Hours spent watching television in a typical day. 
 C2         Shower          Minutes spent taking typical shower 
 C3         Sex                Male or Female 

perhouse
Persons per household in the United States in Census years from 1850 to 2000. (Source: The World Almanac and Book of Facts, 1999, p. 383 and the U.S. Census Bureau)

Column   Name           Description
 C1         year             year of Census 
 C2         perhouse     Persons per household in U.S.

physical
Physical measurements for n=55 college students. Measurements were made by the students during a class activity. (Source: William Harkness, Pennsylvania State University.)

Column   Name           Description
 C1         Sex               Male or Female
 C2         Height          Self-reported height, inches
 C3         LeftArm       Length of left forearm, cm
 C4         RtArm          Length of right forearm, cm
 C5         LeftFoot       Length of left foot, cm
 C6         RtFoot          Length of right foot, cm
 C7         LeftHand     Width of left palm, cm
 C8         RtHand        Width of right palm, cm
 C9         HeadCirc      Head circumference, cm
 C10       nose               Nose length, cm

ProfBooks
Number of pages, price, and whether the book is hardcover or softcover for n=15 books on a professor's bookshelf. (Source: The authors)

Column   Name           Description
 C1         Pages            Number of pages in book
 C2         Price              Cost of book, dollars 
 C3         Type              Type of book (Hardcover or Softcover)

pulsemarch
Students in a statistics class measured their pulse rates before and after marching in place for one minute. The sample size is n = 40.   (Source: Laura Simon, Pennsylvania State University)There are three columns of data:

Column   Name            Description
 C1         Sex                Male or Female
 C2         Before           Pulse rate before marching in place
 C3         After              Pulse rate after marching in place for one minute

rainfall
Annual rainfall in inches in Davis, California from 1951 to 1997. The data ae used for Example 2.9.

Column   Name            Description
C1         Year               Ranges from 1951 to 1997.
C2         Rainfall         Annual rainfall in inches

sats98
For the 50 States and the District of Columbia, the statewide mean verbal and math SAT scores in 1998. The data set also includes the estimated percent of high graduates who took the SAT in each state. (Source: The World Almanac and Book of Facts, 1999, p. 245)

Column   Name           Description
 C1         State            State abbreviation 
 C2         Verbal          Mean verbal SAT in state
 C3         Math            Mean math SAT in state
 C4         PctTook       Percent of graduates who took the SAT 

signdist
For n=30 drivers, the driver's age and the maximum distance at which the driver can read a highway sign at nigh. The data are used in examples in Chapters 5 and 14. (Source: Based on data collected by Last Resource, Inc., Bellefonte, PA)

Column   Name           Description
  C1        Age              Driver's age
  C2        Distance      Distance at which sign can be read, feet.    

sleepstudy
Hours of sleep the previous and hours of studying the previous day for n=116 students The data are used for a correlation example in Chapter 5.. (Source: The authors)

Column   Name           Description
 C1         Sleep           Hours of sleep the previous day
 C2         Study           Hours of studying the previous day.                        

speedlimit
Highway death rates (per 100 million vehicle miles of travel) and maximum speed limits in 10 countries. (Source: "Fifty-five mph speed limit is no safety guarantee," D. J. Rivkin, New York Times (letters to the editor), Nov. 25, 1986, p. 26.) 

Column   Name           Description
 C1         country
 C2         death            Highway accident death rate per 100 million miles of travel  
 C3         limit              Maximum speed limit 

temperature
Geographic latitude and mean temperature in January, April, and August for 20 cities in the United States. (Source: The World Almanac and Book of Facts, 1999, p. 220 and p. 456)There are five columns of data:

Column   Name           Description
 C1         city               City name
 C2         latitude         Geographic latitude of city 
 C3         JanTemp      Mean January temperature, Fahrenheit 
 C4         AprTemp      Mean April temperature, Fahrenheit   
 C5         AugTemp     Mean August temperature, Fahrenheit 

turkeypop
Data from an experiment done to examine the "anchoring" effect, which is that a person's estimate of a quantity is influenced by numbers given in prior questions. At a conference for teachers of business statistics, n = 45 participants were randomized to two experimental groups. One group answered the question "Do you think the population of Turkey is more than 10 million?"  The other group answered the question " Do you think the population of Turkey is more than 70 million?" All participants were then asked to estimate the population of Turkey, to the nearest million. If the anchoring effect holds, participants asked if the population is more than 70 million would tend to give higher estimates than those who were asked if it is more than 10 million. (Source: The authors)

Column   Name           Description
 C1         Form            The value given in the first question asked (either 10 or 70).     
 C2         Estimate      Participants estimate of the population of Turkey (millions)

UCDavis1
Data from n=173 college students in two different statistics classes. One class was for students not in the liberal arts (n=148), while the other class was for students in the liberal arts (n=25). The data were collected in the Spring quarter of 2000. (Source: The authors)

Column    Name           Description
 C1         Sex               Male or Female 
 C2         TV                Hours spent watching television each week.                 
 C3         computer      Hours spent at a computer each week. 
 C4         Sleep            Hours of sleep previous night.
 C5         Seat              Typical classroom seat location (Front Middle Back) 
 C6         alcohol          Number of alcoholic beverages consumed each week
 C7         Height           Self-reported height, inches
 C8         momheight    Mother's height, inches  
 C9         dadheight      Father's height, inches
 C10       exercise        Hours spent exercising each week
 C11       GPA              Student's GPA
 C12       class              Liberal Arts or Non Liberal Arts

UCDavis1F
Data for the n=94 females in the UCDavis1 data set.

UCDavis1M
Data for the n=79 males in the UCDavis1 data set.

UCDavis2
Data from n=239 college students. The data were collected in the Fall quarter of 2000. (Source: The authors)

Column    Name           Description
 C1         Sex               Male or Female
 C2         GPA             Student's GPA
 C3         Seat              Typical classroom seat location (Front Middle Back)
 C4         alcohol          Number of alcoholic beverages consumed in typical week
 C5         WtFeel          Does student feel he/she is Oveweight, Underweight, About Right?
 C6         Height           Self-reported height, inches
 C7         IdealHt          Student's choice of ideal height, inches
 C8         momheight     Mother's height, inches 
 C9         dadheight       Father's height, inches
 C10       Hand              Are you   Left-handed or Right-handed ? 
 C11       Looks             On a scale of 1-25 importance of personality (1) versus looks (25)
 C12       Friends          Who is easiest to make friends with? (Opposite sex   Same sex)
 C13       Cheat             Would you tell the instructor if you saw somebody cheating on an exam?
 C14       Smoke            Does student smoke at least one pack of cigarettes per week?

UCDchap5
Self-reported heights and parents' heights for n=18 college students. (Source: A subset of the dataset UCDavis2)

Column   Name              Description
 C1         Sex                 Coded as M or F
 C2         height             Student's height, in inches
 C3         momheight     Mother's height, in inches (as reported by student)
 C4         dadheight       Father's height, in inches (as reported by student)
 C5         midparent       Average of parents' heights, in inches

UCDwomht
Self-reported heights and parents' heights for n=94 college women. (Source: A subset of the dataset UCDavis1)

Column   Name             Description
 C1         height             Student's height, in inches
 C2         momheight     Mother's height, in inches (as reported by student)
 C3         dadheight       Father's height, in inches (as reported by student)
 C4         midparent      Average of parents' heights, in inches

wthghtM
Weight and height data for n=43 college men. The data are used in examples in Chapter 14. (Source: Laura Simon, Pennsylvania State University) 

Column   Name             Description
 C1         Sex                Male for all 43 rows.
 C2         Height            Self-reported height in inches
 C3         Weight           Self-reported weights in pounds
 

YouthRisk
Data from the 2001 Youth Risk Behavior Surveillance System survey of U.S. high school students. These data are for 12th graders, aged 17 and 18 years old, who say they drive.

(Source: http://www.cdc.gov/nccdphp/dash/yrbs/)

Column   Name              Description
 C1         Sex                 Male or Female
 C2         Grades           Grades usually achieved in school 
 C3         DaysCigs       Days student smoked cigarettes in the past 30 days (categorized) 
 C4         DaysDrnk      Days on which student drank alcohol during past 30 days (categorized) 
 C5         SeatBelt         How often student wears seatbelt when driving (categorized)