Ch2 Exam Questions Intermediate Statistical Investigations - Intermediate Statistical Investigations 1st Ed - Exam Bank by Nathan Tintle. DOCX document preview.
Chapter 2
Intermediate Statistical Investigations Test Bank
Question types: FIB = Fill in the blank Calc = Calculation
Ma = Matching MS = Multiple select
MC = Multiple choice TF = True-false
CHAPTER 2 TERMINAL LEARNING OUTCOMES
TLO2-1: Analyze paired data as a way to control observational unit variation appropriately.
TLO2-2: Analyze data on quantitative response from a randomized complete block design using both simulation and theory-based approaches.
TLO2-3: Analyze an observational study with two sources of explained variation understanding adjustment for another variable in the analysis and covariation.
Section 2.1: Paired Data
LO2.1-1: Use pairing to potentially reduce unexplained variation and increase the power of a study.
LO2.1-2: Explain how to analyze paired data appropriately.
Questions 1 through 3: Statisticians recommend randomizing the order of treatments in paired experiments, but is order really important? To investigate, 22 students in a statistics class each completed a memory puzzle twice, recording the amount of time (in seconds) it took to complete the puzzle on each attempt. They also calculate the difference between their two attempts (first – second). The results of the experiment are shown below.
First Attempts (n=22) | Second Attempts (n=22) | Differences (n=22) |
Mean = 83.755 SD = 18.318 | Mean = 75.963 SD = 18.008 | Mean = 7.792 SD = 8.283 |
- What is the value of the t-statistic for this study?
Sol:
- What is the value of the F-statistic for this study?
Sol:
- The 95% confidence interval for the mean difference between first and second attempts is (4.120, 11.465). Does this interval provide strong evidence of an order effect in this context? You may assume 0.05.
- Yes, there is strong evidence of an order effect, because the sample mean (7.792) is inside the interval.
- Yes, there is strong evidence of an order effect, because 0 is outside the interval.
- No, there is not strong evidence of an order effect, because the sample mean (7.792) is inside the interval.
- No, there is not strong evidence of an order effect, because 0 is outside the interval.
Questions 4 and 5: An economist at Vanderbilt University designed a study to compare different types of online auctions. In one experiment, he compared a Dutch auction to a first-price sealed bid auction. In the Dutch auction, the item for sale starts at a very high price and is lowered gradually until someone finds the price low enough to buy. In the first-price sealed bid auction, each bidder submits a single sealed bid, and the highest bid wins.
The researcher auctioned off collectible trading cards from the game Magic: The Gathering. He placed pairs of identical cards up for auction; one would go into the Dutch auction and other to the first-price sealed bid auction. He then looked at the difference in the prices he received on each pair. He repeated this for a total of 88 pairs.
- Which of the following are appropriate ways to state the null hypothesis in this scenario? Select all that apply.
- There is an association between type of auction and selling price.
- There is no association between type of auction and selling price.
- Are the validity conditions met for a one-sample t-test for paired data?
- No, because the distribution of selling prices are skewed right for both types of auctions.
- No, because the spread of the differences is less than half the spread of the original selling prices.
- Yes, because the distribution of the differences is unimodal and approximately symmetric.
- Yes, because the spread of the selling prices is roughly the same for both types of auctions.
Questions 6 through 10: Is there a difference between standing and sitting heart rates? To decide, 52 students measured their heart rates (in beats per minute) in two positions: once while standing and once while sitting. Each student then calculated the difference in their measurements: standing heart rate – sitting heart rate. Data are shown below.
- Which of the following inference methods is appropriate for this scenario? Select all that apply.
- Two-sample t-test
- One-way ANOVA F-test
- t-test for paired data
- In this study, it is helpful to control for person-to-person variability, removing it as a source of unexplained variation. Which features of the graph above indicate that pairing will lead to a more powerful test in this scenario? Select all that apply.
- The connected lines show that those with high standing heart rates also tend to have high sitting heart rates and vice versa.
- On average, standing heart rates tend to be higher than sitting heart rates (based on sample means).
- The variability of the differences is less than the variability within treatment groups (based on sample standard deviations).
- The distribution of difference is unimodal and roughly symmetrical, so it is reasonable to assume the residuals are Normally distributed.
- The following statistical model could be used to predict heart rates.
Predicted heart rate = overall mean + position effect + person effect
One participant had a standing heart of 88 and a sitting heart rate of 82. Using this information and the data shown above, calculate the person effect for this participant.
Sol: Person mean = (88+82)/2 = 85; Overall mean = (77.0+70.0)/2 = 73.5; Person effect = 85–73.5 = 11.5
- Based on the ANOVA table below, what proportion of the variability in heart rates is due to participants (person-to-person variability)? Give your answer as a decimal.
Source | DF | Sum of Squares | Mean Squares | F | p-value |
Position | 1 | 1281.0 | 1281.0 | 43.4243 | <0.001 |
Participant | 51 | 8154.5 | 8154.5 | 5.4201 | <0.001 |
Residuals | 51 | 1504.5 | 29.50 | ||
Total | 103 | 10940.0 |
Sol: 8154.5/10940.0 = 0.74
- As shown in the ANOVA table below, there is strong evidence of a position effect – a difference between standing and sitting heart rates. Suppose you had used only position as a predictor of heart rate, ignoring person-to-person variability in your model. How would the strength of evidence change?
The evidence of a position effect would be ______ (stronger/weaker). Specifically SSError would be ______ (larger/smaller), which would lead to a ______ (larger/smaller) F-statistic for testing the effect of position.
Source | DF | Sum of Squares | Mean Squares | F | p-value |
Position | 1 | 1281.0 | 1281.0 | 43.4243 | <0.001 |
Participant | 51 | 8154.5 | 8154.5 | 5.4201 | <0.001 |
Residuals | 51 | 1504.5 | 29.50 | ||
Total | 103 | 10940.0 |
- In a paired experiment to compare sleep aids, participants all try two different treatments. The order of the treatments is randomized, and sleep quality is measured for each treatment. A partially completed ANOVA table is given below.
Source | DF | Sum of Squares | Mean Squares | F |
Treatment | 1 | 322 | ||
Person | 19 | 561 | ||
Error | 19 | |||
Total | 39 | 1211 |
Calculate the standard error of the residuals.
Sol: SSError = ; SE of residuals =
- Which of the following are examples of paired designs? Select all that apply.
- A teacher gives each of her students two cookies: one made with eggs and butter and one made with a vegan recipe. The students rate the taste of each cookie on a scale of 1-10.
- The managing partner of a consulting firm wants to compare the effectiveness of two computer skills training programs. Employees are randomly assigned to complete one of the two programs. Afterward, both groups take the same computer skills test.
- A college student is trying to decide which of two grocery stores – Publix or Kroger – offers better prices. The student selects 50 items that are available in both stores, and for each of these items, records both the Publix price and the Kroger price.
- Which of the following are benefits of a paired design compared to an independent groups design?
- Paired designs allow us to generalize results to a larger population.
- Paired designs always support cause-and-effect conclusions.
- Paired designs are always practical, whereas independent groups designs do not make sense in some contexts.
- Paired designs often lead to tests with more statistical power to detect an effect.
- In a paired design where each participant is observed under two conditions (treatment and control), why is it inappropriate to conduct a two-sample t-test?
- Because it is not reasonable to consider multiple observations on the same individual as independent
- Because it is not reasonable to consider multiple observations on the same individual as Normally distributed
- Because this design leads to large differences in standard deviations for the treatment and control groups
- Because this design leads to large differences in means for the treatment and control group
- True or False: In a paired design where each participant is observed under two conditions (treatment and control), the sum of the person effects across all participants will be zero.
Section 2.2: Randomized Complete Block Designs
LO2.2-1: Analyze a randomized complete block design using simulation-based methods.
LO2.2-2: Analyze a randomized complete block design using theory-based methods.
- True or False: In a randomized complete block design, each block should be a diverse group of observational units, roughly mirroring the population of interest.
- A matched pairs design is a specific kind of block design. When can the term “matched pairs” be used to describe a block design?
- When there are only two blocks of experimental units
- When there are only two observations in each block
- When there are two independent groups of experimental units
- When the alternative hypothesis in the test for a treatment effect is two-sided
- A blocking variable is a variable that the researcher believes is ________ (related to/independent of) the response variable; this variable ________ (is/is not) manipulated by the researcher.
- The table below shows the means and standard deviations for each block in a randomized block design.
Block | Mean Response | Standard Deviation of Responses |
1 | 500 | 12 |
2 | 501 | 14 |
3 | 498 | 9 |
4 | 497 | 12 |
Was the choice of blocking variable effective in this study?
- Yes, because the mean responses are very similar for the four blocks.
- No, because the mean responses are very similar for the four blocks.
- Yes, because the standard deviations are fairly similar for the four blocks.
- No, because the standard deviations are not similar enough for the four blocks.
Questions 5 through 7: A farmer has a field with three rows, where each row contains nine plants. The farmer suspects that the row may affect plants’ yield, because of variations in sunlight across his property, so he decides to use a randomized block design to test the effectiveness of three different fertilizers.
- Describe an appropriate randomized complete block design.
- Randomly assign the plants to one of the three fertilizers, which results in nine plants per treatment. Record both the yield and the row for each plant.
- Randomly sample one plant from each of the three rows, then randomly assign the three selected plants to the three different fertilizers.
- Randomly assign each of the three rows to one of the three fertilizers, so that all plants in a given row will receive the same treatment.
- For each row, randomly assign the nine plants to the three fertilizers, which results in three plants per treatment in each row.
- The null distribution of F-statistics was produced by randomly re-assigning the plant yields within blocks and calculating the F-statistic for each shuffle. The observed F-statistic for the real study was 0.92. What do you conclude?
- After adjusting for block effects, this data provides strong evidence that all three fertilizers are the same in terms of their effect on yield.
- After adjusting for block effects, this data provides strong evidence that all three fertilizers are different in terms of their effect on yield.
- After adjusting for block effects, this data provides strong evidence that at least one of the fertilizers are different in terms of its effect on yield.
- Even after adjusting for block effects, this data provides only weak evidence of a fertilizer effect.
- The null distribution of F-statistics was produced by randomly re-assigning the plant yields to treatments within blocks and calculating the F-statistic for each shuffle. How would the null distribution change if the plant yields were randomly re-assigned to treatments completely at random ignoring blocks?
- The null distribution of F-statistics would have a larger spread.
- The null distribution of F-statistics would have a smaller spread.
- The null distribution of F-statistics would be skewed left instead of skewed right.
- The null distribution of F-statistics would be approximately Normal.
Questions 8 through 12: Concentration is a one-person memory game in which cards are laid face down on a surface and two cards are flipped face up at a time. The object of the game is to turn over matching pairs of cards.
An online version of this game includes three different sets of cards: one has images of animals on the cards, one has images of babies, and one has images of holiday scenes. Are these three variations equally difficult? To investigate, eight students tried all three versions of the game in random order. They recorded the amount of time (in seconds) it took to complete the game. The overall mean time to complete the game was 79.31 seconds. The graphs below display the raw data and the data after adjusting for block effects.
- To find adjusted data values, it is first necessary to calculate block effects for each student in the sample. One student completed the animal version of the game in 65 seconds, the baby version in 73 seconds, and the holiday version in 59 seconds. Calculate the block effect for this student.
Sol: Student average = (65+73+59)/3 = 65.67; Student effect = 65.67–79.31 = -13.64
- Compare the variability in the adjusted data to the variability in the raw data.
In the adjusted data, the between-groups variability, as measured by SSModel, is ______ (>, <, =) the between-groups variability in the raw data.
In the adjusted data, the within-groups variability, as measured by SSError, is ______ (>, <, =) the within-groups variability in the raw data.
- In a test to decide whether the three versions of the game are equally difficult, which dataset would produce a larger F-statistic?
- The raw dataset (not accounting for blocks) would produce a larger F-statistic.
- The adjusted dataset (accounting for blocks) would produce a larger F-statistic.
- The F-statistics for the two datasets would be equal.
- You want to use simulation to test whether the three versions of the game are equally difficult. Which of the following is an appropriate design for the simulation. Select all that apply.
- Write the times from the raw dataset on cards. Shuffle and deal them into three piles representing the three versions of the game ignoring blocks. Calculate the F-statistic based on a one-variable ANOVA. Repeat.
- Write the times from the raw dataset on cards. Randomly re-assign the times to versions within blocks. Calculate the F-statistic based on a one-variable ANOVA. Repeat.
- Write the times from the adjusted dataset on cards. Shuffle and deal them into three piles representing the three versions of the game ignoring blocks. Calculate the F-statistic based on a one-variable ANOVA. Repeat.
- True or False: The validity conditions for ANOVA with a blocking variable are the same as the validity conditions for ANOVA without a blocking variable, except that the conditions are checked based on the adjusted data not the raw data.
Questions 13 through 15: Sixty-eight allergy sufferers were recruited to participate in a study comparing the effectiveness of different allergy treatments. During allergy season, each participant tried three treatments with active ingredients plus one placebo treatment, with the order of the treatments randomized and adequate time between treatments. Participants rated the severity of their allergy symptoms while taking each treatment. A partially filled in ANOVA table is given below.
Source | DF | Sum of Squares | Mean Squares | F |
Treatment | 2174.4 | |||
Participant | 3913.6 | |||
Error | 201 | |||
Total | 271 | 8697.4 |
- In this study, what are the blocks?
- The 68 allergy sufferers
- The 3 allergy treatments with active ingredients
- The 4 allergy treatments (including the placebo)
- The 272 ratings recorded across participants
- Calculate the F-statistic that could be used to test for a treatment effect.
Sol: ; ;
- What if this analysis had failed to account for person-to-person variability in allergy symptoms? How would the ANOVA table change?
The row for Participant would be removed, and the amount of variation due to individual differences in allergy symptoms (SSParticipants = 3913.6) would be added to _________ (SSTreatment/SSError), which would make the F-statistic _________ (larger/smaller).
Section 2.3: Observational Study with Two Explanatory Variables
LO2.3-1: Analyze an observational study with two sources of explained variation.
LO2.3-2: Understand what it means to adjust for another variable in the analysis.
LO2.3-3: Explore effects of covariation in explanatory variables on the analysis.
Questions 1 through 4: In 2018, a sample of 628 academic faculty from universities across the country were surveyed about their salaries (in US dollars). The results were classified according to each faculty member’s academic rank (instructor, assistant professor, associate professor, and full professor) and gender (male, female).
- Calculate the effect of “female” on wages. Note that the sample sizes for male and female faculty members are not equal.
| Salary | |
Mean | SD | |
Full sample (n = 628) | $114,946 | $51,681 |
Male (n = 428) | $123,953 | $55,281 |
Female (n = 200) | $95,671 | $55,281 |
Sol: ;
- In the sample, male faculty members are paid more than female faculty members. Does this analysis provide evidence that gender causes differences in salary?
- It depends whether the results are statistically significant. If this difference results in a small p-value, then the analysis supports causal conclusions.
- It depends how the sample was selected. If the sample is representative of the population, then the analysis supports causal conclusions.
- No, because this sample is not large enough. There are over a million academic faculty in the U.S., so it is not appropriate to draw conclusions based on only 628.
- No, because data come from an observational study not a randomized experiment, so there may be confounding variables present.
Ans D; LO2.3-1; Difficulty: Medium; Type: MC
- The ANOVA table below summarizes a model that uses both gender and academic rank to predict a faculty member’s salary (in $1000s).
Source | DF | Adj SS | Adj MS | F | p-value |
Gender | 1 | 15,167 | 15,167 | 9.54 | 0.0021 |
Rank | 3 | 575,905 | 191,968 | 120.81 | <0.0001 |
Error | 623 | 989,757 | 1,589 | ||
Total | 627 | 1,674,691 |
Calculate the sum of squares for the covariation - the amount of variation in salaries that is attributed jointly to gender and rank.
- SSCovariation = SSGender + SSRank = 591,072
- SSCovariation = SSGender + SSRank + SSError = 1,580,829
- SSCovariation = SSTotal – SSGender – SSRank = 1,083,619
- SSCovariation = SSTotal – SSGender – SSRank – SSError = 93,862
Ans D; LO2.3-3; Difficulty: Medium; Type: MC
- Statistical software was used to calculate the adjusted effects of gender and rank. The statistical model is given below.
with
According to this model, if we compare two faculty members of the same rank, on average, the male faculty member earns _______ dollars more per year than the female faculty member.
Questions 5 through 8: A real estate agent collected information on 100 recent home sales in their town. In addition to selling prices (in $1000s), the agent recorded information about the home’s location (north side of town or south side of town) and the number of bedrooms (at least 3 or less than 3). A partially filled in ANOVA table for the two-variable model is given below.
Source | DF | Adj SS | Adj MS | F | p-value | R2 |
Location | 1 | 19,843 | 0.0100 | 0.0631 | ||
Bedrooms | 1 | 22,180 | 0.0066 | 0.0705 | ||
Error | 97 | 278,799 | ||||
Total | 99 | 314,433 |
- What does the adjusted sum of squares for Location represent?
The variation in prices that is explained by Location, _________ (excluding/including) the variation explained jointly by Location and Bedrooms.
Ans excluding; LO2.3-2; Difficulty: Easy; Type: FIB
- For a one-variable model that uses only Location as a predictor of price, R2 = 0.09. Is there an association between Location and Bedrooms?
- Yes, there is an association between Location and Bedrooms.
- No, there is not an association between Location and Bedrooms.
- We do not have enough information to decide if there is an association between Location and Bedrooms.
Ans A; LO2.3-3; Difficulty: Medium; Type: MC
- Calculate the F-statistic for Location.
Sol:
- After adjusting for the number of bedrooms, does this data provide strong evidence of a difference in the population mean prices for houses on the north side and houses on the south side? You may assume that the sample is representative of all houses in this town.
- No, because the p-value for Location is small.
- No, because the R2 value for Location is small.
- Yes, because the p-value for Location is small.
- Yes, because the R2 value for Location is small.
Questions 9 and 10: Sixty-eight allergy sufferers were recruited to participate in a study comparing the effectiveness of different allergy medications. During allergy season, all 68 participants tried three medications with active ingredients plus one placebo medication, with the order of the treatments randomized and adequate time in between. Participants rated the severity of their allergy symptoms while taking each treatment.
- True or False: The mean of the four treatment means will be the same as the overall mean rating.
- True or False: The design of this study ensures there is no relationship between the treatments and the blocks; thus there is no covariation.
Questions 11 and 12: In an observational study, two categorical explanatory variables are used to predict a response. You may assume that there is a moderate amount of covariation.
Source | DF | Adj SS | F | p-value |
Variable 1 | ||||
Variable 2 | ||||
Error | ||||
Total |
- How do you calculate SSModel for this study?
- SSModel = SSVariable1 + SSVariable2
- SSModel = SSTotal – SSError
- Both (A) and (B) would produce the correct value for SSModel.
- Neither (A) nor (B) would produce the correct value for SSModel.
- Suppose Variable 2 were removed from the model. Would the adjusted sums of squares and p-value for Variable 1 change?
- Removing Variable 2 would not affect the sum of squares or p-value for Variable 1.
- Removing Variable 2 would not affect the sum of squares for Variable 1, but the p-value for Variable 1 may change.
- Removing Variable 2 would not affect the p-value for Variable 1, but the sum of squares for Variable 1 may change.
- Removing Variable 2 may change both the sum of squares and the p-value for Variable 1.
Questions 13 through 15: Students in a statistics class wanted to know how customers rate one of their favorite coffee shops. The students administered surveys and asked customers to rate their experience in the shop on a scale of 1-10. They also asked whether the survey respondent was a college student and how often they visit coffee shops (at least once a week or less than once a week).
- The students decide to use a two-variable model that predicts ratings based on whether the customer is a student and the frequency of their coffee shop visits. Based on the graph below, do you expect covariation – variation in ratings that is attributed jointly to Student and Frequency?
- Yes, because more than half of the customers in the sample are students.
- Yes, because students tend to give higher ratings than non-students.
- Yes, because students are more likely to visit coffee shops at least once per week.
- No, because the sample is evenly divided between customers who visit coffee shops at least once per week and those who visit less than once per week.
- Consider two possible models for predicting ratings. A one-variable model that predicts ratings based on frequency of coffee shop visits has an R2 value of 0.06. A one-variable model that predicts ratings based on whether the customer is a student has an R2 value of 0.03. What is the R2 value of a two-variable model that uses both Frequency and Student to predict ratings?
- R2 = 0.06 + 0.03 = 0.09
- R2 = 0.06 - 0.03 = 0.03
- R2 = (0.06 + 0.03) 2 = 0.045
- There is not enough information to calculate R2 for the two-variable model.
- Consider two possible models to predict ratings:
Model 1:
, with
Model 2:
with
Compare the adjusted sum of squares for student in each of these two models.
The adjusted sum of squares for student in Model 1 ________ (>, <, =) the adjusted sum of squares for student in Model 2.
Document Information
Connected Book
Intermediate Statistical Investigations 1st Ed - Exam Bank
By Nathan Tintle