- Basic Regression Analysis Chapter 11 1e Full Test Bank - Understanding Business Statistics 1e Test Bank by Ned Freed. DOCX document preview.

- Basic Regression Analysis Chapter 11 1e Full Test Bank

CHAPTER 11

TRUE/FALSE

1. Residual analysis makes use of a plot of residuals (on the y axis) and the independent variable (on the x axis) to evaluate the assumptions about the error term in regression.

2. A procedure used for finding the equation of a straight line that provides the best fit by minimizing the sum of the squared vertical distances of points from the line is called the least squares method.

3. When the slope of a simple linear regression equation is a negative value, the correlation coefficient can be either positive or negative.

4. In residual analysis, if the assumptions about the error term are valid then the plot of the residuals against the corresponding x values should have a funnel shape.

5. A procedure used for finding the equation of a straight line that provides the best fit for the relationship between the independent and dependent variables is called the minimum deviations method.

6. The coefficient of determination r2, a measure of the goodness of fit of the estimated regression equation, is the ratio of the explained variation in y to the total variation in y, and takes on values between 0 and 1.

7. The standard error of estimate (sy.x) in simple linear regression is the square of the Mean Square Error (MSE) which is the sum of squares error (SSE) multiplied by its degrees of freedom n-2.

8. In simple regression, the usual goal is to identify a cause-and-effect relationship between two variables.

9. In multiple regression, a single independent variable is linked to a set of two or more dependent variables.

10. In simple linear regression, the slope and intercept values for the least squares line fit to a sample of data points serve as point estimates of the slope and intercept terms of the least squares line that would be fit to the population of data points.

11. In simple linear regression, the least squares line fit to a sample of data will maximize the number of data points that will fall along that line.

12. In simple linear regression, the least squares line fit to a sample of data points seeks to minimize the maximum distance of points from the line.

13. In simple linear regression, the variable that will be predicted is labeled the dependent variable.

14. In simple linear regression, the difference between a predicted y value and an observed y value is commonly called the residual or error value.

15. In simple linear regression, the r2 value measures the percentage of total variation in the sample data that cannot be explained by the x-to-y relationship that has been identified.

16. In simple linear regression, the r2 value is the ratio of SSE/SST.

17. In simple linear regression, there is an additive relationship between SST, SSR, and SSE: that is, SST = SSR + SSE

18. In a simple linear regression, the least squares line turns out to be y = 120 + 4x. The observed value of y when x = 10 is 182. The residual value for x = 10 must be 22.

19. In a simple linear regression, the least squares line turns out to be y = 120 + 4x. The sum of the residuals (errors) around this line will be 0.

20. In simple linear regression, rejecting the null hypothesis in the key hypothesis test regarding the slope of the “population” line means we haven’t yet found sufficient sample evidence that there’s a useful linear relationship between x and y.

MULTIPLE CHOICE

21. In regression analysis, which of the following is NOT a required assumption about the error term, ε?

a. The expected value of the error term is zero.

b. The error term follows a standard normal distribution.

c. The standard deviation is constant for all values of x

d. The values of the error term are independent.

e. none of the above, all are assumptions

22. A least squares line:

a. assumes a relationship between the slope of x and the intercept of y

b. may be used to predict a value of y if the corresponding x value is given

c. must be linear, upward-sloping and have a positive y-intercept

d. minimizes the sum of the deviations between the observed value of y and

the estimated value of y

e. all of the above

23. A least squares line:

a. implies a cause-effect relationship between x and y

b. may be used to predict a value of y if the corresponding x value is given

c. can only be determined if a good linear relationship exists between x and y

d. minimizes the sum of the deviations between the observed value of y and the estimated value of y

e. all of the above

24. Application of the least squares method to regression analysis results in values of the y intercept and the slope that minimizes the sum of the squared deviations between the:

a. observed values of the dependent variable and the estimated values of the dependent variable

b. observed x values and the estimated y values

c. actual values of the independent variable and the estimated values of the dependent variable

d. actual values of x and the estimated values of x

e. none of the above

25. Which of the following is correct?

a. SSE = SSR + SST

b. SSR = SSE + SST

c. SST = (SSR)2

d. SST = SSR + SSE

e. none of the above

26. A regression analysis linking demand (y in 1000 units) and price (x in $) resulted in the following equation: estimated Y = 9 – 3x. This equation implies that if the price is decreased by $1, demand can be expected to:

a. increase by 9,000 units

b. decrease by 9,000 units

c. decrease by 3,000 units

d. increase by 3,000 units

e. none of the above

27. Application of the least squares method to regression analysis results in values of the y intercept and the slope that minimizes the sum of the squared deviations between the:

a. residuals of x and the residuals of y

b. actual value of the y-intercept and the estimated value of the y-

intercept

c. observed values of the dependent variable and the estimated values

of the dependent variable

d. observed values of the independent variable and the estimated values

of the independent variable

e. none of the above

28. Larger values of r2 imply that the observations are more closely grouped about the:

a. mean value of the independent variables

b. mean value of the dependent variable

c. least squares line

d. origin

e. none of the above

29. In regression analysis, which of the following is NOT a required assumption about the error term ε?

a. expected value of the error term is zero

b. variance of the error term is the same for all values of x

c. values of the error term are positive

d. error term is normally distributed

e. all of the above

30. Application of the least squares criterion to regression analysis results in values of the y intercept and the slope that minimizes the sum of the squared deviations between the:

a. observed values of x and the estimated values of y

b. observed values of y and the estimated values of x

c. observed values of the dependent variable and the estimated values

of the dependent variable

d. observed values of the independent variable and the estimated values

of the independent variable

e. none of the above

31. A regression analysis between demand (y in 1000 units) and price (x in $) resulted in the following equation: estimated Y = 9 – 5x. This equation implies that if the price is increased by $1, the demand is expected to

a. increase by 9000-5000 = 4000 units

b. decrease by 9000 units

c. decrease by 9000 + 5000 = 14,000 units

d. decrease by 5,000 units

e. none of the above

32. Which of the following is NOT true regarding the coefficient of determination, r2?

a. it is the sum of squares error (SSE) divided by the sum of squares total

(SST)

b. it is a measure of the goodness of fit of the estimated regression equation

c. it takes on values between 0 and 1

d. it explains the amount of variation in y due to x

e. none of the above: all are true

33. In regression analysis, which of the following is NOT a required assumption about the error term, ε?

a. expected value of the error term is zero

b. error term has a normal distribution

c. standard deviation of the error term is constant for all values of x

d. values of the error term are independent

e. none of the above, all are required assumptions

34. If the correlation coefficient is a positive value, then the slope of the regression line:

a. must also be positive

b. can be either negative or positive

c. can be zero

d. must be negative

e. none of the above

35. Which of the following best describes the least squares criterion?

a. it identifies the best fitting line as the line that minimizes the sum of the

squared vertical distances of points from the line

b. the best fitting line minimizes the sum of the horizontal distances from the

line

c. it minimizes the sum of the absolute distances from the line

d. it is a measure of the goodness of fit of the estimated regression equation

e. none of the above

36. The purpose of regression analysis is to find a mathematical relationship that allows us to:

a. predict the value of an independent variable based on the value of a dependent variable.

b. predict the value of a dependent variable based on the value of an independent variable.

c. determine whether the average value of one variable, x, differs significantly from the average value of another variable, y.

d. determine whether two events X and Y are statistically dependent or statistically independent.

e. determine whether there is a cause-and effect relationship between two variables.

37. In simple linear regression, the:

a. least squares line is used to predict values of the independent variable.

b. residual plot will reveal information about the basic model assumptions.

c. least square line will not pass through the point (average x, average y).

d. value of r (correlation coefficient) cannot be 0.

e. all of the above

38. In simple linear regression, an r2 value of -.93 suggests that:

a. there is a fairly strong negative relationship between the variables.

b. the residual plot will be imprecise.

c. a computational error has been made.

d. the value of r (correlation coefficient) will be -.964.

e. none of the above

39. In simple linear regression, given SSE = 2348 and SST = 3569,

a. SSR = 5917.

b. r2 = .342.

c. SSD = 1343.

d. r = .364.

e. MSE = 892

a. we haven’t yet found sufficient sample evidence that there’s a useful linear relationship between x and y.

b. we have sufficient sample evidence that there’s no useful linear relationship between x and y.

c. there is convincing sample evidence of a useful linear relationship between x and y.

d. the sum of the residuals is such that no linear relationship can be established between x and y.

e. that the p-value is greater than alpha.

PROBLEMS

41. The following data have been collected for a simple linear regression analysis.

x

y

12

20

20

24

8

8

16

28

The least squares line would predict a y value of _______ for an x value of 15.

a. 16.2

b. 21.4

c. 20.9

d. 19.3

e. 18.1

42. In a simple linear regression analysis attempting to relate sales (y) to price (x), the following data are available:

x

y

Price ($)

Sales(units)

40

120

70

60

50

100

80

80

The least squares line would predict a y value of _______ for an x value of 60.

a. 90

b. 80

c. 75

d. 71

e. 84

43. The following data are available for a simple linear regression analysis attempting to link hours of training (x) to hourly output (y).

x

y

training

output

5

120

4

150

8

240

2

90

The least squares line would predict _______ units of output per hour when training is 7 hours.

a. 168

b. 190

c. 204

d. 218

e. 152

44. You are attempting to link weekly hours of exercise (x) to blood pressure (y) using simple linear regression and the following data:

x

y

Exercise

(hours)

Blood pressure

6

70

4

80

8

40

12

50

The estimated regression equation is y = 90 – 4x. Compute the standard error of estimate.

a. 18.21

b. 14.83

c. 21.45

d. 8.42

e. 26.9

45. You are attempting to link responses to a job announcement (y) to the number of days the announcement was repeated (x):

x

y

No. of days

No. of responses

16

20

24

32

4

12

20

28

The estimated regression equation is y = 7 + 1.0x. Compute the standard error of estimate.

a. 4.61

b. 2.45

c. 3.67

d. 5.49

e. 1.27

46. The following data have been collected for a simple linear regression analysis:

x

y

4

4

10

12

6

10

8

14

In applying the least squares criterion, the slope (b) and the intercept (a) for the best-fitting line are b = 1.4 and a = .2. Compute the value of r2 here.

a. .65

b. .23

c. .58

d. .36

e. .70

47. The following data have been collected for a simple linear regression analysis relating sales (y) to price (x):

x

y

Price

($)

Sales

(units)

5

100

7

60

4

120

8

80

In applying the least squares criterion, the slope (b) and the intercept (a) for the best-fitting line are b = -12 and a = 162. Compute the value of r2 here.

a. .66

b. .72

c. .54

d. .77

e. .81

48. The following data are available for a simple linear regression analysis attempting to link hours of training (x) to hourly output (y).

x

y

training

output

5

12

4

15

8

24

2

9

In applying the least squares criterion, the slope (b) and the intercept (a) for the best-fitting line are b = 2.4 and a = 3.6. Compute the value of r, the correlation coefficient here.

a. .735

b. .776

c. .857

d. .926

e. .857

49. You are attempting to link weekly hours of exercise (x) to blood pressure (y) using simple linear regression and the following data:

x

y

exercise

(hours)

Blood pressure

6

70

4

80

8

40

12

50

In applying the least squares criterion, the slope (b) and the intercept (a) for the best-fitting line are b = 90 and a = -4. Produce the 95% confidence interval estimate of the population intercept. Report the upper bound for your interval.

a. 162

b. 154

c. 204

d. 168

e. 177

50. You are using linear regression to link number of responses to a job announcement (y) to the number of days the announcement was repeated (x):

x

y

No. of days

No. of responses

8

10

12

16

2

6

10

14

In applying the least squares criterion, the slope (b) and the intercept (a) for the best-fitting line are b = 1.0 and a = 3.5. Produce the 95% confidence interval estimate of the population intercept and report the upper bound for your interval.

a. 4.5

b. 9.7

c. 8.3

d. 6.1

e. 7.2

51. In a simple linear regression analysis attempting to link lottery sales (y) to jackpot amount (x), the following data are available:

x

y

jackpot

($millions)

Sales

(millions)

12

60

14

70

6

40

8

50

The slope (b) of the estimated regression equation here is 3.5. The intercept (a) is 20. Produce the 95% confidence interval estimate of the population slope, β, and report the upper bound for the interval.

a. 5.02

b. 4.66

c. 7.23

d. 3.72

e. 8.05

52. The following data have been collected for a simple linear regression analysis relating sales (y) to price (x):

x

y

Price

($)

Sales

(units)

5

100

7

60

4

120

8

80

In applying the least squares criterion, the slope (b) and the intercept (a) for the best-fitting line are b = -12 and a = 162. Produce the 99% confidence interval estimate of the population slope, β. Report the upper bound for your interval.

a. -16.7

b. 40.5

c. 21.4

d. -15.2

e. 26.9

53. The following data are available for a simple linear regression analysis attempting to link hours of training (x) to hourly output (y).

x

y

training

output

5

12

4

15

8

24

2

9

In applying the least squares criterion, the slope (b) and the intercept (a) for the best-fitting line are b = 2.4 and a = 3.6. Produce the 90% confidence interval estimate of the population slope, β. Report the upper bound for your interval.

a. 6.86

b. 8.29

c. 4.42

d. 5.71

e. 3.90

54. The following data have been collected for a simple linear regression analysis relating sales (y) to price (x):

x

y

Price

($)

Sales

(units)

4

120

7

60

5

100

8

80

In applying the least squares criterion, the slope (b) and the intercept (a) for the best-fitting line are b = -12 and a = 162. You are to conduct a hypothesis test to determine whether you can reject the null hypothesis that the population slope, β, is 0 at the 1% significance level. Report the value of the appropriate sample test statistic, tstat.

a. -2.27

b. -1.23

c. -4.16

d. -3.52

e. -3.01

55. The following data have been collected for a simple linear regression analysis relating sales (y) to price (x):

x

y

Price

($)

Sales

(units)

4

120

7

60

5

100

8

80

In applying the least squares criterion, the slope (b) and the intercept (a) for the best-fitting line are b = -12 and a = 162. You are to conduct a hypothesis test to determine whether you can reject the null hypothesis that the population slope, β, is 0 at the 1% significance level. Compute the value of the sample test statistic, tstat, and use it to reach the proper conclusion.

a. Since tstat is inside the critical ±9.925 boundaries, we can’t reject the β = 0 null hypothesis.

b. Since tstat is outside the critical ±9.925 boundaries, we can reject the β = 0 null hypothesis.

c. Since tstat is inside the critical ±7.845 boundaries, we can’t reject the β = 0 null hypothesis.

d. Since tstat is inside the critical ±7.845 boundaries, we can reject the β = 0 null hypothesis.

56. The following data are available for a simple linear regression analysis attempting to link hours of training (x) to hourly output (y).

x

y

training

output

5

12

4

15

8

24

2

9

In applying the least squares criterion, the slope (b) and the intercept (a) for the best-fitting line are b = 2.4 and a = 3.6. You are to set up a hypothesis test to determine whether you can reject the hypothesis that the population slope, β, is 0 at the 5% significance level. Report the value of the appropriate sample test statistic, tstat.

a. 3.46

b. 2.12

c. 1.87

d. 2.4

e. 4.42

57. The following data are available for a simple linear regression analysis attempting to link hours of training (x) to hourly output (y).

x

y

training

output

5

12

4

15

8

24

2

9

In applying the least squares criterion, the slope (b) and the intercept (a) for the best-fitting line are b = 2.4 and a = 3.6. Set up a hypothesis test to determine whether you can reject the hypothesis that the population slope, β, is 0 at the 5% significance level. Compute the value of the appropriate sample test statistic, tstat, and use it to reach the proper conclusion.

a. Since tstat is outside the critical values of ±3.213, we can reject the β=0 null hypothesis.

b. Since tstat is between the critical values of ±3.213, we can't reject the β=0 null hypothesis.

c. Since tstat is outside the critical values of ±4.403, we can reject the β=0 null hypothesis.

d. Since tstat is between the critical values of ±4.403, we can’t reject the β=0 null hypothesis.

58. You are attempting to link weekly hours of exercise (x) to blood pressure (y) using simple linear regression and the following data:

x

y

exercise

(hours)

Blood pressure

6

70

4

80

8

40

12

50

In applying the least squares criterion, the slope (b) and the intercept (a) for the best-fitting line are b = -4 and a = 90. Produce the 95% confidence interval estimate of expected blood pressure when weekly exercise is 5 hours (that is, for x = 5) and report the upper bound for the interval.

a. 85.6

b. 70.0

c. 111.8

d. 93.2

e. 78.3

59. You are attempting to link responses to a job announcement (y) to the number of days the announcement was repeated (x):

x

y

No. of days

No. of responses

8

10

12

16

2

6

10

14

The slope term (b) in the estimated regression equation was 1.0. The intercept term (a) was 3.5. Show the 99% confidence interval estimate of the expected number of responses when the number of days the announcement is repeated is 11 and report the upper bound for the interval.

a. 15.62

b. 18.90

c. 20.35

d. 26.43

e. 22.29

60. In a simple linear regression analysis attempting to link lottery sales (y) to jackpot amount (x), the following data are available:

x

y

jackpot

($millions)

Sales

(millions)

12

60

14

70

6

40

8

50

The slope term (b) of the estimated regression equation turns out to be 3.5. The intercept term (a) turns out to be 20. Show the 95% prediction interval for sales for an individual case in which the jackpot is $9 (million). Show the upper bound for the interval.

a. 55.6

b. 78.2

c. 62.4

d. 73.7

e. 81.1

61. The following data have been collected for a simple linear regression analysis relating sales (y) to price (x):

x

y

Price

($)

Sales

(units)

5

100

7

60

4

120

8

80

The slope term (b) in the estimated regression equation is -12. The intercept term (a) is 162. Show the 90% prediction interval for sales in a particular case in which price is $6 (that is, for x = 6). Report the upper bound for your interval.

a. 102.3

b. 126.8

c. 113.4

d. 144.6

e. 135.2

62. The following data are available for a simple linear regression analysis attempting to link hours of training (x) to hourly output (y).

x

y

training

output

5

12

4

15

8

24

2

9

The slope term in the estimated regression equation was 2.4. The intercept term (a) was 3.6. Show the 90% prediction interval for hourly output for an individual with 2.5 hours of training. Report the upper bound for the interval.

a. 11.6

b. 20.4

c. 12.1

d. 22.4

e. 26.5

63. A National football League agent is conducting a simple linear regression study linking passes caught in a season (x) to annual salary (y) for wide receivers in the league. The data for five players are shown below.

Catches

Salary ($000)

110

410

80

300

90

420

50

310

120

460

The slope for the least squares line is 2.2. The intercept is 182. Compute the standard error of estimate (sy.x) here.

a. 116.34

b. 67.82

c. 43.51

d. 18.19

e. 21.68

64. Lillian Chernov is trying to identify a linear relationship that can be used to estimate the construction cost for a new sports complex that her construction company is bidding on. Below is a table showing construction costs and floor area for a sample of five similar buildings in the area.

Floor Area

(000 sq. ft.)

Construction

Cost ($000)

40

620

70

600

80

840

100

820

110

920

The slope for the least squares line is 4.4. The intercept is 408. Compute the "explained variation" (SSR) here.

a. 58080

b. 62365

c. 39267

d. 51256

e. 49241

65. Kareem Martin is trying to identify a linear relationship linking the amount of heat (x) applied in the final hardening stage to the heavy duty steel bolts that his company produces for bridge construction and the strength of those bolts (y). Below is a table showing data for a sample of five of the bolts.

Heat (oF)

Strength index

2400

820

1800

600

2000

840

1200

620

2600

920

The slope for the least squares line is .22. The intercept is 320. Compute the "unexplained variation" (SSE) here.

a. 58080

b. 42365

c. 39267

d. 22720

e. 49241

66. A cell phone's SAR (Specific Absorption Rate) is a measure of the amount of radio frequency (RF) energy absorbed by the body when using a cell phone. The rate can vary by the distance of the phone from the ear. Below is a table showing SAR (y) and distance (x) for a sample of five 10 minute calls.

Distance

(cm)

SAR

level

1.0

46

1.6

41

1.8

42

2.2

30

2.4

31

The slope of the least squares line is 59.6. The intercept is -12. Compute the "total variation" (SST).

a. 432

b. 202

c. 316

d. 520

e. 164

67. Vicente Management is trying to identify a linear relationship that can be used to estimate the heating costs for its commercial properties. Below is a table showing heating costs and floor area for a sample of five buildings in the area.

Floor Area

(1000s sq. ft.)

Heating

Cost ($1000s)

50

310

80

300

90

420

110

410

120

460

The slope of the least squares line is 2.2. The intercept is 182. Construct a 95% confidence interval estimate of E(y200), the expected heating cost for the set of all buildings that have 200,000 square feet of floor space. Report the upper bound for the interval.

a. 754.3

b. 836.5

c. 678.1

d. 792.8

e. 906.9

68. Tennis coach Ben Gordon is trying to identify a linear relationship that can be used to link average first serve velocity to the percent of first serve points won for his junior tennis players. Below is a table showing velocity and percent of points won for a sample of five junior players.

Velocity (mph)

% points won

120

92

80

60

90

84

110

82

50

62

The slope of the least squares line is .44. The intercept is 36.4. Determine the value of R, the correlation coefficient, that you would insert in the table below.

R

?

R Square

.719

Adjusted R Square

Standard Error

8.7

Observations

5

a. .719

b. .926

c. .773

d. .848

e. .874

69. Tennis coach Ben Gordon is trying to identify a linear relationship that can be used to link average first serve velocity to the percent of first serve points won for his junior tennis players. Below is a table showing velocity and percent of points won for a sample of five junior players.

Velocity (mph)

% points won

120

92

80

60

90

84

110

82

50

62

The slope of the least squares line is .44. The intercept is 36.4. Determine the value of R2, the coefficient of determination, that you would insert in the table below.

Multiple R

.848

R Square

???

Adjusted R Square

Standard Error

8.7

Observations

5

a. .848

b. .719

c. .659

d. .903

e. .792

70. A National Football League agent is conducting a simple linear regression study linking completion percentage in a season (x) to salary per game(y) for quarterbacks. The data for five players are shown below.

Completion %

Salary ($1000)

55

410

40

300

45

420

25

310

60

460

The slope for the least squares line is 4.4. The intercept is 182. Determine sy.x, the standard error of estimate, that should be entered in the table below.

Multiple R

.848

R Square

.719

Adjusted R Square

Standard Error

???

Observations

5

a. 116.34

b. 67.82

c. 43.51

d. 18.19

e. 21.68

71. The computer output table below shows partial results for a linear regression analysis. Determine SSE, the Error Sum of Squares, that should be entered in the ANOVA table.

ANOVA

df

SS

MS

F

Significance F

Regression

1

31815

4.500

Error (Residual)

3

???

Total

4

Coeffs

Standard Error

Intercept

-11.5

X

4.8

a. 12310

b. 8480

c. 18300

d. 21210

e. 19360

72. The computer output below shows partial results for a linear regression analysis. Determine the standard error (standard deviation) of the sampling distribution of the sample slope that should be entered in the indicated cell of the output table:

ANOVA

df

SS

MS

F

Significance F

Regression

XXX

XXX

XXX

XXX

Error (Residual)

XXX

XXX

Total

XXX

20200

Coeffs

Standard Error

t Stat

Intercept

-38

152.2

X

220

???

2.77

a. 86.2

b. 123.7

c. 79.4

d. 164.6

e. 143.5

73. The computer output table below shows partial results for a linear regression analysis. To test the “no useful linear relationship” hypothesis, determine tstat for the sample slope—a value that would be entered in the indicated cell of the output table as shown.

ANOVA

Df

SS

MS

F

Significance F

Regression

XXX

XXX

XXX

XXX

Error (Residual)

XXX

XXX

Total

XXX

20200

Coeffs

Standard Error

t Stat

P-value

Intercept

-38

152.2

XXX

XXX

X

220

79.4

???

XXX

a. 2.77

b. 1.43

c. 3.68

d. 4.10

e. 1.74

74. Preston Gomez is attempting to identify a linear relationship that will link heating and air conditioning costs to total floor area for his commercial properties. Below is a table showing heating /AC costs and floor area for five buildings in the area.

Floor Area

(000 sq. ft.)

Heating/AC

Cost ($000)

50

310

80

300

90

420

110

410

120

460

The computer output table below shows partial results for the linear regression analysis done here. Determine the upper bound on a 95% confidence interval estimate of the slope in the regression equation—a value that would be entered in the indicated cell of the output table.

ANOVA

Df

SS

MS

F

Significance F

Regression

XXX

XXX

XXX

XXX

Error (Residual)

XXX

XXX

Total

XXX

20200

Coeffs

Standard Error

t Stat

P-value

Lower 95%

Upper 95%

Intercept

182

74.1

XXX

XXX

-53.8

417.8

X

2.2

XXX

???

a. 3.56

b. 5.14

c. 4.21

d. 3.19

e. 4.73

75. Economist Joshua Grant is using linear regression to try to establish a link between the unemployment rate (x) and home sales in Lane County (y) during a given month. The following data are available:

x

y

Unemployment

rate

Home Sales

11

150

13

140

15

100

17

110

The slope term (b) in the estimated regression equation is -8. The intercept term (a) is 237. Compute the standard error of estimate (sy.x) here.

a. 13.2

b. 14.5

c. 17.8

d. 15.3

e. 16.9

76. Economist Joshua Grant is using linear regression to try to establish a link between the unemployment rate (x) and monthly home sales in the region (y). The following data are available:

x

y

Unemployment

rate

Home Sales

11

150

13

140

15

100

17

110

The slope term (b) in the estimated regression equation turns out to be -8. The intercept term (a) is 237. Explained variation (SSR) here would be _______.

a. 560

b. 920

c. 1130

d. 1280

e. 840

77. Economist Joshua Grant is using linear regression to try to establish a link between the unemployment rate (x) and monthly home sales in the region (y). The following data are available:

x

y

Unemployment

rate

Home Sales

11

150

13

140

15

100

17

110

The slope term (b) in the estimated regression equation is -8. The intercept term (a) is 237. Construct a 95% confidence interval estimate of E(y5), the expected number of for months in which the unemployment rate is price is 5 percent. Report the upper bound for the interval.

a. 326.4

b. 205.7

c. 289.3

d. 227.5

e. 301.6

78. Karina Burkholtz believes there is a linear connection between the hourly rate (x) that her company charges for its truck rentals and the number of weekly rental hours (y) that the company sells. The following data are available:

Rate

($/hour)

Weekly

Rental Hours

11

150

13

140

15

100

17

110

The slope term (b) in the estimated regression equation is -8. The intercept term (a) is 237. Construct a 95% confidence interval estimate of E(y12), the expected weekly rental hours for an hourly rate of $12. Report the upper bound for the interval.

a. 89.6

b. 166.3

c. 182.8

d. 125.7

e. 137.2

79. Karina Burkholtz believes there is a linear connection between the hourly rate (x) that her company charges for its truck rentals and the number of weekly rental hours (y) that the company sells. The following data are available:

Rate

($/hour)

Weekly

Rental Hours

17

1000

15

1400

13

1500

19

1100

The slope term (b) in the estimated regression equation is -80. The intercept term (a) is 2530. Determine the value of the correlation coefficient R that you should enter in the indicated cell of the table below:

Multiple R

???

R Square

Adjusted R Square

Standard Error

144.9

Observations

4

a. .868

b. .465

c. .719

d. .628

e. .581

80. Karina Burkholtz believes there is a linear connection between the hourly rate (x) that her company charges for its truck rentals and the number of weekly rental hours (y) that the company sells. The following data are available:

Rate

($/hour)

Weekly

Rental Hours

17

1000

15

1400

13

1500

19

1100

Determine the total sum of squares (SST) that should be entered in the indicated cell below.

ANOVA

df

SS

MS

F

Significance F

Regression

1

128000

Residual (Error)

2

Total

3

???

Coeffs

Standard Error

t Stat

P-value

Lower 95%

Upper 95%

Intercept

2530

45.9

39.5

434.5

X

-80

3.24

a. 170000

b. 164000

c. 152000

d. 198000

e. 187000

81. Home Products.com believes there is a linear connection between the number of service operators that it has available for its live customer chat option each work shift (x) and the percentage of items purchased by customers that are returned to the company each week (y). The following sample data were used to establish the relationship:

Operators

Returns

11

40

7

80

15

50

9

70

The computer output below shows partial results for the linear regression analysis done here. Determine the value of tstat that should be entered in the indicated cell below:

ANOVA

df

SS

MS

F

Significance F

Regression

1

560

XXX

XXX

XXX

Residual (Error)

2

XXX

Total

3

Coeffs

Standard Error

t Stat

P-value

Lower 95%

Upper 95%

Intercept

102

27.3

XXX

XXX

-15.7

219.7

X

-4

2.5

???

XXX

a. -3.22

b. -4.28

c. -1.60

d. -2.47

e. -1.96

82. Home Products.com believes there is a linear connection between the number of service operators that it has available for its live customer chat option each work shift (x) and the percentage of items purchased by customers that are returned to the company each week (y). The following sample data were used to establish the relationship:

Operators

Returns

11

40

7

80

15

50

9

70

The computer output below shows partial results for the linear regression analysis. Determine the upper bound on the appropriate 95% confidence interval estimate of the population slope—a value that would be entered in the indicated cell below.

ANOVA

df

SS

MS

F

Significance F

Regression

1

560

XXX

XXX

XXX

Residual (Error)

2

XXX

Total

3

Coeffs

Standard Error

t Stat

P-value

Lower 95%

Upper 95%

Intercept

102

27.3

XXX

XXX

-15.7

219.7

X

-4

2.5

XXX

-14.8

???

a. 3.8

b. 6.8

c. -4.3

d. -2.7

e. 4.6

83. Suppose you have done a simple linear regression analysis using a sample of 50 data points in an attempt to find a linear connection between worker overtime hours (x) and worker productivity (y) for employees of Sky Marketing Enterprises. The correlation coefficient turned out to be -.80.

If total variation in y was 22500, then explained variation (SSR) must be _____.

a. 13600

b. 4300

c. 4840

d. 11900

e. 14400

84. Suppose you have done a simple linear regression analysis using a sample of 50 data points in an attempt to find a linear connection between worker overtime hours (x) and worker productivity (y) for employees of Sky Marketing Enterprises. The correlation coefficient turned out to be -.80.

If total variation in y was 22500, then unexplained variation must be _______.

a. 9430

b. 8100

c. 6780

d. 9250

e. 8710

85. Suppose you have done a simple linear regression analysis using a sample of 50 data points in an attempt to find a linear connection between worker overtime hours (x) and worker productivity (y) for employees of Sky Marketing Enterprises. The correlation coefficient turned out to be -.80.

If total variation in y was 22500, then the standard error of estimate, sy.x, must be ______.

a. 13.0

b. 18.6

c. 14.4

d. 10.3

e. 12.5

86. Needing a simple cost estimator for commercial construction costs in the city, you hope to find a useful linear relationship between cost (y) and floor space (x). You have data from a sample of four recently completed building projects:

x

y

Floor Space

(000s of sq. ft.)

Construction Cost

($000s)

20

48

35

64

25

80

40

96

The estimated regression equation turns out to be y = 24 + 1.6x and sy.x = 17.9. You want to construct an appropriate hypothesis test to determine whether we can use the sample data here to reject a β = 0 null hypothesis. Calculate the value of the proper test statistic, tstat, for the test.

a. 4.2

b. 2.8

c. 5.1

d. 1.4

e. 6.3

87. Needing a simple cost estimator for commercial construction costs in the city, you hope to find a useful linear relationship between cost (y) and floor space (x). You have data from a sample of four recently completed building projects:

x

y

Floor Space

(000s of sq. ft.)

Construction Cost

($000s)

20

48

35

64

25

80

40

96

The estimated regression equation turns out to be y = 24 + 1.6x and sb= 1.13. Construct an appropriate hypothesis test to determine whether we can use the sample data here to reject a β = 0 null hypothesis at the 5% significance level. Report your conclusion.

a. Since tstat is outside the critical values of ±3.182, we can reject the β=0 null hypothesis.

b. Since tstat is outside the critical values of ±4.362, we can reject the β=0 null hypothesis.

c. Since tstat is not outside the critical values of ±4.303, we can't reject the β=0 null hypothesis.

d. Since tstat is not outside the critical values of ±2.182, we can't reject the β=0 null hypothesis.

88. Partial regression results from a sample of 24 observations are shown below. Determine the value for R2, the coefficient of determination here.

Multiple R

R2

???

Adjusted R2

Std Error

Observations

24

ANOVA

df

SS

MS

Regression

1

35800

35800

Error (Residual)

22

16400

745.5

Total

23

a. .547

b. .796

c. .866

d. .686

e. .717

89. Partial regression results from a sample of 24 observations are shown below. Determine the value of R, the correlation coefficient here.

Multiple R

???

R2

Adjusted R2

Std Error

Observations

24

ANOVA

df

SS

MS

Regression

1

35800

Error (Residual)

22

16400

745.5

Total

23

36900

a. .556

b. .745

c. .649

d. .867

e. .895

90. Partial regression results from a sample of 24 observations are shown below. Determine the standard error of estimate here.

Multiple R

R2

Adjusted R2

0.637

Std Error

???

Observations

24

ANOVA

df

SS

MS

Regression

1

568079.9

568079.9

Error (Residual)

22

301269.9

13694.09

Total

23

a. 108

b. 193

c. 157

d. 117

e. 106

91. Partial regression results from a sample of 12 observations are shown below. Determine the standard error (standard deviation) of the sampling distribution of the sample slope, b.

Coeff

Std Error

t Stat

P-value

Lower 95%

Upper 95%

Intercept

110

27.6

2.96

26.62

188.36

X

1.3

???

3.05

XXX

a. .117

b. .239

c. .426

d. .921

e. .352

92. Partial regression results from a sample of 12 observations are shown below. Determine the upper bound on the 95% interval estimate of the population slope, β.

Coeff

Std Error

t Stat

P-value

Lower 95%

Upper 95%

Intercept

-4.06

13.40

2.96

0.77

-33.9

25.8

X

5.38

.56

9.6

XXX

4.13

???

a. 7.29

b. 6.63

c. 5.78

d. 8.14

e. 5.26

93. Partial regression results from a sample of 12 observations are shown below. Can we use the sample results shown here to reject a β = 0 null hypothesis at the 5% significance level?

Coeff

Std Error

t Stat

P-value

Lower 95%

Upper 95%

Intercept

107.49

36.29

2.96

0.01

26.62

188.36

X

1.3

2.927

XXX

a. Yes, given tstat = 2.96 for the sample intercept term is greater than 2.228

b. No, given tstat = 2.927 for the sample slope term is less than 3.414.

c. No, given tstat = 2.96 for the sample intercept term is less than 3.414.

d. Yes, given tstat = 2.927 for the sample slope term is greater than 2.228.

e. The answer here can’t be determined without more information.

94. The regression printout below shows the results of a regression analysis intended to link company profits (x) to CEO salary (y). A sample of 18 CEOs was used. Based on the printout provided, what proportion of the variation in income CANNOT be explained by the profit-to-salary relationship represented by the estimated regression line?

Multiple R

0.976

R Square

0.952

Standard Error

17.63

Observations

18

ANOVA

DF

SS

Regression

1

98,994

Error (Residual)

16

4,974

Total

17

103,968

Coeff

Std Error

t Stat

P value

Lower 95%

Upper 95%

Intercept

7.4

9.143

.8

0.430

-11.98

26.78

profit

4.8

0.267

17.8

0.000

4.20

5.33

a. .082

b. .106

c. .391

d. .048

e. .024

95. The regression printout below shows the results of a regression analysis intended to link company profits (x) to CEO salary (y). A sample of 18 CEOs was used.

Multiple R

0.976

R Square

0.952

Standard Error

17.63

Observations

18

ANOVA

DF

SS

Regression

1

98,994

Error (Residual)

16

4,974

Total

17

103,968

Coeff

Std Error

t Stat

P value

Lower 95%

Upper 95%

Intercept

7.4

9.143

.8

0.430

-11.98

26.78

profit

4.8

0.267

17.8

0.000

4.20

5.33

Can you use the sample results shown here to reject a β = 0 null hypothesis at the 5% significance level? Explain.

a. Yes, since the upper bound on the 95% interval estimate of β is greater than 0, we can reject the β = 0 null hypothesis.

b. No, since the p-value for the intercept term is greater than the significance level of .05, we can't reject the β = 0 null hypothesis.

c. Yes, since the p-value for the coefficient of the profit variable is less than .05, we can reject the β = 0 null hypothesis.

d. No, since tstat for the profit variable is inside ±2.120, we can’t reject the β = 0 null hypothesis.

96. The regression printout below shows the results of a regression analysis intended to link variable x to variable y. A sample of 25 data points was used.

Coeff

Std Error

t Stat

P value

Lower 95%

Upper 95%

Intercept

223.1

182.3

1.22

0.223

-154.1

600.2

x

2.52

.99

2.56

0.018

.48

4.57

Can you use the sample results shown here to reject a β = 0 null hypothesis at the 1% significance level? Explain.

a. Yes, since the upper bound on the 95% interval estimate of β is greater than 0, we can reject the β = 0 null hypothesis.

b. No, since the p-value for the intercept term is greater than the significance level of .01, we can't reject the β = 0 null hypothesis.

c. No, since the p-value for the coefficient of the x variable is greater than .01, we can’t reject the β = 0 null hypothesis.

d. No, since tstat for the x variable is inside ±2.069, we can’t reject the β = 0 null hypothesis.

97. The regression printout below shows the results of a regression analysis intended to link variable x to variable y. A sample of 15 data points was used.

Coeff

Std Error

t Stat

P value

Lower 95%

Upper 95%

Intercept

53.34

428.0

.125

0.903

-871.3

978.0

x

4.25

2.58

1.65

0.124

-1.33

9.83

Can you use the sample results shown here to reject a β = 0 null hypothesis at the 5% significance level? Explain.

a. Yes, since the upper bound on the 95% interval estimate of β is greater than 0, we can reject the β = 0 null hypothesis.

b. Yes, since the p-value for the coefficient of x is greater than the significance level of .05, we can reject the β = 0 null hypothesis.

c. No, since the p-value for the coefficient of the x variable is greater than .05, we can’t reject the β = 0 null hypothesis.

d. No, since tstat for the x variable is outside ±2.160, we can’t reject the β = 0 null hypothesis.

98. Suppose a rental car company uses simple linear regression to develop an equation that predicts the repair costs for each of its vehicles based on the mileage of the car (total miles driven). The company uses the following data on repair costs and miles driven for five of its cars:

x

y

Mileage

(000s of miles driven)

Annual repair cost

($s)

80

1,200

30

400

50

550

15

300

The slope of the least squares regression line is:

a. 0.067

b. 2.89

c. 6.46

d. 13.85

99. Suppose a rental car company uses simple linear regression to develop an equation that predicts the repair costs for each of its vehicles based on the mileage of the car (total miles driven). The company uses the following data on repair costs and miles driven for five of its cars:

x

y

Mileage

(000s of miles driven)

Annual repair cost

($s)

80

1,200

30

400

50

550

15

300

The intercept of the least squares regression line is:

a. 0.067

b. 2.89

c. 6.46

d. 13.85

100. Suppose a rental car company uses simple linear regression to develop an equation that predicts the repair costs for each of its vehicles based on the mileage of the car (total miles driven). The company uses the following data on repair costs and miles driven for five of its cars:

x

y

Mileage

(000s of miles driven)

Annual repair cost

($s)

80

1,200

30

400

50

550

15

300

The least squares line would predict annual repair costs of $______ for a car with 60,000 miles on the odometer (rounded to the nearest whole dollar).

a. 646

b. 837

c. 1385

d. 831,006

101. Suppose a rental car company uses simple linear regression to develop an equation that predicts the repair costs for each of its vehicles based on the mileage of the car (total miles driven). The company uses the following data on repair costs and miles driven for five of its cars:

x

y

Mileage

(000s of miles driven)

Annual repair cost

($s)

80

1,200

30

400

50

550

15

300

The least squares line predicts that if the mileage of a car increases by 10,000 miles, its annual repair costs will increase by an estimated:

a. $13.85

b. $64.64

c. $138.52

d. $144.96

102. A local shopping mall would like to better understand the effectiveness of its security guards in preventing shoplifting. The mall’s operation manager uses simple linear regression to develop an equation that predicts the number of daily shoplifting incidents at the mall’s stores based on the number of security guards employed.

x

y

Security guards

on duty

Daily shoplifting

incidents

4

10

6

5

7

8

9

3

The slope of the regression line is b = -1.2; the intercept of the regression line is a = 14.5. The value of r2 here is:

a. 0.52

b. 0.68

c. 0.82

d. 2.15

103. A local shopping mall would like to better understand the effectiveness of its security guards in preventing shoplifting. The mall’s operation manager uses simple linear regression to develop an equation that predicts the number of daily shoplifting incidents at the mall’s stores based on the number of security guards employed.

x

y

Security guards

on duty

Daily shoplifting

incidents

4

10

6

5

7

8

9

3

The slope of the regression line is b = -1.2; the intercept of the regression line is a = 14.5. Compute the value of r, the correlation coefficient.

a. -0.82

b. 0.52

c. 0.68

d. 0.82

104. A local shopping mall would like to better understand the effectiveness of its security guards in preventing shoplifting. The mall’s operation manager uses simple linear regression to develop an equation that predicts the number of daily shoplifting incidents at the mall’s stores based on the number of security guards employed.

x

y

Security guards

on duty

Daily shoplifting

incidents

4

10

6

5

7

8

9

3

The slope of the regression line is b = -1.2; the intercept of the regression line is a = 14.5. Compute the value of the standard error of the estimate.

a. 0.52

b. 0.68

c. 0.82

d. 2.15

105. The manager of a local shopping mall uses simple linear regression to develop an equation that predicts the number of daily shoplifting incidents at the mall’s stores based on the number of security guards employed. The manager uses the following data:

x

y

Security guards

on duty

Daily shoplifting

incidents

4

10

6

5

7

8

9

3

The slope of the regression line is b = -1.2; the intercept of the regression line is a = 14.5. Produce the 95% confidence interval for the estimate of the population slope, β, and use it to complete the following sentence:

With 95% confidence, the slope of the population regression line is between ______ and ______.

a. -3.8, 1.3

b. -2.9, 31.9

c. -2.06, 3.59

d. -1.2, 14.5

106. The manager of a local shopping mall uses simple linear regression to develop an equation that predicts the number of daily shoplifting incidents at the mall’s stores based on the number of security guards employed. The manager uses the following data:

x

y

Security guards

on duty

Daily shoplifting

incidents

4

10

6

5

7

8

9

3

The slope of the regression line is b = -1.2; the intercept of the regression line is a = 14.5. Produce the 95% confidence interval for the estimate of the intercept of the population regression line and use it to complete the following sentence:

With 95% confidence, the intercept of the population regression line is between ______ and ______.

a. -3.8, 1.3

b. -2.9, 31.9

c. -2.06, 3.59

d. -1.2, 14.5

107. The manager of a local shopping mall uses simple linear regression to develop an equation that predicts the number of daily shoplifting incidents at the mall’s stores based on the number of security guards employed. The manager uses the following data:

x

y

Security guards

on duty

Daily shoplifting

incidents

4

10

6

5

7

8

9

3

The slope of the regression line is b = -1.2; the intercept of the regression line is a = 14.5.

Conduct a hypothesis test to determine whether you can reject the null hypothesis that the population slope, β, is 0 at a 5% significance level. Compute the value of the sample test statistic, tstat. Which of the following is the correct conclusion?

a. Since tstat is outside the critical values of ±3.182, we can reject the β=0 null hypothesis.

b. Since tstat is outside the critical values of ±4.303, we can reject the β=0 null hypothesis.

c. Since tstat is not outside the critical values of ±3.182, we cannot reject the β=0 null hypothesis.

d. Since tstat is not outside the critical values of ±4.303, we cannot reject the β=0 null hypothesis.

108. The manager of a local shopping mall uses simple linear regression to develop an equation that predicts the number of daily shoplifting incidents at the mall’s stores based on the number of security guards employed. The manager uses the following data:

x

y

Security guards

on duty

Daily shoplifting

incidents

4

10

6

5

7

8

9

3

The slope of the regression line is b = -1.2; the intercept of the regression line is a = 14.5.

Use the results of the regression analysis to produce a 95% confidence interval for the expected number of shoplifting incidents when there are 5 security guards on duty. Based on the confidence interval, complete the following sentence:

With 95% confidence, the expected number of shoplifting incidents on days when there are 5 security guards on duty is between ______ and ______.

a. -2.7, 19.4

b. 2.3, 14.4

c. 6.24, 10.36

d. 7.7, 8.9

109. The manager of a local shopping mall uses simple linear regression to develop an equation that predicts the number of daily shoplifting incidents at the mall’s stores based on the number of security guards employed. The manager uses the following data:

x

y

Security guards

on duty

Daily shoplifting

incidents

4

10

6

5

7

8

9

3

The slope of the regression line is b = -1.2; the intercept of the regression line is a = 14.5.

Use the results of the regression analysis to produce a 95% prediction interval for the expected number of shoplifting incidents on a particular day when there are 5 security guards on duty. Based on the prediction interval, complete the following sentence:

With 95% confidence, the predicted number of shoplifting incidents on a particular day when there are 5 security guards on duty is between ______ and ______.

a. -2.7, 19.4

b. 2.3, 14.4

c. 6.24, 10.36

d. 7.7, 8.9

110. Partial results from a regression analysis based on five observations are given below. Determine the standard error (standard deviation) of the sampling distribution of the estimated slope, b.

Coefficient

Std Error

t Stat

P-value

Intercept

-197.30

180.45

-1.09

0.35

x

15.96

???

4.38

0.02

a. 3.65

b. 4.38

c. 6.82

d. 180.45

111. The manager of a local shopping mall uses simple linear regression to develop an equation that predicts the number of daily shoplifting incidents at the mall’s stores based on the number of security guards employed. Below is a partial table showing the data that was used.

x

y

Security guards

on duty

Daily shoplifting

incidents

4

10

6

5

.

.

.

.

The slope of the regression line is b = -1.2; the intercept of the regression line is a = 14.5.

Calculate the residual associated with the first observation in the data set, where x = 4 and y = 10.

a. -2.03

b. -2.30

c. 0.30

d. 2.30

112. The manager of a local shopping mall uses simple linear regression to develop an equation that predicts the number of daily shoplifting incidents at the mall’s stores based on the number of security guards employed. Below is a partial table showing the data that was used.

x

y

Security guards

on duty

Daily shoplifting

incidents

4

10

.

.

.

.

9

3

The slope of the regression line is b = -1.2; the intercept of the regression line is a = 14.5.

Calculate the residual associated with the last observation in the data set, where x = 9 and y = 3.

a. -2.11

b. -0.7

c. 2.11

d. 0.7

Document Information

Document Type:
DOCX
Chapter Number:
11
Created Date:
Aug 21, 2025
Chapter Name:
Chapter 11 - Basic Regression Analysis
Author:
Ned Freed

Connected Book

Understanding Business Statistics 1e Test Bank

By Ned Freed

Test Bank General
View Product →

$24.99

100% satisfaction guarantee

Buy Full Test Bank

Benefits

Immediately available after payment
Answers are available after payment
ZIP file includes all related files
Files are in Word format (DOCX)
Check the description to see the contents of each ZIP file
We do not share your information with any third party