Ch8 Complete Test Bank Advanced Methods For Establishing

Predictive Analytics for Business Strategy, 1e (Prince)

Chapter 8 Advanced Methods for Establishing Causal Inference

1) In the context of regression analysis, a variable that allows us to isolate the causal effect of a treatment on an outcome due to its exogenous correlation with the treatment is known as a(n):

A) instrumental variable.

B) control variable.

C) difference in difference estimator.

D) dummy variable.

2) A variable is exogenous if that variable:

A) has no effect on the treatment variable beyond the combined effects of other variables already in the determining function.

B) has no effect on the outcome variable beyond the combined effects of other variables already in the determining function.

C) has a strong correlation with the weather.

D) is uncorrelated with the residuals of a regression estimated by OLS.

3) An instrumental variable is relevant if that variable is:

A) a dichotomous variable.

B) normally distributed.

C) a consistent estimator of the coefficient on the treatment.

D) correlated with the treatment after controlling for other control variables in the determining function.

4) In order for a variable to be a valid instrumental variable it needs to satisfy which two conditions?

A) Exogenous and consistent

B) Efficient and uncorrelated

C) Relevant and exogenous

D) Relevant and efficient

5) In words, why can utilizing an instrumental variable be an effective means toward identifying the causal effect of a treatment on an outcome?

A) It allows you to identify the causal effect of the treatment by controlling for unobservables that may crucially affect the outcome.

B) It allows you to identify the causal effect of the treatment by using only variation in the treatment that is correlated with unobservables that affect the outcome.

C) It allows you to identify the causal effect of the treatment by using only variation in the treatment that isn't correlated with unobservables that affect the outcome.

D) None of the answers is correct.

6) All of the following conditions are necessary for an instrumental variable Z to be a valid instrument for X in the regression: Yi = β0 + β1Xi + Ui except for which condition?

A) Cov(Xi, Zi) ≠ 0

B) Cov(Zi, Ui) = 0

C) Cov(Xi, Yi) ≠ 0

D) None of these choices are correct.

7) In estimating the effect of price on sales (Salesi = α0 + α1Pricei + Ui), you are attempting to find an instrumental variable that will solve the endogeneity problem caused by the confounding factor of number of competitors being within Ui, which is correlated with price. Which of the following statements would suggest that wholesale costs would satisfy the exogenous condition to be a potential instrument variable?

A) Wholesale costs are uncorrelated with price.

B) Wholesale costs are correlated with price.

C) Wholesale costs are uncorrelated with sales.

D) Wholesale costs are uncorrelated with number of competitors.

8) In estimating the effect of price on sales (Salesi = α0 + α1Pricei + Ui), you are attempting to find an instrumental variable that will solve the endogeneity problem caused by the confounding factor of number of competitors being within Ui, which is correlated with price. Which of the following statements would suggest that wholesale costs would satisfy the relevant condition to be a potential instrument variable?

A) Wholesale costs are uncorrelated with price.

B) Wholesale costs are correlated with price.

C) Wholesale costs are uncorrelated with sales.

D) Wholesale costs are uncorrelated with number of competitors.

9) Why is it the case that an instrumental variable is not found directly in the determining function?

A) If an instrument is exogenous, then it has no effect on the outcome beyond those already in the determining function.

B) If an instrument is relevant, then it has no effect on the outcome beyond those already in the determining function.

C) If an instrument is exogenous, then it has an effect on the outcome just not a statistically significant one.

D) If an instrument is relevant, then it has an effect on the outcome just not a statistically significant one.

10) The process of using two regressions to measure the causal effect of a variable while utilizing an instrumental variable is known as:

A) nonlinear least squares.

B) two-stage least squares.

C) difference in difference.

D) probit.

11) In estimating a regression model with an instrumental variable typically one of two methods is used to estimate the mode. The two methods are:

A) two-stage least squares and fixed effects regression.

B) fixed effects regression and panel data methods.

C) two-stage least squares and generalized method of moments.

D) two-stage least squares and within estimator.

12) Two-stage least squares can be executed:

A) only with advanced software.

B) only on small samples.

C) straightforwardly by combining two separate regressions.

D) only with panel data sets.

13) Suppose you are trying to estimate the following regression: Yi = β0 + β1X1i + β2X2i + β3X3i + Ui, with an instrument for Zi for X2i. All of the following variables will be included (on the right- and left-hand side of this regression) in the second stage of two-stage least squares except for which one?

A) X1i

B) X2i

C) X3i

D) Yi

14) Suppose you are trying to estimate the following regression: Yi = β0 + β1X1i + β2X2i + β3X3i + Ui, with an instrument for Zi for X2i. All of the following variables will be included (on the right- and left-hand side of this regression) in the first stage of two-stage least squares except for which one?

A) X1i

B) X2i

C) X3i

D) Yi

15) What object from the first stage regression (of two-stage least squares) is critical to incorporate in your implementation of the second stage regression?

A) The estimated coefficient for the treatment variable from the first stage.

B) The estimated residuals for the treatment variable from the first stage.

C) The estimated predictions for the endogenous variable from the first stage.

D) The estimated residuals for the endogenous variable from the first stage.

16) If you conduct your estimation using two-stage least squares by separately estimating each regression (say in Excel), what condition should you be aware of when interpreting your second stage results?

A) The coefficients will be too low.

B) The coefficients will be too high.

C) The standard errors reported in the second stage are not accurate.

D) The residuals do not sum to zero as they due in the first stage.

17) Which object from the first or second stage reports whether an instrument is relevant?

A) Check if the residuals from the first stage regression are uncorrelated with the instrumental variable.

B) Check if the residuals from the second stage regression are uncorrelated with the instrumental variable.

C) Check if the coefficient on the instrumental variable in the first stage is statistically distinct from zero.

D) Check if the coefficient on the instrumental variable in the second stage is statistically distinct from zero.

18) Why do you use only some of the variation of your treatment variable (the component predicted in the first stage) when using an instrumental variable approach?

A) To get smaller standard errors.

B) To avoid the heteroscedasticity problem.

C) To have the estimate of the treatment effect be based only on variation that is exogenous variation to the other factors affecting the outcome.

D) To have the estimate of the treatment effect be based only on variation of the treatment that is correlated with the outcome.

19) A weak instrument is an instrumental variable:

A) whose distribution is non-normal.

B) whose partial correlation with the outcome is small.

C) whose partial correlation with the (endogenous) treatment variable is small.

D) that suffers from heteroscedasticity.

20) Generally speaking a consequence of using an instrumental variable approach with a weak instrument will be:

A) imprecise coefficient estimates (e.g., large standard errors).

B) low coefficient estimates.

C) high (in absolute magnitude) coefficient estimates.

D) multicollinearity.

21) Typically, the justification for an instrumental variable will come from:

A) some sound theoretical argument and some empirically testable conditions.

B) empirically testable conditions only.

C) "big data" application data sets.

D) panel data sets.

22) The following regression results are from the first stage regression of Price on income and wholesale costs—which is serving as the instrument for a particular grocery product across different markets: Pricei = 3.2(1.0) + 4.3(0.8)WholesaleCostsi + 5.5(2.7)Incomei, where standard errors are reported in parenthesis. What conclusion can be drawn about the instrumental variable?

A) It's unbiased.

B) With a t-stat over 2.5 (4.3/0.8), the instrument is relevant.

C) With a t-stat over 2.5 (4.3/0.8), the instrument is exogenous.

D) It's under reporting the effect of income on price.

23) Which object from the first or second stage reports whether an instrument is exogenous?

A) Check if the residuals from the first stage regression are uncorrelated with the instrumental variable.

B) Check if the residuals from the second stage regression are uncorrelated with the instrumental variable.

C) Check if the instrumental variable is uncorrelated with the outcome variable.

D) None of the answers is correct.

24) All standard regression software should be able to help you in determining if your instrument is:

A) exogenous.

B) relevant.

C) consistent.

D) measured with error.

25) Suppose you are estimating demand relationships, where you are attempting to identify the effect of price on quantity sold (i.e., Qi = α0 + α1Pricei + Ui). It will often be the case that the use of an instrumental variable for price will likely yield a coefficient (for α1) that relates to the use of multiple regression in what way?

A) Is lower, because price and Ui are likely positively correlated.

B) Is lower, because price and Ui are likely negatively correlated.

C) Is higher, because price and Ui are likely negatively correlated.

D) Is higher, because price and Ui are likely positively correlated.

26) Which of the following scenarios might allow you to try and test the exogeneity condition of an instrumental variable empirically?

A) When you know your instrument is relevant beforehand.

B) When you can estimate the model using generalized method of moments.

C) When you have two instrument variables and two endogenous variables.

D) When you have two instrument variables and one endogenous variable.

27) The hardest part of implementing the instrumental variables approach is:

A) deciding between using two-stage least squares or generalized method of moments.

B) reporting the appropriate standard errors.

C) finding and defending the exogeneity condition of your instrumental variable.

D) determining which of your instrumental variables are relevant.

28) Generally speaking, the difference-in-differences is defined to be the:

A) difference in outcomes from random assignment.

B) difference in outcomes from different treatment levels.

C) difference in the temporal change for the outcome between treated and untreated groups.

D) None of the answers is correct.

29) To estimate a difference-in-differences it requires that one has a:

A) panel data set.

B) cross-sectional data set.

C) time series.

D) selected sample.

30) Fortunately, the difference-in-differences estimate of a treatment effect also can be reported/interpreted as:

A) a simple difference in mean outcomes.

B) the coefficient on an interaction term.

C) the ratio of t-stats.

D) the second stage estimates of two-stage least squares.

31) Suppose you have two departments and have daily data on complaints and number of products they oversee for the departments over two years. In the second year, you increased the number of products under the scope of department #2 by 10. To estimate the effect of this increase in the number of products, you estimate the following regression: Complaintsit = α + β1 × Department #2it + β2 × Year2it + β3 × Department #2it × Year2it + Uit. In this regression, what is the "diff-in-diff"?

A) α + β1

B) β1 + β3

C) α + β3

D) β3

32) Suppose you've regressed profits across stores (i) in Indiana and Michigan over two years (t) on an Indiana dummy variable as well as on an interaction between an Indiana dummy variable and Year 2 dummy variable. Thus, your regression equation is: Profitsit = β0 + β1Indianait + β2Year2it Indianait + Ui. What is the marginal effect of a store being in Indiana based off this regression equation?

A) β0 + β1 + β2

B) β1 + β2 × Year2it

C) β0 + β1

D) β2

33) Suppose you have two departments and have daily data on complaints and number of products they oversee for the departments over two years. In the second year, you increased the number of products under the scope of department #2 by 10. To estimate the effect of this increase in the number of products, you estimate the following regression: Complaintsit = α + β1 × Department #2it + β2 × Year2it + β3 × Department #2it × Year2it + Uit. Which coefficient controls for non-time varying effects that make department #1 have more complaints than department #2?

A) β1

B) β2

C) α + β3

D) β3

34) When might it be the case that a difference-in-differences estimator still does not identify a consistent estimate of the causal treatment effect?

A) If the treatment is non-normally distributed.

B) If there are non-time varying effects that generate differences between treated and non-treated groups.

C) If there are time varying effects that generate amongst both treated and non-treated groups symmetrically.

D) If there are time-varying trends that are different amongst treated and non-treated groups.

35) The difference-in-differences approach relaxes some of the required assumptions for establishing causality by leveraging what dimension of the empirical setting?

A) Observing repeated observations of the same cross-sectional units over time.

B) Observing multiple cross-sectional units receiving the treatment.

C) Observing a time series for one series.

D) Being able to construct an instrument from the first stage difference.

36) Suppose the U.S. Federal Reserve raised its interest rate by 1 percentage point between 2014 and 2015, but the Bank of Canada made no change in its interest rate. You estimate the following model in an attempt to assess the effect of the change in interest rate on the unemployment rate:

Unemploymentit = β0 + β1U.S.it + β2Y2015it + β3U.S.it × Y2015it + Uit

Here, U.S.it is a dummy variable equaling one if the observation is in the U.S. and Y2015it is a dummy variable equaling one if the observation is in 2015. Which of the following variables may generate an endogeneity problem when attempting to use the estimate for the diff-in-diff (β3) as the effect of the interest rate change?

A) Changes in the U.S. fiscal policy between 2014 and 2015

B) Changes in overall level of trade in North American between 2014 and 2015

C) The difference in average labor force participation between U.S. and Canada

D) Changes in international immigration laws between 2014 and 2015

37) A fixed effects model is one in which the data generating process includes:

A) controls for cross-sectional groups.

B) controls for heteroscedasticity.

C) some control variables (e.g., Income) that vary over time.

D) a selected sample of cross-sectional units.

38) In the regression model Unemployment Rateit = α + γ1 × Alabamait + γ2 × South Carolinait + γ3 × North Carolinait + β1Incomeit+ φYearit + εit, the coefficients for the fixed effects of the model will be given by:

A) φ.

B) α.

C) Uit.

D) γ1, γ2, γ3.

39) The controls for cross-sectional groups in the data generating process are known as:

A) treatment effects.

B) fixed effects.

C) instrumental variables.

D) difference-in-differences.

40) In lieu of including a separate dummy variable for each different time period of a panel, often times the use of a simple time trend is chosen. This choice will lead to a(n):

A) increase in the r-squared.

B) increase in the adjusted r-squared.

C) increase in the number of parameters to be estimated.

D) decrease in the number of parameters to be estimated.

41) Given the regression results Unemployment Rateit = α + 0.8 (0.3) × Alabamait - 0.4 (0.2) × South Carolinait + 0.2 (0.3) × North Carolinait + 0.4 (0.12)MinimumWageit - 0.1 (0.1)Yeart, where the coefficients are reported with their standard errors in parenthesis, how should we interpret the coefficient for minimum wage?

A) An increase in minimum wage will increase the unemployment rate.

B) An increase in minimum wage will increase the unemployment rate, holding fixed (time constant) differences across states.

C) An increase in minimum wage will increase the unemployment rate, holding fixed the downward trend in unemployment rates occurring across all of these three states.

D) An increase in minimum wage will increase the unemployment rate, holding fixed (time constant) differences across states, and the downward trend in unemployment rates occurring across all of these three states.

42) Given the regression results Unemployment Rateit = α + 0.8 (0.3) × Alabamait - 0.4 (0.2) × South Carolinait + 0.2 (0.3) × North Carolinait + 0.4 (0.12)MinimumWageit - 0.1 (0.1)Yeart, where the coefficients are reported with their standard errors in parenthesis, what might be a fact that would make you concerned about the interpretation of the coefficient on minimum wage as being causal?

A) North Carolina has consistently had a higher share of college educated in the population relative to other states in the analysis.

B) National policies have led to stricter conditions to apply for unemployment insurance.

C) The minimum wage rate is different across many states in the southern region of the U.S.

D) In most instances, when a state raises the minimum wage it is also accompanied by state legislation that changes fiscal policy towards economic activity.

43) In estimating a fixed effects model using panel data, which of the following variables will not be effective controls if you use a full set of (cross sectional) fixed effects for individuals?

A) Year to year changes in income for individuals

B) Yearly (cumulative) education attainment for individuals

C) Income taxes paid

D) Birthplace of the individual

44) Interpreting the coefficients on fixed effects will always be based on what?

A) If the regression was run using STATA or Excel.

B) The sign of the coefficient on the instrumental variable in the first stage regression.

C) The base group omitted from the regression.

D) The standard error on the treatment variable.

45) Given the role of cross sectional fixed effects in the empirical strategy for identifying causal effects, instead of conducting hypothesis tests of any one coefficient on a fixed effect being zero, it is typical to conduct what sort of hypothesis test?

A) A hypothesis test for a fixed effect coefficient being equal to 1 instead of zero.

B) A hypothesis test for a fixed effect coefficient being equal to -1 instead of zero.

C) A joint hypothesis test for all of the fixed effect coefficients being equal to zero.

D) A joint hypothesis test that the fixed effect coefficients are all equal to the treatment effect.

46) Generally speaking, what are the methods available to the econometrician who wants to estimate a linear model with a fixed effects model design (i.e., dummy variable for individual units observed over multiple periods)?

A) Within Estimator and Fixed Effects regression

B) Diff-in-diff and Two-stage least squares

C) Two-stage least squares and Fixed Effects regression

D) Within estimator and instrumental variable regression

47) When one uses within-group differences in variables to estimate parameters in the data generating process, you are using what approach?

A) Two-stage least squares

B) Nonlinear least squares

C) Within estimation

D) Time trends

48) A potential downside of using within estimation is:

A) r-squared is less meaningful.

B) the coefficients will be biased.

C) the estimator will not yield estimates of the fixed effects themselves.

D) the standard errors will be larger.

49) A potential upside of using within estimation besides the reduction in the number of parameters to be estimated is:

A) r-squared is more meaningful.

B) the coefficients will be biased.

C) the estimator will yield estimates of the fixed effects themselves.

D) the standard errors will be larger.

50) All of the following coefficients/statistics will be the same across the dummy variable and within estimator approaches for estimating a fixed effects model except:

A) coefficient on treatment variable.

B) coefficient on time trend.

C) coefficient on additional control variables.

D) r-squared

Document Information

Document Type:

DOCX

Chapter Number:

Created Date:

Aug 21, 2025

Chapter Name:

Chapter 8 Advanced Methods For Establishing Causal Inference

Author:

Jeff Prince

Connected Book

Predictive Analytics 1e Complete Test Bank

By Jeff Prince

Test Bank General

View Product →

Explore recommendations drawn directly from what you're reading

Chapter 6 Correlation Vs. Causality In Regression Analysis

DOCX Ch. 6

Chapter 7 Basic Methods For Establishing Causal Inference

DOCX Ch. 7

Chapter 8 Advanced Methods For Establishing Causal Inference

DOCX Ch. 8 Current

Chapter 9 Prediction For A Dichotomous Variable

DOCX Ch. 9

Chapter 10 Identification And Data Assessment

DOCX Ch. 10

$24.99

100% satisfaction guarantee

Buy Full Test Bank

Quick Navigation

Preview Document Info Connected Book Recommendations

Benefits

Immediately available after payment

Answers are available after payment

ZIP file includes all related files

Files are in Word format (DOCX)

Check the description to see the contents of each ZIP file

We do not share your information with any third party

Ch8 Complete Test Bank Advanced Methods For Establishing

Document Information

Connected Book

Predictive Analytics 1e Complete Test Bank

Explore recommendations drawn directly from what you're reading

$24.99

Quick Navigation

Benefits

Report Unauthorized Use

Added to Cart