Chapter 11. Interval Estimation and Hypothesis Testing
Chapter Purpose
Chapter 10 estimated a simple regression model relating milk prices to package volume. The slope coefficient was 417.0, but any sample estimate contains uncertainty. This chapter introduces standard errors, confidence intervals, and hypothesis tests using the actual Milk Data results.
This is our best estimate of the average relationship between package volume and price.
Confidence Interval
The 95% confidence interval for the volume coefficient is:
\[
333.45 \leq \beta_1 \leq 500.60
\]
Interpretation:
The data suggest that the true effect of a 1,000 ml increase in volume is likely to lie between approximately 333 and 501 price units.
model.conf_int()
0
1
const
320.857636
615.001780
Volume1000
333.451316
500.600835
Standard Errors
A standard error measures the precision of an estimate. Small standard errors imply greater precision. Confidence intervals and hypothesis tests are built from standard errors.
Because the p-value is below 0.05, we reject the null hypothesis.
Confidence Intervals and Significance
The confidence interval does not include zero. This agrees with the p-value result. The relationship between volume and price is statistically significant.
Statistical Significance Versus Economic Significance
Statistical significance asks whether the relationship is likely to be real. Economic significance asks whether the magnitude is large enough to matter.
The Milk Data coefficient of 417.0 is meaningful because a 1,000 ml increase in package volume is associated with a sizeable increase in price.
Warning
A statistically significant coefficient is not automatically a causal effect.
What We Learned From the Milk Data
Estimated coefficient: 417.0
95% confidence interval: [333.45, 500.60]
p-value: < 0.001
The evidence supporting a positive relationship between volume and price is very strong.
Common Mistakes
WarningCommon Mistake 1
Treating p-values as measures of economic importance.
WarningCommon Mistake 2
Ignoring confidence intervals.
WarningCommon Mistake 3
Confusing statistical significance with causality.
Key Takeaways
Regression estimates contain uncertainty.
Confidence intervals provide a range of plausible values.
The estimated volume coefficient is 417.0.
The 95% confidence interval is [333.45, 500.60].
The p-value is below 0.001.
Economists should interpret coefficients before focusing on p-values.