Statistical inference is the process of using data from a sample to make estimates or draw conclusions about an entire population. In VCE Specialist Mathematics, this focuses on estimating the population mean (\(\mu\)) and the population proportion (\(p\)).
KEY TAKEAWAY: A point estimate provides no information about the precision of the estimate; a confidence interval provides a range of plausible values and a level of certainty.
A confidence interval for the population mean is constructed when the population standard deviation (\(\sigma\)) is known, or when the sample size (\(n\)) is large enough (\(n \ge 30\)) to use the sample standard deviation (\(s\)) as an approximation for \(\sigma\).
The \(C\%\) confidence interval for \(\mu\) is given by:
\$\(\bar{x} \pm z \cdot \frac{\sigma}{\sqrt{n}}\)\$
Where:
* \(\bar{x}\) is the sample mean.
* \(z\) is the critical value for the desired confidence level.
* \(\frac{\sigma}{\sqrt{n}}\) is the standard error of the mean.
* \(M = z \cdot \frac{\sigma}{\sqrt{n}}\) is the margin of error.
The value of \(z\) is determined by the standard normal distribution \(Z \sim N(0, 1)\).
| Confidence Level | \(z\) value (approx) | \(z\) value (exact) |
|---|---|---|
| 90% | 1.645 | \(invNorm(0.95, 0, 1)\) |
| 95% | 1.96 | \(invNorm(0.975, 0, 1)\) |
| 99% | 2.576 | \(invNorm(0.995, 0, 1)\) |
Note: For some technology-free questions, VCAA may specify using \(z \approx 2\) for a 95% confidence interval.
EXAM TIP: If a question asks for the “distance between the sample mean and the population mean,” they are asking for the Margin of Error (\(M\)).
The width (\(W\)) of a confidence interval is the distance between the upper and lower bounds.
\$\(W = \text{Upper Bound} - \text{Lower Bound} = 2 \times \text{Margin of Error}\)\$
\$\(W = 2z \frac{\sigma}{\sqrt{n}}\)\$
To find the minimum sample size \(n\) required to achieve a specific margin of error \(M\):
\$\(n = \left( \frac{z \cdot \sigma}{M} \right)^2\)\$
Always round \(n\) up to the nearest whole number to ensure the margin of error is not exceeded.
VCAA FOCUS: Questions often ask how the sample size must change to achieve a certain reduction in width. If the width is reduced by a factor of \(k\), the sample size must increase by a factor of \(\frac{1}{k^2}\). For example, to reduce width by \(\frac{2}{3}\) (to \(\frac{1}{3}\) of the original), \(n\) must increase by \(3^2 = 9\).
When dealing with categorical data (e.g., “Yes/No” responses), we estimate the population proportion \(p\) using the sample proportion \(\hat{p} = \frac{x}{n}\).
For a large sample size, the distribution of \(\hat{P}\) is approximately normal: \(\hat{P} \approx N\left(p, \frac{p(1-p)}{n}\right)\).
The \(C\%\) confidence interval for \(p\) is:
\$\(\hat{p} \pm z \sqrt{\frac{\hat{p}(1-\hat{p})}{n}}\)\$
This approximation is generally considered valid if \(n\hat{p} \ge 10\) and \(n(1-\hat{p}) \ge 10\).
COMMON MISTAKE: Using the population proportion \(p\) in the standard error formula when constructing a confidence interval. Since \(p\) is unknown (that’s why we are making the interval!), we must use the sample estimate \(\hat{p}\) to calculate the standard error.
It is a common misconception that a 95% confidence interval has a “95% probability of containing the population mean.”
| Feature | Population Mean (\(\mu\)) | Population Proportion (\(p\)) |
|---|---|---|
| Point Estimate | \(\bar{x}\) | \(\hat{p} = \frac{x}{n}\) |
| Standard Error | \(\frac{\sigma}{\sqrt{n}}\) | \(\sqrt{\frac{\hat{p}(1-\hat{p})}{n}}\) |
| Confidence Interval | \(\bar{x} \pm z\frac{\sigma}{\sqrt{n}}\) | \(\hat{p} \pm z\sqrt{\frac{\hat{p}(1-\hat{p})}{n}}\) |
STUDY HINT: Practice using your CAS calculator to find confidence intervals quickly. In the Statistics menu, look for
One-Sample Z Intervalfor means andOne-Prop Z Intervalfor proportions. Knowing how to do this manually is essential for “Show that” or technology-free questions.