Confidence Interval

Source: FAD1015 L21-L22 — Estimation of Population Mean, §3–6.

A confidence interval (CI) is an interval estimate for a population parameter with a specified level of confidence. It is one of the two main branches of statistical inference — the other being hypothesis testing.

Structure (Source: §3)

$$\mu = \text{point estimate} \pm \text{margin of error}$$

Lower Confidence Limit (LCL)
Upper Confidence Limit (UCL)
Width = UCL − LCL

Confidence Level

The confidence level is denoted $(1 - \alpha)100%$:

Confidence Level	$1 - \alpha$	$\alpha$	$z_{\alpha/2}$
90%	0.90	0.10	1.645
95%	0.95	0.05	1.96
99%	0.99	0.01	2.576

Interpretation (Source: §3)

Correct: "We are 95% confident that the true population mean $\mu$ lies between [LCL] and [UCL]."

Incorrect: "There is a 95% probability that $\mu$ is in this interval."

The population mean $\mu$ is fixed but unknown. The 95% refers to the procedure: if you repeated the sampling many times, approximately 95% of the constructed intervals would capture $\mu$. From the lecture simulation (20 studies, $n = 100$ each), about 95% of the 95% CIs contain the true mean.

Procedure: Constructing a CI for $\mu$ (Source: §4–7)

Step 1 — Identify $\sigma$ Status

Follow the lecture's decision tree:

graph TD
    Q1["Is sigma known?"] -->|Yes| Z["Use Z-distribution<br/>CI = xbar +/- z_alpha/2 * sigma/sqrt(n)"]
    Q1 -->|No| Q2["Is n >= 30?"]
    Q2 -->|Yes| Z2["Use Z-distribution<br/>approximate sigma with s<br/>CI = xbar +/- z_alpha/2 * s/sqrt(n)"]
    Q2 -->|No| T["Use t-distribution<br/>df = n - 1<br/>CI = xbar +/- t_alpha/2,n-1 * s/sqrt(n)"]

Step 2 — Find Critical Value

$z$: use the table above or standard normal table
$t$: use the $t$-table with $df = n - 1$ at column $\alpha/2$ (Source: §6 — rows = df, columns = upper-tail probability)

Step 3 — Compute Margin of Error

$$E = \text{critical value} \times \frac{\text{standard deviation}}{\sqrt{n}}$$

$\sigma$ known: $E = z_{\alpha/2} \cdot \frac{\sigma}{\sqrt{n}}$
$\sigma$ unknown: $E = t_{\alpha/2,,n-1} \cdot \frac{s}{\sqrt{n}}$

Step 4 — Construct the Interval

$$\bar{x} - E \le \mu \le \bar{x} + E$$

or equivalently:

$$(\bar{x} - E,; \bar{x} + E)$$

Step 5 — Interpret

State the interval in context using the correct interpretation wording from §3.

Case 1: $\sigma$ Known (Source: §4)

When $\sigma$ is known (rare in practice — usually from historical data):

$$\bar{x} \pm z_{\alpha/2} \cdot \frac{\sigma}{\sqrt{n}}$$

Assumptions (Source: §4):

Population is normally distributed, OR
$n \ge 30$ (Central Limit Theorem applies)

Example — Circuit Resistance (Source: §4, Example 1)

A sample of 11 circuits from a large normal population has $\bar{x} = 2.20\ \Omega$, $\sigma = 0.35\ \Omega$.

95% CI:

$$2.20 \pm 1.96 \times \frac{0.35}{\sqrt{11}} = 2.20 \pm 0.207$$

$$95%\ \text{CI} = (1.993,; 2.407)$$

99% CI:

$$2.20 \pm 2.576 \times \frac{0.35}{\sqrt{11}} = 2.20 \pm 0.272$$

$$99%\ \text{CI} = (1.928,; 2.472)$$

Observation from the lecture: Higher confidence level produces a wider interval.

Case 2: $\sigma$ Unknown, Small $n$ (Source: §6)

When $\sigma$ is unknown, substitute $s$ (sample SD). This adds uncertainty, so we use the Student's $t$-distribution with $df = n - 1$:

$$\bar{x} \pm t_{\alpha/2,,n-1} \cdot \frac{s}{\sqrt{n}}$$

Student's $t$-Distribution (Source: §6)

Bell-shaped and symmetric, but has fatter tails than $z$
As $n$ increases, $t \to z$ (standard normal = $t$ with $df = \infty$)

How to read the $t$-table (Source: §6):

Rows: degrees of freedom $\nu = n - 1$
Columns: upper-tail probability $\alpha$
For 95% CI with $n = 25$ ($df = 24$), use $\alpha = 0.025$ column → $t = 2.064$

Assumptions (Source: §6):

Population is normally distributed
If not normal, need $n \ge 30$ (fall back to Case 3)

Example — Random Sample (Source: §6, Example 3)

$n = 25$, $\bar{x} = 50$, $s = 8$. 95% CI:

$df = 24$, $t_{0.025,,24} = 2.064$
$50 \pm 2.064 \times \frac{8}{\sqrt{25}} = 50 \pm 3.30$
$95%\ \text{CI} = (46.70,; 53.30)$

Case 3: $\sigma$ Unknown, $n \ge 30$ (Source: §5)

The CLT ensures the sampling distribution of $\bar{x}$ is approximately normal. We approximate $\sigma$ with $s$ and use $z$:

$$\bar{x} \pm z_{\alpha/2} \cdot \frac{s}{\sqrt{n}}$$

Example — Tea Boxes (Source: §5, Example 2)

$n = 200$, $\bar{x} = 101.0$, $s = 2.78$. 99% CI:

$$101.0 \pm 2.576 \times \frac{2.78}{\sqrt{200}} = 101.0 \pm 0.506$$

$$99%\ \text{CI} = (100.494,; 101.506)$$

$z$ vs $t$ Summary (Source: §7)

Condition	Distribution	Formula
$\sigma$ known, normal pop (or $n \ge 30$)	$z$	$\bar{x} \pm z_{\alpha/2} \frac{\sigma}{\sqrt{n}}$
$\sigma$ unknown, normal pop	$t$ ($df = n - 1$)	$\bar{x} \pm t_{\alpha/2,,n-1} \frac{s}{\sqrt{n}}$
$\sigma$ unknown, $n \ge 30$ (CLT)	$z$ (approx with $s$)	$\bar{x} \pm z_{\alpha/2} \frac{s}{\sqrt{n}}$

Standardized forms from the lecture:

$$Z = \frac{\bar{X} - \mu}{\sigma/\sqrt{n}} \sim N(0,1) \quad \text{and} \quad t = \frac{\bar{X} - \mu}{S/\sqrt{n}} \sim t_{n-1}$$

Factors Affecting Interval Width (Source: §4)

Higher confidence level → wider interval (larger $z_{\alpha/2}$ or $t$)
Larger sample size → narrower interval (smaller $\sigma/\sqrt{n}$ or $s/\sqrt{n}$)
More variable population → wider interval (larger $\sigma$ or $s$)

Connection to Hypothesis Testing (Source: FAD1015 L23-L24 — Hypothesis Testing About the Mean, §5.3)

A CI can be used for a two-tailed test at significance level $\alpha$:

$\mu_0$ falls outside the $(1-\alpha)$ CI → reject $H_0$
$\mu_0$ falls inside the $(1-\alpha)$ CI → do not reject $H_0$

Hypothesis Testing — the other branch of inference; uses CIs as one testing method
FAD1015 L21-L22 — Estimation of Population Mean — primary source lecture for this concept
FAD1015 Tutorial 10 — Estimation of Population Mean — tutorial practice problems from the lecture
FAD1015 Tutorial 10 — Estimation of the Population Mean — alternate tutorial source
FAD1015 Statistical Tables — Murdoch & Barnes — $z$ and $t$ tables
FAD1015 - Mathematics III — course page