Summary of last week

For the linear regression model $Y = X β + ϵ$ where $E (ϵ) = 0_{n \times 1}$ , $Var (ϵ) = σ^{2} I_{n}$ , and the matrix $X_{n \times p}$ is fixed with rank $p$ .

The least squares estimates are $\hat{β} = (X^{T} X)^{- 1} X^{T} Y$

Furthermore, the least squares estimates are BLUE, and $E (\hat{β}) = β, Var (\hat{β}) = σ^{2} (X^{T} X)^{- 1}$

We have not used any Normality assumptions to show these properties.

Today

Verify:

$E ({\hat{σ}}^{2}) = E (\frac{1}{n - p} \sum_{i = 1}^{n} e_{i}^{2}) = σ^{2}$
Add Normal assumption to get inference on regression coefficents.

Go over the estimation of $σ$

Strategy: Write $e_{i}^{2}$ as a linear combination of uncorrelated variables, $ϵ_{i}$ .

Write correlated residuals as combination of uncorrelated errors

Claim:

$| | e | |^{2} = ϵ^{T} (I - H) ϵ$

Your turn at home:

Show $(I - H) ϵ = e$ . Hint: substitute $ϵ = Y - X β$ , expand and use properties of $H$ .
Show $| | e | |^{2} = e^{T} e = ϵ^{T} (I - H) ϵ$ . Hint: substitute in $e = (I - H) ϵ$ from above and use properties of $(I - H)$ .

Find expected value of $| | e | |^{2}$ in terms of $trace (I - H)$

Show $E (ϵ^{T} (I - H) ϵ) = σ^{2} trace (I - H)$

Hint $x^{T} A x = \sum_{i = 1}^{n} \sum_{j = 1}^{n} x_{i} x_{j} A_{i j}$ where $x = {(x_{1}, x_{2}, \dots, x_{n})}^{T}, A = {(\begin{matrix} A_{11} & A_{12} & \dots \\ A_{21} & A_{22} & \dots \\ ⋮ \end{matrix})}_{n \times n}$

Find expected value of $| | e | |^{2}$ in terms of $trace (I - H)$

$E (ϵ^{T} (I - H) ϵ) =$

Find $trace (I - H)$

Show $trace (I - H) = n - p$

Hint: $\begin{aligned} trace (A + B) & = trace (A) + trace (B) \\ trace (A B) & = trace (B A) \end{aligned}$

$trace (I - H) =$

Put it all together

$E ({\hat{σ}}^{2}) =$

Inference on the regression coefficients

Normality assumption

Assume $ϵ \sim N (0, σ^{2} I)$ .

Important reminders:

Leads to: $Y \sim N (,)$

$\hat{β} \sim N (,)$

Inference on individual parameters

With the addition of the Normal assumption, it can be shown that

$\frac{\hat{β_{j}} - β_{j}}{S E (\hat{β_{j}})} \sim t_{n - p}$

leads to the usual construction of tests and confidence intervals for single parameters.

Exercises

See handout.

Some details to tidy up Jan 23 2019

Summary of last week

Today

Go over the estimation of σσ

Write correlated residuals as combination of uncorrelated errors

Find expected value of ||e||2||e||2 in terms of trace(I−H)trace(I−H)

Find expected value of ||e||2||e||2 in terms of trace(I−H)trace(I−H)

Find trace(I−H)trace(I−H)

Put it all together

Inference on the regression coefficients

Normality assumption

Inference on individual parameters

Exercises

Go over the estimation of $σ$

Find expected value of $| | e | |^{2}$ in terms of $trace (I - H)$

Find expected value of $| | e | |^{2}$ in terms of $trace (I - H)$

Find $trace (I - H)$